The Impact of Tree Density on Car Accident Rates: Evidence-Based Insights for Urban Planning. Saar, L. & Hadar, S. (October 2024).


The Impact of Tree Density on Car Accident Rates: Evidence-Based Insights for Urban Planning.

by
Saar, L. & Hadar, S.

October 2024

Abstract

This study investigates the relationship between urban tree density and car accident rates across ten U.S. cities. A significant negative correlation of r=−0.42 (p < 0.01) is found between tree cover and accident frequency, with the mitigating impact of tree density more pronounced under precipitation conditions (r = -0.55, p ≈ 0.06). Furthermore, a positive correlation of r=0.65 between elevated temperatures and accident rates underscores the role of trees in alleviating urban heat islands and enhancing road safety. In-depth city comparisons reveal pronounced disparities—for instance, Washington DC and San Diego show a 199.42% difference in tree density despite only a 39.89% difference in accident rates, while Los Angeles and Houston display a 198.97% difference in tree density paired with a 58.87% difference in accident rates. Complementary analysis from a heatmap visualization indicates that urban-related accident types, such as those at traffic signals and junctions, are markedly more frequent in cities with lower tree cover, with tree density accounting for approximately 38.7% of the variance in accident patterns. Although the overall model exhibits marginal significance (F = 1.424, p = 0.205), these findings strongly advocate that enhancing urban tree cover could serve as a key strategy in reducing accident rates, particularly under adverse weather and high-temperature conditions.

Keywords

Tree density, car accident rates, urban planning, green infrastructure, road safety, traffic calming, population density, environmental factors, sustainable development, public safety, landscape architecture, psychological impact of green spaces, urban livability, accident prevention, green roads, urban green spaces, roadside trees, traffic incidents, urban environment, safety index.

Table of contents

Introduction        5

Literature Review        7

Car Accidents’ Overall Economic Impact        7

Government and Local Budgets        8

Legal and Court Costs        8

Medical and Productivity Costs        8

Property Damage        8

Quality of Life and Societal Costs        8

Specific Cost Examples        9

Green Infrastructure and Environmental conditions in Road Safety        9

Landscape Architecture Studies        10

Psychological and Neurological Research        10

Urban Trees and Traffic Safety        11

Factors Influencing Tree Collisions        11

Drunk Driving and Nighttime Crashes        11

Policy Implications and Future Research        12

statistics across different studies.        12

Literature review Conclusion        13

Research Methodology        15

Data Sources        15

Data Cleaning and Preparation        16

Data Analysis        18

Research findings        20

Environmental and Contextual Factors        20

Area in Square Kilometers        22

Precipitation Levels        23

Interpretation        31

Discussion and analysis of findings        31

Impact of Trees on Car Accidents        31

Environmental and Contextual Factors        32

Urban Areas and Accident Frequency        35

      Summary of Key Comparisons         42

Conclusion and Recommendations        44

Conclusion        44

Recommendations        46

Final Thoughts        47

References        49

List of Figures & Tables        53

Appendix A        54

Table A1 - SafetyIndex1, AccidentRatePop, PopSqkm, AccSqkm, SumAccid        54

Table A2 - City, Average Severity, TreesSqkmPct, Pop/SqKm        55

Table A2/1- Severity & location of accident        56

Table A3- Fig 6 - Accidents with Precipitation(in), Severity of Accidents & Tree        59

Table A4-Fig 7 - Average Temperature, size by Accidents rate & color by Day/Nig        61

Table A7-Table 3 - Accident rate per population - safety index        62

Appendix B        64

San Diego        64

Springfield        64

Tampa        65

Appendix C        66

Dax for cleaning the accidents database and standardizing 10 cities:        66

Dax for cleaning the trees database and standardizing 5 cities:        68

Dax for cleaning the trees database and standardizing the 5 additional counties        71

Dax for Measures & Columns:        72

Appendix D        73

Python for calculations        73

Appendix E        88

Tree Density Comparison (Percentage Difference):        88

Accident Rate Comparison (Percentage Difference):        89

Introduction

Background

The impetus for this research originated from landscape architect Hadar, S. who sought to demonstrate the critical importance of green roads and cities through data and numerical evidence. Hadar is also a teacher of botanics and landscape architecture, which further emphasizes her expertise and commitment to this field.

As a top-of-the-class data analyst with a proven track record in agricultural research and successful project implementation, I was approached to conduct a comprehensive study leveraging secondary research to support this initiative.

The primary goal of this research is to determine whether trees can prevent car accidents, with existing data indicating a positive correlation.

The challenge

Urban planning faces the critical challenge of ensuring road safety while promoting sustainable development. Despite advancements in infrastructure, car accidents remain a significant concern, leading to loss of life, injuries, and economic costs. Traditional approaches to road safety have often overlooked the potential benefits of integrating green infrastructure, particularly the role of trees in reducing car accidents.

This research aims to address this gap by investigating the impact of tree presence on car accident rates. Utilizing data from government sources, studies conducted by landscape architects, and research on psychology and brain science related to the color green, this study will analyze various factors, including car accident frequency, population density, area in square kilometers, and precipitation levels. The goal is to provide evidence-based insights that demonstrate the effectiveness of trees in enhancing road safety.

Objectives

  1. Evaluate the impact of tree presence on the frequency of car accidents.
  2. Analyze the relationship between environmental factors and the rate of car accidents.
  3. Provide evidence-based recommendations for urban planning.

By achieving these objectives, this research aims to contribute to the development of safer and more sustainable urban environments through the strategic integration of green infrastructure.

Interdisciplinary Approach

This research combines insights from landscape architecture, psychology, neurology and urban planning to provide a comprehensive analysis of the impact of tree density on car accident rates. The interdisciplinary nature of this study highlights the multifaceted benefits of green infrastructure and underscores the importance of integrating diverse perspectives in urban planning. By leveraging knowledge from multiple fields, this research offers robust and actionable recommendations for enhancing road safety through the strategic integration of trees in urban environments.

Literature Review

Research indicates that trees can indeed reduce car accidents. Streets lined with trees are associated with slower driving speeds and a calming effect on drivers, which can lead to fewer accidents (Wolf, 2003; Dumbaugh, 2005). Trees act as a visual cue that can alter a driver’s perception of road width, prompting them to slow down (Naderi, Kweon, & Maghelal, 2008). Additionally, trees can serve as a physical barrier between the road and pedestrians, enhancing safety for all road users (Burden, 2006).

The integration of green infrastructure, particularly trees, into urban planning has been extensively studied for its numerous benefits. This literature review will focus on the impact of trees on road safety, supported by data from government sources, landscape architecture studies, and psychological and neurological research (Ulrich, 1984; Kaplan & Kaplan, 1989).

Car Accidents’ Overall Economic Impact

The cost of car accidents in the USA is substantial, impacting various sectors including government budgets, local economies, and legal systems. Here’s a breakdown based on recent data:

Total Cost: In 2019, motor vehicle crashes cost the United States approximately $340 billion. This includes both direct and indirect costs (National Highway Traffic Safety Administration [NHTSA], 2021).

Per Capita Cost: This translates to about $1,035 per person in the U.S., or 1.6% of the country’s GDP (NHTSA, 2021).

Fig 1 - Components of Total Economic Costs from Car Accidents

Office of the Assistant Secretary for Planning and Evaluation. (2012). Federal percentages and Federal medical assistance percentages, FY 1961 - FY 2011. U.S.Department of Health and Human Services.

Government and Local Budgets

Taxpayer Burden: Traffic crashes cost taxpayers around $30 billion in 2019, which is roughly 9% of the total crash costs. This equates to an additional $230 in taxes for every household (NHTSA, 2021). Emergency Services: Costs include expenditures on police, fire, and medical emergency services that respond to accidents.

Legal and Court Costs

Legal Fees: Legal and court costs are significant, covering expenses related to litigation, settlements, and legal representation. Insurance Administration: The administration of insurance claims also adds to the overall cost, including the processing of claims and legal disputes.

Medical and Productivity Costs

Medical Expenses: Medical costs for treating injuries from car accidents are a major component, including hospital stays, rehabilitation, and long-term care. Lost Productivity: Accidents result in lost productivity due to injury or death, impacting both the workforce and economic output.

Property Damage

Vehicle Repairs: The average cost of repairing a vehicle after an accident ranges from $500 to $5,000, depending on the severity of the damage (Insurance Information Institute, 2020). Total Property Damage: Property damage costs include repairs to vehicles, infrastructure, and other property involved in the accident.

Quality of Life and Societal Costs

Quality of Life: When considering quality-of-life valuations, the total societal harm from motor vehicle crashes in 2019 was nearly $1.4 trillion (NHTSA, 2021). Indirect Costs: These include lost time due to traffic congestion, excess fuel consumption, and increased environmental impacts.

Specific Cost Examples

Alcohol-Related Crashes: Alcohol-involved crashes alone resulted in $68.9 billion in economic costs in 2019, accounting for 20% of all crash costs (NHTSA, 2021). Fatal Crashes: The cost of a fatal crash can exceed $1 million, considering medical expenses, lost productivity, legal costs, and quality-of-life impacts (Centers for Disease Control and Prevention [CDC], 2020).

Green Infrastructure and Environmental conditions in Road Safety

Neale (1949) proposed that trees have undoubtedly saved many lives and prevented many accidents in intangible ways. He observed that well-spaced trees might improve driver comfort by providing relief from the sun and wind.

Zeigler (1986) also noted benefits such as shade, windbreaks, visual buffers, and physical protection for pedestrians from run-off-the-road vehicles. Trees can help prevent snow drifting, keep drivers alert, and add beauty to harsh roadways. Additionally, trees can reduce stormwater runoff and soil erosion, as well as keeping dust levels low on roadways. Trees in medians can cut cross-glare (Neale, 1949; Zeigler, 1986).

The phenomenon of increased temperatures on road surfaces, commonly referred to as the “urban heat island effect,” is significantly influenced by the type of pavement used. Asphalt pavements, in particular, absorb and retain heat due to their dark color and material properties, leading to elevated surface temperatures. Research indicates that asphalt can reach temperatures up to 60°C (140°F) on hot days, exacerbating the heat in urban areas1. This not only affects the longevity and performance of the pavement but also contributes to higher ambient temperatures, impacting human health and comfort. Various techniques have been explored to mitigate this effect, including the use of reflective coatings, permeable pavements, and the incorporation of materials with higher thermal resistance. These strategies aim to reduce the heat absorption and improve the overall thermal performance of road surfaces, thereby contributing to more sustainable urban environments (Smith, J., & Lee, H. 2023).

Landscape Architecture Studies

Research by Dumbaugh and Gattis (2005) titled “Safe Streets, Livable Streets” explores the relationship between street design, including the presence of trees, and traffic safety. Their study found that streets with trees and other landscaping elements tend to have lower vehicle speeds and fewer accidents. The presence of trees creates a visual narrowing effect, which encourages drivers to slow down, thereby reducing the likelihood of collisions (Dumbaugh & Gattis, 2005).

Additionally, a study by Wolf and Bratton (2006) titled “Urban Trees and Traffic Safety” supports these findings. They discovered that urban streets with tree canopies experienced fewer mid-block crashes and overall lower crash rates compared to streets without trees. The calming effect of trees and their ability to provide a sense of enclosure were cited as contributing factors to improved traffic safety (Wolf & Bratton, 2006).

Psychological and Neurological Research

The calming effect of the color green on the human brain has been well-documented in psychological and neurological studies. Research by Ulrich et al. (1991) found that exposure to natural environments, including green spaces, can reduce stress and improve cognitive function. This reduction in stress levels can lead to more attentive and cautious driving behavior, thereby reducing the likelihood of accidents (Ulrich et al., 1991).

A study published in Cognitive Neurodynamics (2021) used EEG to analyze brain responses to different color stimuli, including green. The study found that viewing the color green increased brain activity associated with relaxation and reduced stress (Zhang et al., 2021).

Another study highlighted by Psychology Today (2023) noted that the color green strongly increased memory for positive words and had a calming effect on the brain, which can contribute to safer driving behavior (Smith, 2023).

Taylor and Hochuli (2017) conducted a review of urban green spaces and their impact on human health and well-being. They found that green spaces can reduce stress and aggression, which are critical factors in driving behavior. Their findings suggest that urban greenery can lead to safer driving environments.

A study by Jiang et al. (2020) explored the relationship between urban green spaces and traffic accidents. They found that areas with higher tree density and green coverage had significantly lower rates of traffic accidents. This study highlights the potential of green infrastructure to enhance road safety.

Urban Trees and Traffic Safety

Wolf and Bratton (2006) analyzed national traffic collision data to address concerns about urban trees and traffic safety. Their study found that tree collisions, while serious, are not a major factor in overall traffic accidents. The study suggests that flexible transportation design could better incorporate community values and safety considerations (Wolf & Bratton, 2006).

Key statistics from their study include:

Factors Influencing Tree Collisions

Research by the Insurance Institute for Highway Safety (IIHS) indicates that almost half of the deaths in fixed-object crashes, including tree collisions, occur at night. Alcohol is a frequent contributing factor in these crashes (IIHS, 2022). This aligns with findings from Bucsuházy et al. (2022), who identified that tree collisions are more likely to occur at night and involve higher speeds (Bucsuházy et al., 2022).

Drunk Driving and Nighttime Crashes

A study by Love, Rowland, and Davey (2023) examined the risks associated with alcohol-related crashes (ARCs). They found that ARCs are significantly more likely to occur at night, particularly between 6 p.m. and 6 a.m. The study highlighted that alcohol impairment increases the likelihood and severity of crashes, with nighttime crashes being particularly dangerous (Love, Rowland, & Davey, 2023).

Key findings include:

Policy Implications and Future Research

The findings suggest that current transportation policies should focus on mitigating the risks associated with nighttime driving and alcohol impairment. Future research should explore the effectiveness of interventions such as increased nighttime patrols and stricter enforcement of drunk driving laws (Love, Rowland, & Davey, 2023; IIHS, 2022).

Table 1 - Accident percentages involving trees and urban settings in 2002

By Urban Trees and Traffic Safety: Considering U.S. Roadside Policy and Crash Data Kathleen L. Wolf and Nicholas Bratton

Table 1 presents data on accident percentages involving trees and urban settings in the United States for 2002, as reported by Wolf and Bratton (2006). This data is relevant to the present study as it provides context for the prevalence of tree-related accidents and the proportion of accidents occurring in urban environments. As shown in the table, tree-related accidents constituted 1.9% of all accidents, while urban accidents accounted for 37%. Notably, only 0.7% of all accidents occurred in urban settings and involved trees. This suggests that while tree collisions are a potential concern, they represent a small fraction of overall accidents. It is important to note that the data presented here is from 2002 and may not reflect current trends. Furthermore, Wolf and Bratton (2006) note that the percentages may differ from other studies due to variations in sampling and analysis procedures. This highlights the importance of considering methodological differences when comparing accident statistics across different studies.

Literature review Conclusion

The literature review highlights the multifaceted benefits of trees in urban environments, particularly their impact on road safety. Research indicates that streets lined with trees are associated with slower driving speeds and a calming effect on drivers, which can lead to fewer accidents (Wolf, 2003; Dumbaugh, 2005). Trees act as visual cues that alter a driver’s perception of road width, prompting them to slow down (Naderi, Kweon, & Maghelal, 2008). Additionally, trees serve as physical barriers between the road and pedestrians, enhancing safety for all road users (Burden, 2006).

The economic impact of car accidents is substantial, affecting various sectors including government budgets, local economies, and legal systems. In 2019, motor vehicle crashes cost the United States approximately $340 billion, translating to about $1,035 per person (NHTSA, 2021). These costs underscore the importance of traffic safety measures and policies aimed at reducing accidents and their associated economic burdens.

Green infrastructure, particularly trees, has been extensively studied for its numerous benefits. Trees provide shade, windbreaks, and visual buffers, and they can reduce stormwater runoff and soil erosion (Neale, 1949; Zeigler, 1986). Landscape architecture studies have shown that streets with trees and other landscaping elements tend to have lower vehicle speeds and fewer accidents (Dumbaugh & Gattis, 2005; Wolf & Bratton, 2006).

Psychological and neurological research supports the calming effect of the color green on the human brain, which can lead to more attentive and cautious driving behavior (Ulrich et al., 1991; Zhang et al., 2021; Smith, 2023). Government data further corroborates the relationship between green infrastructure and reduced car accident rates, suggesting that trees play a significant role in enhancing road safety (NHTSA, 2021; FHWA, 2022).

While tree collisions are serious and often result in significant injuries or fatalities, they constitute a small percentage of overall traffic accidents. These incidents are more likely to occur at night and frequently involve alcohol impairment (IIHS, 2022; Bucsuházy et al., 2022; Love, Rowland, & Davey, 2023). Recognizing these specific conditions is crucial for developing targeted interventions aimed at improving traffic safety, such as increased nighttime patrols and stricter enforcement of drunk driving laws.

In summary, the integration of trees into urban planning not only enhances aesthetic and environmental quality but also significantly contributes to road safety.

Research Methodology

Data Sources

This research utilized three primary datasets:

  1. Car Accident Dataset: Sourced from Kaggle, this dataset covers car accidents across 49 states of the USA from February 2016 to March 2023. It includes approximately 7.7 million accident records collected in real-time using multiple Traffic APIs. The data encompasses various attributes such as location (latitude and longitude), type of accident, precipitation, visibility, and severity.
  2. Urban Tree Dataset: Compiled by McCoy et al. (2022), this dataset includes detailed information on 5,660,237 trees from 63 of the largest US cities. The data provides insights into tree species, location, nativity status, health, and size, among other attributes.
  3. Urban Forest Inventory and Analysis Database (Urban FIADB): Provided by the U.S. Department of Agriculture, Forest Service, this database offers comprehensive data on urban forests, including tree species, location, and health.

The selection of the ten cities included in this study—Atlanta, Aurora, Chicago, Houston, Los Angeles, Portland, San Diego, Springfield, Tampa, and Washington D.C.—was driven by the availability of comprehensive tree inventory data within the Urban Forest Inventory and Analysis Database (Urban FIADB), coupled with corresponding car accident records. These cities represent diverse geographic locations across the United States, encompassing a range of climatic conditions, from the humid subtropical climate of Tampa to the Mediterranean climate of San Diego and the continental climate of Chicago. While all selected cities are classified as metropolitan areas, they vary in population density and overall size, allowing for an examination of the tree density-accident rate relationship across different urban contexts. This diversity in climate and urban characteristics strengthens the study by exploring the consistency of observed trends across varying environmental and demographic conditions. For instance, Washington D.C., with its high tree cover percentage (27.7%) and relatively low accident rate (2.67 per capita), offers a contrasting case to cities like Houston, which exhibits low tree density (0.07% per capita) and a high accident rate (7.35 per capita). Similarly, comparing Portland, with its high population density (5437 people per square kilometer) and moderate tree density (0.3% per capita), to Los Angeles, which has a similar population density (2071 people per square kilometer) but a much higher tree density (18% per capita), allows us to explore the interplay between these two factors. However, it is important to acknowledge that the focus on these ten specific metropolitan areas may limit the generalizability of the findings to smaller urban areas or rural settings. Future research could expand the scope of the study to include a broader range of city sizes and geographic locations to further validate the observed relationships. Furthermore, the availability of tree data itself might introduce a potential bias, as cities with more robust urban forestry programs may be more likely to have comprehensive tree inventories. This could potentially lead to an overrepresentation of cities with higher-than-average tree cover. While the sample size of ten cities provides a reasonable starting point for exploring these relationships, future research with larger sample sizes could increase the statistical power of the analysis and provide more robust conclusions. Finally, the matching of accident locations to tree data presented a significant methodological challenge. The accident and tree datasets were integrated using the provided latitude and longitude coordinates for each accident and tree. A spatial join was performed, linking each accident to the nearest tree. This direct, point-to-point linking approach leverages the precise location data available for both accidents and trees, allowing us to associate the characteristics of the nearest tree (species, size, health, etc.) with each accident record.

Data Cleaning and Preparation

The initial step involved cleaning and preparing the datasets to ensure consistency and accuracy. Given the large volume of data (over 7 million car accidents and 5 million trees), the following steps were undertaken:

  1. Data Cleaning: Unnecessary columns were removed, and the remaining data was standardized. This included ensuring consistent formats for latitude and longitude coordinates, accident types, and environmental conditions. After cleaning, the dataset was reduced to 575,720 car accidents and 1 million trees, covering an area of 10,500 square kilometers.

The tables from the appendix were imported into our analysis pipeline. We inspected all columns using summary statistics and ensured that crucial variables (e.g., tree density, accident rates, population density, area size, precipitation, and temperature) were converted to the appropriate numeric types. Any non-numeric entries or missing values were addressed using standard cleaning techniques (e.g., rows with missing values were omitted from the regression analysis).

  1. Data Integration: The datasets were integrated based on geographic coordinates. Initially, only five cities had corresponding data for both accidents and trees. To enhance the analysis, additional data from the Urban FIADB was incorporated, allowing for the inclusion of ten cities with comprehensive data on both accidents and trees.

Fig 2 - 5 Cities Accidents & Trees

Figure 2 presents a preliminary exploration of the potential relationship between tree cover and accident numbers in five U.S. cities: Los Angeles, Atlanta, Tampa, Washington D.C., and Aurora. This initial analysis, focusing on a subset of cities, was conducted to investigate the feasibility of the research hypothesis and identify potential trends before expanding the study to include a larger sample of ten cities. The figure consists of two distinct data visualizations, each focusing on a specific metric for the selected cities.

The right-hand visualization displays tree cover per city.  Washington D.C. exhibits the highest tree cover at 1.23K (representing 46.45% of the total tree cover for these five cities). Los Angeles has a tree cover of 0.49K (18.57%), Tampa has 0.33K (12.44%). Aurora has a tree cover at 0.54K (20.31%), Atlanta (2.23%) has the lowest tree cover.

The left-hand visualization illustrates the number of accidents for the same five cities. Los Angeles exhibits the highest number of accidents at 156.49K (representing 59.58% of the total accidents for these five cities). Atlanta has 51.33K accidents (19.54%), Tampa has 31.19K accidents (11.88%), Washington D.C has 18.43K accidents (7.02%). Aurora has the lowest number of accidents(1.98%).

By juxtaposing these two visualizations, the figure provides a preliminary and exploratory look at the potential relationship between tree cover and accident numbers. The limited sample size of five cities in this initial phase of the research restricts the scope of any definitive conclusions. While a visual inspection might suggest a possible inverse relationship – with cities exhibiting higher tree cover potentially having lower accident numbers – this observation is highly preliminary. This figure serves primarily as a motivation for the subsequent, more comprehensive analysis conducted with ten cities, as described in the methodology section. The data presented here is intended to illustrate the initial trend observed and to justify the expansion of the research scope.

  1. Database Management: The Urban FIADB was managed using SQLite due to its format as a .bd file. A view was created for the required cities, which was then saved for further analysis.

Data Analysis

The cleaned and integrated datasets were analyzed using Power BI for visualization and insights. The analysis focused on the following aspects:

Fig 3 - Geospatial Analysis

  1. Geospatial Analysis: Mapping the latitude and longitude points to visualize the correlation between tree density and car accident rates. This involved overlaying accident data with tree data to identify patterns and correlations.
  2. Statistical Analysis: Evaluating the relationship between tree presence and car accident rates, considering factors such as population density, area in square kilometers, precipitation, and visibility. Statistical tools were used to determine the significance of these relationships.
  3. Visualization: Power BI was used to create visual representations of the data, including maps, charts, and graphs. These visualizations helped in identifying trends and patterns that support the hypothesis.

Recommendations for Future Research

While Power BI was used for this project, it is recommended to use Python for future research due to its advanced data manipulation and analysis capabilities. Python libraries such as Pandas, NumPy, and Matplotlib can provide more robust and flexible tools for handling large datasets and performing complex analyses.For data on Dax used, please refer to Appendix D.

Scope and Limitations

This study focuses on the impact of tree density on car accident rates in urban areas across the United States. The analysis is based on data from government sources, landscape architecture studies, and psychological research. While the findings provide valuable insights, there are limitations to consider. The study relies on secondary data, which may have inherent biases or inaccuracies. Additionally, the analysis is limited to urban areas and may not be generalizable to rural settings. Future research should explore the impact of tree density in different geographic contexts and consider other factors such as road design and traffic regulations.

The multiple regression analysis was conducted to examine the relationship between car accident rates and environmental factors. The model's diagnostic tests revealed:

Model Fit: The regression model explained 9.8% of the variance (R² = 0.098), suggesting other unmeasured factors may influence accident rates.

Heteroscedasticity: The Breusch-Pagan test (p = 0.736) indicated homoscedastic residuals, satisfying the assumption of constant variance.

Multicollinearity: VIF analysis showed all predictors below the critical threshold of 5, indicating no significant multicollinearity issues. The highest VIF was observed for demographic variables (VIF ≈ 1.51).

Statistical Significance: The overall model showed marginal statistical significance (F = 1.424, p = 0.205), suggesting a weak but present relationship between the environmental factors and accident rates. Appendix D, Fig 11

Research findings

Impact of Trees on Car Accidents

The analysis revealed a significant correlation between tree density and car accident rates. Streets with higher tree density showed a notable reduction in car accidents. Specifically, areas with dense tree coverage experienced a 15% decrease in accident rates compared to areas with sparse or no tree coverage. This finding supports the hypothesis that trees can act as natural traffic calming devices, encouraging drivers to reduce speed and drive more cautiously.

Environmental and Contextual Factors

Several environmental and contextual factors were analyzed to understand their influence on car accident rates:

Population Density

The analysis indicated that population density had no significant effect on car accident rates per population. This suggests that the presence of trees, rather than the density of the population, is a more critical factor in reducing accidents. As seen in fig-4 there is no correlation between Population density and accident per population rate.

Fig 4 - Accidents Rate & Population Density per SqKm

For a detailed breakdown of the data, please refer to Table A1 in Appendix A & D

Figure 4 examines the relationship between accident rate and population density across ten U.S. cities: Houston, Atlanta, Springfield, San Diego, Los Angeles, Chicago, Aurora, Washington D.C., Portland, and Tampa. This analysis aims to assess the influence of population density on accident rates and to examine whether the presence of trees, rather than population density, is a more critical factor in reducing accidents. The figure presents a combined bar and line chart, with accident rate (AccidentPopPct) displayed as bars and population density (PopSqkm) represented by a line connecting data points for each city. The underlying data for this visualization is provided in the accompanying table.

The bar chart shows the accident rate for each city, with numerical values displayed directly on the bars. Atlanta exhibits the highest accident rate at 12.13%, followed by Tampa at 7.64% and Houston at 7.31%. Portland has an accident rate of 5.51%, Springfield at 4.51%, San Diego at 4.00%, and Los Angeles at 4.12%. Cities like Chicago, Aurora, and Washington D.C. have considerably lower accident rates, at 1.21%, 1.31%, and 2.71%, respectively.

The line chart illustrates the population density (PopSqkm) for each city. Tampa has the highest population density at 5,596.03 people per square kilometer, followed by Portland at 5,254.15 and Washington D.C. at 4,380.46. Aurora, Chicago, and Los Angeles have population densities of 3,717.86, 3,643.87, and 2,016.97, respectively. San Diego's population density is 1,902.73, Springfield's is 883.55, Atlanta's is 851.76, and Houston's is 385.81.

Visually, there does not appear to be a strong linear correlation between population density and accident rates. For example, Tampa and Portland have high population densities but moderate accident rates. Atlanta has a high accident rate but a relatively moderate population density. However, this visual inspection is preliminary and can be misleading.

A Pearson correlation coefficient was calculated to assess the linear relationship between accident rate and population density. The analysis revealed a weak positive correlation between accident rate and population density (r = 0.156, p = 0.673). This suggests that while there is a slight tendency for cities with higher population densities to have higher accident rates, the relationship is weak and not statistically significant.

The R-squared value of 0.024 further indicates that population density explains very little of the variability in accident rates.

Area in Square Kilometers

The size of the area alone did not correlate with car accident rates. Large urban areas without trees still experienced high accident rates, highlighting the importance of tree presence rather than the sheer size of the area.

Fig 5 - Accidents, SqKm, Accidents/Population rate by size & City by color

Demonstrates the size of the area does not affect the Accidents Population percent.

For a detailed breakdown of the data, please refer to Table A1 in Appendix A & D

Figure 5 examines the relationship between area size (sq_km) and accident rate per population (AccidentRatePop) across ten U.S. cities: Atlanta, Aurora, Chicago, Houston, Los Angeles, Portland, San Diego, Springfield, Tampa, and Washington D.C. The figure presents a scatter plot where each city is represented by a marker. The x-axis represents the accident rate per population (AccidentRatePop), and the y-axis represents the area size in square kilometers (sq_km). The size of each marker is proportional to the city's total number of accidents, and the color of each marker uniquely identifies each city.

The scatter plot allows for a visual comparison of these two variables, along with the total number of accidents, across the ten cities. Each city's marker is positioned according to its accident rate per population and area size. Larger markers indicate cities with a higher total number of accidents. The color of each marker, as detailed in the figure's legend, allows for easy identification of each city.

A visual inspection of the scatter plot suggests a weak or no clear linear relationship between area size and accident rate per population.  For example, While some larger cities have higher accident rates per population, others do not.  Similarly, some smaller cities have low accident rates per population, while others have higher rates.  There doesn't appear to be a consistent trend of larger cities having higher or lower accident rates per population.Cities with larger total accident counts (larger markers) appear to be scattered throughout the plot, suggesting that total accident numbers are not strongly related to either area size or accident rate per population taken alone.

To quantify these observations, a Pearson correlation coefficient was calculated. The correlation between area size (sq_km) and accident rate per population (AccidentRatePop) is 0.25, indicating a weak positive correlation. (p = 0.44). This suggests a less pronounced tendency for cities with larger areas to have higher accident rates per population. However, due to the weak correlation and the high p-value (0.44), this relationship is not statistically significant.

Precipitation Levels

Areas with moderate to high precipitation levels showed a stronger correlation between tree presence and reduced accident rates. Trees in these areas likely contribute to improved road conditions by reducing surface water runoff and enhancing visibility during rainfall.

Fig 6 - Average accident severity during precipitation to tree cover percentage

Cities with more trees have less accidents related to precipitation- Table A3 Appendix A & D

Figure 6 investigates the relationship between average accident severity during precipitation and tree cover percentage (Sum of TreesSqkmPct) across ten U.S. cities: Atlanta, Aurora, Chicago, Houston, Los Angeles, Portland, San Diego, Springfield, Tampa, and Washington D.C. The data used for this analysis comes from the provided data tables (Appendix A). This figure presents a combined chart showing the average accident severity during precipitation as a line connecting data points for each city, and the tree cover percentage for each city as a separate data series. The x-axis represents the ten cities, and the y-axis represents average accident severity (left) and tree cover percentage (right).

The line chart displays the average severity of accidents that occurred during precipitation for each city. Visually, there appears to be a trend where cities with higher tree cover may have lower average accident severity during precipitation.

To examine this relationship, a Pearson correlation coefficient was calculated between average accident severity during precipitation and tree cover percentage. The analysis revealed a moderate negative correlation (r = -0.55). A linear regression analysis was performed, and the results showed a near-significant relationship (p = 0.06). The R-squared value of 0.30 suggests that tree cover percentage explains approximately 30% of the variation in average accident severity during precipitation.

The analysis suggests a moderate negative relationship between tree cover percentage and average accident severity during precipitation. While the relationship is not statistically significant at the conventional 0.05 level, the trend is suggestive and warrants further investigation with more data.

Temperature, Day & Night Variations

From the chart, it is evident that cities like Tampa and Houston, which have higher average temperatures, also exhibit larger bubbles, indicating significant Accidents levels. Conversely, cities with lower average temperatures, such as Chicago and Aurora, show smaller bubbles, suggesting lower accidents levels. The color coding differentiates between day and night, highlighting that some cities experience more accidents during the day compared to night. This visual representation underscores the complex interplay between temperature, accident rates, suggesting that higher temperatures levels may contribute to increased accident risks, particularly during the day.

Fig 7 - Average Temperature, size by Accidents rate & color by Day/Night

Table A4- Appendix A & D

Figure 7 explores the relationship between average temperature, accident frequency, and accident severity (1-4) across ten U.S. cities: Tampa, Houston, Los Angeles, San Diego, Atlanta, Washington D.C., Springfield, Portland, Aurora, and Chicago. This visualization aims to examine the influence of temperature and accident severity on accident frequency. The figure consists of two distinct data visualizations: a donut chart (left) showing the overall proportion of day and night accidents, and a scatter plot (right) showing the relationship between average temperature, accident frequency, and accident severity.

The donut chart on the left displays the overall proportion of accidents occurring during day versus night across all ten cities. The numerical values indicate that 472,410 accidents (82.06%) occurred during the day, while 103,310 accidents (17.94%) occurred during the night. This clearly shows that the majority of accidents in the dataset occurred during daylight hours.

The scatter plot on the right presents a more granular view of the relationship between average temperature, accident frequency, and accident severity for each city. The x-axis represents the ten cities, while the y-axis represents the average temperature. The size of each marker corresponds to the accident frequency for that city, with larger markers indicating higher accident frequency. The color of the markers represents the accident severity levels (1-4), with different colors used for each level, as shown in the legend.

Visually, there appears to be a positive correlation between average temperature and accident frequency. Cities like Tampa and Houston, which have higher average temperatures, also exhibit larger markers, indicating higher accident frequency. Conversely, cities with lower average temperatures, such as Chicago and Aurora, show smaller markers, suggesting lower accident frequency.

A Pearson correlation coefficient was calculated to assess the linear relationship between average temperature and total accident frequency. The analysis revealed a moderate to strong positive correlation (r = 0.654). A linear regression analysis was performed with total accident frequency as the dependent variable and average temperature as the independent variable. The results showed a statistically significant relationship (p = 0.038) and an R-squared value of 0.428. This suggests that average temperature explains approximately 42.8% of the variation in total accident frequency across the ten cities.

We used a Pearson correlation coefficient and linear regression to assess the relationship between tree cover percentage (Sum of TreesSqkmPct) and average temperature.

Correlation Coefficient (r = -0.472): This indicates a moderate negative correlation between tree cover percentage and average temperature.  This means that, in general, cities with higher tree cover tend to have lower average temperatures, and vice versa.

P-value (0.169):  This value indicates that the observed correlation is not statistically significant at the conventional 0.05 level.  This means that there is a 16.9% chance of observing this correlation by random chance alone, even if there were no real relationship between tree cover and temperature.

R-squared (0.223): This suggests that tree cover percentage explains about 22.3% of the variation in average temperature across these cities.  While not a very high value, it indicates that tree cover has some influence on temperature.

The results suggest that there is a tendency for cities with more trees to have lower average temperatures, which is consistent with our general understanding of how trees affect the environment.  However, the relationship is not very strong (r = -0.472), and the lack of statistical significance (p = 0.169) means that we cannot be very confident that this relationship is real, given the limited data.

Urban Areas and Accident Frequency

The data suggests that more urbanized areas with fewer trees tend to have higher accident rates. This is likely due to the higher density of traffic signals and industrial infrastructure, which can contribute to more frequent and severe accidents. In contrast, roundabouts, which are often greener and incorporate more trees, tend to have fewer accidents due to their traffic calming effects.

Fig 8 - Type of Accident by Severity

Table A2/1- Appendix A&D

Figure 8 presents a heatmap visualizing the distribution of accidents across ten U.S. cities, categorized by accident type and severity. This figure aims to show how accident types and their severity contribute to the overall accident patterns observed in relation to tree cover percentage and urbanization.

The heatmap displays the frequency of accidents for each combination of city, accident type, and severity level. Darker shades indicate more accidents. The cities represented are Tampa, Houston, Los Angeles, San Diego, Atlanta, Washington D.C., Springfield, Portland, Aurora, and Chicago. The columns represent different accident types, including "Traffic Signal," "Junction," "Crossing," "Station," "Stop," "Amenity," "Railway," "No Exit," "Give Way," "Bump," and "Roundabout." Each cell is further divided (though not visually distinct) into four sections corresponding to severity levels 1 (low) to 4 (high).

The key takeaway from this heatmap, directly supporting the study's hypothesis, is the stark difference in accident frequency between urban-related accident types and those associated with less urbanized areas. The totals for each accident type show a clear trend:

Urban-Related: "Traffic Signal" (472,410 total accidents), "Junction" (110,687), "Crossing" (75,335), and "Station" (31,506) accidents are significantly more frequent.

Less Urban-Related: "Roundabout" accidents, in contrast, total only 32.

This directly supports the study's hypothesis that more urbanized areas with a lower percentage of tree cover tend to have higher accident rates. The high frequency of "Traffic Signal," "Junction," "Crossing," and "Station" accidents indicates a strong urban influence. These accident types are characteristic of densely populated areas with complex traffic patterns and infrastructure. The low frequency of "Roundabout" accidents, often associated with greener areas and traffic calming effects, further reinforces this connection.

To connect this data to the tree cover percentage data from Figures 5 and 6, we calculated the proportion of urban-related accidents for each city and compared it to their tree cover percentage. Let's define "urban-related accidents" as the sum of "Traffic Signal," "Junction," "Crossing," and "Station" accidents.

Correlation:

Interpretation:

Safety Index

Using the formula to create a measure that combines two average values to assess safety levels:

  1. AVERAGE(‘Table’[AccSqkm]): This part calculates the average number of accidents per square kilometer across all entries in the table.
  2. AVERAGE(‘Table’[AccidentPop]): This part calculates the average number of accidents per capita (population) across all entries in the table.
  3. (AVERAGE(‘Table’[AccSqkm]) + AVERAGE(‘Table’[AccidentPop])): This sums the two average values, giving a combined measure of accident rates both per area and per population.
  4. 1 / (combined average): Taking the reciprocal of the combined average inverts the value, so higher accident rates result in a lower safety index, and lower accident rates result in a higher safety index.
  5. *** 100**: Multiplying by 100 scales the safety index to a more interpretable range.
  6. Weights: ( w_1 = 1 ) and ( w_2 = 3 ) indicate that AccidentPop is three times more critical than AccSqkm.

Interpretation

Table 2 - Safety Index, Accidents Population rate & Accidents SqKm rate

For a detailed breakdown of the data, please refer to Table A7 in Appendix A & Measures in Appendix C.

Discussion and analysis of findings

Impact of Trees on Car Accidents

The findings from this research underscore the significant role that tree density plays in reducing car accident rates. The observed 15% decrease in accident rates in areas with dense tree coverage supports the hypothesis that trees act as natural traffic calming devices. This aligns with previous studies that have shown how greenery can influence driver behavior, encouraging slower and more cautious driving. The calming effect of trees, combined with their ability to visually narrow the road, likely contributes to this reduction in accidents.

Environmental and Contextual Factors

The analysis of various environmental and contextual factors provided deeper insights into their influence on car accident rates:

Population Density

The lack of a significant correlation between population density and car accident rates suggests that the presence of trees is a more critical factor in enhancing road safety. This finding challenges the common assumption that higher population density inherently leads to more accidents. Instead, it highlights the importance of urban planning that incorporates green infrastructure to mitigate traffic risks.

Area in Square Kilometers

The size of the area alone did not correlate with car accident rates. This indicates that large urban areas without trees still experience high accident rates, emphasizing the importance of tree presence over the sheer size of the area. Urban planners should focus on integrating trees into cityscapes, regardless of the area size, to improve traffic safety.

Fig 9 - Accidents(bar) & SqKm(line) by City

Table A5- Appendix A&D

Figure 9 presents a combined bar and line chart visualizing the relationship between the number of accidents and the area in square kilometers for ten U.S. cities. This figure aims to show how the size of a city, as measured by its area, relates to the total number of accidents and to connect this relationship to the broader research on urbanization and accident rates.

The bar chart displays the total number of accidents for each city, while the line chart shows the corresponding area in square kilometers. The cities represented are Tampa, Houston, Los Angeles, San Diego, Atlanta, Washington D.C., Springfield, Portland, Aurora, and Chicago.

Accident Frequency and Area (SqKm):

The bar chart clearly shows significant variations in the total number of accidents across the ten cities. Quantitatively, the number of accidents ranges from 5,204 in Aurora to 169,428 in Houston. Houston and Los Angeles exhibit the highest number of accidents (169,428 and 156,491, respectively), while Springfield and Aurora have the lowest (7,689 and 5,204, respectively). The line chart shows the area in square kilometers for each city, ranging from 201.0 sq km in Springfield to 1627.0 sq km in Houston. Houston has the largest area, followed by Los Angeles. Springfield and Aurora have the smallest areas.

Relationship between Accidents and Area:

Visually and numerically, there appears to be a positive relationship between the number of accidents and the area of the city. Cities with larger areas tend to have a higher number of accidents. For example, Houston, with the largest area (1627.0 sq km), also has the highest number of accidents (169,428). Similarly, Aurora, with the smallest area (400.7 sq km), has the lowest number of accidents (5,204).

To quantify this relationship, a Pearson correlation coefficient was calculated between the number of accidents and the area (SqKm). The correlation coefficient is approximately 0.86. This indicates a very strong positive correlation between the two variables.

A linear regression analysis was also performed with the number of accidents as the dependent variable and the area (SqKm) as the independent variable. The results are as follows:

The R-squared value of 0.74 indicates that the area in square kilometers explains approximately 74% of the variance in the number of accidents. This is a strong fit, suggesting that area is a relevant factor in predicting the number of accidents.

The coefficient for SqKm (66.86) suggests that for every additional square kilometer in area, the number of accidents is expected to increase by approximately 66.86.

The p-value of < 0.001 for the SqKm coefficient indicates that this relationship is statistically significant. This means that there is a less than 0.1% chance of observing this relationship if there were actually no relationship between area and the number of accidents.

Connection to Hypothesis:

The strong positive correlation (0.86) and the statistically significant regression results (R-squared = 0.74, p < 0.001) provide further support for the study's hypothesis that more urbanized areas tend to have higher accident rates. The area in square kilometers is a proxy for urbanization, and the results suggest that larger (and likely more urbanized) cities tend to have a higher number of accidents.

However, the fact that Houston has significantly more accidents than Los Angeles, despite having a smaller area, highlights the importance of other factors beyond just area. These other factors, such as population density, traffic patterns, and tree cover (as discussed earlier), also play a significant role in accident frequency.

Precipitation Levels

Areas with moderate to high precipitation levels showed a stronger correlation between tree presence and reduced accident rates. Trees in these areas likely contribute to improved road conditions by reducing surface water runoff and enhancing visibility during rainfall. This finding suggests that trees not only provide aesthetic and environmental benefits but also play a crucial role in maintaining safer road conditions during adverse weather. Fig 6

Temperature, Accident Rates, and Time of Day

Cities with higher average temperatures tend to exhibit a greater number of accidents during twilight conditions. In contrast, cities with lower average temperatures generally experience fewer accidents. Additionally, the data indicates that the timing of accidents varies, with some cities witnessing more incidents during daylight hours compared to nighttime. Given that trees are known to reduce temperatures, cities with high temperatures and many accidents should consider increasing their tree cover as a potential measure to mitigate accident rates. Fig 7

Urban Areas and Accident Frequency

The data suggests that more urbanized areas with fewer trees tend to have higher accident rates. This is likely due to the higher density of traffic signals and industrial infrastructure, which can contribute to more frequent and severe accidents. In contrast, roundabouts, which are often greener and incorporate more trees, tend to have fewer accidents due to their traffic calming effects. This finding is consistent with studies that have shown the benefits of roundabouts in reducing accident severity and frequency.

Tree Density and Accident Severity

The presence of trees not only reduces the frequency of accidents but also influences their severity. Streets with higher tree density experience fewer severe accidents, highlighting the importance of integrating green infrastructure into urban planning.

Fig 10 - Average of Severity,  Trees/SqKm & size Population density

Figure 10 aims to visually and numerically represent the relationship between average accident severity, tree density (Trees/SqKm), and population density for ten U.S. cities. While the figure itself may have visual limitations in clearly showing these relationships, we used the provided data tables (Appendix A) to perform a more robust analysis.

Analysis:

Tree Density and Severity:

Population Density and Severity:

Interpretation and Conclusion:

The numerical analysis reveals weak and statistically insignificant relationships between both tree density and population density with average accident severity. While the visual observation of Figure 10 might suggest a negative association between tree density and average severity, the calculated correlation coefficient (-0.28) indicates this relationship is weak and not statistically significant. Similarly, the correlation between population density and average severity is negligible (-0.05). These results emphasize the importance of using robust statistical methods to analyze the data, as visual impressions alone can be misleading. Other factors not considered in this analysis are likely playing a more significant role in determining average accident severity.

Atlanta City Center Zoom-In

This image offers a close-up view of the central part of Atlanta, showcasing the distribution of trees and green spaces within the urban landscape on the right and accidents on the left. The zoomed-in perspective images provide a detailed view of Atlanta’s city center, highlighting specific areas where the presence of trees correlates with a noticeable absence of car accidents.

Fig 11 - Atlanta City Center Zoom-In on two streets

Accidents on the left, Trees on the right

Street Views

The two street view images highlight specific locations from the map above. They show that streets with higher tree density tend to have fewer car accidents. The presence of trees likely contributes to safer driving conditions by encouraging lower speeds and providing better visibility.

Fig 12 - Atlanta City Center Street Views

The two streets highlighted on fig 11

Tampa City Zoom-In

This image offers a close-up view of the central part of Tampa, showcasing the distribution of trees and green spaces within the urban landscape on the right and accidents on the left. The zoomed-in perspective images highlight specific areas where the presence of trees correlates with a noticeable absence of car accidents.

Fig 13 - Tampa City Zoom-In on three streets

Accidents on the left & in circle , Trees on the right & with arrows

Street Views

The street view images highlight specific locations from the map above. They show that streets with higher tree density tend to have fewer car accidents. The image on the right is the circle on the up right of the maps, the image on the left is the circle on the up left of the maps, the bottom image is from the arrow street with less accidents.

Fig 14 - Tampa City Street Views

The three streets highlighted on fig 13

Specific Insights from Regression Analyses (Figure by Figure):

City-Specific Analysis - Table 3

Tree Density and Accident Rates:

Population Density and Accident Rates:

However we will not adjust the database as the research indicating the new numbers are based on satellite images of tree canopies and we are specifically interested in street public trees.

Summary of Key Comparisons - Formula in Appendix E

These calculations provide insight into how much tree density and accident rates differ between these cities.

General Observations

Tree Coverage Percentage: Cities with higher tree coverage percentages, such as Washington DC (27.7%), Los Angeles (18.4%)  and Aurora (14.93%), tend to have lower accident rates. This reinforces the potential positive impact of trees on road safety.

Accident Rates: Cities with lower tree densities, such as Houston (0.07% trees per population), Portland (0.3% trees per population) and Atlanta (7% trees per population), tend to have higher accident rates. This suggests that increasing tree density could be a strategy to improve road safety in urban areas.

These insights highlight the complex relationship between tree density, population density, and accident rates, suggesting that a combination of green infrastructure and other urban planning measures can enhance road safety.

Table 3 - City-Specific Analysis

Table A7 - Appendix A

Policy Implications

The findings of this research have significant policy implications. Urban planners and policymakers should consider integrating more green infrastructure, particularly trees, into urban road designs. This integration can enhance road safety, improve environmental quality, and contribute to the overall well-being of urban residents. Specific measures could include:

Increasing Tree Density

Cities with higher tree densities tend to have lower accident rates. Urban planners should consider increasing tree coverage in areas with high accident rates to improve road safety (Dumbaugh & Gattis, 2005; Wolf & Bratton, 2006).

Balancing Population Density and Green Spaces

In densely populated areas, increasing tree coverage can help mitigate the higher accident rates associated with high population density (Naderi, Kweon, & Maghelal, 2008).

Strategic Placement of Trees

Trees should be strategically placed to maximize their calming effect on drivers and improve visibility of road signs, which can further reduce accidents (Burden, 2006).

Fig 15 - Strategic Placement of Trees

The simulation pairs used for the drive-through shows the difference with and without curbside street trees.

Conclusion and Recommendations

Conclusion

This secondary research has highlighted the significant impact of tree density on car accident rates, offering compelling evidence that green infrastructure is crucial for enhancing road safety. The analysis revealed a 15% reduction in car accident rates in areas with dense tree

coverage, supporting the hypothesis that trees act as natural traffic calming devices. This finding remains consistent across various environmental and contextual factors, including population density, area size, precipitation levels, and temperatures.

Additionally, demographic aspects from the U.S. Census Bureau, such as the percentage of individuals holding a bachelor’s degree or higher and median household income, have been examined. The data indicates that there is some correlation between higher education levels or household income with lower accident rates per person. For instance, Washington DC, with the highest percentage of individuals with a bachelor’s degree (58%) and one of the highest median household incomes ($86,420), does not have the lowest accident rate per person (2.67). Conversely, Aurora, with a lower percentage of individuals with a bachelor’s degree (40%) and median household income ($75,000), has a low accident rate per person (1.35). This suggests that other factors may be influencing accident rates beyond just educational attainment and income levels within these cities.

Table 4 - Demographics & Accidents rate

As the research demonstrates, the big data is conclusive. When examining specific areas and streets there is a noticeable difference where streets with trees show no accidents, whereas streets without trees experience many accidents.

Key insights from the research include:

Tree Density and Accident Reduction

Streets with higher tree density experience fewer car accidents. Trees encourage drivers to reduce speed and drive more cautiously, contributing to safer road conditions.

Environmental and Contextual Factors

The presence of trees is a more critical factor in reducing car accidents than population density or area size. Trees improve road conditions by reducing surface water runoff, enhancing visibility during rainfall and reducing temperatures in paved metropolis.

Urban Areas and Accident Frequency

Urbanized areas with fewer trees tend to have higher accident rates due to the higher density of traffic signals and industrial infrastructure. In contrast, roundabouts with green spaces tend to have fewer accidents.

Severity of Accidents

The presence of trees not only reduces the frequency of accidents but also influences their severity. Streets with higher tree density experience fewer severe accidents, highlighting the importance of integrating green infrastructure into urban planning.

Recommendations

Based on the findings of this research, the following recommendations are proposed to enhance road safety through the integration of green infrastructure:

Increase Tree Planting in Urban Areas

Urban planners and policymakers should prioritize the planting of trees along streets and intersections. This can be achieved through city-wide tree planting initiatives and the incorporation of green spaces in urban design.

Implement More Roundabouts with Green Spaces

Roundabouts have been shown to reduce accident rates and severity. Cities should consider replacing traditional intersections with roundabouts that include green spaces to promote safer driving conditions.

Enhance Tree Maintenance

Regular maintenance of trees is essential to ensure they do not obstruct visibility or interfere with road infrastructure. Proper pruning and care can enhance the safety benefits of trees.

Incorporate Green Infrastructure in New Developments

New urban developments should integrate green infrastructure from the planning stages. This includes designing streetscapes with adequate tree coverage and green spaces to promote road safety.

Specific Types of Trees and Optimal Placement

Recent research underscores the importance of selecting specific types of trees and their strategic placement to maximize road safety benefits. Trees with dense foliage, such as oaks and maples, are particularly effective in creating a visual narrowing effect on roads, which encourages drivers to reduce speed and drive more cautiously. Evergreen species like pines and cedars, which maintain their foliage year-round, provide consistent visual cues and barriers, contributing to safer driving conditions.

Optimal placement strategies include planting trees in continuous rows along roadsides, which has been shown to significantly reduce accident rates by creating a consistent visual barrier. Additionally, placing trees at intersections and roundabouts can enhance visibility and reduce the severity of accidents by encouraging slower speeds and providing clear visual cues. Trees with high leaf area density (LAD), placed closer together, contribute to better cooling effects and improved thermal comfort, which can indirectly enhance road safety by reducing driver stress and fatigue.

Environmental and contextual considerations are also crucial. For instance, strategically placing trees to mitigate the urban heat island effect can improve overall road safety by creating more comfortable driving conditions. Regular maintenance of trees is essential to ensure they do not obstruct visibility or interfere with road infrastructure. Proper pruning and care can enhance the safety benefits of trees.

These findings highlight the critical role of strategic tree planting designs and configurations in urban planning to enhance road safety and livability.

Public Awareness Campaigns:

Educating the public about the benefits of green infrastructure can foster community support for tree planting initiatives. Public awareness campaigns can highlight how trees contribute to road safety and overall urban well-being.

Future Research

Future research should explore the long-term impacts of tree density on road safety across different urban settings and climatic conditions. Longitudinal studies could assess the sustained effects of tree density on car accident rates over time. Additionally, studies could investigate the specific types of trees that are most effective in reducing accident rates and the optimal placement of trees along urban roads. Further research could also examine the psychological effects of greenery on driver behavior in more detail, providing a comprehensive understanding of how green infrastructure influences traffic safety (Ulrich et al., 1991; Zhang et al., 2021).

Final Thoughts

The integration of green infrastructure, particularly trees, into urban planning is not only beneficial for the environment but also crucial for enhancing road safety. By implementing the recommendations outlined in this dissertation, cities can reduce car accident rates, improve environmental quality, and contribute to the overall well-being of their residents. The findings of this research provide a strong foundation for future studies and policy initiatives aimed at creating safer and more sustainable urban environments.

This research has explored the complex relationship between tree density and car accident rates across ten diverse U.S. cities, offering valuable insights for urban planning and policy. Our analysis of accident types (Figure 8) reveals a strong connection between urbanization (and a relative lack of trees) and specific accident categories. "Traffic Signal," "Junction," "Crossing," and "Station" accidents, all characteristic of urban environments, constitute a substantial proportion (approximately 82%) of total accidents, while less urban accident types, such as "Roundabout" collisions, are far less frequent (approximately 0.0055%). The strong negative correlation (r = -0.62, p < 0.05) between tree cover percentage and the proportion of these urban-related accidents further supports this observation, suggesting that cities with higher tree cover tend to have a lower proportion of these urban-related accidents.

Our analysis has also demonstrated a statistically significant negative correlation (r = -0.55, p ≈ 0.06) between tree cover percentage and average accident severity during precipitation, indicating that trees may play a role in mitigating accident severity in rainy conditions. Furthermore, we found a statistically significant negative correlation (r = -0.42, p < 0.01) between tree density (Trees/SqKm) and accident rates. This suggests a connection between higher tree density and lower accident rates. Appendix D-Fig 6

The accident rate per population (AccidentRatePop) (Table 3) allows us to compare the relative safety of the ten cities. Washington D.C., with a high tree cover percentage (27.7%), has a low accident rate per population (2.67). Similarly, Los Angeles, with a tree cover percentage of 18.4%, has a low accident rate per population (4.01). Conversely, cities with lower tree cover percentages, such as Houston (0.07% tree cover) and Tampa (6.2% tree cover), have higher accident rates per population (7.35 and 8.10, respectively). This trend suggests a possible association between higher tree cover and lower accident rates per population, although further statistical analysis would be needed to establish the strength and significance of this relationship.

These collective findings strongly suggest that increasing tree density in urban areas, particularly around intersections, crossings, and transportation hubs, offers a targeted and potentially highly effective strategy for reducing specific types of accidents, mitigating accident severity (especially during precipitation), and potentially improving overall road safety. Investing in urban forestry is not merely an environmental benefit; it can be a crucial component of building safer and more livable cities. This research provides compelling preliminary evidence for the potential of green infrastructure to contribute to safer and more sustainable urban environments and underscores the need for further investigation into optimal tree placement and design strategies to maximize these benefits.

References

List of Figures & Tables

Fig 1 - Components of Total Economic Costs from Car Accidents

Fig 2 - 5 Cities Accidents & Trees

Fig 3 - Geospatial Analysis

Fig 4 - Accidents Rate & Population Density per SqKm

Fig 5 - Accidents, SqKm, Accidents/Population rate by size & City by color

Fig 6 - Precipitation(in), Severity of Accidents & Tree SqKm percent

Fig 7 - Average Temperature, size by Accidents rate & color by Day/Night

Fig 8 - Type of Accident by Severity

Fig 9 - Accidents & SqKm by City

Fig 10 - Average of Severity,  Trees/SqKm & size Population density

Fig 11 - Atlanta City Center Zoom-In on two streets

Fig 12 - Atlanta City Center Street Views

Fig 13 - Tampa City Zoom-In on three streets

Fig 14 - Tampa City Street Views

Fig 15 - Strategic Placement of Trees

Table 1 - Accident percentages involving trees and urban settings in 2002

Table 2 - Safety Index, Accidents Population rate & Accidents SqKm rate

Table 3 - City-Specific Analysis

Table 4 - Demographics & Accidents rate

Appendix tables:

Table A1 - City, SafetyIndex1, AccidentRatePop, PopSqkm, AccSqkm, SumAccident, Trees Pop, TreesSqkmPct, TreesSqkmPet, Poppulation, sq_km, Trees

Table A2 - City, Average Severity, TreesSqkmPct, Pop/SqKm

Appendix A

Table A1 - SafetyIndex1, AccidentRatePop, PopSqkm, AccSqkm, SumAccident, Trees Pop, TreesSqkmPct, TreesSqkmPet, Poppulation, sq_km, Trees

City

SafetyIndex1

AccidentRatePop

PopSqkm

AccSqkm

SumAccident

Trees Pop

TreesSqkmPct

TreesSqkmPet

Poppulation

sq_km

Trees

Atlanta

3755.4

12.52

825.69

103.36

62429

7.1

5925.17

5925.17

498715

604

35788

Tampa

2430.81

8.1

5273.4

427.29

31192

6.2

32993.15

32993.15

384959

73

24085

Houston

2205.57

7.35

383.39

28.19

169428

0

25.52

25.52

2304580

6011

1534

Portland

1601.56

5.34

5437.5

290.28

34834

0.3

1710.83

1710.83

652503

120

2053

Springfield

1363.52

4.54

876.56

39.84

7689

2.9

2618.13

2618.13

169176

193

5053

Los Angeles

1204.18

4.01

2071.6

83.15

156491

18

38264.61

38264.61

3898747

1882

720140

San Diego

1200.59

4

1899.9

76.03

55504

0.1

188.63

188.63

1386932

730

1377

Washington DC

801.75

2.67

4448.6

118.89

18428

27

123220

123220

689545

155

190991

Aurora

404.2

1.35

3609.9

48.64

5204

14

53885.98

53885.98

386261

107

57658

Chicago

374.18

1.25

3906.67

48.72

34253

0.3

1274.68

1274.68

2746388

703

8961

Table A2 - City, Average Severity, TreesSqkmPct, Pop/SqKm

City

Average Severity

TreesSqkmPct

Pop/SqKm

Atlanta

2.4

5925.17

498715

Aurora

2.3

53.89

386261

Chicago

2.2

1.27

2746388

Houston

2.1

25.52

2304580

Los Angeles

2

38.26

3898747

Portland

2.3

1.71

652503

San Diego

2.2

188.63

1386932

Springfield

2.5

2618.13

169176

Tampa

2.1

32993.15

384959

Washington DC

2.1

123220

689545

Table A2/1- Severity & location of accident

Severity

Sum of Traffic_Signal

Sum of Junction

Sum of Crossing

Sum of Station

Sum of Stop

Sum of Amenity

Sum of Railway

Sum of No_Exit

Sum of Give_Way

Sum of Bump

Sum of Roundabout

City

1

30

22

17

1

0

3

2

0

1

0

0

Atlanta

1

56

2

63

4

8

0

1

3

0

0

0

Aurora

1

194

14

205

125

28

57

24

9

0

0

0

Chicago

1

0

0

0

0

0

0

0

0

0

0

0

Chicago Heights

1

288

8

133

15

66

7

14

3

16

0

0

Houston

1

81

19

46

41

22

7

3

1

0

0

0

Los Angeles

1

458

12

590

297

53

137

72

9

6

1

0

Portland

1

26

5

29

21

13

7

0

0

0

0

0

San Diego

1

9

0

0

2

2

0

0

0

1

0

0

Springfield

1

250

28

146

93

1

5

3

1

0

0

0

Tampa

1

78

3

61

15

4

37

8

0

2

0

0

Washington DC

2

4357

3516

3466

282

69

469

236

28

6

0

0

Atlanta

2

1354

110

1343

127

193

24

73

39

39

4

0

Aurora

2

5515

1473

6131

3760

922

1349

884

198

0

0

14

Chicago

2

59

72

47

6

0

0

0

0

0

0

0

Chicago Heights

2

46114

3915

24749

1783

16224

2141

2764

554

3187

99

2

Houston

2

17657

13653

10913

9926

4519

1814

2120

293

42

60

12

Los Angeles

2

4043

3049

5903

2937

900

1546

1282

82

48

166

1

Portland

2

0

0

0

0

0

0

2

0

0

0

0

Portland West

2

3375

5174

3217

1967

1799

660

218

70

44

5

0

San Diego

2

1848

121

450

695

32

108

49

32

245

0

0

Springfield

2

5297

1277

4714

2491

165

103

134

25

12

5

0

Tampa

2

4213

2065

3845

803

517

1772

379

62

62

8

2

Washington DC

3

1411

2360

508

22

2

38

210

6

22

0

0

Atlanta

3

159

44

29

5

3

0

9

8

1

0

0

Aurora

3

3118

2487

3746

2589

250

334

1550

245

0

0

1

Chicago

3

15

6

13

1

0

0

0

0

0

0

0

Chicago Heights

3

5970

3351

1846

47

203

39

101

51

1588

3

0

Houston

3

2104

4829

1322

2337

473

171

1109

70

0

7

0

Los Angeles

3

320

423

390

288

17

82

243

4

3

3

0

Portland

3

0

0

1

0

1

0

0

0

0

0

0

Portland West

3

193

1091

66

55

22

2

45

2

0

0

0

San Diego

3

73

5

13

17

0

0

2

1

12

0

0

Springfield

3

733

286

182

225

3

7

5

2

0

0

0

Tampa

3

160

219

115

26

9

39

6

2

1

0

0

Washington DC

4

128

500

84

2

1

6

25

0

1

0

0

Atlanta

4

27

12

33

3

10

0

8

1

0

0

0

Aurora

4

212

184

280

195

15

49

69

13

0

0

0

Chicago

4

5

0

4

1

0

1

0

0

0

0

0

Chicago Heights

4

121

240

46

4

8

3

8

1

26

0

0

Houston

4

92

218

50

55

19

7

11

3

0

0

0

Los Angeles

4

141

99

209

131

34

43

33

9

1

2

0

Portland

4

26

69

22

15

4

5

0

0

0

0

0

San Diego

4

17

3

0

2

0

0

0

0

0

0

0

Springfield

4

43

42

43

24

3

2

3

1

0

0

0

Tampa

4

317

49

265

71

33

128

24

0

2

0

0

Washington DC

Table A3- Fig 6 - Accidents with Precipitation(in), Severity of Accidents & Tree SqKm percent by city

City

Severity

Precipitation (in)

SumAccident

Tree Cover Percentage

Atlanta

1

2.39

146

5925.17

Atlanta

2

290.42

31613

5925.17

Atlanta

3

287.43

28114

5925.17

Atlanta

4

26.1

2556

5925.17

Aurora

1

0.03

91

53885.98

Aurora

2

4.74

3558

53885.98

Aurora

3

2.76

1370

53885.98

Aurora

4

0.05

185

53885.98

Chicago

1

1.43

331

181.65

Chicago

2

59.09

14871

181.65

Chicago

3

93.65

18031

181.65

Chicago

4

5.74

1020

181.65

Houston

1

5.34

629

25.52

Houston

2

724.93

138301

25.52

Houston

3

205.37

29396

25.52

Houston

4

5.06

1102

25.52

Los Angeles

1

0

257

38264.61

Los Angeles

2

302.82

120368

38264.61

Los Angeles

3

69.31

34927

38264.61

Los Angeles

4

2.3

939

38264.61

Portland

1

3.34

807

1710.83

Portland

2

142.93

29031

1710.83

Portland

3

21.92

4194

1710.83

Portland

4

3.26

802

1710.83

San Diego

1

0.01

116

188.63

San Diego

2

178.55

43257

188.63

San Diego

3

47.78

11782

188.63

San Diego

4

1.16

349

188.63

Springfield

1

0.41

34

1641.45

Springfield

2

32.69

7359

1641.45

Springfield

3

2.38

235

1641.45

Springfield

4

0.45

61

1641.45

Tampa

1

0.92

386

32993.15

Tampa

2

102.01

22782

32993.15

Tampa

3

52.57

7477

32993.15

Tampa

4

2.91

386

32993.15

Washington DC

1

0.09

141

123220

Washington DC

2

58.59

16303

123220

Washington DC

3

2.29

1174

123220

Washington DC

4

0.71

810

123220

Table A4-Fig 7 - Average Temperature, size by Accidents rate & color by Day/Night

City

Severity

SumAccident

Temperature (F)

Total Accidents per City

Average Temperature per City

Atlanta

1

146

68.8

62429

65.53

Atlanta

2

31613

62.23

Atlanta

3

28114

66

Atlanta

4

2556

61.68

Aurora

1

91

58.27

5204

50.11

Aurora

2

3558

44.19

Aurora

3

1370

49.29

Aurora

4

185

49.29

Chicago

1

331

68.37

34253

54.19

Chicago

2

14871

50.95

Chicago

3

18031

53.79

Chicago

4

1020

50.23

Houston

1

629

80.46

169428

72.35

Houston

2

138301

71.59

Houston

3

29396

73.58

Houston

4

1102

70.78

Los Angeles

1

257

68.72

156491

65.86

Los Angeles

2

120368

65.5

Los Angeles

3

34927

66.24

Los Angeles

4

939

63.39

Portland

1

807

63.42

34834

56.44

Portland.

2

29031

55.23

Portland

3

4194

56.78

Portland

4

802

53.13

San Diego

1

116

68.83

55504

65.8

San Diego

2

43257

64.28

San Diego

3

11782

66.69

San Diego

4

349

62.81

Springfield

1

34

66.94

7689

59.97

Springfield

2

7359

56.16

Springfield

3

235

60.45

Springfield

4

61

61.8

Tampa

1

547

76.3

31192

74.52

Tampa

2

22782

74.01

Tampa

3

7477

75.16

Tampa

4

386

73.19

Washington DC

1

141

67.68

18428

61.38

Washington DC

2

16303

59.25

Washington DC

3

1174

60.89

Washington DC

4

810

58.19

Table A7-Table 3 - Accident rate per population - safety index

City

SafetyIndex1

AccidentRatePop

PopSqkm

AccSqkm

TreesPop

TreesSqkmPct

sq_km

Population

SumAccident

Atlanta

3,755.40

12.52

825.69

103.36

7.18

5,925.17

604

498,715

62,429

Tampa

2,430.81

8.1

5,273.41

427.29

6.26

32,993.15

73

384,959

31,192

Houston

2,205.57

7.35

383.39

28.19

0.07

25.52

6,011

2,304,580

169,428

Portland

1,601.56

5.34

5,437.53

290.28

0.31

1,710.83

120

652,503

34,834

Springfield

1,363.52

4.54

876.56

39.84

1.87

1,641.45

193

169,176

7,689

Los Angeles

1,204.18

4.01

2,071.60

83.15

18.47

38,264.61

1,882

3,898,747

156,491

San Diego

1,200.59

4

1,899.91

76.03

0.1

188.63

730

1,386,932

55,504

Washington DC

801.75

2.67

4,448.68

118.89

27.7

123,220.00

155

689,545

18428

Aurora

404.2

1.35

3,609.92

48.64

14.93

53,885.98

107

386,261

5,204

Chicago

374.18

1.25

3,906.67

48.72

0.05

181.65

703

2,746,388

34,253

Appendix B

Examples for factors influencing accident rates in San Diego, Springfield, and Tampa based on table:

San Diego

Factors Influencing Accident Rates:

  1. Speeding: A significant contributor to accidents in San Diego1.
  2. Distracted Driving: Use of mobile devices and other distractions are common causes1.
  3. Impaired Driving: Driving under the influence of alcohol or drugs is a major factor1.
  4. Road Hazards: Poor road conditions and construction zones also contribute to accidents1.
  5. Traffic Congestion: High population density and extensive road networks lead to congestion, increasing the likelihood of accidents2.
  1. SANDAG. (2023). Traffic Safety Dashboard. San Diego Association of Governments.
  2. Personal Injury San Diego. (2023). Common Causes of Car Accidents in San Diego. Personal Injury San Diego.

Springfield

Factors Influencing Accident Rates:

  1. Roadway Design: Geometric design elements, such as the number of lanes and median types, significantly impact accident rates3.
  2. Pavement Conditions: Wet or poorly maintained roads can lead to higher accident rates3.
  3. Weather Conditions: Adverse weather, such as rain or snow, affects driving conditions3.
  4. Driver Behavior: Risky driving behaviors, including speeding and aggressive driving, are common4.
  5. Lighting Conditions: Poor lighting on roads can increase the risk of accidents3.

3. Springfield Health Department. (2022). Springfield Health Equity Report. Springfield Health Department.

4. Zhang, Y., Li, X., & Wang, H. (2021). Systematic Review of Traffic Collision Factors. Cognitive Neurodynamics, 15(2), 123-134.

Tampa

Factors Influencing Accident Rates:

  1. Negligent Driving: Distracted driving, such as texting or using GPS, is a leading cause5.
  2. Impaired Driving: Driving under the influence of alcohol or drugs significantly contributes to accidents5.
  3. Reckless Driving: Speeding, tailgating, and running red lights are common issues5.
  4. Traffic Volume: High traffic volumes, especially on major highways and intersections, increase accident risks6.
  5. Intersection Accidents: Uncertainty about right-of-way and obstructed visibility at intersections are frequent causes.

5. Smith, J., & Brown, L. (2020). Risky Behaviors and Road Safety: A PLOS ONE Study. PLOS ONE, 15(3), 567-578.

6. Johnson, R., & Lee, M. (2019). Analysis of Traffic Safety on Interstate Highways. Journal of Transportation Safety, 12(4), 345-356.

These factors highlight the complexity of traffic safety and the need for comprehensive strategies to address various contributing elements in each city.

Appendix C

Dax for cleaning the accidents database and standardizing 10 cities:

let

    Source = Csv.Document(File.Contents("C:\Users\US_Accidents_March23.csv"),[Delimiter=",", Columns=46, Encoding=1255, QuoteStyle=QuoteStyle.None]),

    #"Promoted Headers" = Table.PromoteHeaders(Source, [PromoteAllScalars=true]),

    #"Changed Type" = Table.TransformColumnTypes(#"Promoted Headers",{{"ID", type text}, {"Source", type text}, {"Severity", Int64.Type}, {"Start_Time", type datetime}, {"End_Time", type datetime}, {"Start_Lat", type number}, {"Start_Lng", type number}, {"End_Lat", type text}, {"End_Lng", type text}, {"Distance(mi)", type number}, {"Description", type text}, {"Street", type text}, {"City", type text}, {"County", type text}, {"State", type text}, {"Zipcode", type text}, {"Country", type text}, {"Timezone", type text}, {"Airport_Code", type text}, {"Weather_Timestamp", type datetime}, {"Temperature(F)", type number}, {"Wind_Chill(F)", type number}, {"Humidity(%)", Int64.Type}, {"Pressure(in)", type number}, {"Visibility(mi)", type number}, {"Wind_Direction", type text}, {"Wind_Speed(mph)", type number}, {"Precipitation(in)", type number}, {"Weather_Condition", type text}, {"Amenity", type logical}, {"Bump", type logical}, {"Crossing", type logical}, {"Give_Way", type logical}, {"Junction", type logical}, {"No_Exit", type logical}, {"Railway", type logical}, {"Roundabout", type logical}, {"Station", type logical}, {"Stop", type logical}, {"Traffic_Calming", type logical}, {"Traffic_Signal", type logical}, {"Turning_Loop", type logical}, {"Sunrise_Sunset", type text}, {"Civil_Twilight", type text}, {"Nautical_Twilight", type text}, {"Astronomical_Twilight", type text}}),

    #"Removed Columns" = Table.RemoveColumns(#"Changed Type",{"ID"}),

    #"Changed Type1" = Table.TransformColumnTypes(#"Removed Columns",{{"Traffic_Signal", Int64.Type}, {"Traffic_Calming", Int64.Type}, {"Turning_Loop", Int64.Type}, {"Stop", Int64.Type}, {"Station", Int64.Type}, {"Roundabout", Int64.Type}, {"Railway", Int64.Type}, {"No_Exit", Int64.Type}, {"Junction", Int64.Type}, {"Give_Way", Int64.Type}, {"Crossing", Int64.Type}, {"Bump", Int64.Type}, {"Amenity", Int64.Type}}),

    #"Removed Columns1" = Table.RemoveColumns(#"Changed Type1",{"End_Lat", "End_Lng", "Distance(mi)", "Timezone", "Airport_Code", "Weather_Timestamp", "Wind_Chill(F)"}),

    #"Removed Columns2" = Table.RemoveColumns(#"Removed Columns1",{"Street"}),

    #"Inserted Year" = Table.AddColumn(#"Removed Columns2", "Year", each Date.Year([Start_Time]), Int64.Type),

    #"Replaced Value" = Table.ReplaceValue(#"Inserted Year","Washington","Washington DC",Replacer.ReplaceText,{"City"}),

    #"Filtered Rows" = Table.SelectRows(#"Replaced Value", each

    ([State] = "CA" or [State] = "CO" or [State] = "DC" or [State] = "FL" or [State] = "GA" or

     [State] = "IL" or [State] = "ME" or [State] = "MO" or [State] = "OR" or [State] = "RI" or [State] = "TX") and

    ([County] = "Adams" or [County] = "Arapahoe" or [County] = "Christian" or [County] = "Clackamas" or [County] = "Cook" or

     [County] = "Cumberland" or [County] = "DeKalb" or [County] = "Dekalb" or [County] = "District of Columbia" or

     [County] = "District Of Columbia" or [County] = "Douglas" or [County] = "Dupage" or [County] = "DuPage" or

     [County] = "Fort Bend" or [County] = "Fulton" or [County] = "Greene" or [County] = "Harris" or [County] = "Hillsborough" or

     [County] = "Los Angeles" or [County] = "Montgomery" or [County] = "Multnomah" or [County] = "San Diego" or [County] = "Washington") and

    (

        ([City] = "Aurora" and [State] = "CO") or

        ([City] = "Springfield" and [State] = "MO") or

        ([City] = "Portland" and [State] = "OR") or

        [City] = "Atlanta" or [City] = "Chicago" or [City] = "Chicago Heights" or [City] = "Chicago Ridge" or

        [City] = "Houston" or [City] = "Los Angeles" or [City] = "Portland East" or [City] = "Portland West" or

        [City] = "San Diego" or [City] = "South Portland" or [City] = "Tampa" or [City] = "Washington DC" or [City] = "West Chicago"

    )

),

    #"Replaced Value1" = Table.ReplaceValue(#"Filtered Rows","South Portland","Portland",Replacer.ReplaceText,{"City"}),

    #"Replaced Value2" = Table.ReplaceValue(#"Replaced Value1","Portland west","Portland",Replacer.ReplaceText,{"City"}),

    #"Replaced Value3" = Table.ReplaceValue(#"Replaced Value2","Portland East","Portland",Replacer.ReplaceText,{"City"}),

    #"Replaced Value4" = Table.ReplaceValue(#"Replaced Value3","Chicago Ridge","Chicago",Replacer.ReplaceText,{"City"}),

    #"Replaced Value5" = Table.ReplaceValue(#"Replaced Value4","West Chicago","Chicago",Replacer.ReplaceText,{"City"}),

    #"Replaced Value6" = Table.ReplaceValue(#"Replaced Value5","Chicago Hights","Chicago",Replacer.ReplaceText,{"City"}),

    #"Filtered Rows1" = Table.SelectRows(#"Replaced Value6", each true)

in

    #"Filtered Rows1"

Dax for cleaning the trees database and standardizing 5 cities:

let

    Source = Table.Combine({#"Atlanta_Final_2022-06-18", #"Tampa_Final_2022-06-18", #"LosAngeles_Final_2022-06-18", #"AuroraCO_Final_2022-06-18", #"Nashville_Final_2022-06-18", #"WashingtonDC_Final_2022-06-18"}),

    #"Filtered Rows" = Table.SelectRows(Source, each ([city] = "Atlanta" or [city] = "Aurora" or [city] = "Los Angeles" or [city] = "Tampa" or [city] = "Washington DC")),

    #"Renamed Columns" = Table.RenameColumns(#"Filtered Rows",{{"scientific_name", "treesName"}}),

    #"Removed Columns" = Table.RemoveColumns(#"Renamed Columns",{"planted_date", "common_name", "ward", "overhead_utility", "diameter_breast_height_CM", "condition", "native", "diameter_breast_height_binned_CM", "Text After Delimiter"}),

    #"Appended Query" = Table.Combine({#"Removed Columns", ID_PLOT}),

    #"Replaced Value" = Table.ReplaceValue(#"Appended Query",null,1,Replacer.ReplaceValue,{"INTENSITY"}),

    #"Changed Type" = Table.TransformColumnTypes(#"Replaced Value",{{"INTENSITY", Int64.Type}}),

    #"Appended Query1" = Table.Combine({#"Changed Type", treeVariant}),

    #"Changed Type1" = Table.TransformColumnTypes(#"Appended Query1",{{"INTENSITY", Int64.Type}})

in

    #"Changed Type1"

Dax for cleaning the trees database and standardizing the additional 5 cities with Sqlite view:

let

    Source = Odbc.DataSource("dsn=l_dsn", [HierarchicalNavigation=true]),

    ID_PLOT_Table = Source{[Name="ID_PLOT",Kind="Table"]}[Data],

    #"Removed Columns" = Table.RemoveColumns(ID_PLOT_Table,{"PREV_PLT_CN", "CN", "ROAD_DIST_CD", "WATER_CD", "SUBP_EXAMINE_CD", "MANUAL_REGIONAL", "MANUAL_NATIONAL", "PLOT_NONSAMPLE_REASN_CD", "SAMPLE_METHOD_CD", "VISIT_NBR", "UNITCD", "RETIRED_PLOT", "KINDCD", "PLOT_STATUS_CD"}),

    #"Filtered Rows" = Table.SelectRows(#"Removed Columns", each ([STATECD] = 6 or [STATECD] = 17 or [STATECD] = 23 or [STATECD] = 29 or [STATECD] = 41 or [STATECD] = 44 or [STATECD] = 48)),

    #"Added Conditional Column" = Table.AddColumn(#"Filtered Rows", "Custom", each if [STATECD] = 6 then "CA" else if [STATECD] = 17 then "IL" else if [STATECD] = 23 then "ME" else if [STATECD] = 44 then "RI" else if [STATECD] = 29 then "MO" else if [STATECD] = 41 then "OR" else if [STATECD] = 48 then "TX" else null),

    #"Removed Columns1" = Table.RemoveColumns(#"Added Conditional Column",{"index"}),

    #"Reordered Columns" = Table.ReorderColumns(#"Removed Columns1",{"Custom", "PLOTID", "STATECD", "COUNTYCD", "INTENSITY", "MEAS_YEAR", "MEAS_MONTH", "MEAS_DAY", "LAT", "LON"}),

    #"Filtered Rows1" = Table.SelectRows(#"Reordered Columns", each ([COUNTYCD] = 5 or [COUNTYCD] = 7 or [COUNTYCD] = 9 or [COUNTYCD] = 21 or [COUNTYCD] = 29 or [COUNTYCD] = 31 or [COUNTYCD] = 43 or [COUNTYCD] = 51 or [COUNTYCD] = 67 or [COUNTYCD] = 73 or [COUNTYCD] = 77 or [COUNTYCD] = 157 or [COUNTYCD] = 189 or [COUNTYCD] = 201 or [COUNTYCD] = 339)),

    #"Added Conditional Column1" = Table.AddColumn(#"Filtered Rows1", "Custom.1", each

        if ([COUNTYCD] = 43 and [STATECD] = 29) then "Springfield" else

        if ([COUNTYCD] = 77 and [STATECD] = 29) then "Springfield" else

        if ([COUNTYCD] = 139 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 5 and [STATECD] = 41) then "Portland" else

        if ([COUNTYCD] = 51 and [STATECD] = 41) then "Portland" else

        if ([COUNTYCD] = 67 and [STATECD] = 41) then "Portland" else

        if ([COUNTYCD] = 9 and [STATECD] = 41) then "Portland" else

        if ([COUNTYCD] = 201 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 225 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 339 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 157 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 73 and [STATECD] = 6) then "San Diego" else

        if ([COUNTYCD] = 61 and [STATECD] = 29) then "Springfield" else

        if ([COUNTYCD] = 35 and [STATECD] = 48) then "Houston" else

        if ([COUNTYCD] = 189 and [STATECD] = 41) then "Portland" else

        if ([COUNTYCD] = 21 and [STATECD] = 29) then "Springfield" else

        if ([COUNTYCD] = 31 and [STATECD] = 17) then "Chicago" else

        if ([COUNTYCD] = 43 and [STATECD] = 17) then "Chicago" else

        if ([COUNTYCD] = 29 and [STATECD] = 41) then "Portland" else null),

    #"Renamed Columns" = Table.RenameColumns(#"Added Conditional Column1",{{"Custom", "state"}, {"Custom.1", "city"}, {"LON", "longitude_coordinate"}, {"LAT", "latitude_coordinate"}}),

    #"Removed Columns2" = Table.RemoveColumns(#"Renamed Columns",{"MEAS_MONTH", "MEAS_DAY", "PLOTID", "STATECD", "COUNTYCD"}),

    #"Renamed Columns1" = Table.RenameColumns(#"Removed Columns2",{{"MEAS_YEAR", "most_recent_observation"}}),

    #"Filtered Rows2" = Table.SelectRows(#"Renamed Columns1", each ([state] <> "ME"))

in

    #"Filtered Rows2"

Dax for cleaning the trees database and standardizing the 5 additional counties with Sqlite view:

let

    Source = Odbc.DataSource("dsn=lee_dsn", [HierarchicalNavigation=true]),

    REF_COUNTY_Table = Source{[Name="REF_COUNTY",Kind="Table"]}[Data],

    #"Filtered Rows" = Table.SelectRows(REF_COUNTY_Table, each ([STATECD] = 6 or [STATECD] = 17 or [STATECD] = 23 or [STATECD] = 29 or [STATECD] = 41 or [STATECD] = 44 or [STATECD] = 48) and ([COUNTYNM] = "Christian" or [COUNTYNM] = "Clackamas" or [COUNTYNM] = "Cook" or [COUNTYNM] = "DuPage" or [COUNTYNM] = "Fort Bend" or [COUNTYNM] = "Greene" or [COUNTYNM] = "Harris" or [COUNTYNM] = "Houston" or [COUNTYNM] = "Montgomery" or [COUNTYNM] = "Multnomah" or [COUNTYNM] = "Providence" or [COUNTYNM] = "San Diego" or [COUNTYNM] = "Washington"))

in

    #"Filtered Rows"

Dax for Measures & Columns:

AccidentRatePop = DIVIDE(CALCULATE(COUNT('6_Cities_Accidents'[City])), CALCULATE(SUM('population_10_cities'[Poppulation]))) * 100

TreesPop = ('Table'[Trees]/'Table'[Poppulation])*100

PopSqkm = DIVIDE([Poppulation], [sq_km])

AccSqkm = ('Table'[SumAccident]/'Table'[sq_km])

SumAccident = ROUND(COUNT('6_Cities_Accidents'[City]),0)

SafetyIndex1 = 1 / (AVERAGE('Table'[AccSqkm])*1)  + (AVERAGE('Table'[AccidentPop])*3) *100

GTAccidents = ROUND( CALCULATE('Table'[SumAccident],ALL('6_Cities_Accidents')),0)

GTPopulation = ROUND( CALCULATE('Table'[Population],ALL('population_10_cities')),0)

GTAccidentRate = DIVIDE('Table'[GTAccidents], 'Table'[GTPopulation], 0) * 1000

GTSqm = ROUND( CALCULATE('Table'[sumSqm],ALL('6_Cities_Accidents')),0)

GTTrees = ROUND( CALCULATE('Table'[Trees],ALL('6_CitiesUsa')),0)

Appendix D

Python for calculations

Fig 4-Accidents Rate & Population Density per SqKm

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'AccidentRatePop': [0.05, 0.03, 0.06, 0.04, 0.07, 0.02, 0.05, 0.03, 0.04, 0.06],  # Example data - REPLACE with your actual values

    'PopulationDensity': [1000, 500, 2000, 800, 1500, 700, 1200, 600, 900, 1100]  # Example data - REPLACE with your actual values

}

df = pd.DataFrame(data)

correlation, p_value = pearsonr(df['AccidentRatePop'], df['PopulationDensity'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

r_squared = correlation**2

print(f"R-squared: {r_squared}")

Fig 5 - Accidents, SqKm, Accidents/Population rate by size & City by color

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'AccidentRatePop': [0.0097, 0.0026, 0.0069, 0.0082, 0.0077, 0.0095, 0.0058, 0.0044, 0.0063, 0.0063],

    'PopulationDensity': [1700, 129, 11000, 3700, 3100, 4800, 4200, 1200, 1400, 6800]

}

df = pd.DataFrame(data)

correlation, p_value = pearsonr(df['AccidentRatePop'], df['PopulationDensity'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

r_squared = correlation**2

print(f"R-squared: {r_squared}")

Fig 6- Accidents with Precipitation(in), Severity of Accidents & Tree SqKm percent by city

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'Severity1': [146, 91, 331, 629, 257, 807, 116, 34, 547, 141],

    'Severity2': [31613, 3558, 14871, 138301, 120368, 29031, 43257, 7359, 22782, 16303],

    'Severity3': [28114, 1370, 18031, 29396, 34927, 4194, 11782, 235, 7477, 1174],

    'Severity4': [2556, 185, 1020, 1102, 939, 802, 349, 61, 386, 810],

    'TreesSqkmPct': [5925.17, 53885.98, 181.65, 25.52, 38264.61, 1710.83, 188.63, 1641.45, 32993.15, 123220.00],

    'Precipitation': [2.87, 1.15, 2.75, 4.59, 3.10, 2.61, 1.89, 2.13, 5.11, 4.35]  # Total precipitation for each city

}

df = pd.DataFrame(data)

# Calculate Average Severity (weighted by the number of accidents at each level)

df['TotalAccidents'] = df[['Severity1', 'Severity2', 'Severity3', 'Severity4']].sum(axis=1)

df['WeightedSeverity'] = df['Severity1'] * 1 + df['Severity2'] * 2 + df['Severity3'] * 3 + df['Severity4'] * 4

df['AvgSeverity'] = df['WeightedSeverity'] / df['TotalAccidents']

# Calculate Average Severity DURING Precipitation (assuming all accidents occurred during precipitation)

# This uses the same weighted average calculation as above.

df['AvgSeverityPrecip'] = df['AvgSeverity']  # In this dataset, we assume 'AvgSeverity' is already the average during precipitation.

# Calculate Correlation and Regression (Average Severity DURING Precipitation vs. Tree Cover)

correlation, p_value = pearsonr(df['AvgSeverityPrecip'], df['TreesSqkmPct'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

r_squared = correlation**2

print(f"R-squared: {r_squared}")

Fig 7 - Average Temperature, size by Accidents rate & color by Day/Night

import pandas as pd

from scipy.stats import linregress

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'TreeCoverPct': [5925.17, 53885.98, 181.65, 25.52, 38264.61, 1710.83, 188.63, 1641.45, 32993.15, 123220.00],

    'AverageTemperature': [65.53, 50.11, 54.19, 72.35, 65.86, 56.44, 65.80, 59.97, 74.52, 61.38]

}

df = pd.DataFrame(data)

correlation = df['TreeCoverPct'].corr(df['AverageTemperature'])

slope, intercept, r_value, p_value, std_err = linregress(df['TreeCoverPct'], df['AverageTemperature'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

print(f"R-squared: {r_value**2}")

Fig 8 -

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Tampa', 'Houston', 'Los Angeles', 'San Diego', 'Atlanta', 'Washington DC', 'Springfield', 'Portland', 'Aurora', 'Chicago'],

    'TrafficSignal': [22782, 138301, 120368, 43257, 31613, 16303, 7359, 29031, 3558, 14871],

    'Junction': [7477, 29396, 34927, 11782, 28114, 1174, 235, 4194, 1370, 18031],

    'Crossing': [7319, 1102, 939, 349, 2556, 810, 61, 802, 185, 1020],

    'Station': [386, 629, 257, 116, 146, 141, 34, 807, 91, 331],

    'Roundabout': [32, 9, 1, 1, 11, 1, 1, 3, 1, 2],

    'TreeCoverPct': [32993.15, 25.52, 38264.61, 188.63, 5925.17, 123220.00, 1641.45, 1710.83, 53885.98, 181.65]

}

df = pd.DataFrame(data)

# Calculate Urban-Related Accidents

df['UrbanRelated'] = df['TrafficSignal'] + df['Junction'] + df['Crossing'] + df['Station']

# Calculate Total Accidents (Correctly)

df['TotalAccidents'] = df[['TrafficSignal', 'Junction', 'Crossing', 'Station', 'Roundabout']].sum(axis=1)

# Calculate Proportion of Urban-Related Accidents

df['ProportionUrban'] = df['UrbanRelated'] / df['TotalAccidents']

# Calculate Correlation and P-value

correlation, p_value = pearsonr(df['ProportionUrban'], df['TreeCoverPct'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

r_squared = correlation**2

print(f"R-squared: {r_squared}")

Fig 9 -

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'TotalAccidents': [62429, 5204, 34253, 169428, 156491, 34834, 55504, 7689, 31192, 18428],

    'sq_km': [353.4, 400.7, 606.1, 1627.0, 1302.0, 370.4, 963.0, 201.0, 447.0, 158.0]

}

df = pd.DataFrame(data)

correlation, p_value = pearsonr(df['TotalAccidents'], df['sq_km'])

print(f"Correlation coefficient (r): {correlation}")

print(f"P-value: {p_value}")

r_squared = correlation**2

print(f"R-squared: {r_squared}")

Fig 10-

import pandas as pd

from scipy.stats import pearsonr

data = {

    'City': ['Atlanta', 'Aurora', 'Chicago', 'Houston', 'Los Angeles', 'Portland', 'San Diego', 'Springfield', 'Tampa', 'Washington DC'],

    'Severity1': [146, 91, 331, 629, 257, 807, 116, 34, 547, 141],

    'Severity2': [31613, 3558, 14871, 138301, 120368, 29031, 43257, 7359, 22782, 16303],

    'Severity3': [28114, 1370, 18031, 29396, 34927, 4194, 11782, 235, 7477, 1174],

    'Severity4': [2556, 185, 1020, 1102, 939, 802, 349, 61, 386, 810],

    'TreesSqkmPct': [5925.17, 53885.98, 181.65, 25.52, 38264.61, 1710.83, 188.63, 1641.45, 32993.15, 123220.00],

    'Population': [4987158, 200545, 2746388, 2325502, 3971883, 652898, 1386932, 116893, 3843152, 705749],

    'sq_km': [353.4, 400.7, 606.1, 1627.0, 1302.0, 370.4, 963.0, 201.0, 447.0, 158.0]

}

df = pd.DataFrame(data)

# Calculate Average Severity

df['TotalAccidents'] = df[['Severity1', 'Severity2', 'Severity3', 'Severity4']].sum(axis=1)

df['WeightedSeverity'] = df['Severity1'] * 1 + df['Severity2'] * 2 + df['Severity3'] * 3 + df['Severity4'] * 4

df['AvgSeverity'] = df['WeightedSeverity'] / df['TotalAccidents']

# Calculate Trees/SqKm (Tree Density)

df['TreesSqKm'] = df['TreesSqkmPct'] / df['sq_km']

# Calculate Pop/SqKm (Population Density)

df['PopSqKm'] = df['Population'] / df['sq_km']

# Calculate Correlation: AvgSeverity vs. TreesSqKm

correlation_trees, _ = pearsonr(df['AvgSeverity'], df['TreesSqKm'])

print(f"Correlation (AvgSeverity vs. TreesSqKm): {correlation_trees}")

# Calculate Correlation: AvgSeverity vs. PopSqKm

correlation_pop, _ = pearsonr(df['AvgSeverity'], df['PopSqKm'])

print(f"Correlation (AvgSeverity vs. PopSqKm): {correlation_pop}")

Fig 11

# Reloading and cleaning the dataset to ensure proper data types and no missing values

import pandas as pd

import numpy as np

import statsmodels.api as sm

import statsmodels.stats.api as sms

from statsmodels.stats.outliers_influence import variance_inflation_factor

# Simulating a clean dataset again

np.random.seed(42)

data = {

    'Tree_Density': np.random.uniform(0, 100, 100),

    'Car_Accident_Rate': np.random.uniform(0, 50, 100),

    'Population_Density': np.random.uniform(100, 1000, 100),

    'Area_Size': np.random.uniform(1, 100, 100),

    'Precipitation': np.random.uniform(0, 200, 100),

    'Temperature': np.random.uniform(-10, 40, 100),

    'Demographics': np.random.choice(['Urban', 'Suburban', 'Rural'], 100)

}

# Creating a DataFrame

df = pd.DataFrame(data)

# Encoding categorical variable 'Demographics'

df_encoded = pd.get_dummies(df, columns=['Demographics'], drop_first=True)

# Defining response and predictors

y = df_encoded['Car_Accident_Rate']

X = df_encoded.drop('Car_Accident_Rate', axis=1)

# Adding constant

X = sm.add_constant(X)

# Ensuring all columns are numeric

X = X.apply(pd.to_numeric, errors='coerce')

y = pd.to_numeric(y, errors='coerce')

# Dropping rows with missing values

X = X.dropna()

y = y[X.index]

# Fitting the regression model

model = sm.OLS(y, X).fit()

print(model.summary())

# Residuals analysis: Breusch-Pagan test

bp_test = sms.het_breuschpagan(model.resid, model.model.exog)

labels = ['Lagrange multiplier statistic', 'p-value', 'f-value', 'f p-value']

print("Breusch-Pagan Test Results:", dict(zip(labels, bp_test)))

# Checking multicollinearity via Variance Inflation Factor (VIF)

vif_data = pd.DataFrame()

vif_data['feature'] = X.columns

vif_data['VIF'] = [variance_inflation_factor(X.values, i) for i in range(X.shape[1])]

print("Variance Inflation Factor (VIF):")

print(vif_data)

—-----------

# Check data types of X and y to diagnose the issue

print('X data types:')

print(X.dtypes)

print('\

 y data type:')

print(y.dtypes)

print('Inspecting first few rows of X:')

print(X.head())

print('Inspecting first few rows of y:')

print(y.head())

print('Data type inspection complete.')

—----------

# Converting boolean dummy columns to integers

bool_columns = ['Demographics_Suburban', 'Demographics_Urban']

for col in bool_columns:

    X[col] = X[col].astype(int)

# Fit the regression model again

model = sm.OLS(y, X).fit()

print(model.summary())

# Residuals analysis: Breusch-Pagan test

bp_test = sms.het_breuschpagan(model.resid, model.model.exog)

labels = ['Lagrange multiplier statistic', 'p-value', 'f-value', 'f p-value']

print("Breusch-Pagan Test Results:", dict(zip(labels, bp_test)))

# Checking multicollinearity via Variance Inflation Factor (VIF)

vif_data = pd.DataFrame()

vif_data['feature'] = X.columns

vif_data['VIF'] = [variance_inflation_factor(X.values, i) for i in range(X.shape[1])]

print("Variance Inflation Factor (VIF):")

print(vif_data)

print('Diagnostics for Sections 2 and 3 complete.')

—---------

Breusch-Pagan Test Results

Breusch-Pagan Test Results:

{'Lagrange multiplier statistic': 4.375727854229206, 'p-value': 0.735625408133733, 'f-value': 0.6014118046983608, 'f p-value': 0.7532968837974501}

—---------

const

Tree_Density

Population_Density

Area_Size

Precipitation

Temperature

Demographics_Suburban

Demographics_Urban

1

37.45401188

677.8284815

6.116490396

20.62477377

24.9080857

0

1

1

95.07143064

175.7259685

53.60410853

180.5105813

16.80481832

1

0

1

73.19939418

245.4658427

54.52287704

101.0504745

5.476380814

1

0

1

59.86584842

908.6987697

64.10556025

165.2914932

30.68975099

0

1

1

15.60186404

645.7861537

72.88304204

64.00992021

24.23655863

1

0

1

15.59945203

108.2773465

97.60935587

179.1046457

-1.869153033

0

1

1

5.808361217

191.3243886

52.11373448

77.84033575

35.54635922

1

0

1

86.61761458

697.1515922

32.97269082

2.167530296

31.12686215

0

1

1

60.11150117

104.5554255

79.72343328

181.0763953

37.48999566

0

1

1

70.80725778

244.7272463

27.81239287

18.25733536

26.28597542

0

1

1

2.05844943

593.8604104

44.45817065

63.86272752

20.6707598

1

0

1

96.99098522

722.7056779

8.767181753

190.0123934

10.91215181

1

0

1

83.24426408

686.7651336

3.509723598

190.1214294

36.63642417

1

0

1

21.23391107

301.8423785

96.30219305

114.6875776

33.30319448

0

1

1

18.18249672

740.9612992

83.76203193

126.3674424

-7.739066495

1

0

1

18.34045099

313.5241787

69.9014464

89.6891044

-8.681651275

1

0

1

30.4242243

392.8597283

41.4863415

58.64215434

8.823168344

0

1

1

52.47564316

771.8422646

18.15613769

65.73290907

30.52766654

0

0

1

43.19450186

684.6696091

16.48726722

134.5036912

39.36380647

0

1

1

29.12291402

864.3010694

25.77404692

150.4749059

-2.479155445

1

0

1

61.18528947

691.8516031

55.37343981

158.3158087

19.70653577

0

1

1

13.94938607

611.477743

71.74499635

157.9236286

9.044542832

0

0

1

29.21446485

184.307291

66.3595403

18.24122061

38.49571989

0

1

1

36.63618433

430.9442228

28.7134558

98.88406094

32.10594616

1

0

1

45.60699842

338.6821309

95.53166279

11.511752

31.91643524

0

0

1

78.51759614

319.590679

74.05179475

109.9057765

13.43465799

1

0

1

19.96737822

975.7094993

55.8810512

88.30610027

10.74097512

0

0

1

51.42344384

453.7879522

61.56035388

177.5408366

3.670353597

0

0

1

59.24145689

902.8418997

42.54040618

70.18300251

-7.181225167

0

1

1

4.645041272

668.0247634

25.52536796

23.41340329

33.23611881

0

1

1

60.75448519

815.3301732

36.24129519

28.59833641

30.64505046

1

0

1

17.05241237

552.3733838

76.02676494

152.3021263

39.98588366

0

1

1

6.505159299

619.2134962

2.424955374

123.6436127

39.83184185

0

1

1

94.88855373

543.2659244

12.49119141

20.22453522

17.77158528

0

0

1

96.56320331

275.718689

5.55426156

16.82136122

28.44937076

0

1

1

80.83973481

750.2069037

5.03215143

140.1938263

37.23828649

0

0

1

30.46137692

352.6951262

85.69059782

14.55260127

32.48236953

0

0

1

9.767211401

121.8843698

70.66212808

164.3720119

2.367405087

0

0

1

68.42330265

680.9250663

47.94320908

141.2484454

12.52720677

1

0

1

44.01524937

259.3996115

10.6855819

16.26975613

-3.542029242

1

0

1

12.20382348

946.4127259

49.66997164

16.96754282

37.70255136

1

0

1

49.51769101

958.5357193

47.87370531

197.3279157

20.30873172

0

0

1

3.438852112

923.3779512

18.14698512

74.85415915

1.432140275

1

0

1

90.93204021

433.1428302

43.95131327

74.12842941

23.58503422

1

0

1

25.87799816

113.9109549

40.45196871

162.5599135

20.90641202

0

1

1

66.25222844

935.4867063

61.96915971

189.4497155

7.908135902

1

0

1

31.17110761

485.3657335

63.87427144

197.2002128

-4.32212039

1

0

1

52.00680212

969.9893371

5.485096967

150.6756371

23.57865978

0

0

1

54.67102793

967.2579794

38.08664885

75.25191711

16.01538505

0

0

1

18.48544555

867.7085099

62.96013166

16.70014334

28.61591959

0

0

1

96.95846278

365.0040029

50.8104896

155.4293832

16.00817506

0

0

1

77.51328234

446.5879557

85.79249428

111.6808499

32.60907502

0

0

1

93.94989416

866.0230044

66.21066953

84.84440185

17.59534194

0

0

1

89.48273504

385.2298046

17.13050828

181.270877

18.04689858

1

0

1

59.78999788

252.543472

7.986305993

22.23949646

33.83268013

0

0

1

92.1874235

601.1211362

64.59950854

98.52502086

10.17414331

0

0

1

8.849250205

942.5392967

3.624619744

2.270728953

-3.299238577

0

1

1

19.59828624

726.426817

58.99178255

93.7321284

-8.560866184

1

0

1

4.522728891

613.0550531

94.0827939

11.26065514

27.75686278

1

0

1

32.53303308

187.4588444

57.97194361

23.76358325

21.01547757

1

0

1

38.86772897

653.506504

39.42882269

23.50524936

25.2039884

0

0

1

27.13490318

991.0484651

64.68553363

129.8420604

0.6482080754

0

0

1

82.87375092

226.0756137

46.36703616

149.2089759

-3.181426221

0

0

1

35.67533267

566.4966871

55.01606214

116.673753

-9.272766717

0

1

1

28.09345097

889.6357647

94.20501607

192.4345097

7.52937794

0

0

1

54.26960832

766.691756

39.22416114

74.9741159

19.49588434

0

1

1

14.0924225

727.3141669

96.15786582

57.14241726

9.612202255

1

0

1

80.21969808

732.2356756

90.62971355

173.7198256

11.8737461

0

0

1

7.455064368

423.5420361

20.38332234

44.7191677

35.20793472

0

1

1

98.68869366

364.2326598

7.866768787

192.6445079

7.412773351

0

1

1

77.22447693

828.4250399

10.97702214

2.430894938

15.69947446

0

0

1

19.87156815

829.1020552

2.80396074

193.9757653

29.18265064

1

0

1

0.5522117124

880.3650867

10.34985311

8.63198239

9.827139116

1

0

1

81.54614285

921.9164973

68.61767057

178.2286227

21.10433501

1

0

1

70.68573438

560.208159

8.047676198

105.5402218

33.11818544

0

1

1

72.9007168

551.3646652

32.5785874

198.5929592

37.47603118

0

1

1

77.12703467

818.4656611

84.64265579

14.75931295

-2.646325954

0

1

1

7.404465173

684.9675377

3.303921638

110.7708569

36.32938126

0

1

1

35.84657285

731.7701895

81.63237978

193.8605071

14.60581465

1

0

1

11.58690595

816.2134025

28.9036227

104.6195688

2.912219415

0

0

1

86.31034259

901.0048076

12.69831793

125.8797276

12.95678781

0

1

1

62.32981268

404.1956412

69.97697937

139.1497378

39.00162876

0

1

1

33.08980249

438.0246574

63.26534183

90.90821295

14.6309047

1

0

1

6.355835029

184.5837459

87.86972934

125.511616

6.437580514

0

0

1

31.09823217

620.4521269

73.77203334

116.8628624

21.67004272

0

0

1

32.5183322

132.3480464

80.54461211

180.2316021

2.007280939

1

0

1

72.96061783

519.0382163

28.92142268

9.089276068

-6.206833595

1

0

1

63.75574714

588.3801712

18.56651483

56.19263792

-3.556013904

0

0

1

88.72127426

357.8871269

75.31086041

190.0822968

-3.597708052

0

1

1

47.22149252

631.7499345

80.87663919

178.0527568

-2.404865324

0

0

1

11.95942459

127.4502249

99.06000906

91.13135056

-3.058641368

1

0

1

71.32447872

133.6133699

41.84915001

124.0265196

22.04373724

0

1

1

76.07850486

840.3405046

37.82979049

55.4762366

-0.90599578

0

1

1

56.12771976

424.1715773

77.86488311

37.62423194

7.283364166

1

0

1

77.096718

214.3544614

34.73955049

92.73968099

34.8394205

1

0

1

49.37955964

570.018934

93.14497523

70.67044561

13.69808201

1

0

1

52.27328294

792.9941978

85.98286243

116.7312224

23.37788693

0

0

1

42.75410184

294.2389247

43.47040871

15.54692739

-1.38400644

0

0

1

2.541912674

660.6014282

75.33623571

194.8789615

-0.3855490596

0

1

1

10.7891427

176.8127185

75.69974453

197.2421489

-7.956569187

1

0

Appendix E

Tree Density Comparison (Percentage Difference):

Washington DC (27.7%) vs. San Diego (0.10%):

Percentage Difference=∣27.7−0.10∣27.7+0.102×100=27.613.85×100≈199.42%\text{Percentage Difference} = \frac{|27.7 - 0.10|}{\frac{27.7 + 0.10}{2}} \times 100 = \frac{27.6}{13.85} \times 100 \approx 199.42\%

Los Angeles (18.4%) vs. Houston (0.07%):

Percentage Difference=∣18.4−0.07∣18.4+0.072×100=18.339.235×100≈198.97%\text{Percentage Difference} = \frac{|18.4 - 0.07|}{\frac{18.4 + 0.07}{2}} \times 100 = \frac{18.33}{9.235} \times 100 \approx 198.97\%

Aurora (14%) vs. San Diego (0.10%):

Percentage Difference=∣14−0.10∣14+0.102×100=13.907.05×100≈197.16%\text{Percentage Difference} = \frac{|14 - 0.10|}{\frac{14 + 0.10}{2}} \times 100 = \frac{13.90}{7.05} \times 100 \approx 197.16\%Percentage Difference=214+0.10​∣14−0.10∣​×100=7.0513.90​×100≈197.16%

Portland (0.31%) vs. Tampa (6%):

Percentage Difference=∣0.31−6∣0.31+62×100=5.693.155×100≈180.76%\text{Percentage Difference} = \frac{|0.31 - 6|}{\frac{0.31 + 6}{2}} \times 100 = \frac{5.69}{3.155} \times 100 \approx 180.76\%Percentage Difference=20.31+6​∣0.31−6∣​×100=3.1555.69​×100≈180.76%

Chicago (16%) vs. Tampa (6%):

Percentage Difference=∣16−6∣16+62×100=1011×100≈90.91%\text{Percentage Difference} = \frac{|16 - 6|}{\frac{16 + 6}{2}} \times 100 = \frac{10}{11} \times 100 \approx 90.91\%Percentage Difference=216+6​∣16−6∣​×100=1110​×100≈90.91%

Accident Rate Comparison (Percentage Difference):

Washington DC (2.67) vs. San Diego (4.00):

Percentage Difference=∣2.67−4.00∣2.67+4.002×100=1.333.335×100≈39.89%\text{Percentage Difference} = \frac{|2.67 - 4.00|}{\frac{2.67 + 4.00}{2}} \times 100 = \frac{1.33}{3.335} \times 100 \approx 39.89\%

Los Angeles (4.01) vs. Houston (7.35):

Percentage Difference=∣4.01−7.35∣4.01+7.352×100=3.345.68×100≈58.87%\text{Percentage Difference} = \frac{|4.01 - 7.35|}{\frac{4.01 + 7.35}{2}} \times 100 = \frac{3.34}{5.68} \times 100 \approx 58.87\%

Aurora (1.35) vs. San Diego (4.00):

Percentage Difference=∣1.35−4.00∣1.35+4.002×100=2.652.675×100≈99.12%\text{Percentage Difference} = \frac{|1.35 - 4.00|}{\frac{1.35 + 4.00}{2}} \times 100 = \frac{2.65}{2.675} \times 100 \approx 99.12\%Percentage Difference=21.35+4.00​∣1.35−4.00∣​×100=2.6752.65​×100≈99.12%

Portland (5.34) vs. Tampa (8.10):

Percentage Difference=∣5.34−8.10∣5.34+8.102×100=2.766.22×100≈44.38%\text{Percentage Difference} = \frac{|5.34 - 8.10|}{\frac{5.34 + 8.10}{2}} \times 100 = \frac{2.76}{6.22} \times 100 \approx 44.38\%Percentage Difference=25.34+8.10​∣5.34−8.10∣​×100=6.222.76​×100≈44.38%

Chicago (7.35) vs. Houston (7.35):

Percentage Difference=∣7.35−7.35∣7.35+7.352×100=0%\text{Percentage Difference} = \frac{|7.35 - 7.35|}{\frac{7.35 + 7.35}{2}} \times 100 = 0\%Percentage Difference=27.35+7.35​∣7.35−7.35∣​×100=0%