The study applies the recommended methodology from Part C, Chapter 11 of the HSM 1st edition. The methodology includes calculating the network screening predicted frequency (Npredicted) using the crash modification factors (CMFs) and the observed crash data. A calibration factor (Cx) is determined by comparing the observed crashes (Nobserved) to the predicted crashes (Npredicted).
4.2. The Goodness of Fit (GOF) Measures
The goodness-of-fit measures, including mean absolute deviation (MAD), mean absolute percentage error (MAPE), and root mean square error (RMSE), are presented in
Table 9. The smaller values of these measures indicate a better model fit. The results suggest that the total crash model performs better than the FI crash model, indicating the variability in calibrated predicted values. Moreover, the comparison with previous studies shows a good methodology performance regarding these measures.
As anticipated, applying the Empirical Bayes (EB) method yielded estimated values closely aligned with the observed values. The MAD, MAPE, and RMSE metrics indicate that the total crash model outperforms the FI model in the final step after implementing the EB method. Conversely, the MAD and RMSE values suggest that the calibrated predicted values exhibit more substantial variation when including PDO crashes, which is reasonable given that FI crashes represent only a smaller portion (approximately 30%) of the total crash data. The MAD for FI crashes accounted for 43% of the MAD for total crashes, while the RMSE was 39%. Moreover, a study by Waihrich and Andrade reported a MAD value of 5.54 for total crashes in the Minas Gerais state, which is 24% higher than the MAD value found for total crashes in the São Paulo state (MAD = 4.44) [
37].
Figure 2 displays the results for the entire study period, comparing the calibrated N
predicted values with the observed total and FI crashes to the centerline. The proximity of the model’s output to the centerline signifies a closer alignment between the predicted and observed data. The graph underscores the dispersion of total crashes in comparison to FI crashes. Furthermore, points below the centerline indicate underestimation by the model, while points above the centerline indicate an overestimation of the observed data. Approximately 56% of the predicted values fall below the centerline trend, indicating that the model has predominantly underestimated the data.
Figure 3 also compares the calibrated N
predicted values for total and FI crashes to the centerline for each study year.
To further analyze the difference between N
predicted and N
expected (results obtained after applying the EB method),
Figure 4 and
Figure 5 display the estimated data for total crashes, while
Figure 6 and
Figure 7 illustrate the estimated data for FI crashes. These graphs facilitate the estimation of the R
2 values for each severity type (total or FI) and year, as presented in
Table 10. Upon applying the EB method, the performance of N
expected aligns with previous literature studies, as indicated by the proximity of the points to the centerline. This close alignment suggests that N
expected closely resembles N
observed. In contrast, N
predicted exhibits a moderate dispersion. The graphs for each study year exhibit a similar pattern to the one depicted in
Figure 4. Notably, 2019 displays a denser distribution of N
predicted values, indicating a closer prediction of the actual number of crashes. In contrast, the estimated values for 2016 appear more dispersed, suggesting a less accurate prediction for that particular year.
By adjusting the graph scale to accommodate the smaller sample represented by FI crashes (
Figure 6), a clearer understanding of the performance of N
predicted can be achieved. Consistent with previous observations, the predicted values are denser above the centerline, indicating a tendency for underprediction by the model. However, in the case of N
expected for FI crashes, most values fall below the centerline, indicating that most expected values underestimated the observed FI crashes. This suggests that the EB method did not perform as effectively for all types of crashes. In this context, the expected crashes are more scattered, and the results indicate a general trend of underprediction.
Figure 7 demonstrates a similar performance of N
expected compared to
Figure 6, indicating that the EB method consistently underpredicted the observed crash counts. The graphs reveal a pattern where the model underestimates FI crashes in segments where more than five crashes are observed annually. Lastly,
Table 10 provides the R
2 estimates obtained from the developed graphs.
Table 10 presents the estimated R
2 values for N
predicted and N
expected, categorized by severity type and year. The R
2 value represents the goodness of fit between observed and estimated graphs, providing a correlation measure. A higher R
2 value indicates a better fit. As expected, the R
2 values for N
expected, which accounts for the observed crash counts, are significantly higher than those for N
predicted after applying the EB method. Notably, the FI crashes show lower R
2 values compared to all crashes.
Table 11 presents the results of various goodness of fit (GOF) tests for calibrated predicted crashes. These tests include MAD, MAPE, RMSE, and R
2. It is observed that using FI crashes yields lower MAD and RMSE values, indicating better accuracy, while MAPE and R
2 show better performance for all crashes.
On the other hand,
Table 12 compares the GOF parameters for expected crashes. Here, improved results are observed for all crash types. This suggests that the prediction of crashes using the HSM model performed better across all crash types.
4.3. Crash Data Analysis for 2020
Due to the COVID-19 pandemic, 2020 witnessed significant disruptions in global traffic patterns. Ongoing studies are exploring the impact of the pandemic on various aspects of human health, including traffic-related fatalities and injuries [
55]. In
Figure 8, the number of Property Damage Only (PDO) crashes is depicted, while
Figure 9 displays the reported traffic-related fatalities on state highways in the years 2019, 2020, and 2021, as documented by the São Paulo State government [
56].
Notably, there is a substantial difference in PDO crashes between April 2019 and April 2020, with a 40% reduction in PDO crashes. Similarly, the highest reduction in fatal crashes was observed during the same period, with a decline of approximately 27%. Despite the lockdown measures commencing on 22 March 2020 in São Paulo State, the impact of these measures became more pronounced in April 2020. The graphs also present the moving average of crashes over a twelve-month interval, demonstrating the reduction in crashes during that period.
Table 13 and
Figure 10 illustrate the variation in fatal crashes and average AADT concerning the average and counts of the previous year. The data suggest that the decrease in fatal crashes is associated with the reduction in AADT, which can be attributed to the implementation of disease control measures during that period.
To assess the impact of COVID-19 on the analyzed segments,
Table 14 presents the crash data specifically from 2020. A comparison is made between the crash counts in 2020 and the average counts from the previous four years. The data indicate a significant reduction in crashes during 2020, with a decrease of approximately 20% for all types of crashes and 11% for FI crashes compared to the average counts. This reduction in crash numbers suggests a notable influence of the COVID-19 pandemic on traffic safety, potentially due to factors such as reduced traffic volume, changes in driver behavior, and altered travel patterns resulting from pandemic-related restrictions and guidelines.
The 2020 AADT data played a crucial role in predicting the crash counts for that year using the HSM prediction model. This model relies on AADT values and segment length to estimate crash counts. In this study, the calibration factors obtained in
Section 4.1 (C
x,
TOTAL = 2.62 and C
x,
FI = 2.35) were utilized to calculate the calibrated N predicted for 2020, as shown in
Table 15. These calibration factors were derived from the baseline data of the four previous years. The EB method was also applied using the obtained parameters to calculate the Nexpected, which represents the expected crash counts considering the observed crash data. By comparing the calibrated Npredicted and Nexpected, it becomes possible to evaluate the prediction model’s performance and assess its accuracy in estimating the crash counts for 2020.
The COVID-19 pandemic has affected crash counts and changed the Average Annual Daily Traffic (AADT) values. Consequently, the Nspf (predicted crash counts) reflects the impact of the pandemic on crash frequencies. Despite this, the Nobserved (actual observed crash counts) remains approximately 10% lower than the calibrated Npredicted (predicted crash counts considering calibration factors) for all types of crashes. However, the calibrated prediction of FI crashes aligns closely with the observed counts, indicating high accuracy in predicting FI crashes.
Moreover, applying the EB method, which incorporates the observed number of crashes, brings Nexpected (expected crash counts) closer to Nobserved. This suggests that the prediction model performs well in estimating unseen data.
In
Table 16, the evaluation of model performance based on mean absolute deviation (MAD) and root mean square error (RMSE) reveals that FI crashes demonstrate a better model adjustment to the actual inputs compared to all types of crashes. Conversely, the mean absolute percentage error (MAPE) and R
2 (coefficient of determination) indicate that using all types of crashes yields successful model adjustment. The high MAPE value for 2020 can be attributed to the sudden reduction in crashes, which the model was not explicitly trained to anticipate as it was based on previous years’ data.
After applying the EB method and considering the 2020 data, the GOF (Goodness of Fit) tests demonstrate that the model performs better when using all types of crashes, as shown in
Table 17. However, further investigation is required to fully understand the influence of infection prevention and control procedures on road safety in Brazil.