Synopsis
We plan to finalize 2023 Crash Data in May of 2024With the goal of providing finalized 2023 data sets by the end of May, we have scheduled 2 finalization dates, a primary for 2024-05-15 and a fallback for 2024-05-22.
Overall Numbers (Preliminary)
These results for 2022 and earlier reflect finalized data. The 2023 data are production as of 2024-04-24 end of day. The statistics are the same as or similar to those provided in Final year-end crash statistics. Some of the statistics from that source are omitted because their calculation may have required significantly more work to interpret and reproduce.
...
Highlight Color | Meaning |
---|---|
Fatality Count Differences. The official statistics have one additional fatality in 2018 and 2020. It is on off in each year. | |
Idiosyncratic; no obvious reason for the difference. | |
Differences caused by how individuals are identified as Motorcyclists. Mine is anyone whose Unit Type is Motorcycle. Something more elaborate may be being used for the official calculations. If the identifying motorcyclists is non-trivial, we might want to consider adding a person level motorcyclist flag improve analysis reproducibility. | |
Something strange with Injured Pedestrians | |
There appears to be a difference in how large truck crashes were identified in 2020 and earlier. |
TLDR = Showing Work Because of Disagreeing Numbers
This is a lot of lines, but it is not complex. Create a flags for each of the person characteristics of interest. For example, person.is_bike_a
, was the person a bicyclist who suffered an A injury? Sum those over crashes to get the counts of each sort of person in each crash, and joint those totals to the crash level data. Finally sum those over the years to get year totals for each types of person and/or type of crash.
...