Road traffic accidents represent a global public safety crisis, necessitating advanced computational tools for accurate injury severity prediction and effective decision support. This study evaluates high-performing ensemble machine learning models, including AdaBoost, XGBoost, LightGBM, HistGBRT, CatBoost, Gradient Boosting, NGBoost, and Random Forest, using a comprehensive National Highway Traffic Safety Administration (NHTSA) dataset from 2018 to 2022. While all models demonstrated exceptional predictive accuracy, with HistGBRT achieving the highest overall accuracy of 92.26%, a defining achievement of this work is the perfect classification (100% precision and recall) of fatal injuries across all ensemble architectures. To bridge the gap between predictive performance and actionable intelligence, this research integrates SHapley Additive exPlanations (SHAP) to provide both global insights into dataset-wide risk factors and local, instance-specific rationales for individual crash events. The global analysis identified ethnicity, airbag deployment, and harmful event type as primary drivers of injury severity, while local force and waterfall plots revealed the precise “push and pull” of variables for specific incidents. The results offer a robust, interpretable framework for stakeholders tasked with improving traffic safety and mitigating crash-related harm.
Competing Interest StatementThe authors have declared no competing interest.
Funding StatementThis research received no external funding.
Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Comments (0)