Limitation of classification tree models in investigating road accident severity

Maria Lígia Chuerubim, Alan Valejo, Barbara Stolte Bezerra, Irineu da Silva

Abstract


The objective of this study is to discuss the main limitations encountered in the classification process of traffic accident severity, based on decision tree models (CART). With this purpose, CART was used in the mining of an unbalanced database of road accidents, considering injury severity, categorized as accidents without victims and with victims (fatal and non-fatal), as the dependent variable. The variables associated with accident characteristics, road infrastructure and environmental conditions were used to identify the influence of these factors on accident severity. Although the classification by CART resulted in a high accuracy overall, it resulted in a low rate of accuracy in the classification of accidents with victims, which correspond to the rarest observations in the database. In addition, it resulted in a high number of decision rules, considering the number of categories of independent variables in the prediction process of the target variable. The results indicated that CART is not efficient in the study of multiple-effect phenomena, such as road accidents, since it does not have the potential to associate a large number of parameters, which restricts the analysis and interpretation of the results to the binary structure of the tree. Thus, an exploratory analysis of the database is suggested, when it is desired to analyze the influence of a specific category of a database variable in the occurrence of traffic accidents.


Keywords


Road accidents. Severity. Data Mining. Classification. Decision Tree.

Full Text:

PDF

References


ABDEL-ATY, M. A.; RADWAN, A.E. Modeling traffic accident occurrence and involvement. Accident Analysis & Prevention, v. 32, n. 5, p. 633-642, 2000.

ABELLÁN, J et al. Analysis of traffic accident severity using Decision Rules via Decision Trees. Expert Systems with Applications, v. 40, n. 15, p. 6047-6054, 2013.

CHANG, L.; CHIEN, J. Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model. Safety Science, v. 51, n. 1, p. 17-22, 2013.

CHANG, L.; WANG, H. Analysis of traffic injury severity: An application of non-parametric classification tree techniques. Accident Analysis and Prevention, v. 38, p. 1019–1027, 2006.

DE OÑA et al. Analysis of traffic accidents on rural highways using Latent Class Clustering and Bayesian Networks. Accident Analysis and Prevention, v. 51, p. 1–10, 2013.

DE OÑA et al. Analysing the relationship among accident severity, drivers’ behaviour and their socio-economic characteristics in different territorial contexts. Procedia - Social and Behavioral Sciences, v. 160, n. Cit, p. 74–83, 2014.

GRISELDA, L. et al. Using Decision Trees to extract Decision Rules from Police Reports on Road Accidents. Procedia - Social and Behavioral Sciences, v. 53, n. SIIV-5th International Congress-Sustainability of Road Infrastructures, p. 106–114, 2012.

HAUER, E. Safety Models for Urban Four-lane Undivided Road Segments. Transportation Research Record: Journal of the Transportation Research Board, n. 96–105, p. 1–22, 2007.

HAUER, E. Observational Before-After Studies in Road Safety. Pergamon Press, Elsevier Science Ltd., Oxford, England. 1997.

KASHANI, A. T. et al. A data mining approach to identify key factors of traffic injury severity. Traffic&Transportation, v. 23, p. 11–17, 2011.

KOHAVI, R. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. Appears in the International Joint Conference on Articial Intelligence, v. 5, p. 1–7, 1995.

LÓPEZ, G. et al. Analysis of traffic accidents on rural highways using Latent Class clustering and Bayesian Networks. Accident Analysis and Prevention, v. 51, p. 1–10, 2013.

LÓPEZ, G. et al. Influence of deficiences in traficc control devices in crashes on two-lane rural roads. Accident Analysis and Prevention, v. 96, p. 130-139, 2016.

LÜ, L.; ZHOU, T. Link prediction in weighted networks: The role of weak ties. Europhysics Letters, v. 89, n. 1, p. 18001, 2010.

MIAOU, S. P. The relationship between truck accidents and geometric design of road sections: Poisson versus negative binomial regressions. Accident Analysis and Prevention, v. 26, n. 4, p. 471–482, 1994.

MIAOU, S. P.; LUM, H. Modeling vehicle accidents and highway geometric design relationships. Accident Analysis and Prevention, v. 25, n. 6, p. 689–709, 1993.

MONTELLA, A. et al. Data-Mining Techniques for Exploratory Analysis of Pedestrian Crashes Data-Mining Techniques for Exploratory Analysis of Pedestrian Crashes. Transportation Research Record, n. 2237, p. 107–116, 2011.

MONTELLA, A. et al. Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery. Accident Analysis and Prevention, v. 49, p. 58–72, 2012.

MUJALLI, R. O.; DE OÑA, J. Injury severity models for motor vehicle accidents : a review. Proceedings of the Institution of Civil Engineering – Transport., v. 166, p. 255–270, 2013. DOI: http://dx.doi.org/10.1680/tran.11.00026

ONSV. National Road Safety Observatory. Statistics. 2014. Available at: http://iris.onsv.org.br/iris-beta/#/stats/maps. Access: 22 fev. 2018. In Portuguese.

PAKGOHAR, A. et al. The role of human factor in incidence and severity of road crashes based on the CART and LR regression : a data mining approach. Procedia Computer Science, v. 3, p. 764–769, 2011.

PANDE, A.; ABDEL-ATY, M. Assessment of freeway traffic parameters leading to lane-change related collisions. Accident Analysis & Prevention, v. 38, p. 936–948, 2006.

RUSSO, F. et al. Safety performance functions for crash severity on undivided rural roads. Accident Analysis and Prevention, v. 93, p. 75–91, 2016.

SAVOLAINEN, P. T. et al. The statistical analysis of highway crash-injury severities: A review and assessment of methodological alternatives. Accident Analysis and Prevention, v. 43, n. 5, p. 1666–1676, 2011.

SINGH, G. et al. M5 model tree based predictive modeling of road accidents on non-urban sections of highways in India. Accident Analysis and Prevention, v. 96, p. 108–117, 2016.

WEI, X. et al. Analyzing traffic crash severity in work zones under different light conditions. Journal of Advanced Transportation, v. 2017.




DOI: https://doi.org/10.18256/2358-6508.2019.v6i2.2927

Refbacks

  • There are currently no refbacks.




Copyright (c) 2020

Licença Creative Commons
Esta obra da Revista de Engenharia Civil IMED está licenciada com uma Licença Creative Commons Atribuição 4.0 Internacional.

ISSN 2358-6508

 Indexadores

DOAJ.jpg   logos_DOI_CrossRef_CrossChek.png 
SHERPA-RoMEO-long-logo.gif   
 
 latindex.jpg