Identification and Prediction of Highway Accident-Prone Spots Based on Improved XGBoost Model
In order to accurately and quickly predict the accident-prone sections of highways,obtain the characteristic samples of accident spatio-temporal data,clarify the spatio-temporal evolution pat-terns and correlation mechanisms of accidents,and identify the location and the spatio-temporal evolu-tion patterns of accident-prone points based on spatio-temporal hotspot analysis results,this paper con-structed GA-XGBoost accident-prone point prediction model.Firstly,based on the sample data,the spatio-temporal cube was constructed under annual scale and daily scale respectively,and hotspot analysis was carried out.According to the spatio-temporal hotspot analysis results,the locations of ac-cident-prone point of the sample highway and their spatio-temporal evolution pattern were obtained.After comparative analysis and relevance test,seven characteristics were selected to predict whether the accident was located in the accident-prone location,including accident occurrence time,mileage,event type,processing time,number of affected lanes,whether it was in the vicinity of the confluence,and whether it was a holiday.Then,four algorithms,including CNN-LSTM,CNN-LSTM-ATT,Ran-dom Forest,and XGBoost model,were used to predict the accident-prone points respectively,and the results showed that the XGBoost model had the highest prediction accuracy compared to the other three algorithms.Subsequently,the XGBoost model was optimized with GA(Genetic Algorithm),and a GA-XGBoost combination model was constructed,which improved the prediction accuracy by 0.06 and F1 score by 0.07,and the precision by 0.08.This indicated that compared to existing algorithms,the GA-XGBoost model could more accurately predict whether a road section was located in an acci-dent-prone area,and clarify the spatio-temporal feature of accidents in accident-prone areas.Finally,the prediction results were interpreted by SHAP value analysis,and it was found that the samples lo-cated near the confluence,with incident types of rollover,breakdown,during National Day holidays and with 2 affected lanes were more likely to be at an accident-prone point compared to those not lo-cated near the confluence and with other accident types.Based on this,preventive measures could be taken in traffic safety and emergency management to improve the efficiency and emergency response capabilities of traffic management,so as to creat a safe and efficient traffic environment.
accident-prone spotspatio-temporal featureaccident identification and predictionXGBoostGA(Genetic Algorithm)spatio-temporal cube modelSHAP interpretation