Spatio-Temporal Feature Encoding for Traffic Accident Detection in VANET Environment

Zhili Zhou*, Xiaohua Dong, Zhetao Li, Keping Yu, Chun Ding, Yimin Yang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

34 Citations (Scopus)

Abstract

In the Vehicular Ad hoc Networks (VANET) environment, recognizing traffic accident events in the driving videos captured by vehicle-mounted cameras is an essential task. Generally, traffic accidents have a short duration in driving videos, and the backgrounds of driving videos are dynamic and complex. These make traffic accident detection quite challenging. To effectively and efficiently detect accidents from the driving videos, we propose an accident detection approach based on spatio-temporal feature encoding with a multilayer neural network. Specifically, the multilayer neural network is used to encode the temporal features of video for clustering the video frames. From the obtained frame clusters, we detect the border frames as the potential accident frames. Then, we capture and encode the spatial relationships of the objects detected from these potential accident frames to confirm whether these frames are accident frames. The extensive experiments demonstrate that the proposed approach achieves promising detection accuracy and efficiency for traffic accident detection, and meets the real-time detection requirement in the VANET environment.

Original languageEnglish
Pages (from-to)19772-19781
Number of pages10
JournalIEEE Transactions on Intelligent Transportation Systems
Volume23
Issue number10
DOIs
Publication statusPublished - 2022 Oct 1
Externally publishedYes

Keywords

  • Neural network
  • VANETs
  • security communication
  • traffic accident detection
  • traffic safety

ASJC Scopus subject areas

  • Automotive Engineering
  • Mechanical Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Spatio-Temporal Feature Encoding for Traffic Accident Detection in VANET Environment'. Together they form a unique fingerprint.

Cite this