Integrating visual large language model and reasoning chain for driver behavior analysis and risk assessment
•A custom visual LLM for Distracted Driving Classification.•DDLM enhances precision by integrating whole-body pose estimation for keypoint analysis.•Reasoning chain in DDLM boosts logical analysis and inferential abilities.•DDLM uses cost-effective zero-shot/few-shot learning, reducing training reso...
Gespeichert in:
Veröffentlicht in: | Accident analysis and prevention 2024-04, Vol.198, p.107497-107497, Article 107497 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •A custom visual LLM for Distracted Driving Classification.•DDLM enhances precision by integrating whole-body pose estimation for keypoint analysis.•Reasoning chain in DDLM boosts logical analysis and inferential abilities.•DDLM uses cost-effective zero-shot/few-shot learning, reducing training resources.•Fine-tuning LLM with LoRA using specific data crucial for optimal performance.
Driver behavior is a critical factor in driving safety, making the development of sophisticated distraction classification methods essential. Our study presents a Distracted Driving Classification (DDC) approach utilizing a visual Large Language Model (LLM), named the Distracted Driving Language Model (DDLM). The DDLM introduces whole-body human pose estimation to isolate and analyze key postural features—head, right hand, and left hand—for precise behavior classification and better interpretability. Recognizing the inherent limitations of LLMs, particularly their lack of logical reasoning abilities, we have integrated a reasoning chain framework within the DDLM, allowing it to generate clear, reasoned explanations for its assessments. Tailored specifically with relevant data, the DDLM demonstrates enhanced performance, providing detailed, context-aware evaluations of driver behaviors and corresponding risk levels. Notably outperforming standard models in both zero-shot and few-shot learning scenarios, as evidenced by tests on the 100-Driver dataset, the DDLM stands out as an advanced tool that promises significant contributions to driving safety by accurately detecting and analyzing driving distractions. |
---|---|
ISSN: | 0001-4575 1879-2057 |
DOI: | 10.1016/j.aap.2024.107497 |