Simulator Pre-Screening of Underprepared Drivers Prior to Licensing On-Road Examination: Clustering of Virtual Driving Test Time Series Data.

Grethlein, David; Winston, Flaura Koplin; Walshe, Elizabeth; Tanner, Sean; Kandadai, Venk; Ontañón, Santiago

Grethlein, David; Winston, Flaura Koplin; Walshe, Elizabeth; Tanner, Sean; Kandadai, Venk; Ontañón, Santiago.

Afiliação

Grethlein D; Diagnostic Driving, Inc, Philadelphia, PA, United States.
Winston FK; Computer Science Department, Drexel University, Philadelphia, PA, United States.
Walshe E; Diagnostic Driving, Inc, Philadelphia, PA, United States.
Tanner S; Center for Injury Research and Prevention, Children's Hospital of Philadelphia, Philadelphia, PA, United States.
Kandadai V; Perelmen School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.
Ontañón S; Center for Injury Research and Prevention, Children's Hospital of Philadelphia, Philadelphia, PA, United States.

J Med Internet Res ; 22(6): e13995, 2020 06 18.

Article em En | MEDLINE | ID: mdl-32554384

ABSTRACT

ABSTRACT

BACKGROUND:

A large Midwestern state commissioned a virtual driving test (VDT) to assess driving skills preparedness before the on-road examination (ORE). Since July 2017, a pilot deployment of the VDT in state licensing centers (VDT pilot) has collected both VDT and ORE data from new license applicants with the aim of creating a scoring algorithm that could predict those who were underprepared.

OBJECTIVE:

Leveraging data collected from the VDT pilot, this study aimed to develop and conduct an initial evaluation of a novel machine learning (ML)-based classifier using limited domain knowledge and minimal feature engineering to reliably predict applicant pass/fail on the ORE. Such methods, if proven useful, could be applicable to the classification of other time series data collected within medical and other settings.

METHODS:

We analyzed an initial dataset that comprised 4308 drivers who completed both the VDT and the ORE, in which 1096 (25.4%) drivers went on to fail the ORE. We studied 2 different approaches to constructing feature sets to use as input to ML algorithms the standard method of reducing the time series data to a set of manually defined variables that summarize driving behavior and a novel approach using time series clustering. We then fed these representations into different ML algorithms to compare their ability to predict a driver's ORE outcome (pass/fail).

RESULTS:

The new method using time series clustering performed similarly compared with the standard method in terms of overall accuracy for predicting pass or fail outcome (76.1% vs 76.2%) and area under the curve (0.656 vs 0.682). However, the time series clustering slightly outperformed the standard method in differentially predicting failure on the ORE. The novel clustering method yielded a risk ratio for failure of 3.07 (95% CI 2.75-3.43), whereas the standard variables method yielded a risk ratio for failure of 2.68 (95% CI 2.41-2.99). In addition, the time series clustering method with logistic regression produced the lowest ratio of false alarms (those who were predicted to fail but went on to pass the ORE; 27.2%).

CONCLUSIONS:

Our results provide initial evidence that the clustering method is useful for feature construction in classification tasks involving time series data when resources are limited to create multiple, domain-relevant variables.

Assuntos

Acidentes de Trânsito/prevenção & controle; Condução de Veículo/normas; Adolescente; Adulto; Análise por Conglomerados; Feminino; Humanos; Masculino; Programas de Rastreamento; Adulto Jovem

Palavras-chave

accidents, traffic; adolescent; automobile driving; cause of death; child; humans; licensure; machine learning; motor vehicle; motor vehicles; on-road exam; simulated driving assessment; support vector machines

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Condução de Veículo / Acidentes de Trânsito Tipo de estudo: Diagnostic_studies / Prognostic_studies / Screening_studies Limite: Adolescent / Adult / Female / Humans / Male Idioma: En Revista: J Med Internet Res Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google