25
Towards a Learning Incident Detection System ICML 06 Workshop on Machine Learning for Surveillance and Event Detection June 29, 2006 Tomas Singliar Joint work with Dr. Milos Hauskrecht

Towards a Learning Incident Detection System

  • Upload
    zorion

  • View
    39

  • Download
    3

Embed Size (px)

DESCRIPTION

Towards a Learning Incident Detection System. ICML 06 Workshop on Machine Learning for Surveillance and Event Detection June 29, 2006 Tomas Singliar Joint work with Dr. Milos Hauskrecht. Outline. Replace traffic engineers with ML algorithms for incident detection - PowerPoint PPT Presentation

Citation preview

Page 1: Towards a Learning Incident Detection System

Towards a Learning Incident Detection

System

ICML 06 Workshop on Machine Learning for Surveillance and Event Detection

June 29, 2006Tomas Singliar

Joint work with Dr. Milos Hauskrecht

Page 2: Towards a Learning Incident Detection System

Outline Replace traffic engineers with ML

algorithms for incident detection Traffic data collection and quality

Why, who and for what purposes Incident detection algorithms Evaluation metrics Individual feature performance Sensor fusion with SVM Noisy data problems

Attempts to model accident evolution with DBN Conclusions and future work

Noisy data: Poor onset tagging and “bootstrap”

Page 3: Towards a Learning Incident Detection System

Traffic data collection Sensor network

Volumes Speeds Occupancy

Data aggregated over 5 minutes

Incidents police camera system

Page 4: Towards a Learning Incident Detection System

Incident Annotationincident

incident no incident

Page 5: Towards a Learning Incident Detection System

Incident annotation Incident labels not necessarily correct or timely

Do not correct timing (opportunity for more ML )

Page 6: Towards a Learning Incident Detection System

Incident detection algorithms, intuition Incidents detected indirectly through caused congestion Baseline: “California 2” algorithm:

If OCC(up) – OCC(down) > T1, next step If [OCC(up) – OCC(down)]/ OCC(up) > T2, next step If [OCC(up) – OCC(down)]/ OCC(down) > T3, possible accident If previous condition persists for another time step, sound alarm

Hand-calibrated T1-T3 – very labor intensive Why so few ML applications?

nontraditional data, anomaly detection – rare positives, common sense works well

Occupancy spikes Occupancy falls

Page 7: Towards a Learning Incident Detection System

Evaluation metrics AMOC curve

Time to detection (TTD) vs False positive rate (FPR)

Don’t know when exactly incident happened

Maximal TTD (120min) AU interesting region of C

Performance envelope Detection rate (DR) vs FPR Random gets over diagonal Report ROC as a check

Sensitivity vs specificity

Low false positive region 1 false alarm/day * 150

sensors

Page 8: Towards a Learning Incident Detection System

Features

Sensor measurements Temporal derivative Spatial differences

Page 9: Towards a Learning Incident Detection System

Features Simple measurements: 3 per sensor, 6 total

Occupancy < threshold

Page 10: Towards a Learning Incident Detection System

Temporal features Capture abrupt changes

Occupancy spike – now minus previous time slice

Page 11: Towards a Learning Incident Detection System

Spatial differences “Discontinuities” in flow between sensor positions

Difference in speeds downstream - upstream

Page 12: Towards a Learning Incident Detection System

Sensor fusion

Information in all simple detectors How to combine their outputs? Linear combination – SVM

Page 13: Towards a Learning Incident Detection System

Baseline: California 2 Hand-calibrated (+brute force) Good low FAR performance, but poor detection rate

Page 14: Towards a Learning Incident Detection System

SVM Combines sensor measurements via

a linear combination

Page 15: Towards a Learning Incident Detection System

SVM Spatial relations

Sensor measurements plus ratios and differences from the neighboring sensor

Page 16: Towards a Learning Incident Detection System

SVM Temporal derivatives

Sensor measurements plus differences and ratios to previous step

Page 17: Towards a Learning Incident Detection System

Focus on low FAR California better – persistency check

Page 18: Towards a Learning Incident Detection System

A dynamic Naïve Bayes network Problem: Incidents are recorded later than they occur

True state of highway is unobservable by sensors Picture of incidents evolves in time

About 30 features: 3 readings up/down stream, differences, ratios to neighboring sensor, previous time point

speed

Occupancy(t-5)

Incident observed

True hidden stateH HH

I

On

O1

I

On

O1

I

On

O1

………

Page 19: Towards a Learning Incident Detection System

A dynamic Naïve Bayes network Evolution of an accident:

Normal traffic steady state Accident happens, effects build up Constricted steady state Recovery

Model has 4 hidden states Anchor hidden states to desired semantics: clamp p(I|H) Raise alarm if p(H=acc_state|O) > threshold

Learned hidden state transition matrix:

0.9536 0.0332 0.0000 0.01330.0050 0.9577 0.0339 0.00340.0000 0.0882 0.9033 0.00840.0957 0.0000 0.0753 0.8290

H1 H2

H4 H3

Page 20: Towards a Learning Incident Detection System

DNB Performance Poor job at low FAR

Fairly insensitive to threshold

Page 21: Towards a Learning Incident Detection System

Summary Challenges to ML in traffic incident detection

Rare class – data sparsity, unequal misclassif cost Incident annotations are noisy

Machine learning methods competitive though SVM outperforms current practice No manual tuning, readapts to data after changes

Lessons and surprises: Richer feature sets do not help much Neither does removing diurnal trends (?) SVM has very stable performance Dynamic Naïve Bayes weak

Page 22: Towards a Learning Incident Detection System

Future work Discriminate incident and benign congestion

Improve discriminative classification SVM with nonlinearities (?) Unequal misclassification cost models

Improve dynamical models SVM handles time awkwardly – Dynamic Bayes Nets Conditional random fields – discriminative + time

Improve Data Bootstrap – use even a strawman to label incident start,

learn from relabeled data (, iterate)

Supplemental materials available http://www.cs.pitt.edu/~tomas/papers/icml06w/ (AMOC curves that did not fit into the paper)

Page 23: Towards a Learning Incident Detection System

Thank you

Questions?

Suggestions?

Page 24: Towards a Learning Incident Detection System

SVM California 2 measurements

Current and past occupancies

Page 25: Towards a Learning Incident Detection System

DNB Performance