20
Anomaly Detection in GPS Data Based on Visual Analytics Kyung Min Su - Zicheng Liao, Yizhou Yu, and Baoquan Chen, Anomaly Detection in GPS Data Based on Visual Analytics. IEEE Conference on Visual Analytics Science and Technology, 2010

Anomaly Detection in GPS Data Based on Visual Analytics

  • Upload
    jules

  • View
    87

  • Download
    0

Embed Size (px)

DESCRIPTION

Anomaly Detection in GPS Data Based on Visual Analytics. Kyung Min Su. - Zicheng Liao, Yizhou Yu, and Baoquan Chen, Anomaly Detection in GPS Data Based on Visual Analytics. IEEE Conference on Visual Analytics Science and Technology, 2010. Overview. Data analysis on GPS traces of taxi s - PowerPoint PPT Presentation

Citation preview

Page 1: Anomaly Detection in GPS Data  Based on Visual Analytics

Anomaly Detection in GPS Data Based on Visual AnalyticsKyung Min Su

- Zicheng Liao, Yizhou Yu, and Baoquan Chen, Anomaly Detection in GPS Data Based on Visual Analytics. IEEE Conference on Visual Analytics Science and Technology, 2010

Page 2: Anomaly Detection in GPS Data  Based on Visual Analytics

Overview

Data analysis on GPS traces of taxis For traffic monitoring To detect abnormal situations

Visual analytics approach collaboration between machines and human

analysts

Page 3: Anomaly Detection in GPS Data  Based on Visual Analytics
Page 4: Anomaly Detection in GPS Data  Based on Visual Analytics

System architecture

Page 5: Anomaly Detection in GPS Data  Based on Visual Analytics

Feature Set

Page 6: Anomaly Detection in GPS Data  Based on Visual Analytics

Feature Extraction

Page 7: Anomaly Detection in GPS Data  Based on Visual Analytics

Probabilistic Models

Conditional Random Fields (CRF)

Page 8: Anomaly Detection in GPS Data  Based on Visual Analytics

Conditional Random Fields (CRF) Hidden state sequence y

Z(x): normalization item

Page 9: Anomaly Detection in GPS Data  Based on Visual Analytics

CRF - Training

Training: computes the model parameters (the weight vector) according to labeled training data pairs {y, x}

Page 10: Anomaly Detection in GPS Data  Based on Visual Analytics

CRF - Inference

Inference: tries to find the most likely hidden state

assignment y, the label sequence for the unlabeled input sequence x

Page 11: Anomaly Detection in GPS Data  Based on Visual Analytics

Active Learning

Active learning: learner selectively chooses the examples to reduced amount of training data to improve the generalization performance on a

fixed-size training set

Criteria Uncertainty Representativeness Diversity

Page 12: Anomaly Detection in GPS Data  Based on Visual Analytics

Uncertainty

High model uncertainty Help enrich the classifier

Confidence

Uncertainty

Page 13: Anomaly Detection in GPS Data  Based on Visual Analytics

Representativeness

High representativeness sample sequence is not similar to any other

Page 14: Anomaly Detection in GPS Data  Based on Visual Analytics

Diversity

Diversity: To remove items that are redundant with respect

to data items that are already in the training set from the previous iteration.

Similarity score is not greater than the average pairwise similarity among all sequences currently in the training set.

Page 15: Anomaly Detection in GPS Data  Based on Visual Analytics

Visualization and Interaction

Page 16: Anomaly Detection in GPS Data  Based on Visual Analytics

Interaction Interface

Basic mode Raw GPS traces without any labels

Monitoring mode Anomaly tags are shown. Show the internal CRF states of the tagged data items.

Tagging mode Active learning module is activated. Highly uncertain labels from the CRF model are

highlighted, requesting for user input.

Page 17: Anomaly Detection in GPS Data  Based on Visual Analytics

Visualizing CRF Features

CRF internal states visualization

Features and their Weights Red: + Negative: -

Page 18: Anomaly Detection in GPS Data  Based on Visual Analytics

Visualizing CRF Features

Page 19: Anomaly Detection in GPS Data  Based on Visual Analytics

Summarization

Anomaly detection system Conditional Random Fields

Active Learning

Visualization and Interaction

Page 20: Anomaly Detection in GPS Data  Based on Visual Analytics

References [1] Zicheng Liao, Yizhou Yu, and Baoquan Chen.

Anomaly Detection in GPS Data Based on Visual Analytics. IEEE Conference on Visual Analytics Science and Technology (VAST 2010), 2010.

[2] J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the International Conference on Machine Learning (ICML-2001), 2001.

[3] C. T. Symons, N. F. Samatova, R. Krishnamurthy, B. H. Park, T. Umar, D. Buttler, T. Critchlow, and D. Hysom. Multi-criterion active learning in conditional random fields. In Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence, 2006.