51
CS 6965 Fall 2019 Prof. Bei Wang Phillips University of Utah Advanced Data Visualization Lecture 05

Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

CS 6965Fall 2019

Prof. Bei Wang PhillipsUniversity of Utah

Advanced Data Visualization

Lecture 05

Page 2: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Brainstorming

BrainstormingA spontaneous group discussion to produce ideas and ways of solving problems.

Page 3: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Brainstorming

Question: how to enhance an existing PCA-based visual analytic system?

+ Improve model interpretability + Improve data interpretability

Page 4: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Enhance PCA systemAdd DR techniquesOutlier detectionDistortion Dynamic projection, animationCapture user analytic history and user intensionsCompare across multiple parameter settingsParameter tuning, parameter suggestionUser study: guidance in exploratory analysisAdd SVD information: visualize singular values and eigenvalues More…

Page 5: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

More on DR, Cluster ing, and Vis

HD

Page 6: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

A few more words on mapper algorithm

A tool for high-dimensional data analysis and visualization

Page 7: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Clustering algorithm

Let X be the original high-dimensional point cloud.Clustering algorithm applies to

The inverse image of the interval, which are points in the domain: a subset of X (the classic algorithm). Alternatives, clustering can be applied to a transformed version of X, referred to as Y. For example, Y can be the result of DR of X.

Page 8: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Input: Point cloud data X + distance metric on the point cloud Filter functions f on X

Output:A graph or a simplicial complex representation

Parameters:Filter functionsNumber of intervalsAmount of interval overlapColor functions, etc.

[SinghMemoliCarlsson2007]

Mapper I/O and Parameters

Page 9: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

KepperMapper

A Demo

Page 10: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

One Circle

Lens: x-valuescircle-demo.py

Page 11: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Two Circles

Lens: x-values

Color function: labels

double_circle_demo.py

Page 12: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Digits

digits-demo.py

Applying clustering to projected data

Page 13: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Control point based projectionsDistance metricDR precision measure

More discussions on DR

Page 14: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Dealing with large data: control points

Page 15: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Control point based DR

Improve efficiency of traditional linear/nonlinear DR2 phase approach

Project a set of control points (anchor points)Project the rest of the points based on the location of control points and preservation of local features

Scalable systemAllow users manipulate and modify the outcome of the DR

Page 16: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Part Linear Multi-dim. Proj. (PLMP)

[PaulovichSilvaNonato2010]

Page 17: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

PLMP

[PaulovichSilvaNonato2010]

Preserving distances between data instances as much as possibleApproximate the above linear transformation using anchor pointsSample selection: random vs clustering (cluster centers of k-means)If sampling rate increases, random and clustering produces similar results

Page 18: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

PLMP: steering projection

[PaulovichSilvaNonato2010]

Page 19: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Distance metric

Page 20: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Learning distance interactively

A suitable distance metric is essential for DRHow to learn a distance function from dataDistance function learning

A new distance function is calculated based on point layout manipulation by an expert user

[BrownLiuBrodley2012]

Page 21: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

1. 2D scatter plot visualization of the data2. Find inconsistencies in data based on prior knowledge; drag/drop

and selection to manipulate the data3. Calculate a new distance function based on feedbacks from Step 2

[BrownLiuBrodley2012]

Learning distance interactively

Page 22: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

[BrownLiuBrodley2012]

Page 23: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

[BrownLiuBrodley2012]http://www.cs.tufts.edu/~remco/publications/2012/videos/brown2012disfunction.mp4

Page 24: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

DR + Precision Measure

Page 25: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

DR Quality Measures

DR-dependent distortion measures

DR-independent distortion measures

[LeeVerleysen2009] [LiuWangBremer2014]

Page 26: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

DR-dependent distortion measuresDR: optimizing a cost function ff incorporates a natural quality measure that assesses how much structure, in terms of relations among data points in high dimensions, stays consistent with the one inferred by the low-dimensional embeddingAlternatively, how much cost is needed in transforming one to another.Global distortion measure: overall quality of DR, Local distortion measure: point-wise derivation of the global measure

[LeeVerleysen2009] [LiuWangBremer2014]

Page 27: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

PCA distortion measure

[LiuWangBremer2014]

Page 28: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Locally linear embedding (LLE)

[LiuWangBremer2014]

LLE represents each point as a weighted linear c o m b i n a t i o n o f i t s neighbors and tries to p rese rve th i s l i nea r r e l a t i o n s h i p i n t h e reduced dimension.Wij: weight matrix that stores linear relationships

Page 29: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

DR-ind. distortion measures

Kernel density estimate (KDE) distortionStressRobust distance distortionCo-ranking distortion

[LeeVerleysen2009] [LiuWangBremer2014]

Page 30: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

KDE distortion

[LeeVerleysen2009] [LiuWangBremer2014]

Global KDE distortion

Local KDE distortion

KDE

KDE w. Gaussian kernel

Page 31: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Stress

[LeeVerleysen2009] [LiuWangBremer2014]

Global stress

Local stress

Page 32: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Co-ranking distortion

dij: distance between xi and xj [LeeVerleysen2009] [LiuWangBremer2014]

⇢ij = |{k | dik dij or (dik = dij and 1 k < j N)}|Rank of xj w.r.t. xi

�ij = |{k | d̂ik d̂ij or (d̂ik = d̂ij and 1 k < j N)}|Rank of yj w.r.t. yi

Rank Error

Co-rank matrix: a histogram of all rank errors:

Page 33: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Co-ranking: global & local distortion

[LeeVerleysen2009] [LiuWangBremer2014]

K: number of neighbors under consideration

Point-wise contribution A larger Qi correspond to less distortion

Page 34: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Visualizing the quality of DR

[MokbelLueksGirbrecht2013]

t-SNE point-wise distortion: notice the tearing

Page 35: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Distortion-guided, structure-driveninteractive exploration of high-dim data [LiuWangBremer2014]

Page 36: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Clustering & Vis

Page 37: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

https://www.naftaliharris.com/blog/visualizing-k-means-clustering/

Visualizing Clustering ProcessVisualizing the algorithmic process for clustering (especially iterative ones)

Page 38: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Visualizing DBSCAN

https://www.naftaliharris.com/blog/visualizing-dbscan-clustering/

Page 39: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Subspace clustering (SC) & Vis

Page 40: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Subspace clustering vs DR

Clustering: widely used data-driven analysis methodsDR: compute one single embedding that best describes the structure of dataSubspace clustering

Identify multiple embeddings, each capturing a different aspect of the dataClustering either the dimensions or the data points

Page 41: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Subspace clustering & Vis

Explore dimension space (this lecture)Explore subsets of dimensions (next lecture)Non-Axis-Aligned subspaces (next lecture)

Page 42: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

SC: Dimension Space Exploration

Page 43: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Dimension space exploration

Guided by the userInteractively group relevant dimensions into subsets

Page 44: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Dual Analysis Model

https://fmfatore.files.wordpress.com/2012/09/pres.pdf [TurkayFilzmoserHauser2011]

MVA: Multivariate analysis

Page 45: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Representative factor generation

[TurkayLundervoldLundervold2012]Grouping a collection of dimensions as a factor

Page 46: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Dimension projection matrix/tree

[YuanRenWang2013]

Page 47: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Dimension projection matrix/tree

[YuanRenWang2013]Video: http://vis.pku.edu.cn/wiki/doku.php?id=publication:start

Page 48: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Data Context Map

https://www.youtube.com/watch?v=nnjkHA8xvbI&feature=youtu.be[ChengMuller2016]

Observing the data points in the context of the attributes

Combining two similarity matrices typically used in isolation – the matrix encoding the similarity of the attributes and the matrix encoding the similarity of the data points.

http://www3.cs.stonybrook.edu/~mueller/research/pages/dataContextMap/

Page 49: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Thanks!You can find me at: [email protected]

Any questions?

Page 50: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Vector Icons by Matthew Skiles

Presentation template designed by Slidesmash

Photographs by unsplash.com and pexels.com

CREDITSSpecial thanks to all people who made and share these awesome resources for free:

Page 51: Advanced Data Visualizationbeiwang/teaching/cs6965-fall-2019/Lecture05-DR... · t-SNE point-wise distortion: notice the tearing. Distortion-guided, structure-driven interactive exploration

Presentation DesignThis presentation uses the following typographies and colors:

Co lo rs used

Free Fonts used:http://www.1001fonts.com/oswald-font.html

https://www.fontsquirrel.com/fonts/open-sans