Upload
vijaya-natarajan
View
218
Download
0
Embed Size (px)
Citation preview
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 1/99
Visualization and Data Mining
Lecture delivered
byDr S.Natarajan, Professor ISE, PESIT
forFDPAt
SJBIT Bangaloreon19-7-2010
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 2/99
What is Visualization?
• Dictionary (Merriam-Webster Online Dictionary)
• Wikipedia
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 3/99
March 30, 2012 Data Mining: Concepts and Techniques 3
Purpose of Visualization
Gain insight into an information space by mapping data
onto graphical primitives
Provide qualitative overview of large data sets
Search for patterns, trends, structure, irregularities,
relationships among data
Help find interesting regions and suitable parameters for
further quantitative analysis Provide a visual proof of computer representations
derived
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 4/99
What is Visualization?
• NSF (1987)
“Visualization is a method of computing. Ittransforms the symbolic into the geometric,enabling researchers to observe theirsimulation and computations. Visualization
offers a method to see the unseen.”
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 5/99
Function of Visualization
• Support reasoning (process, calculate, reason)
– For analysing, interpreting the data
• Convey information (share, persuade,
collaborate, emphasize) – Communicating between individuals and groups
– Present, Disseminate Information, Results
• Archive record – Visually memorize through photography,
drawings, etc.
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 6/99
Rationale To Understand Visualization
• Computer and Computer graphics
digitalinformation
interpretationmeasurements
digitalrepresentation
Transform(Visualization)
Knowledge
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 7/99
Jacques Bertin -- Semiology
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 8/99
Rationale To Understand Visualization
• More Data, More Information
• More Complex, Large Volume of Data
digital
information
interpretationmeasurement
digital
representation
transform
digitalinformation
digitalInformation
digitalrepresentation
digital
representation
digitalRepresentation
measurement
simulation
Knowledge
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 9/99
Different Types of Visualization
• Scientific Visualization (SciViz)
– Measured Data
– Simulated Data
Example: weather, CFD
• Information Visualization (InfoViz)
– Abstract Data
Example: population, files on your PC
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 10/99March 30, 2012 Data Mining: Concepts and Techniques 10
Visual Data Mining: An Overview
What is Visual Data Mining?
Survey of techniques
Data Visualization
Visualizing Data Mining Results
Visual Data Mining
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 11/99March 30, 2012 Data Mining: Concepts and Techniques 11
What Is Visual Data Mining?
Visual data mining “discovers implicit and usefulknowledge from large data sets using data
and/or knowledge visualization techniques”
Data visualization + Data mining techniques
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 12/99March 30, 2012 Data Mining: Concepts and Techniques 12
Why Visual Data Mining?
Advantages of human visual system Highly parallel processor
Sophisticated reasoning engine
Large knowledge base
Can be used to comprehend data distributions, patterns,clusters, and outliers
Data Mining Algorithms Visualization
Actionable + –
Evaluation + –
Flexibility – +
User Interaction – +
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 13/99
Visualization Model
Visualization pipeline (Data and Process)
Data Oriented Process
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 14/99
Visualization Model
Visualization pipeline: human in the loop
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 15/99
Visualization Research Field
The Visualization Field emerged as computers gotfaster and had better graphics
Need for processing large amount of data
Change in computing from symbol based to graphics based
1990 First Conference (Eurographics Viz, IEEE Viz)
1995 First Journal (Transaction on Visualization andComputer Graphics)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 16/99March 30, 2012 Data Mining: Concepts and Techniques 18
Why Not Only Visual Data Mining?
Disadvantages of human visual system Needs training
Not automated
Intrinsic bias
Limit of about 106 or 107 observations
(Wegman 1995)
Power of integration with analytical methods
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 17/99March 30, 2012 Data Mining: Concepts and Techniques 19
Scope of Visual Data Mining
Visualization: Use of computer graphics to create visualimages which aid in the understanding of complex, oftenmassive representations of data
Visual Data Mining: The process of discovering implicit butuseful knowledge from large data sets using visualization
techniques
ComputerGraphics
HighPerformanceComputing
PatternRecognition
HumanComputer
Interfaces
MultimediaSystems
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 18/99March 30, 2012 Data Mining: Concepts and Techniques 20
Visual Data Mining & Data Visualization
Integration of visualization and data mining data visualization
data mining result visualization
data mining process visualization
interactive visual data mining
Data visualization
Data in a database or data warehouse can be viewed
at different levels of abstraction as different combinations of attributes or
dimensions
Data can be presented in various visual forms
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 19/99March 30, 2012 Data Mining: Concepts and Techniques 21
Visual Mining vs. Scientific Vis. & Graphics
Scientific Visualization Often visualize physical model, low
dimensionality
Graphics More concerned with how to render (draw)
rather than what to render
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 20/99March 30, 2012 Data Mining: Concepts and Techniques 22
Data Visualization
View data in database or data warehouse
User may control
Different levels of details
Subset of attributes
Drawn using boxplots, histograms, polylines, etc.
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 21/99March 30, 2012 Data Mining: Concepts and Techniques 23
Classification of Data Visualization Techniques
Geometric Techniques:
Scatterplots, Landscapes, Projection Pursuit, Prosection Views,Hyperslice, ParallelCoordinates ...
Icon-based Techniques:
Chernoff Faces, Stick Figures , Shape-Coding, Color Icons, TileBars,...
Pixel-oriented Techniques:
Recursive Pattern Technique, Circle Segments Technique, Spiral- & Axes- Techniques ,...
Hierarchical Techniques:
Dimensional Stacking, Worlds-within-Worlds,Treemap , Cone Trees,InfoCube,...
Graph-Based Techniques: Basic Graphs (Straight-Line, Polyline, Curved-Line,...)
Specific Graphs (e.g., DAG, Symmetric, Cluster,...)
Systems (e.g., Tom Sawyer, Hy+, SeeNet , Narcissus,...)
Hybrid Techniques: arbitrary combinations from above
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 22/99
Motivation for Visualization
Find the number of E‟s in the following text:
TYUKSAWHZWESC
GHEKVXAISXBLSE JUIDVRTHOXEWPX
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 23/99
Motivation for Visualization
Find the number of E‟s:
oA IE
5
4
3
2
1
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 24/99
3 D Wireframe lattice
March 30, 2012 Data Mining: Concepts and Techniques 26
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 25/99
19901998 2000 2002
Expectations
Performance
The Hype Curve for
Data Mining and Knowledge Discovery
Over-inflatedexpectations
Disappointment
Growing acceptanceand mainstreaming
risingexpectations
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 26/99
March 30, 2012 Data Mining: Concepts and Techniques 28
Geometric
Icon-based
Pixel-oriented
Hierarchical
Graph-based
Mapping Projection Filtering Link & Brush Zooming
Simple
Complex
Data Visualization Techniques
Distortion Techniques
Interaction Techniques
Dimensions of Exploratory Data Visualization
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 27/99
March 30, 2012 Data Mining: Concepts and Techniques 29
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 28/99
30
Chernoff Faces
Encode different variables‟ values in characteristics of human face. The user decides its own coding.
http://www.cs.uchicago.edu/~wiseman/chernoff/
http://hesketh.com/schampeo/projects/Faces/chernoff.html Cute applets:
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 29/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 30/99
March 30, 2012 Data Mining: Concepts and Techniques 32
Geometric Techniques
Basic Idea Visualization of geometric transformations and
projections of the data
Methods
Landscapes [Wis 95] Projection Pursuit Techniques [Hub 85] (a
techniques for finding meaningful projections of multidimensional data)
Scatterplot-Matrices [And 72, Cle 93] Prosection Views [FB 94, STDS 95]
Hyperslice [WL 93]
Parallel Coordinates [Ins 85, ID 90]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 31/99
High Resolution Satellite data
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 32/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 33/99
Visualization Principle• From Data to Visual Representation
Visualization: Convey Information through VISUAL representations to explore, analyze, discover, explainsparse/dense, complex, multidimensional DATAIt shows a variety of activities including incoming calls,outgoing calls both answered, and unanswered. It also shows incoming and outgoing SMS messages. Theapplication takes this data and creates a visual map of the day on your phone. While this may not be the firstcell phone inherent visualization - it's the first one I've seen. I would love to see this for more phone platforms.
Currently, it's only available for Symbian based mobile phones.
101210323123213434324
SADAS
11’ 33’ 4
DATA VISUAL REPRESENTATION
TRANSFORMGENERATE
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 34/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 35/99
http://www.visual-literacy.org/periodic_table/periodic_table.html
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 36/99
Selective Dynamic Manipulation(SDM) Sample
• SDM deals with these issues
• Selection and control methods rely on objectsrather than spaces.
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 37/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 38/99
SDM Feedback Mechanisms• Clearly Identify the Selected
Set – The selected set is painted
differently (green)
– A white grid may be drawn
beneath all selected objects to
indicate positions and spread
• Maintain Scene Context
– Used to maintain context
when objects are displaced
– Object ‘body’ (green) andobject ‘shell’ (white)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 39/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 40/99
A Brief History of Visualization
• 105-150
– Invention of Paper + Ptolemy World Map
[from http://www.math.yorku.ca/SCS/Gallery/milestone/]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 41/99
A Brief History of Visualization
• 1453-1500
– Guttenberg+ Arnovalley (Da Vinci)
[from http://www.math.yorku.ca/SCS/Gallery/milestone/]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 42/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 43/99
A Brief History of Visualization
• 1600-1850 – Printed map, weather map+ contour techniques, pie,
bar charts + first photographical technique
[from http://www.math.yorku.ca/SCS/Gallery/milestone/]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 44/99
A Brief History of Visualization• 1850-1974
– Motion, color photography, cinematographe, stereograme, psychometrics,
computer visualization, semiology
[from http://www.math.yorku.ca/SCS/Gallery/milestone/]
SYMAP, producing isoline, choropleth and proximal maps on a line printer- HowardFisher , USASemiology: study of signs and symbols
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 45/99
A Brief History of Visualization
• 1974-Present – More techniques, 3D, interactivity, animation
[from http://www.math.yorku.ca/SCS/Gallery/milestone/]
Historical Overview of Exploratory
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 46/99
March 30, 2012 Data Mining: Concepts and Techniques 49
Historical Overview of ExploratoryData Visualization Techniques (cf. [WB 95])
Pioneering works of Tufte [Tuf 83, Tuf 90] and Bertin [Ber81] focus on
Visualization of data with inherent 2D-/3D-semantics
General rules for layout, color composition, attribute
mapping, etc. Development of visualization techniques for different types
of data with an underlying physical model
Geographic data, CAD data, flow data, image data,
voxel data, etc. Development of visualization techniques for arbitrary
multidimensional data (w.o. an underlying physical model)
Applicable to databases and other information resources
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 47/99
Visualization and Fields
• Multi-disciplinary
Visualization
DigitalImaging
ParallelComputing
Perception
Cognitive
Algorithmic
ArtComputerGraphics
Electronic
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 48/99
51 J.C.Roberts
Minard’s plot
http://www.math.yorku.ca/SCS/Gallery/re-minard.html
The French engineer, Charles Minard (1781-1870), illustrated the
disastrous result of Napoleon's failed Russian campaign of 1812.
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 49/99
52 J.C.Roberts
One of the first uses of a map todisplay epidemiological data was
this dot chart (from Tufte, 1983, p.
24) by Dr. John Snow (1855)
showing deaths from cholera (dots)
in relation to the locations of publicwater pumps.
Tufte says, "Snow observed that
cholera occurred almost entirely
among those who lived near (and
drank from) the Broad Street water
pump. He had the handle of the
contaminated pump removed, ending
the neighborhood epidemic which
had taken more than 500 lives."
S ‟ Ch l
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 50/99
53
Snow‟s CholeraMap, 1855
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 51/99
54
Napoleon Invasion of Russia, 1812
Napoleon
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 52/99
55
Marley, 1885
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 53/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 54/99
Baseball data:
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 55/99
Scatterplots and histograms and bars (from Wills 95)
select high
salaries
avg career
HRs vs avg
career hits
(batting ability)
avg assists vs
avg putouts
(fielding ability)
how long
in majors
distribution
of positions
played
[www.sims.berkeley.edu/courses/is247/s02/lectures/Lecture3.ppt]
Li ki t f i t b h i
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 56/99
Linking types of assist behaviorto position played (from Wills 95)
[www.sims.berkeley.edu/courses/is247/s02/lectures/Lecture3.ppt]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 57/99
March 30, 2012 Data Mining: Concepts and Techniques 60
Landscapes [Wis 95]
• Visualization of the data as perspective landscape
• The data needs to be transformed into a (possibly artificial) 2D spatialrepresentation which preserves the characteristics of the data
news articlesvisualized as
a landscape
U s e d b y p e r m i s s i o n o f B .
W r i g h t , V i s i b l e D e c i s i o n s I n c .
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 58/99
61
Parallel Coordinates
• Encode variables along a horizontal row• Vertical line specifies values
Dataset in a Cartesian coordinates
Same dataset in parallel coordinates
Invented by Alfred Inselberg
while at IBM, 1985
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 59/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 60/99
63
The parallel coordinate plot:
• The parallel coordinate plot, described by Al Inselberg (1985), representsmultidimensional data using lines.
• Whereas in traditional Cartesian coordinates all axes are mutuallyperpendicular, in parallel coordinate plots, all axes are parallel to one
another and equally spaced.• In this approach, a point inm-dimensional space is represented as a
series of m-1 line segments in 2-dimensional space. Thus, if the originaldata observation is written as (x1, x2, … xm,), then its parallel coordinaterepresentation is them-1 line segments connecting points (1,x1), (2,x2), . .. (m,xm).
• Typically, features will be standardized before a parallel coordinate plot isdrawn.
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 61/99
Flower and some of its parts
l i li i i
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 62/99
65
Example: Visualizing Iris Data
sepallength
sepalwidth
petallength
petalwidth
5.1 3.5 1.4 0.2
4.9 3 1.4 0.2
... ... ... ...
5.9 3 5.1 1.8
Iris setosa
Iris versicolor
Iris virginica
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 63/99
66
Parallel CoordinatesSepal
Length
5.1
sepallength
sepalwidth
petallength
petalwidth
5.1 3.5 1.4 0.2
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 64/99
67
Parallel Coordinates: 2 DSepal
Length
5.1
SepalWidth
3.5
sepallength
sepalwidth
petallength
petalwidth
5.1 3.5 1.4 0.2
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 65/99
68
Parallel Coordinates: 4 DSepal
Length
5.1
SepalWidth
Petal
length
Petal
Width
3.5
sepallength
sepalwidth
petallength
petalwidth
5.1 3.5 1.4 0.2
1.40.2
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 66/99
69
5.1
3.5
1.4 0.2
Parallel Visualization of Iris data
I B d T h i
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 67/99
March 30, 2012 Data Mining: Concepts and Techniques 70
Icon-Based Techniques
Basic Idea Visualization of the data values as features of icons
Overview
Chernoff-Faces [Che 73, Tuf 83] Stick Figures [Pic 70, PG 88]
Shape Coding [Bed 90]
Color Icons [Lev 91, KK 94]
TileBars [Hea 95]
(use of small icons representing the relevance feature
vectors in document retrieval)
Stick Figures
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 68/99
March 30, 2012 Data Mining: Concepts and Techniques 71
census datashowing age,income, sex,education, etc.
Stick Figures
“two attributes
mapped to axes,
remaining attributes
mapped to angle or
length of limbs”. Look
at texture pattern
Dimensional Stacking
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 69/99
March 30, 2012 Data Mining: Concepts and Techniques 72
Dimensional Stacking [LWW 90]
partitioning of the n-dimensional attribute space in 2-
dimensional subspaces which are „stacked‟ into each other
partitioning of the attribute value ranges into classes theimportant attributes should be used on the outer levels
adequate especially for data with ordinal attributes of low
cardinality
attribute 1
attribute 2
attribute 3
attribute 4
Hierarchical Techniques
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 70/99
March 30, 2012 Data Mining: Concepts and Techniques 73
Hierarchical Techniques
Basic Idea: Visualization of the data using ahierarchical partitioning into subspaces.
Overview
Dimensional Stacking [LWW 90] Worlds-within-Worlds [FB 90a/b]
Treemap [Shn 92, Joh 93]
Cone Trees [RMC 91] InfoCube [RG 93]
Di e io l St ki
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 71/99
March 30, 2012 Data Mining: Concepts and Techniques 74
Used by permission of M. Ward, Worcester Polytechnic InstituteVisualization of oil mining data with longitude andlatitude mapped to the outer x-, y-axes and ore gradeand depth mapped to the inner x-, y-axes
Dimensional Stacking
Dimensional Stacking
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 72/99
March 30, 2012 Data Mining: Concepts and Techniques 75
Dimensional Stacking
Disadvantages: Difficult to display more than nine dimensions
Important to map dimensions appropriately
May be difficult to understand visualizations atfirst
C T
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 73/99
Cone Tree
16
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 74/99
March 30, 2012 Data Mining: Concepts and Techniques 77
Cone Trees
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 75/99
Cone Trees
research.microsoft.com/~ggr/gi97.ppt17
Treemap [JS 91 Shn 92 Joh 93]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 76/99
March 30, 2012 Data Mining: Concepts and Techniques 79
Screen-filling method which uses a hierarchical
partitioning of the screen into regions depending on theattribute values
The x- and y-dimension of the screen are partitionedalternately according to the attribute values (classes)
Treemap [JS 91, Shn 92, Joh 93]
MSR Netscan image:
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 77/99
treemaps
Chris Stolte
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 78/99
+ 2003: PhD Dissertation, Stanford
+ Extended the semiology from Bertin & Mackinlay
+ VizQL connected visualizations to databases
+ Accessible drag-and-drop interface
VizQL
Query Data Interpreter Visual Interpreter View
3D GUI for Web Browsing
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 79/99
3D GUI for Web Browsing
22
3D GUI for Web Browsing
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 80/99
3D GUI for Web Browsing
http://research.microsoft.com/ui/TaskGallery/index.htm 23
Web Forager
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 81/99
Web Forager
http://research.microsoft.com/ui/TaskGallery/index.htm 24
Web Book
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 82/99
Web Book
research.microsoft.com/~ggr/gi97.ppt 25
3D GUI for Desktop
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 83/99
3D GUI for Desktop
http://research.microsoft.com/ui/TaskGallery/index.htm 26
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 84/99
Introduction Approach Related Work PhD Work Next?
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 85/99
October 13, 2006 Nathalie HenryDoctoral Consortium
88
Social Networks
Vizster [Heer 2006]
Infovis Co-authoring Network [Börner et al. 2004]
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 86/99
Presentation at Nowcasting Symposium, Design/Media ArtsDepartment, UCLA, Los Angeles, October 16-17, 2009(original title: “Cultural Analytics annual report 6/2008-9/2009)
Dr. Lev ManovichDirector, Software Studies Initiative, Calit2 + UCSDProfessor, Visual Arts [email protected]
You can capture this lecture using any media and share it.Follow our research: softwarestudies.com
Cultural Visualization techniques
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 87/99
cultural analytics research @Software Studies Initiative atCalit2/UCSD - key people:
Dr. Jeremy Douglass | Postdoctoral ResearcherTara Zepel | PhD student, Art HistorySunsern Cheamanunkul | PhD student, Computer Science
So Yamaoka | PhD student, Computer ScienceWilliam Huber | PhD student, Art History
for the expanded list of participants, see softwarestudies.com
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 88/99
Our research is made possible by the support from:
Center for Research in Computing and the Arts (CRCA)California Institute for Telecommunication andInformation (Calit2)
NEH Office of Digital HumanitiesNational Energy Research Scientific Computing CenterSingapore Ministry of EducationBergen University (Norway)
UCHRI
o ware u es n a ve o a ora ors:Yuri Tsivian, Department of Art History, University of Chicago: cinemetrics.lv | filmanalysis
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 89/99
analysis Adele Eisenstein: Digital Formalism project (Department for Theatre, Film and MediaStudies (TFM), Vienna University; the Austrian Film Museum; Interactive MediaSystems Group, Vienna University of Technology) | film analysis
Netherlands Institute for Sound and Vision | television and motion graphicsanalysisSan Diego Museum of Contemporary Art | mapping art for an exhibition Isabel Galhano Rodrigues, University of Porto, Portugal | gesture analysis David Kirsh, Cognitive Science, UCSD | dance video analysis
Ph.D. students, Art History, UCSD | art history and visual culture
Jim Hollan, Cognitive Science, UCSD | visualization | cultural analytics softwareFalko Kuester, Structural Engineering, UCSD + Calit2 | visual analytics | culturalanalytics software Yoav Freund, Computer Science and Engineering, UCSD | machine learning andmachine vision | cultural analytics software Kay O’Halloran, Multimodal Analysis Lab, National University of Singapore | MappingAsian Cultures projectGiorgos Cheliotis: Communication and New Media, National University of Singapore |Mapping Asian Cultures projectMatthew Fuller: Goldsmiths College, University of London | software studies, gamestudies
goals of cultural analytics:- being able to better represent the complexity, diversity,
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 90/99
variability, and uniqueness of cultural processes and artifacts- develop techniques to describe the dimensions of cultural
artifacts and cultural processes which until now received littleor no attention (such as gradual temporal change)- create much more inclusive cultural histories and analysis -ideally taking into account all available cultural objects createdin particular cultural area and time period (“art history without
names”)
- democratize cultural research by creating open-source toolsfor cultural analysis and visualization- create interfaces for exploration of cultural data which
operate across multiple scales - from details of structure of aparticular individual cultural artifact/processes to massivecultural data sets/flows
cultural analytics - typical steps:
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 91/99
-1) description (i.e, “culture into data”):
a) manual: annotation, tagging
b) automatic: software analysis of media; capturing user activity
our focus: easy-to-use techniques for automatic description of visual and
interactive media
2) optional: statistical data analysis
3) data visualization (reduction, summarization) and data mapping
(expansion, outlining, layering)
our focus: new visualization + mapping techniques appropriate for
interactive exploration of large sets of visual objects
4) interpretation (humanities), or explanation (science), or correlation
(social science)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 92/99
Cultural Analytics software running on HIPerSpace (May 2009)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 93/99
Cultural Analytics software running on HIPerSpace (May 2009)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 94/99
Cultural Analytics software running on HIPerSpace (May 2009)
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 95/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 96/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 97/99
8/2/2019 DM and Visualization SJBIT 19 07 10
http://slidepdf.com/reader/full/dm-and-visualization-sjbit-19-07-10 98/99
Example: Home Finder ( Map )