2
University of Groningen Visual Explanation of Multidimensional Projections Rodrigues Oliveira da Silva, Renato; Rauber, Paulo Eduardo; Messias Martins, Rafael; Telea, Alexandru IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below. Document Version Publisher's PDF, also known as Version of record Publication date: 2015 Link to publication in University of Groningen/UMCG research database Citation for published version (APA): Rodrigues Oliveira da Silva, R., Rauber, P. E., Messias Martins, R., & Telea, A. (2015). Visual Explanation of Multidimensional Projections. Poster session presented at ICT.Open, Amersfoort, Netherlands. Copyright Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons). Take-down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum. Download date: 11-02-2018

University of Groningen Visual Explanation of ...Renato Rodrigues Oliveira da Silva Johann Bernoulli Institute for Mathematics and Computer Science University of Groningen Nijenborgh

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: University of Groningen Visual Explanation of ...Renato Rodrigues Oliveira da Silva Johann Bernoulli Institute for Mathematics and Computer Science University of Groningen Nijenborgh

University of Groningen

Visual Explanation of Multidimensional ProjectionsRodrigues Oliveira da Silva, Renato; Rauber, Paulo Eduardo; Messias Martins, Rafael; Telea,Alexandru

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite fromit. Please check the document version below.

Document VersionPublisher's PDF, also known as Version of record

Publication date:2015

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):Rodrigues Oliveira da Silva, R., Rauber, P. E., Messias Martins, R., & Telea, A. (2015). Visual Explanationof Multidimensional Projections. Poster session presented at ICT.Open, Amersfoort, Netherlands.

CopyrightOther than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of theauthor(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policyIf you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediatelyand investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons thenumber of authors shown on this cover page is limited to 10 maximum.

Download date: 11-02-2018

Page 2: University of Groningen Visual Explanation of ...Renato Rodrigues Oliveira da Silva Johann Bernoulli Institute for Mathematics and Computer Science University of Groningen Nijenborgh

Visual Explanation ofMultidimensional Projections

Renato R. O. da Silva1,2, Paulo E. Rauber1, Rafael M. Martins1,2

supervisor: Alexandru C. Telea11 - Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen

2 - Institute of Mathematical and Computer Sciences, University of São Paulo

Motivation

M ultidimensional Projections (MPs) are key tools used to support theanalysis of multidimensional data. MPs can project data to a low

dimensional representation, typically visualized as a 2D scatterplot wheresimilar elements are conveniently positioned in close neighborhoods. How-ever such visualizations tell us which points are similar, but not why. Ouraim is, thus, to enrich 2D MP scatterplots with explanatory informationtelling users which key dimensions make closely-projected points similar.

Figure 1: Multidimensional Projection pipeline: A MP technique aims toproject data into a low dimensional space based on the nD distances. Theprojection is typically visualied as a 2D scatterplot, whose points positionsdepict their similarity of the original nD space.

Explanation maps

T o explain a projection we identify which dimensions are more impor-tant to define similarity among close points. First assign a rank toeach of the n dimensions for each projected point, based on increasingorder of data variance in that dimension over that neighborhood. We alsodefine the ranking confidence for each point based on how mixed arethe top-ranked dimensions on their neighborhood points.Next we display the top ranks and their confidences over the projec-tion using a dense map technique based on nearest-neighbor (Voronoi)interpolation, with top-ranks encoded by a categorical colormap andconfidences by brightness.

Figure 2: Explanation map of projected points sampled from three cubefaces. Points on the same cube-face have the same value for one dimen-sion d, so they have assigned the same color. Color brightness indicatethe confidence ranking, from hight (bright) to low (dark). A histogramin the right indicates the top-ranked dimensions frequencies and theirlabels.

Map color coding scheme

M any datasets might have more dimensions than the available colorsin a categorical colormap. In that case we only define colors to the C

more frequent top-ranked dimensions of the projected points. Dimensionshaving top ranks (for fewer points) which do not get mapped to colors,due to the colormap’s limited size C, are mapped to a reserved color.

Single dimension explanation

C lusters created by the projection layout can be explained using thetop-ranked dimension for each point. The regions defined by areas of

same color can create clusters sub-partitions, grouping elements basedon the dimension that best explains their similarities.

Figure 3: Projected clusters can be explained and further partitioned bycompact same-color areas, which are defined by the same top-rankeddimension among points neighborhoods.

Dimension set map

T o explain groups in a deeper level of detail we can utilize more thanone dimension per point. Based on an user-defined threshold τ , wecan select the dimensions whose ranks sum up to at most τ and create aset of dimensions per point. Next we assign colors to the most frequentdimension sets in the map, creating more specialized sub-partitions.

Figure 4: Based on an user-defined similarity threshold, each point willdefine a set of more important dimensions. The more frequent ones arevisualized as specialized sub-partitions of the points clusters.

Future work

O ngoing work targets better visual encodings that map dimensionnames atop of projection point-groups. We also want to explaining re-

gions by both dimensions and dimension-values, thereby leading to morerefined explanations.

Contact

Renato Rodrigues Oliveira da SilvaJohann Bernoulli Institute for Mathematics and Computer ScienceUniversity of GroningenNijenborgh 99747 AG Groningen

e-mail: [email protected]

This work was supported by FAPESP, process no 2014/01692-6