33
Graph Exploration: Taking the User into the Loop Davide Mottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany 2016/10/24 CIKM2016, Indianapolis, US

Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

Embed Size (px)

Citation preview

Page 1: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

Graph Exploration: Taking the User into the Loop

Davide Mottin,AnjaJentzsch,EmmanuelMüllerHasso Plattner Institute,Potsdam,Germany

2016/10/24CIKM2016,Indianapolis,US

Page 2: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Where we are

D. MOTTIN, A. JENTZSCH, E. MÜLLER

ExploratoryGraphAnalysis(35min)

RefinementofQueryResults(35min)

FocusedGraphMining(35min)

103

Background(5 min)Graphmodels,subgraphisomorphism,subgraphmining,graphclustering

RealWorld-UseCase(15min)LinkedDatagraphs

Challengesanddiscussion

Page 3: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

The Web of Data

• 1,019datasets• 84+billionRDFtriples• 808+millionRDFlinksbetweendatasets

D. MOTTIN, A. JENTZSCH, E. MÜLLER 104

http://lod-cloud.net

Page 4: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Vocabularies on the Web of Data• TheWebofDataisheterogeneous⁃ Manyvocabulariesareinuse(576asofOctober2016)⁃ Manydifferentwaystorepresentthesameinformation

D. MOTTIN, A. JENTZSCH, E. MÜLLER 105

https://lov.okfn.org

Page 5: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

RDF Data Model

D. MOTTIN, A. JENTZSCH, E. MÜLLER 106

ns:cikm2016 ns:Event

dbpedia:Indianapolis

ACMConferenceonInformationandKnowledgeManagement(CIKM2016)

rdf:type

rdfs:label

ns:location

Page 6: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

RDF Data Model

D. MOTTIN, A. JENTZSCH, E. MÜLLER 107

ns:cikm2016 ns:Event

dbpedia:Indianapolis

ACMConferenceonInformationandKnowledgeManagement(CIKM2016)

rdf:type

rdfs:label

ns:location

820445

dbpedia:populationTotal

JosephH.Hogsett

dbpedia:leaderName

Page 7: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Linked Data exploration use cases

• Datasetexploration• Graphmining• Queryformulationandrefinement

! ButLinkedDataismessy

D. MOTTIN, A. JENTZSCH, E. MÜLLER 108

Page 8: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Linked Data graph exploration challenges

● Nestedgraphs � Makesreasoningdifficult

● Loosestructure � Thingshavedifferentpropertysets

● Incomplete � Missingpropertydefinitions

● Poorlyformatted � Propertytypesusedinconsistently

● Inconsistent � Multiplerepresentationsclaimoppositethings

D. MOTTIN, A. JENTZSCH, E. MÜLLER 109

Page 9: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Linked Data exploration systems timeline

D. MOTTIN, A. JENTZSCH, E. MÜLLER 110

Page 10: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

DBpedia Mobile

D. MOTTIN, A. JENTZSCH, E. MÜLLER 111

C.BeckerandC.Bizer.DBpedia mobile:Alocation-enabledlinkeddatabrowser.LDOW2008.

• displaysWikipediadataonmap• aggregatesdifferentdatasources

Page 11: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

RelFinder

D. MOTTIN, A. JENTZSCH, E. MÜLLER 112

Heim,P.,Hellmann,S.,Lehmann,J.,Lohmann,S.,andStegemann,T.RelFinder:RevealingRelationshipsinRDFKnowledgeBases.SAMT2009.

• visualizationofpathsbetweenany2entities• pathidentificationoninstancelevel

Page 12: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

gFacet

D. MOTTIN, A. JENTZSCH, E. MÜLLER 113

P.Heim,T.Ertl,andJ.Ziegler.Facetgraphs:Complexsemanticqueryingmadeeasy.TheSemanticWeb:ResearchandApplications.Springer,2010.

• Schemaexploration

• combinesgraph-basedvisualizationand

facetedfilteringtechniques

Page 13: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

graphVizdb

D. MOTTIN, A. JENTZSCH, E. MÜLLER 114

Bikakis,N.,Liagouris,J.,Krommyda,M.,Papastefanatos,G.andSellis,T.graphVizdb:Ascalableplatformforinteractivelargegraphvisualization.ICDE,2016

• Graphlayoutisindexedwithaspatialdatastructure,i.e.,anR-tree,andstoredinadatabase

• Inruntime,useroperationsaretranslatedintoefficientspatialoperations(i.e.,windowqueries)inthebackend

Page 14: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

LODeX

D. MOTTIN, A. JENTZSCH, E. MÜLLER 115

Benedetti,F.,Bergamaschi,S.andPo,L.Lodex:Atoolforvisualqueryinglinkedopendata.ISWC,2015

• ExploreaLinkedDatasetusingaschemasummary

• Pickgraphicalelementsfromittocreateavisualquery

• Browsetheresults• Refinethequery

Page 15: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Aemoo

D. MOTTIN, A. JENTZSCH, E. MÜLLER 116

A.Musetti,A.G.Nuzzolese,F.Draicchio,V.Presutti,E.Blomqvist,A.Gangemi,andP.Ciancarini.Aemoo:Exploratorysearchbasedonknowledgepatternsoverthesemanticweb.SemanticWebChallenge,2012.

• ExploratorysearchsystembasedonEncyclopedicKnowledgePatterns

• EKPareknowledgepatternsthatdefinethetypicalclassesusedtodescribeentitiesofacertainclass

Page 16: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Linked Jazz

D. MOTTIN, A. JENTZSCH, E. MÜLLER 117

M.C.Pattuelli,M.Miller,L.Lange,S.Fitzell,andC.Li-Madeo.Craftinglinkedopendataforculturalheritage:Mappingandcuration toolsforthelinkedjazzproject.Code4LibJournal,2013.

• revealsthenetworkofthesocialandprofessionalrelationswithintheAmericanjazzcommunity

Page 17: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Semantic Wonder Cloud

D. MOTTIN, A. JENTZSCH, E. MÜLLER 118

http://sisinflab.poliba.it/semantic-wonder-cloud/index/

Page 18: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

inWalk

D. MOTTIN, A. JENTZSCH, E. MÜLLER 119

Castano,S.,Ferrara,A.andMontanelli,S.inWalk:InteractiveandThematicWalksinsidetheWebofData.EDBT,2014

Page 19: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

ProLOD++ Mining Graph Patterns on the Web of Data

D. MOTTIN, A. JENTZSCH, E. MÜLLER 120

Jentzsch,A.,Dullweber,C.,Troiano,P.,Naumann,F.ExploringLinkedDataGraphStructures.ISWC2015.

ProLOD++● WebframeworkforvariousdataprofilingandminingtasksonLinkedDatasets

● ExplorativeresearchonLinkedDatasetgraphstofind⁃ frequentgraphpatterns⁃ commongraphpatternsforclasses⁃ generalgraphmodelforLinkedDatasets

https://prolod.org

Page 20: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

ProLOD++Graph pattern mining

D. MOTTIN, A. JENTZSCH, E. MÜLLER 121

Jentzsch,A.,Dullweber,C.,Troiano,P.,Naumann,F.ExploringLinkedDataGraphStructures.ISWC2015.

DefinitionofcoresetoffrequentgraphpatternsinLinkedDatasetsbasedonsatellitecomponentanalysis

Page 21: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

ProLOD++Graph patterns

D. MOTTIN, A. JENTZSCH, E. MÜLLER 122

Jentzsch,A.,Dullweber,C.,Troiano,P.,Naumann,F.ExploringLinkedDataGraphStructures.ISWC2015.

● Groupclass-coloured graphsbytheirpermutationgroups[Luks82]⁃ Permutationgroup:thesetofallautomorphisms ofagraph

drugtargetunknown

Page 22: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Loupe

D. MOTTIN, A. JENTZSCH, E. MÜLLER 123

Mihindukulasooriya,N.,Poveda-Villalón,M.,García-Castro,R.and Gómez-Pérez,A.Loupe - An OnlineTool for InspectingDatasets inthe Linked DataCloud.ISWC2015.

• Frequenttriplepatterns• Graphicalontologybrowsing

Page 23: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Requirements for Linked Data exploratory search systems

D. MOTTIN, A. JENTZSCH, E. MÜLLER 124

• Thesystemprovidesefficientoverviews• Thesystemhelpstheusertounderstandtheinformationspaceandtoshapehismentalmodel

• Theusercanexploremultiple,heterogeneousresultsandbrowsingpaths

• Thesystemeasesthememorizationofrelevantresults

• Thesysteminspirestheuserandshapeshisinformationneed• Thesystemprovokesdiscoveries

Page 24: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Challenges

D. MOTTIN, A. JENTZSCH, E. MÜLLER 125

● Displayingthegraphforexploration⁃ E.g.byclusteringoftopicaldomains⁃ Allowingtheusertodrilldown

● Livegraphexploration⁃ E.g.viafederatedSPARQLqueries▪ RequiresknowledgeonendpointURIs▪ Slowinreal-time

● Guidingtheusertointerestingpartsofthegraph⁃ Usuallydonebyentityinlinks▪ Limitedinsights

Page 25: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Tutorial outline

D. MOTTIN, A. JENTZSCH, E. MÜLLER

ExploratoryGraphAnalysis(35min)

RefinementofQueryResults(35min)

FocusedGraphMining(35min)

126

Background(5 min)Graphmodels,subgraphisomorphism,subgraphmining,graphclustering

RealWorld-UseCase(15min)LinkedDatagraphs

Challengesanddiscussion

Page 26: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Summary of Exploratory Graph Analysis

D. MOTTIN, A. JENTZSCH, E. MÜLLER 127

ApproximateQueries• UserqueryisimpreciseBy-Examplemethods• Userqueryisanexampleresult

• Onlyneedapartialknowledgeonthedata

• Noneedforcomplicatequerylanguages(useexamples,partialdescriptions)

• Thequeryadaptstouserneed• Enableexploratorysearchby

usingsmallqueriesonthedata

Query(agraph)

Graph

?

Query(anexample)

Graph

Page 27: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Challenges for Exploratory Graph Analysis

D. MOTTIN, A. JENTZSCH, E. MÜLLER 128

• Unsupportedinmostofthecurrentgraphdatabases• No”universal”indextoanswermultipletypeofqueries• Partitioningonlyforexactqueryanswering

Database

• Userinteractivityintheexplorationprocess• Nosolutionsforprobabilisticgraphs• Respondtoquerieswhilethegraphchanges• Findexamplesinstreamingsettings

InformationretrievalInformation

retrieval

• Exploitingquerylogsforpersonalizedqueryanswering• Retrieveresultsinformofdocumentsconvertingthequery

structures

Datamining

Page 28: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Summary of Focused Graph Mining

D. MOTTIN, A. JENTZSCH, E. MÜLLER 129

Thefocusonindividualuserinterest…asQuery totheGraphMiningSystem…asSeedNode(s) forLocalSearch…asAttributes andWeights

• getorinferuserinterestà unexpectedresults

• interactiveexplorationà intuitiveparametrization

• adaptivegraphminingà individuallocalsearch

Page 29: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Challenges for Focused Graph Mining

D. MOTTIN, A. JENTZSCH, E. MÜLLER 130

Datamining

scale

Userinteractivityinthegraphminingprocess• unsupportedinmostofthecurrentgraphminingalgorithms• hugevarietyofuserinteractions possible• feedbackloopneedstobeunified andbecomeexchangeable

Revolutionofformalmodelsandsearchalgorithms• insufficientextensionsofexistingmodelsandalgorithms• adaptivesteering ofalgorithmsvs.fixedparametrization• evaluationofalgorithmswithuserstudies

Scalabilityofalgorithmsforreal-timeinteraction• NP-hardproblems,heuristicalgorithms,…,stillnotscalable• exploittheuserinterest forpruningthesearchspace

Page 30: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Summary of Refinement of Query Results

D. MOTTIN, A. JENTZSCH, E. MÜLLER 131

Refinement• Theuserqueryistoorestrictiveortoo

genericTop-kResults• QueriestypicallyhaveinexactmatchesSkylineQueries• Findsmallsetofinterestingitemswith

manydimensionsandincrementalupdates

• Theusermighthaveaverygenericideaofhowtodescribethestructureofinterest

• Thesystemguidestheusertowardstheanswerwithsimplesteps

• Theresultsareexplainedwithreformulations

• Thequerymatchesareinexactandinteresting

Page 31: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

Challenges for Refinement of Query Results

D. MOTTIN, A. JENTZSCH, E. MÜLLER 132

• Realtimeperformance• Profilingofqueriesforoptimizedperformance

• Personalizedreformulationsandinteractivity• Facetsearchdiscoveryingraphs

• Uncertaingraphdata

Database

InformationretrievalInformation

retrieval

Datamining

Page 32: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

CIKM 2016 TUTORIAL

The missing tiles in graph exploration

D. MOTTIN, A. JENTZSCH, E. MÜLLER 133

Interactivity Adaptivity

Personalization Scalability

Page 33: Graph Exploration: Taking the User into the Loop Exploration: Taking the User into the Loop DavideMottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany

Questions?

CIKM 2016 TUTORIAL D. MOTTIN, A. JENTZSCH, E. MÜLLER 134

Slides:https://hpi.de//mueller/tutorials/graph-exploration.html