24
Ilias Tachmazidis 1,2 , Grigoris Antoniou 1,2,3 , Giorgos Flouris 2 , Spyros Kotoulas 4 1 University of Crete 2 Foundation for Research and Technolony, Hellas (FORTH) 3 University of Huddersfield 4 Smarter Cities Technology Centre, IBM Research, Ireland

Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

  • Upload
    others

  • View
    12

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4

1 University of Crete2 Foundation for Research and Technolony, Hellas (FORTH)

3 University of Huddersfield4 Smarter Cities Technology Centre, IBM Research, Ireland

Page 2: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

MotivationBackground◦ Defeasible Logic◦ MapReduce Framework◦ RDFMulti-Argument Implementation over RDFExperimental EvaluationFuture Work

Page 3: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Huge data set coming from◦ the Web, government authorities, scientific

databases, sensors and moreDefeasible logic ◦ is suitable for encoding commonsense knowledge

and reasoning◦ avoids triviality of inference due to low-quality dataDefeasible logic has low complexity◦ The consequences of a defeasible theory D can be

computed in O(N) time, where N is the number of symbols in D

Page 4: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Reasoning is performed in the presence of defeasible rulesDefeasible logic has been implemented for in-memory reasoning, however, it was not applicable for huge data setsSolution: scalability/parallelization using the MapReduce framework

Page 5: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Facts ◦ e.g. bird(eagle)Strict Rules◦ e.g. bird(X) → animal(X)Defeasible Rules◦ e.g. bird(X) ⇒ flies(X)

Page 6: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Defeaters◦ e.g. brokenWing(X) ↝ ¬ flies(X)

Priority Relation (acyclic relation on the set of rules)◦ e.g. r: bird(X) ⇒ flies(X)

r’: brokenWing(X) ⇒ ¬ flies(X)r’ > r

Page 7: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Inspired by similar primitives in LISP and other functional languagesOperates exclusively on <key, value> pairsInput and Output types of a MapReduce job:◦ Input: <k1, v1>◦ Map(k1,v1) → list(k2,v2)◦ Reduce(k2, list (v2)) → list(k3,v3)◦ Output: list(k3,v3)

Page 8: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Provides an infrastructure that takes care of◦ distribution of data◦ management of fault tolerance◦ results collectionFor a specific problem◦ developer writes a few routines which are following

the general interface

Page 9: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Rule sets can be divided into two categories:◦ Stratified◦ Non-stratifiedPredicate Dependency Graph

Page 10: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Consider the following rule set:◦ r1: X sentApplication A, A completeFor D ⇒ X acceptedBy D.◦ r2: X hasCerticate C, C notValidFor D ⇒ X ¬acceptedBy D.◦ r3: X acceptedBy D, D subOrganizationOf U ⇒

X studentOfUniversity U.◦ r1 > r2.Both acceptedBy and ¬acceptedBy are represented by acceptedBySuperiority relation is not part of the graph

Page 11: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Initial pass: ◦ Transform facts into <fact, (+Δ, +∂)> pairsNo reasoning needs to be performed for the lowest stratum (stratum 0)For each stratum from 1 to N◦ Pass1: Calculate fired rules◦ Pass2: Perform defeasible reasoning

Page 12: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

INPUTLiterals in multiple files

File02‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐

<John hasCerticate Cert, (+Δ, +∂)><Cert notValidFor Dep, (+Δ, +∂)>

<Dep subOrganizationOf Univ, (+Δ, +∂)>

File01‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐

<John sentApplication App, (+Δ, +∂)> <App completeFor Dep, (+Δ, +∂)> 

<key, < John sentApplication App, (+Δ, +∂)>>

< key, < App completeFor Dep, (+Δ, +∂)>>

< key, < John hasCerticate Cert, (+Δ, +∂)>>

< key, < Cert notValidFor Dep, (+Δ, +∂)>>

< key, < Dep subOrganizationOf Univ, (+Δ, +∂)>>

MAP phase Input<position in file, literal and knowledge>

Page 13: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Group

ing/Sorting

<App, <(John,sentApplication,+Δ,+∂),(Dep,completeFor,+Δ,+∂)>>

<Cert,<(John,hasCerticate,+Δ,+∂),(Dep,notValidFor,+Δ,+∂)>>

Reduce phase Input<matchingArgValue, 

List(Non‐MatchingArgValue,Predicate, knowledge)>

MAP phase Output<matchingArgValue, 

(Non‐MatchingArgValue,Predicate, knowledge)>

<App,(John,sentApplication,+Δ,+∂)>

<App,(Dep,completeFor,+Δ,+∂)>

<Cert, (John,hasCerticate,+Δ,+∂)>

<Cert, (Dep,notValidFor,+Δ,+∂)>

Page 14: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

<John acceptedBy Dep, (+∂, r1)>

<John acceptedBy Dep,(¬, +∂,r2)>

Reduce phase Output (Final Output)

<literal and knowledge>

Page 15: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

INPUTLiterals in multiple files

File01‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐

<John sentApplication App, (+Δ, +∂)> <App completeFor Dep, (+Δ, +∂)> <John hasCerticate Cert, (+Δ, +∂)><Cert notValidFor Dep, (+Δ, +∂)>

<Dep subOrganizationOf Univ, (+Δ, +∂)>

File02‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐

<John acceptedBy Dep, (+∂, r1)><John acceptedBy Dep, (¬, +∂, r2)>

MAP phase Input<position in file, literal and knowledge>

<key, < Dep subOrganizationOf Univ, (+Δ,+∂)>>

< key, < John acceptedBy Dep, (+∂, r1)>>

< key, < John acceptedBy Dep, (¬, +∂, r2)>>

<key, < John sentApplication App, (+Δ, +∂)>>

< key, < App completeFor Dep, (+Δ, +∂)>>

< key, < John hasCerticate Cert, (+Δ, +∂)>>

< key, < Cert notValidFor Dep, (+Δ, +∂)>>

Page 16: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

< Dep subOrganizationOf Univ, (+Δ,+∂)>

< John acceptedBy Dep, (+∂, r1)>

< John acceptedBy Dep, (¬, +∂, r2)>

MAP phase Output<literal, knowledge>

Group

ing/Sorting

< Dep subOrganizationOf Univ, (+Δ,+∂)>

< John acceptedBy Dep, <(+∂, r1), (¬, +∂, r2)>>

Reduce phase Input<literal, list(knowledge)>

Page 17: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

< John acceptedBy Dep, (+∂)>

Reduce phase Output (Final Output)

<Conclusions after reasoning>

No output

Page 18: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

LUBM (up to 1B)Custom defeasible ruleset

IBM Hadoop Cluster v1.3 (Apache Hadoop0.20.2)40-core serverXIV storage SAN

Page 19: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research
Page 20: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research
Page 21: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research
Page 22: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Challenges of Non-Stratified Rule SetsAn efficient mechanism is need for –Δ and -∂◦ all the available information for the literal must be

processed by a single node causing:main memory insufficiencyskewed load balancing

Storing conclusions for +/–Δ and +/-∂ is not feasible◦ Consider the cartesian product of X, Y, Z for

X Predicate1 Y, Y Predicate2 Z.

Page 23: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research

Run extensive experiments to test the efficiency of multi-argument defeasible logicApplications on real datasets, with low-quality dataMore complex knowledge representation methods such as: ◦ Answer-Set programming◦ Ontology evolution, diagnosis and repairAI Planning

Page 24: Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos ......Ilias Tachmazidis1,2, Grigoris Antoniou1,2,3, Giorgos Flouris2, Spyros Kotoulas4 1University of Crete 2Fou ndatio for Research