143
IR Models based on Predicate Logic IR Models based on Predicate Logic Norbert Fuhr 1 / 85

IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

IR Models based on Predicate Logic

Norbert Fuhr

1 / 85

Page 2: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

The logical view on IR

IR as InferenceIR as uncertain inferencePropositional vs. Predicate Logic

Disjoint eventsRelational BayesProbabilistic rules

Page 3: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

IR as Inference

The logical view on IRIR as inference

q - queryd – documentretrieval:search for documents which imply the query: d → q

Example: classical IR:d = {t1, t2, t3}q = {t1, t3}retrieval: q ⊂ d ?

logical view:d = t1 ∧ t2 ∧ t3

q = t1 ∧ t3

retrieval: d → q ?

3 / 85

Page 4: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

IR as Inference

advantage of inference-based approach:step from term-based to knowledge-based retrievale.g. easy incorporation of additional knowledgeexample:d : ’squares’q: ’rectangles’thesaurus: ’squares’ → ’rectangles’⇒: d → q

4 / 85

Page 5: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

IR as uncertain inference

IR as uncertain inference

d : ’quadrangles’q: ’rectangles’⇒ uncertain knowledge required

’quadrangles’0.3→ ’rectangles’

[Rijsbergen 86]:IR as uncertain inferenceRetrieval =estimate probability P(d → q) = P(q|d)

t1 t4

t2 t5

t3 t6d

q

5 / 85

Page 6: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

Limitations of propositional logic:

conventional indexing (based on propositional logic): d = {tree,house}query: Is there a picture with a tree on the left of the house?⇒ query cannot be expressed in propositional logic

predicate logic:

d: tree(t1). house(h1). left(h1,t1).

?- tree(X) & house(Y) & left(X,Y).

6 / 85

Page 7: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

Relational Structures: Datalog

Datalog program: finite set of rules each expressing a conjunctivequery

t(X1, ...,Xk) : −r1(U11, . . . ,U1m1), . . . , rn(Un1, . . . ,Unmn)

where each variable Xi occurs in the body of the rule (this way,every rule is safe).

woman(X) :- person(X), sex(X,female).

path(X,Y) :- link(X,Y).

path(X,Z):-link(X,Y), path(Y,Z).

7 / 85

Page 8: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

Datalog rules

t(X1, ...,Xk) : −r1(U11, . . . ,U1m1), . . . , rn(Un1, . . . ,Unmn)

corresponds to the logical formula

∀X1 . . . ∀Xk∀U11 . . . ∀Unmn

t(X1, ...,Xk) ∧ ¬r1(U11, . . . ,U1m1) ∧ . . . ∧ ¬rn(Un1, . . . ,Unmn)

t(X1, ..., xk) is called the head andr1(U11, . . . ,U1m1), ..., rn(Un1, . . . ,Unmn) the body.

A formula without a head is also called a fact

8 / 85

Page 9: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

Datalog Properties

horn predicate logic

no functions

restricted forms of negation allowed

t(X1, ...,Xk) : −r1(U11, . . . ,U1m1), ...,¬rn(Un1, . . . ,Unmn)

rules may be recursive (head predicate may occur in the body)

r(X ,Y ) : −l(X ,Z ), r(Z ,Y )

sound and complete evaluation algorithms

9 / 85

Page 10: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

IR and DatabasesThe Logic View

Retrieval

DB: given query q, find objects o with o → q

IR: given query q, find documents d with high values ofP(d → q)

DB is a special case of IR!(in a certain sense)

This section: Focusing on the logic view

Inference

Vague predicates

Query language expressiveness

10 / 85

Page 11: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

The logical view on IR

Propositional vs. Predicate Logic

IR and DatabasesThe Logic View

Retrieval

DB: given query q, find objects o with o → q

IR: given query q, find documents d with high values ofP(d → q)

DB is a special case of IR!(in a certain sense)

This section: Focusing on the logic view

Inference

Vague predicates

Query language expressiveness

10 / 85

Page 12: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

Inference

IR with the Relational ModelThe Probabilistic Relational ModelInterpretation of probabilistic weightsExtensions

Disjoint eventsRelational BayesProbabilistic rules

Page 13: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

IR with the Relational Model

Relational ModelProjection

indexDOCNO TERM

1 ir1 db2 ir3 db3 oop4 ir4 ai5 db5 oop

topic

irdboopai

Projection: what is the collection about?topic(T) :- index(D,T).

12 / 85

Page 14: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

IR with the Relational Model

Relational ModelSelection

indexDOCNO TERM

1 ir1 db2 ir3 db3 oop4 ir4 ai5 db5 oop

aboutir

124

Selection: which documents are about IR?aboutir(D) :- index(D,ir).

13 / 85

Page 15: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

IR with the Relational Model

Relational ModelJoin

indexDOCNO TERM

1 ir1 db2 ir3 db3 oop4 ir4 ai5 db5 oop

authorDOCNO NAME

1 smith2 miller3 johnson4 firefly4 bradford5 bates

irauthor

smithmillerfireflybradford

Join: who writes about IR?irauthor(A):- index(D,ir) & author(D,A).

14 / 85

Page 16: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

IR with the Relational Model

Relational ModelUnion

indexDOCNO TERM

1 ir1 db2 ir3 db3 oop4 ir4 ai5 db5 oop

irordb

12345

Union: which documents are about IR or DB?irordb(D) :- index(D,ir).

irordb(D) :- index(D,db).

15 / 85

Page 17: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

IR with the Relational Model

Relational ModelDifference

indexDOCNO TERM

1 ir1 db2 ir3 db3 oop4 ir4 ai5 db5 oop

irnotdb

24

Difference: which documents are about IR, but not DB?irnotdb(D) :- index(D,ir) & not(index(D,db)).

16 / 85

Page 18: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

The Probabilistic Relational Model

[Fuhr & Roelleke 97] [Suciu et al 11]

indexβ DOCNO TERM

0.8 1 IR0.7 1 DB0.6 2 IR0.5 3 DB0.8 3 OOP0.9 4 IR0.4 4 AI0.8 5 DB0.3 5 OOP

17 / 85

Page 19: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

The Probabilistic Relational Model

[Fuhr & Roelleke 97] [Suciu et al 11]

indexβ DOCNO TERM

0.8 1 IR0.7 1 DB0.6 2 IR0.5 3 DB0.8 3 OOP0.9 4 IR0.4 4 AI0.8 5 DB0.3 5 OOP

Which documents are about DB?aboutdb(D) :- index(D,db).

17 / 85

Page 20: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

The Probabilistic Relational Model

[Fuhr & Roelleke 97] [Suciu et al 11]

indexβ DOCNO TERM

0.8 1 IR0.7 1 DB0.6 2 IR0.5 3 DB0.8 3 OOP0.9 4 IR0.4 4 AI0.8 5 DB0.3 5 OOP

aboutdb0.7 10.5 30.8 5

Which documents are about DB?aboutdb(D) :- index(D,db).

17 / 85

Page 21: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

The Probabilistic Relational Model

[Fuhr & Roelleke 97] [Suciu et al 11]

indexβ DOCNO TERM

0.8 1 IR0.7 1 DB0.6 2 IR0.5 3 DB0.8 3 OOP0.9 4 IR0.4 4 AI0.8 5 DB0.3 5 OOP

aboutdb0.7 10.5 30.8 5

aboutirdb0.8*0.7 1

Which documents are about DB?aboutdb(D) :- index(D,db).

Which documents are about IR and DB?aboutirdb(D) :- index(D,ir) & index(D,db). 17 / 85

Page 22: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Extensional vs. intensional semantics

doctermβ DOC TERM

0.9 d1 ir0.5 d1 db

linkβ S T

0.7 d2 d1

about(D,T) :- docTerm(D,T).

about(D,T) :- link(D,D1) & about(D1,T)

q(D) :- about(D,ir) & about(D,db).

Extensional semantics:weight of derived fact as function of weights of subgoalsP(q(d2)) = P(about(d2,ir)) · P(about(d2,db)) =

(0.7 · 0.9) · (0.7 · 0.5)

Problem

“improper treatment of correlated sources of evidence” [Pearl 88]→ extensional semantics only correct for tree-shaped inferencestructures

18 / 85

Page 23: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Extensional vs. intensional semantics

doctermβ DOC TERM

0.9 d1 ir0.5 d1 db

linkβ S T

0.7 d2 d1

about(D,T) :- docTerm(D,T).

about(D,T) :- link(D,D1) & about(D1,T)

q(D) :- about(D,ir) & about(D,db).

Extensional semantics:weight of derived fact as function of weights of subgoalsP(q(d2)) = P(about(d2,ir)) · P(about(d2,db)) =

(0.7 · 0.9) · (0.7 · 0.5)

Problem

“improper treatment of correlated sources of evidence” [Pearl 88]→ extensional semantics only correct for tree-shaped inferencestructures

18 / 85

Page 24: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Extensional vs. intensional semantics

doctermβ DOC TERM

0.9 d1 ir0.5 d1 db

linkβ S T

0.7 d2 d1

about(D,T) :- docTerm(D,T).

about(D,T) :- link(D,D1) & about(D1,T)

q(D) :- about(D,ir) & about(D,db).

Extensional semantics:weight of derived fact as function of weights of subgoalsP(q(d2)) = P(about(d2,ir)) · P(about(d2,db)) =

(0.7 · 0.9) · (0.7 · 0.5)

Problem

“improper treatment of correlated sources of evidence” [Pearl 88]→ extensional semantics only correct for tree-shaped inferencestructures

18 / 85

Page 25: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Intensional semantics

weight of derived fact as function of weights of underlying groundfacts

Method: Event keys and event expressions

doctermβ κ DOC TERM

0.9 dT(d1,ir) d1 ir0.5 dT(d1,db) d1 db

linkβ κ S T

0.7 l(d2,d1) d2 d1

?- docTerm(D,ir) & docTerm(D,db).

givesd1 [dT(d1,ir) & dT(d1,db)] 0.9 · 0.5 = 0.45

19 / 85

Page 26: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Intensional semantics

weight of derived fact as function of weights of underlying groundfacts

Method: Event keys and event expressions

doctermβ κ DOC TERM

0.9 dT(d1,ir) d1 ir0.5 dT(d1,db) d1 db

linkβ κ S T

0.7 l(d2,d1) d2 d1

?- docTerm(D,ir) & docTerm(D,db).

givesd1 [dT(d1,ir) & dT(d1,db)] 0.9 · 0.5 = 0.45

19 / 85

Page 27: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Intensional semantics

weight of derived fact as function of weights of underlying groundfacts

Method: Event keys and event expressions

doctermβ κ DOC TERM

0.9 dT(d1,ir) d1 ir0.5 dT(d1,db) d1 db

linkβ κ S T

0.7 l(d2,d1) d2 d1

?- docTerm(D,ir) & docTerm(D,db).

givesd1 [dT(d1,ir) & dT(d1,db)] 0.9 · 0.5 = 0.45

19 / 85

Page 28: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Intensional semantics

weight of derived fact as function of weights of underlying groundfacts

Method: Event keys and event expressions

doctermβ κ DOC TERM

0.9 dT(d1,ir) d1 ir0.5 dT(d1,db) d1 db

linkβ κ S T

0.7 l(d2,d1) d2 d1

?- docTerm(D,ir) & docTerm(D,db).

givesd1 [dT(d1,ir) & dT(d1,db)] 0.9 · 0.5 = 0.45

19 / 85

Page 29: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Event keys and event expressions

doctermβ κ DOC TERM

0.9 dT(d1,ir) d1 ir0.5 dT(d1,db) d1 db

linkβ κ S T

0.7 l(d2,d1) d2 d1

about(D,T) :- docTerm(D,T).

about(D,T) :- link(D,D1) & about(D1,T)

?- about(D,ir) & about(D,db).

givesd1 [dT(d1,ir) & dT(d1,db)] 0.9 · 0.5 = 0.45d2 [l(d2,d1) & dT(d1,ir) & l(d2,d1) & dT(d1,db)]

0.7 · 0.9 · 0.5 = 0.315

20 / 85

Page 30: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Recursion

about(D,T) :- docTerm(D,T).

about(D,T) :- link(D,D1) & about(D1,T).

d3

docterm

linkd1 d2

0.5

0.40.8

0.9

0.5

ir

db

?- about(D,ir)

d1 [dT(d1,ir) | l(d1,d2) & l(d2,d3) & l(d3,d1) &

dT(d1,ir) | ...] 0.900d3 [l(d3,d1) & dT(d1,ir)] 0.720d2 [l(d2,d3) & l(d3,d1) & dT(d1,ir)] 0.288

?- about(D,ir) & about(D,db)

d1 [dT(d1,ir) & dT(d1,db)] 0.450d3 [l(d3,d1) & dT(d1,ir) & l(d3,d1) & dT(d1,db)] 0.360

d2 [l(d2,d3) & l(d3,d1) & dT(d1,ir) & dT(d1,db)] 0.14421 / 85

Page 31: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Computation of probabilities for event expressions

1 transformation of expression into disjunctive normal form2 application of sieve formula:

simple case of 2 conjuncts: P(a∨ b) = P(a) +P(b)−P(a∧ b)general case:ci – conjunct of event keys

P(c1 ∨ . . . ∨ cn) =n∑

i=1

(−1)i−1∑

1≤j1<...<ji≤n

P(cj1 ∧ . . . ∧ cji ).

exponential complexity

use only when necessary for correctness

see [Dalvi & Suciu 07]

22 / 85

Page 32: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Computation of probabilities for event expressions

1 transformation of expression into disjunctive normal form2 application of sieve formula:

simple case of 2 conjuncts: P(a∨ b) = P(a) +P(b)−P(a∧ b)general case:ci – conjunct of event keys

P(c1 ∨ . . . ∨ cn) =n∑

i=1

(−1)i−1∑

1≤j1<...<ji≤n

P(cj1 ∧ . . . ∧ cji ).

exponential complexity

use only when necessary for correctness

see [Dalvi & Suciu 07]

22 / 85

Page 33: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Computation of probabilities for event expressions

1 transformation of expression into disjunctive normal form2 application of sieve formula:

simple case of 2 conjuncts: P(a∨ b) = P(a) +P(b)−P(a∧ b)general case:ci – conjunct of event keys

P(c1 ∨ . . . ∨ cn) =n∑

i=1

(−1)i−1∑

1≤j1<...<ji≤n

P(cj1 ∧ . . . ∧ cji ).

exponential complexity

use only when necessary for correctness

see [Dalvi & Suciu 07]

22 / 85

Page 34: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

The Probabilistic Relational Model

Computation of probabilities for event expressions

1 transformation of expression into disjunctive normal form2 application of sieve formula:

simple case of 2 conjuncts: P(a∨ b) = P(a) +P(b)−P(a∧ b)general case:ci – conjunct of event keys

P(c1 ∨ . . . ∨ cn) =n∑

i=1

(−1)i−1∑

1≤j1<...<ji≤n

P(cj1 ∧ . . . ∧ cji ).

exponential complexity

use only when necessary for correctness

see [Dalvi & Suciu 07]

22 / 85

Page 35: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Interpretation of probabilistic weights

Possible worlds semantics

0.9 docTerm(d1,ir).

P(W1) = 0.9: {docTerm(d1,ir)}P(W2) = 0.1: {}

23 / 85

Page 36: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Interpretation of probabilistic weights

0.6 docTerm(d1,ir). 0.5 docTerm(d1,db).

Possible interpretations:

I1: P(W1) = 0.3: {docTerm(d1,ir)}P(W2) = 0.3: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.2: {docTerm(d1,db)}P(W4) = 0.2: {}

I2: P(W1) = 0.5: {docTerm(d1,ir)}P(W2) = 0.1: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {docTerm(d1,db)}

I3: P(W1) = 0.1: {docTerm(d1,ir)}P(W2) = 0.5: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {}

probabilistic logic:0.1 ≤ P(docTerm(d1, ir)&docTerm(d1, db)) ≤ 0.5probabilistic Datalog with independence assumptions:P(docTerm(d1, ir)&docTerm(d1, db)) = 0.3

24 / 85

Page 37: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Interpretation of probabilistic weights

0.6 docTerm(d1,ir). 0.5 docTerm(d1,db).

Possible interpretations:

I1: P(W1) = 0.3: {docTerm(d1,ir)}P(W2) = 0.3: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.2: {docTerm(d1,db)}P(W4) = 0.2: {}

I2: P(W1) = 0.5: {docTerm(d1,ir)}P(W2) = 0.1: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {docTerm(d1,db)}

I3: P(W1) = 0.1: {docTerm(d1,ir)}P(W2) = 0.5: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {}

probabilistic logic:0.1 ≤ P(docTerm(d1, ir)&docTerm(d1, db)) ≤ 0.5probabilistic Datalog with independence assumptions:P(docTerm(d1, ir)&docTerm(d1, db)) = 0.3

24 / 85

Page 38: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Interpretation of probabilistic weights

0.6 docTerm(d1,ir). 0.5 docTerm(d1,db).

Possible interpretations:

I1: P(W1) = 0.3: {docTerm(d1,ir)}P(W2) = 0.3: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.2: {docTerm(d1,db)}P(W4) = 0.2: {}

I2: P(W1) = 0.5: {docTerm(d1,ir)}P(W2) = 0.1: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {docTerm(d1,db)}

I3: P(W1) = 0.1: {docTerm(d1,ir)}P(W2) = 0.5: {docTerm(d1,ir), docTerm(d1,db)}P(W3) = 0.4: {}

probabilistic logic:0.1 ≤ P(docTerm(d1, ir)&docTerm(d1, db)) ≤ 0.5probabilistic Datalog with independence assumptions:P(docTerm(d1, ir)&docTerm(d1, db)) = 0.3

24 / 85

Page 39: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Disjoint events

β City State

0.7 Paris France0.2 Paris Texas0.1 Paris Idaho

Interpretation:P(W1) = 0.7: {cityState(paris, france)}P(W2) = 0.2: {cityState(paris, texas)}P(W3) = 0.1: {cityState(paris, idaho)}

25 / 85

Page 40: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Disjoint events

β City State

0.7 Paris France0.2 Paris Texas0.1 Paris Idaho

Interpretation:P(W1) = 0.7: {cityState(paris, france)}P(W2) = 0.2: {cityState(paris, texas)}P(W3) = 0.1: {cityState(paris, idaho)}

25 / 85

Page 41: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Relational Bayes

[Roelleke et al. 07]

Role of the relational Bayes: Generation of a probabilistic database

database

Non−probabilistic Bayes

database

Probabilistic

26 / 85

Page 42: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Relational BayesExample: P(Nationality | City)

nationality and cityNationality City

”British” ”London””British” ”London””British” ”London””Scottish” ”London””French” ”London””German” ”Hamburg””German” ”Hamburg””Danish” ”Hamburg””British” ”Hamburg””German” ”Dortmund””German” ”Dortmund””Turkish” ”Dortmund””Scottish” ”Glasgow”

=⇒Bayes[$City]()

nationality cityP(Nationality|City) Nationality City

0.600 ”British” ”London”0.200 ”Scottish” ”London”0.200 ”French” ”London”0.500 ”German” ”Hamburg”0.250 ”Danish” ”Hamburg”0.250 ”British” ”Hamburg”0.667 ”German” ”Dortmund”0.333 ”Turkish” ”Dortmund”1.000 ”Scottish” ”Glasgow”

1 # P(Nationality | City) :2 nationality city SUM(Nat, City) :−3 nationality and city (Nat, City) | (City) ;

27 / 85

Page 43: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Relational BayesExample: P(t|d)

termTerm DocId

sailing doc1boats doc1sailing doc2boats doc2sailing doc2east doc3coast doc3sailing doc3sailing doc4boats doc5

p t d space(Term, DocId) :-term(Term, DocId) | (DocId);

P(t|d) Term DocId

0.50 sailing doc10.50 boats doc10.33 sailing doc20.33 boats doc20.33 sailing doc20.33 east doc30.33 coast doc30.33 sailing doc31.00 sailing doc41.00 boats doc5

p t d SUM(Term, DocId) :-term(Term, DocId) | (DocId);

P(t|d) Term DocId

0.50 sailing doc10.50 boats doc10.67 sailing doc20.33 boats doc20.33 east doc30.33 coast doc30.33 sailing doc31.00 sailing doc41.00 boats doc5

28 / 85

Page 44: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for deterministic facts:

0.7 likes-sports(X) :- man(X).

0.4 likes-sports(X) :- woman(X).

man(peter).

Interpretation:P(W1) = 0.7: {man(peter), likes-sports(peter)}P(W2) = 0.3: {man(peter)}

29 / 85

Page 45: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for deterministic facts:

0.7 likes-sports(X) :- man(X).

0.4 likes-sports(X) :- woman(X).

man(peter).

Interpretation:P(W1) = 0.7: {man(peter), likes-sports(peter)}P(W2) = 0.3: {man(peter)}

29 / 85

Page 46: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for uncertain facts:

# gender is disjoint on the first attribute

0.7 l-sports(X) :- gender(X,male).

0.4 l-sports(X) :- gender(X,female).

0.5 gender(X,male) :- human(X).

0.5 gender(X,female) :- human(X).

human(jo).

Interpretation:P(W1) = 0.35: {gender(jo,male), l-sports(jo)}P(W2) = 0.15: {gender(jo,male)}P(W3) = 0.20: {gender(jo,female), l-sports(jo)}P(W4) = 0.30: {gender(jo,female)}

?- l-sports(jo) P(W1) + P(W3) = 0.5530 / 85

Page 47: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for uncertain facts:

# gender is disjoint on the first attribute

0.7 l-sports(X) :- gender(X,male).

0.4 l-sports(X) :- gender(X,female).

0.5 gender(X,male) :- human(X).

0.5 gender(X,female) :- human(X).

human(jo).

Interpretation:P(W1) = 0.35: {gender(jo,male), l-sports(jo)}P(W2) = 0.15: {gender(jo,male)}P(W3) = 0.20: {gender(jo,female), l-sports(jo)}P(W4) = 0.30: {gender(jo,female)}

?- l-sports(jo) P(W1) + P(W3) = 0.5530 / 85

Page 48: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for uncertain facts:

# gender is disjoint on the first attribute

0.7 l-sports(X) :- gender(X,male).

0.4 l-sports(X) :- gender(X,female).

0.5 gender(X,male) :- human(X).

0.5 gender(X,female) :- human(X).

human(jo).

Interpretation:P(W1) = 0.35: {gender(jo,male), l-sports(jo)}P(W2) = 0.15: {gender(jo,male)}P(W3) = 0.20: {gender(jo,female), l-sports(jo)}P(W4) = 0.30: {gender(jo,female)}

?- l-sports(jo) P(W1) + P(W3) = 0.5530 / 85

Page 49: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Probabilistic rulesRules for independent events

sameauthor(D1,D2) :- author(D1,X) & author(D2,X).

0.5 link(D1,D2) :- refer(D1,D2).

0.2 link(D1,D2) :- sameauthor(D1,D2).

?? link(D1,D2) :- refer(D1,D2) & sameauthor(D1,D2).

P(l |r), P(l |s) → P(l |r ∧ s)?

31 / 85

Page 50: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Inference

Extensions

Rules for independent eventsModeling probabilistic inference networks

0.7 link(D1,D2) :- refer(D1,D2) & sameauthor(D1,D2).

0.5 link(D1,D2) :- refer(D1,D2) & not(sameauthor(D1,D2)).

0.2 link(D1,D2) :- sameauthor(D1,D2) & not(refer(D1,D2)).

Probabilistic inference networks,rules define link matrix

refer sameauthor

link

32 / 85

Page 51: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

Vague Predicates

Disjoint eventsRelational BayesProbabilistic rules

The Logical View on Vague PredicatesVague Predicates in IR and DatabasesProbabilistic Modeling of Vague Predicates

Page 52: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Vague PredicatesMotivating Example

34 / 85

Page 53: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Vague PredicatesMotivating Example

34 / 85

Page 54: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Propositional vs. Predicate Logic

Current IR systems are based on proposition logic(query term present/absent in document)

Similarity of values not considered

but multimedia IR deals with similarity already

transition from propositional to predicate logic necessary

35 / 85

Page 55: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Propositional vs. Predicate Logic

Current IR systems are based on proposition logic(query term present/absent in document)

Similarity of values not considered

but multimedia IR deals with similarity already

transition from propositional to predicate logic necessary

35 / 85

Page 56: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Propositional vs. Predicate Logic

Current IR systems are based on proposition logic(query term present/absent in document)

Similarity of values not considered

but multimedia IR deals with similarity already

transition from propositional to predicate logic necessary

35 / 85

Page 57: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Propositional vs. Predicate Logic

Current IR systems are based on proposition logic(query term present/absent in document)

Similarity of values not considered

but multimedia IR deals with similarity already

transition from propositional to predicate logic necessary

35 / 85

Page 58: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Propositional vs. Predicate Logic

Current IR systems are based on proposition logic(query term present/absent in document)

Similarity of values not considered

but multimedia IR deals with similarity already

transition from propositional to predicate logic necessary

35 / 85

Page 59: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

The Logical View on Vague Predicates

Vague Predicates in Probabilistic Datalog

[Fuhr & Roelleke 97] [Fuhr 00]

Example: Shopping 45 inch LCD TV

vague predicates as builtin predicates:X ≈ Y

query(D):- Category(D,tv) &

type(D,lcd) & size(D,X) &

≈(X,45)

X ≈ Y

β X Y

0.7 42 450.8 43 450.9 44 451.0 45 450.9 46 450.8 47 45. . . . . . . . .

36 / 85

Page 60: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Vague Predicates in IR and Databases

Data types and vague predicates in IR

Data type: domain + (vague) predicates

Language (multilingual documents) /(language-specific stemming)

Person names / “his name sounds like Jones”

Dates / “about a month ago”

Amounts / “orders exceeding 1 Mio $”

Technical measurements / “at room temperature”

Chemical formulas

37 / 85

Page 61: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Vague Predicates in IR and Databases

Vague Criteria in Fact Databases

”I am looking for a 45-inch LCD TV with

wide viewing angle

high contrast

low price

high user rating”

→ vague criteria are very frequent in end-user querying of factdatabases

→ but no appropriate support in SQL

vague conditions → similar to fuzzy predicates

38 / 85

Page 62: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Vague Predicates in IR and Databases

Vague Criteria in Fact Databases

”I am looking for a 45-inch LCD TV with

wide viewing angle

high contrast

low price

high user rating”

→ vague criteria are very frequent in end-user querying of factdatabases

→ but no appropriate support in SQL

vague conditions → similar to fuzzy predicates

38 / 85

Page 63: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Vague Predicates

Probabilistic Modeling of Vague Predicates

Probabilistic Modeling of Vague Predicates

[Fuhr 90]

learn vague predicates fromfeedback data

construct feature vector~x(qi , di ) from query value qiand document value di(e.g. relative difference)

apply logistic regression

39 / 85

Page 64: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

Expressiveness

Disjoint eventsRelational BayesProbabilistic rules

Retrieval Rules, Joins, Aggregations and RestructuringExpressiveness in XML Retrieval

Page 65: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessFormulating Retrieval Rules

about(D,T) :- docTerm(D,T).

consider document linking / anchor textabout(D,T) :- link(D1,D),about(D1,T).

consider term hierarchyabout(D,T) :- subconcept(T,T1) & about(D,T1).

field-specific term weighting0.9 docTerm(D,T) :- occurs(D,T,title).

0.5 docTerm(D,T) :- occurs(D,T,body).

41 / 85

Page 66: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessFormulating Retrieval Rules

about(D,T) :- docTerm(D,T).

consider document linking / anchor textabout(D,T) :- link(D1,D),about(D1,T).

consider term hierarchyabout(D,T) :- subconcept(T,T1) & about(D,T1).

field-specific term weighting0.9 docTerm(D,T) :- occurs(D,T,title).

0.5 docTerm(D,T) :- occurs(D,T,body).

41 / 85

Page 67: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessFormulating Retrieval Rules

about(D,T) :- docTerm(D,T).

consider document linking / anchor textabout(D,T) :- link(D1,D),about(D1,T).

consider term hierarchyabout(D,T) :- subconcept(T,T1) & about(D,T1).

field-specific term weighting0.9 docTerm(D,T) :- occurs(D,T,title).

0.5 docTerm(D,T) :- occurs(D,T,body).

41 / 85

Page 68: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessFormulating Retrieval Rules

about(D,T) :- docTerm(D,T).

consider document linking / anchor textabout(D,T) :- link(D1,D),about(D1,T).

consider term hierarchyabout(D,T) :- subconcept(T,T1) & about(D,T1).

field-specific term weighting0.9 docTerm(D,T) :- occurs(D,T,title).

0.5 docTerm(D,T) :- occurs(D,T,body).

41 / 85

Page 69: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessJoins

IR authors:

irauthor(N):- about(D,ir) & author(D,N).

Smith’s IR papers cited by Miller

?- author(D,smith) & about(D,ir) &

author(D1,miller) & cites(D,D1).

42 / 85

Page 70: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessJoins

IR authors:

irauthor(N):- about(D,ir) & author(D,N).

Smith’s IR papers cited by Miller

?- author(D,smith) & about(D,ir) &

author(D1,miller) & cites(D,D1).

42 / 85

Page 71: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessAggregation (1)

Who are the major IR authors?

indexβ DNO TERM

0.9 1 ir0.8 1 db0.6 2 ir0.8 3 ir0.7 3 ai

authorDNO NAME

1 smith2 miller3 smith

irauthor0.98 smith0.6 miller

irauthor(A):- index(D,ir) & author(D,A).

Aggregation through projection!

43 / 85

Page 72: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessAggregation (1)

Who are the major IR authors?

indexβ DNO TERM

0.9 1 ir0.8 1 db0.6 2 ir0.8 3 ir0.7 3 ai

authorDNO NAME

1 smith2 miller3 smith

irauthor0.98 smith0.6 miller

irauthor(A):- index(D,ir) & author(D,A).

Aggregation through projection!

43 / 85

Page 73: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Retrieval Rules, Joins, Aggregations and Restructuring

ExpressivenessAggregation (2)

Who are the major IR authors?

indexβ DNO TERM

0.9 1 ir0.8 1 db0.6 2 ir0.8 3 ir0.7 3 ai

authorDNO NAME

1 smith2 miller3 smith

irauths1.7 smith0.6 miller

Aggregation through summing:

irauth(D,A):- index(D,ir) & author(D,A).

irauths SUM(Name) :- irdbauth(Doc,Name) | (Name)

44 / 85

Page 74: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

Expressiveness in XML Retrieval

[Fuhr & Lalmas 07]

45 / 85

Page 75: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

Expressiveness in XML Retrieval

[Fuhr & Lalmas 07]

45 / 85

Page 76: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 1. Nested Structure

XML document as hierarchicalstructure

Retrieval of elements (subtrees)

Typical query language does notallow for specification of structuralconstraints

Relevance-oriented selection ofanswer elements: return the mostspecific relevant elements

46 / 85

Page 77: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 2. Named Fields

Reference to elementsthrough field names only

Context of elements isignored(e.g. author of article vs.author of referenced paper)

Post-Coordination may leadto false hits(e.g. author name – authoraffiliation)

Example: Dublin Core<oai dc:dc xmlns:dc=

"http://purl.org/dc/elements/1.1/">

<dc:title>Generic Algebras

... </dc:title>

<dc:creator>A. Smith (ESI),

B. Miller (CMU)</dc:creator>

<dc:subject>Orthogonal group,

Symplectic group</dc:subject>

<dc:date>2001-02-27</dc:date>

<dc:format>application/postscript</dc:format>

<dc:identifier>ftp://ftp.esi.ac.at/pub/esi1001.ps</dc:identifier>

<dc:source>ESI preprints

</dc:source>

<dc:language>en</dc:language>

</oai dc:dc>47 / 85

Page 78: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 3. XPath

/document/chapter[about(./heading, XML) AND

about(./section//*,syntax)]

48 / 85

Page 79: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 3. XPath

/document/chapter[about(./heading, XML) AND

about(./section//*,syntax)]

48 / 85

Page 80: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 3. XPath (cont’d)

Full expressiveness for navigation through document tree(+links)

Parent/child, ancestor/descendantFollowing/preceding, following-sibling, preceding-siblingAttribute, namespace

Selection of arbitrary elements/subtrees(but answer can be only a single element of the originatingdocument)

49 / 85

Page 81: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 4. XQuery

Higher expressiveness, especially for database-like applications:

Joins (trees → graphs)

Aggregations

Constructors for restructuring results

Example: List each publisher and the average price of its books

FOR $p IN distinct(document("bib.xml")//publisher)

LET $a := avg(document("bib.xml")//book[publisher =

$p]/price)

RETURN

<publisher>

<name> $p/text() </name>

<avgprice> $a </avgprice>

</publisher>50 / 85

Page 82: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML structure: 4. XQuery

Higher expressiveness, especially for database-like applications:

Joins (trees → graphs)

Aggregations

Constructors for restructuring results

Example: List each publisher and the average price of its books

FOR $p IN distinct(document("bib.xml")//publisher)

LET $a := avg(document("bib.xml")//book[publisher =

$p]/price)

RETURN

<publisher>

<name> $p/text() </name>

<avgprice> $a </avgprice>

</publisher>50 / 85

Page 83: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typing

51 / 85

Page 84: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typing: 1. Text<book>

<author>John Smith</author>

<title>XML Retrieval</title>

<chapter> <heading>Introduction</heading>

This text explains all about XML and IR.

</chapter>

<chapter>

<heading> XML Query Language XQL

</heading>

<section>

<heading>Examples</heading>

</section>

<section>

<heading>Syntax</heading>

Now we describe the XQL syntax.

</section>

</chapter>

</book>

Example query

//chapter[about(.,

XML query language]

52 / 85

Page 85: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typing: 2. Data Types

Data type: domain + (vague) predicates(see above)

Close relationship to XML Schema, but

XMLS supports syntactic type checking onlyNo support for vague predicates

53 / 85

Page 86: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typing: 3. Object TypesBased on Tagging / Named Entity Recognition

Object types: Persons, Locations. Dates, .....

Pablo Picasso (October 25, 1881 - April 8, 1973) was aSpanish painter and sculptor..... In Paris, Picasso entertaineda distinguished coterie of friends in the Montmartre andMontparnasse quarters, including Andre Breton, GuillaumeApollinaire, and writer Gertrude Stein.

To which other artists did Picasso have close relationships?Did he ever visit the USA?

Named entity recognition methods allow for automaticmarkup of object types

Object types support increased precision

54 / 85

Page 87: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typingTag semantics modelled as hierarchies

55 / 85

Page 88: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Expressiveness

Expressiveness in XML Retrieval

XML content typingTag semantics modelled in OWL

56 / 85

Page 89: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

Description Logic/Ontologies

Disjoint eventsRelational BayesProbabilistic rules

ThesaurusIntroduction into OWLSPARQL

Page 90: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Thesaurus

regular

triangle

regular

polygon

polygon

rectangle

square

triangle quadrangle ...

thesaurus knowledge:can be expressed in propositional logicsquare = quadrangle ∧ regular-polygon

description logic

based on semantic networksmore expressive than thesauri

instances of conceptsroles between (instances of) concepts

58 / 85

Page 91: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Thesaurus

regular

triangle

regular

polygon

polygon

rectangle

square

triangle quadrangle ...

thesaurus knowledge:can be expressed in propositional logicsquare = quadrangle ∧ regular-polygon

description logic

based on semantic networksmore expressive than thesauri

instances of conceptsroles between (instances of) concepts

58 / 85

Page 92: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Thesaurus

regular

triangle

regular

polygon

polygon

rectangle

square

triangle quadrangle ...

thesaurus knowledge:can be expressed in propositional logicsquare = quadrangle ∧ regular-polygon

description logic

based on semantic networksmore expressive than thesauri

instances of conceptsroles between (instances of) concepts

58 / 85

Page 93: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Thesaurus

regular

triangle

regular

polygon

polygon

rectangle

square

triangle quadrangle ...

thesaurus knowledge:can be expressed in propositional logicsquare = quadrangle ∧ regular-polygon

description logic

based on semantic networksmore expressive than thesauri

instances of conceptsroles between (instances of) concepts

58 / 85

Page 94: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Thesaurus

regular

triangle

regular

polygon

polygon

rectangle

square

triangle quadrangle ...

thesaurus knowledge:can be expressed in propositional logicsquare = quadrangle ∧ regular-polygon

description logic

based on semantic networksmore expressive than thesauri

instances of conceptsroles between (instances of) concepts

58 / 85

Page 95: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 96: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 97: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 98: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 99: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 100: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web (ontology) languages

RDF: “Resource description language”semantic markup language, only resources and theirproperties, serialisation in XML

RDFS: “RDF Schema”, schema definition language for RDF

OWL: extends RDF/RDFS by richer modelling primitives,OWL Lite/DL/Full

OWL Lite contains simple primitivesOWL DL corresponds to expressive descriptionlogicOWL Full is OWL DL + RDF

knowledge base can be modelled as collection of RDFtriples (RDF/XML serialisation)alternative encoding: abstract syntax (easier to read)

59 / 85

Page 101: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Objects, classes, literals and datatypes

Two distinct domains:

Classes: for objectsData types: for literals

Person

owl:Class owl:Datatype

Peter 1,89

rdf:type rdf:type

rdf:type rdf:type

xsd:decimal

60 / 85

Page 102: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Classes (1)

owl:Class

ManMale

Animal Person

Femaleowl:disjointWith

rdfs:subClassOfrdfs:subClassOf

rdfs:subClassOf

rdfs:subClassOf

rdf:type

Class(Female partial Animal)

<owl:Class rdf:ID="Female">

<rdfs:subClassOf rdf:resource="#Animal"/>

</owl:Class>

61 / 85

Page 103: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Classes (2)

owl:Class

ManMale

Animal Person

Femaleowl:disjointWith

rdfs:subClassOfrdfs:subClassOf

rdfs:subClassOf

rdfs:subClassOf

rdf:type

Class(Male partial Animal)

DisjointClasses(Male Female)

<owl:Class rdf:ID="Male">

<rdfs:subClassOf rdf:resource="#Animal"/>

<owl:disjointWith rdf:resource="#Female"/>

</owl:Class>

62 / 85

Page 104: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Object properties (1)

owl:ObjectProperty

Animal Animalrdfs:domain

rdfs:rangehasFather Male

rdfs:rangehasParent

rdf:type

owl:subPropertyOf

ObjectProperty(hasParent domain(Animal) range(Animal))

<owl:ObjectProperty rdf:ID="hasParent">

<rdfs:domain rdf:resource="#Animal"/>

<rdfs:range rdf:resource="#Animal"/>

</owl:Class>

63 / 85

Page 105: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Object properties (2)

owl:ObjectProperty

Animal Animalrdfs:domain

rdfs:rangehasFather Male

rdfs:rangehasParent

rdf:type

owl:subPropertyOf

ObjectProperty(hasFather super(hasParent) range(Male))

<owl:ObjectProperty rdf:ID="hasFather">

<rdfs:subPropertyOf rdf:resource="#hasParent"/>

<rdfs:range rdf:resource="#Male"/>

</owl:Class>

64 / 85

Page 106: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Datatype properties

rdfs:domain rdfs:range

rdf:type

owl:DatatypeProperty

shoesizePerson xsd:decimal

rdf:type

owl:FunctionalProperty

DatatypeProperty(shoesize Functional domain(Animal) range(xsd:decimal))

<owl:DatatypeProperty rdf:ID="shoesize">

<rdfs:domain rdf:resource="#Animal"/>

<rdfs:range rdf:resource="xsd:decimal"/>

<rdf:type rdf:resource="owl:FunctionalProperty"/>

</owl:Class>

65 / 85

Page 107: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Property restrictions

Class(Person partial Animal restriction(hasParent allValuesFrom(Person))

restriction(hasParent cardinality(2)))

<owl:Class rdf:ID="Person">

<rdfs:subClassOf rdf:resource="#Animal"/>

<rdfs:subClassOf>

<owl:Restriction>

<owl:onProperty rdf:resource="#hasParent"/>

<owl:allValuesFrom rdf:resource="#Person"/>

</owl:Restriction>

</rdfs:subClassOf>

<rdfs:subClassOf>

<owl:Restriction>

<owl:onProperty rdf:resource="#hasParent"/>

<owl:cardinality>2</owl:cardinality>

</owl:Restriction>

</rdfs:subClassOf>

</owl:Class>

66 / 85

Page 108: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Instances

Individual(Kain type(Male) value(hasFather Adam)

value(hasMother Eve)

value(shoesize 10))

<Male rdf:ID="Kain">

<hasFather rdf:resource="#Adam"/>

<hasMother rdf:resource="#Eve"/>

<shoesize>10</shoesize>

</Male>

67 / 85

Page 109: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 110: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 111: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 112: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 113: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 114: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 115: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 116: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Further modelling primitives

owl:inverseOf: inverse property: p(a, b)↔ r(b, a)

owl:TransitiveProperty: p(a, b), p(b, c)→ p(a, c)

owl:SymmetricProperty: p(a, b)→ p(b, a)

owl:InverseFunctionalProperty: inverse property is functional

owl:hasValue at least one property value equals object or datatypevalue

owl:someValuesFrom at least one property value is instance ofclass, expression or datatype

owl:interSectionOf, owl:unionOf, owl:complementOf: booleancombinations of class expressions

owl:oneOf: define class by enumerating its instances

68 / 85

Page 117: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Limitations of OWL

OWL lacks support for

uncertainty: only deterministic relationships possible,no weighting or probabilistic facts⇒ “Pr(hasFather(lisa,thomas))=0.9” cannot be expressed

rules: no general rules,only specific rules like subClassOf, TransitiveProperty . . .⇒ “if hasParent(A,B) and hasParent(C,D) andhasSibling(B,D), then hasCousin(A,C)” cannot be expressed

n-ary datatype predicates:OWL datatypes are based on XML Schema datatypes, thusproviding only unary datatype predicates⇒ “sameDomain([email protected],[email protected])” cannot beexpressed

⇒ IR queries cannot be expressed directly in OWL69 / 85

Page 118: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Limitations of OWL

OWL lacks support for

uncertainty: only deterministic relationships possible,no weighting or probabilistic facts⇒ “Pr(hasFather(lisa,thomas))=0.9” cannot be expressed

rules: no general rules,only specific rules like subClassOf, TransitiveProperty . . .⇒ “if hasParent(A,B) and hasParent(C,D) andhasSibling(B,D), then hasCousin(A,C)” cannot be expressed

n-ary datatype predicates:OWL datatypes are based on XML Schema datatypes, thusproviding only unary datatype predicates⇒ “sameDomain([email protected],[email protected])” cannot beexpressed

⇒ IR queries cannot be expressed directly in OWL69 / 85

Page 119: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Limitations of OWL

OWL lacks support for

uncertainty: only deterministic relationships possible,no weighting or probabilistic facts⇒ “Pr(hasFather(lisa,thomas))=0.9” cannot be expressed

rules: no general rules,only specific rules like subClassOf, TransitiveProperty . . .⇒ “if hasParent(A,B) and hasParent(C,D) andhasSibling(B,D), then hasCousin(A,C)” cannot be expressed

n-ary datatype predicates:OWL datatypes are based on XML Schema datatypes, thusproviding only unary datatype predicates⇒ “sameDomain([email protected],[email protected])” cannot beexpressed

⇒ IR queries cannot be expressed directly in OWL69 / 85

Page 120: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Limitations of OWL

OWL lacks support for

uncertainty: only deterministic relationships possible,no weighting or probabilistic facts⇒ “Pr(hasFather(lisa,thomas))=0.9” cannot be expressed

rules: no general rules,only specific rules like subClassOf, TransitiveProperty . . .⇒ “if hasParent(A,B) and hasParent(C,D) andhasSibling(B,D), then hasCousin(A,C)” cannot be expressed

n-ary datatype predicates:OWL datatypes are based on XML Schema datatypes, thusproviding only unary datatype predicates⇒ “sameDomain([email protected],[email protected])” cannot beexpressed

⇒ IR queries cannot be expressed directly in OWL69 / 85

Page 121: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Limitations of OWL

OWL lacks support for

uncertainty: only deterministic relationships possible,no weighting or probabilistic facts⇒ “Pr(hasFather(lisa,thomas))=0.9” cannot be expressed

rules: no general rules,only specific rules like subClassOf, TransitiveProperty . . .⇒ “if hasParent(A,B) and hasParent(C,D) andhasSibling(B,D), then hasCousin(A,C)” cannot be expressed

n-ary datatype predicates:OWL datatypes are based on XML Schema datatypes, thusproviding only unary datatype predicates⇒ “sameDomain([email protected],[email protected])” cannot beexpressed

⇒ IR queries cannot be expressed directly in OWL69 / 85

Page 122: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

OWL: Conclusion

OWL extends RDF(S) by additional modelling primitives

well-defined semantics, based on description logics

does not support all RDF features (no reification, only threelevels owl:Class, classes and objects)

lacks important features:

only deterministic features, no probabilistic relationshipsno rules (but in SWRL)restricted datatype predicates (due to XML Schema)

OWL and associated languages become standard in theSemantic Web

70 / 85

Page 123: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

Introduction into OWL

Semantic Web Layers

71 / 85

Page 124: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

SPARQL

SPARQL

query language for getting information from RDF (OWL) graphsFacilities for

extract information in the form of URIs, blank nodes, plainand typed literals

extract RDF subgraphs

construct new RDF graphs based on information in thequeried graphs

Features:

matching graph patterns

variables – global scope; indicated by ’?’ or ‘$‘

72 / 85

Page 125: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

SPARQL

SPARQL: Basic Graph Pattern

Set of Triple Patterns

Triple Pattern – similar to an RDF Triple (subject, predicate,object), but any component can be a query variable; literalsubjects are allowed?book dc:title ?title

Matching a triple pattern to a graph: bindings betweenvariables and RDF Terms

Matching of Basic Graph Patterns

A Pattern Solution of Graph Pattern GP on graph G is anysubstitution S such that S(GP) is a subgraph of G.SELECT ?x ?v WHERE ?x ?v ?x

73 / 85

Page 126: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

SPARQL

SPARQL: Group Patterns + Value Constraints

Group Pattern: A set of graph patterns which must all matchValue Constraints: restrict RDF terms in a solutionSELECT ?n WHERE

?n profession "Physicist" . ?n isa "Politician"

74 / 85

Page 127: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Description Logic/Ontologies

SPARQL

SPARQL: Query forms

SELECT returns all, or a subset of the variables bound in aquery pattern matchformats : XML or RDF/XML

CONSTRUCT returns an RDF graph constructed by substitutingvariables in a set of triple templates

DESCRIBE returns an RDF graph that describes the resourcesfound.

ASK returns whether a query pattern matches or not.

75 / 85

Page 128: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

Conclusion and Outlook

Disjoint eventsRelational BayesProbabilistic rules

Page 129: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Conclusion

Inference

Probabilistic relational model supports integration of IR+DB

Probabilistic Datalog as powerful inference mechanism

Allows for formulating retrieval strategies as logical rules

Vague predicates

Natural extension of IR methods to attribute values

Vague predicates can be learned from feedback data

Transition from propositional to predicate logic

Expressive query language

Joins

Aggregations

(Re)structuring of results 77 / 85

Page 130: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Conclusion

Inference

Probabilistic relational model supports integration of IR+DB

Probabilistic Datalog as powerful inference mechanism

Allows for formulating retrieval strategies as logical rules

Vague predicates

Natural extension of IR methods to attribute values

Vague predicates can be learned from feedback data

Transition from propositional to predicate logic

Expressive query language

Joins

Aggregations

(Re)structuring of results 77 / 85

Page 131: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Conclusion

Inference

Probabilistic relational model supports integration of IR+DB

Probabilistic Datalog as powerful inference mechanism

Allows for formulating retrieval strategies as logical rules

Vague predicates

Natural extension of IR methods to attribute values

Vague predicates can be learned from feedback data

Transition from propositional to predicate logic

Expressive query language

Joins

Aggregations

(Re)structuring of results 77 / 85

Page 132: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

http://www.eecs.qmul.ac.uk/~thor/

Page 133: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

http://www.spinque.com/

Page 134: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

OutlookIR Systems vs. DBMS

80 / 85

Page 135: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

OutlookIR Systems vs. DBMS

80 / 85

Page 136: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

OutlookIR Systems vs. DBMS

Separation between IRS and IR application?

80 / 85

Page 137: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Towards an IRMS

81 / 85

Page 138: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Towards an IRMS

81 / 85

Page 139: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

Conclusion and Outlook

Towards an IRMS

81 / 85

Page 140: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

References

References I

Dalvi, N. N.; Suciu, D.(2007).Efficient query evaluation on probabilistic databases.VLDB J. 16(4), pages 523–544.

Forst, J. F.; Tombros, A.; Roelleke, T.(2007).POLIS: A Probabilistic Logic for Document Summarisation.In: Proceedings of the 1st International Conference on Theory of InformationRetrieval (ICTIR 07) - Studies in Theory of Information Retrieval, pages201–212.

Frommholz, I.; Fuhr, N.(2006).Probabilistic, Object-oriented Logics for Annotation-based Retrieval in DigitalLibraries.In: Nelson, M.; Marshall, C.; Marchionini, G. (eds.): Opening InformationHorizons – Proc. of the 6th ACM/IEEE Joint Conference on Digital Libraries(JCDL 2006), pages 55–64. ACM, New York.

82 / 85

Page 141: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

References

References II

Fuhr, N.; Lalmas, M.(2007).Advances in XML retrieval: the INEX initiative.In: IWRIDL ’06: Proceedings of the 2006 international workshop on Researchissues in digital libraries, pages 1–6. ACM, New York, NY, USA.

Fuhr, N.; Rolleke, T.(1997).A Probabilistic Relational Algebra for the Integration of Information Retrievaland Database Systems.ACM Transactions on Information Systems 14(1), pages 32–66.

Fuhr, N.; Rolleke, T.(1998).HySpirit – a Probabilistic Inference Engine for Hypermedia Retrieval in LargeDatabases.In: Proceedings of the 6th International Conference on Extending DatabaseTechnology (EDBT), pages 24–38. Springer, Heidelberg et al.

83 / 85

Page 142: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

References

References III

Fuhr, N.(1990).A Probabilistic Framework for Vague Queries and Imprecise Information inDatabases.In: Proceedings of the 16th International Conference on Very Large Databases,pages 696–707. Morgan Kaufman, Los Altos, California.

Fuhr, N.(2000).Probabilistic Datalog: Implementing Logical Information Retrieval for AdvancedApplications.Journal of the American Society for Information Science 51(2), pages 95–110.

Lalmas, M.; Roelleke, T.; Fuhr, N.(2002).Intelligent Hypermedia Retrieval.In: Szczepaniak, P. S.; Segovia, F.; Zadeh, L. A. (eds.): Intelligent Explorationof the Web, pages 324–344. Springer, Heidelberg et al.

84 / 85

Page 143: IR Models based on Predicate Logic - uni-due.de · Propositional vs. Predicate Logic IR and Databases The Logic View Retrieval DB: given query q, nd objects o with o !q IR: given

IR Models based on Predicate Logic

References

References IV

Pearl, J.(1988).Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference.Morgan Kaufman, San Mateo, California.

Rolleke, T.; Wu, H.; Wang, J.; Azzam, H.(2007).Modelling retrieval models in a probabilistic relational algebra with a newoperator: the relational Bayes.The International Journal on Very Large Data Bases (VLDB) 17(1), pages 5–37.

Suciu, D.; Olteanu, D.; Re, C.; Koch, C.(2011).Probabilistic Databases.Morgan & Claypool Publishers.

85 / 85