Upload
suzuki
View
66
Download
0
Embed Size (px)
DESCRIPTION
Auto-Epistemic Logic. Proposed by Moore (1985) Contemplates reflection on self knowledge (auto-epistemic) Allows for representing knowledge not just about the external world, but also about the knowledge I have of it. Syntax of AEL. - PowerPoint PPT Presentation
Citation preview
Auto-Epistemic Logic
• Proposed by Moore (1985)
• Contemplates reflection on self knowledge (auto-epistemic)
• Allows for representing knowledge not just about the external world, but also about the knowledge I have of it
Syntax of AEL
• 1st Order Logic, plus the operator L (applied to formulas)
• L means “I know ”• Examples:
MScOnSW → L MScOnSW
(or L MScOnSW → MScOnSW)
young (X) L studies (X) → studies (X)
Meaning of AEL
• What do I know?– What I can derive (in all models)
• And what do I not know?– What I cannot derive
• But what can be derived depends on what I know– Add knowledge, then test
Semantics of AEL
• T* is an expansion of theory T iff
T* = Th(T{L : T* |= } {L : T* |≠ })
• Assuming the inference rule /L :
T* = CnAEL(T {L : T* |≠ })
• An AEL theory is always two-valued in L, that is, for every expansion:
| L T* L T*
Knowledge vs. Belief
• Belief is a weaker concept– For every formula, I know it or know it not– There may be formulas I do not believe in,
neither their contrary
• The Auto-Epistemic Logic of knowledge and Belief (AELB), introduces also operator B – I believe in
AELB Example
• I rent a film if I believe I’m neither going to baseball nor football games
Bbaseball Bfootball → rent_filme• I don’t buy tickets if I don’t know I’m going to
baseball nor know I’m going to football L baseball L football → buy_tickets
• I’m going to football or baseballbaseball football
• I should not conclude that I rent a film, but do conclude I should not buy tickets
Axioms about beliefs
• Consistency Axiom
B• Normality Axiom
B(F → G) → (B F → B G)
• Necessitation rule
F
B F
Some consequences of the Axioms
• B(F /\ G) ≡ B F /\ B G
• BF → B F
• B(F \/ G) → B F \/ B G
Minimal models
• In what do I believe?– In that which belongs to all preferred models
• Which are the preferred models?– Those that, for one same set of beliefs, have a minimal
number of true things
• A model M is minimal iff there does not exist a smaller model N, coincident with M on B e Latoms
• When is true in all minimal models of T, we write T |=min
AELB expansions
• T* is a static (autoepistemic) expansion of T iff
T* = CnAELB(T {L : T* |≠ }
{B : T* |=min })
where CnAELB denotes closure using the
axioms of AELB plus necessitation for L
Some properties of static autoepistemic expansions
• T* |= L iff T* |=
• T* |= B if T* |=min
• The other direction of the last implication only holds for particular cases (e.g. rational theories – those without positive occurrences of belief atoms).
The special case of AEB
• Because of its properties, the case of theories without the knowledge operator is especially interesting
• The definition of expansion becomes:
T* = (T*)
where (T*) = CnAEB(T {B : T* |=min })
and CnAEB denotes closure using the axioms of AEB
Least expansion
• Theorem: Operator is monotonic, i.e.
T T1 T2 → (T1) (T2)
• Hence, there always exists a minimal expansion of T, obtainable by transfinite induction:– T0 = Cn(T)
– Ti+1 = (Ti)
– T = U T (for limit ordinals )
Consequences
• Every AEB theory has at least one expansion
• If a theory is affirmative (i.e. all clauses have at least a positive literal) then it has at least a consistent expansion
• There is a procedure to compute the semantics
Example
• BEarthquake /\ BFires Calm
• B(Earthquake \/ Fires) Worried
• BEarthquake /\ BFires Panicked
• BCalm CallHome
• Earthquake \/ Fires
Computation of Static Completion
• T0 = CnAEB(T)
– T0 |= Earthquake \/ Fires
– T0 |=min Earthquake \/ Fires
• T1 = (T0)= CnAEB(T0 {B : T0 |=min })
– T1 = CnAEB(T0 {B(Eq \/ Fi), B(Eq \/ Fi),…})
– T1 |= Worried
– T1 |= BEarthquake \/ BFires
– T1 |= B Earthquake \/ B Fires
– T1 |=min Calm and T1 |=min Panicked
Static Completion (cont.)
• T2 = (T1)= CnAEB(T1 {B : T1 |=min })
– T2 = CnAEB(T1 {B(Eq \/ Fi), B(Eq \/ Fi), BCalm, BPanicked, BWorried,… })
– T2 |= CallHome
• T3 = (T2)= CnAEB(T2 {B : T2 |=min })
– T2 = CnAEB(T1 {B(Eq \/ Fi), B(Eq \/ Fi), BCalm, BPanicked, BWorried, BCallHome, … })
LP forKnowledge Representation
• Due to its declarative nature, LP has become a prime candidate for Knowledge Representation and Reasoning
• This has been more noticeable since its relations to other NMR formalisms were established
• For this usage of LP, a precise declarative semantics was in order
Language• A Normal Logic Programs P is a set of rules:
H A1, …, An, not B1, … not Bm (n,m 0)
where H, Ai and Bj are atoms
• Literal not Bj are called default literals
• When no rule in P has default literal, P is called definite
• The Herbrand base HP is the set of all instantiated atoms from program P.
• We will consider programs as possibly infinite sets of instantiated rules.
Declarative Programming
• A logic program can be an executable specification of a problem
member(X,[X|Y]).
member(X,[Y|L]) member(X,L).
• Easier to program, compact code• Adequate for building prototypes• Given efficient implementations, why not use it to
“program” directly?
LP and Deductive Databases
• In a database, tables are viewed as sets of facts:
• Other relations are represented with rules:
),(
).,(
londonlisbonflight
adamlisbonflight
LondonLisbon
AdamLisbon
tofromflight
).,(),(
).,(),,(),(
).,(),(
BAconnectionnotBAherchooseAnot
BCconnectionCAflightBAconnection
BAflightBAconnection
LP and Deductive DBs (cont)
• LP allows to store, besides relations, rules for deducing other relations
• Note that default negation cannot be classical negation in:
• A form of Closed World Assumption (CWA) is needed for inferring non-availability of connections
).,(),(
).,(),,(),(
).,(),(
BAconnectionnotBAherchooseAnot
BCconnectionCAflightBAconnection
BAflightBAconnection
Default Rules
• The representation of default rules, such as
“All birds fly”can be done via the non-monotonic operator not
).(
).(
).()(
).()(
.)(),()(
ppenguin
abird
PpenguinPabnormal
PpenguinPbird
AabnormalnotAbirdAflies
The need for a semantics
• In all the previous examples, classical logic is not an appropriate semantics– In the 1st, it does not derive not member(3,[1,2])
– In the 2nd, it never concludes choosing another company
– In the 3rd, all abnormalities must be expressed
• The precise definition of a declarative semantics for LPs is recognized as an important issue for its use in KRR.
2-valued Interpretations
• A 2-valued interpretation I of P is a subset of HP
– A is true in I (ie. I(A) = 1) iff A I– Otherwise, A is false in I (ie. I(A) = 0)
• Interpretations can be viewed as representing possible states of knowledge.
• If knowledge is incomplete, there might be in some states atoms that are neither true nor false
3-valued Interpretations
• A 3-valued interpretation I of P is a set
I = T U not F
where T and F are disjoint subsets of HP
– A is true in I iff A T– A is false in I iff A F– Otherwise, A is undefined (I(A) = 1/2)
• 2-valued interpretations are a special case, where:
HP = T U F
Lattice-valued interpretations
• We can generalize the previous definition to an arbitrary lattice of truth-values
• Let L be a complete lattice then an interpretation of a program P is a mapping I:HP → L
• Notice that any complete lattice has a least element () and a top element (T) , so a “true” proposition is mapped into T while a “false” proposition is mapped into .
• Some interesting useful complete lattices:– {0,1} with 0 < 1.– {0,1/2,1} with 0 < 1/2 < 1– [0,1]– Belnap’s four valued logic with with 0 <unknown | contradictory < 1
Intermezzo: lattices
• A partially ordered set (poset) is a set equipped with a reflexive, antisymmetric and transitive binary relation ≤:– Reflexivity: a ≤ a– Antisymmetry: if a ≤ b and b ≤ a then a=b– Transitivity: if a ≤ b and b ≤ c then a ≤ c
• A lattice is a poset such that for any two elements x and y the set {x,y} has both a least upper bound (join or supremum - \/) and a greatest lower bound (meet or infimum - /\). The join and meet obey to the following properties:– Commutative laws: a \/ b = b \/ a, and a /\ b = b /\ a– Associative laws: a \/ (b \/ c)= (a \/ b) \/ c, and a /\ (b /\ c)= (a /\ b) /\ c– Absorption laws: a \/ (a /\ b)= a, and a /\ (a \/ b)= a
• Notice that x ≤ y iff x = x /\ y, or equivalently y = x \/ y. • A complete lattice is a lattice where all subsets have a join and a meet.
Models
• Models can be defined via an evaluation function Î:– For an atom A, Î(A) = I(A)– For a formula F, Î(not F) = T - Î(F) (for lattices with
complement)– For formulas F and G:
• Î((F,G)) = glb(Î(F), Î(G))• Î((F;G)) = lub(Î(F), Î(G))• Î(F G)= T iff Î(F) ≥ Î(G).
• I is a model of P iff, for all rule H B of P:
Î(H B) = T
Minimal Models Semantics• The idea of this semantics is to minimize positive
information. What is implied as true by the program is true; everything else is false.
• {pr(c),pr(e),ph(s),ph(e),aM(c),aM(e)} is a model• Lack of information that cavaco is a physicist, should indicate that he isn’t• The minimal model is: {pr(c),ph(e),aM(e)}
)(
)(
)()(
cavacopresident
einsteinphysicist
XphysicistXaticianableMathem
Minimal Models Semantics
D[Truth ordering] For interpretations I and J, I J iff for all atom A, I(A) I(J), i.e. for the case of 2-valued interpretations TI TJ and FI FJ
T Every definite logic program has a least (truth ordering) model.
D[minimal models semantics] An atom A is true in (definite) P iff A belongs to its least model. Otherwise, A is false in P.
TP operator (2-valued case)• The minimal models of a definite P can be
computed (bottom-up) via operator TP
D [TP] Let I be an interpretation of definite P.
TP(I) = {H: (H Body) P and Body I}
T If P is definite, TP is monotone and continuous. Its minimal fixpoint can be built by:
I0 = {} and In = TP(In-1) with n > 0
T The least model of definite P is TP({})
TP operator (L-valued case)
D[TP] Let I be an interpretation of definite P.
TP(I)(H) = lub{Î(Body): (H Body) P}
T If P is definite, TP is monotone. Its minimal fixpoint can be built by iterating the TP operator:
I0 = TP={}
Iα = TPα = TP(Iα -1) = TP (TP
α-1), where α is a successor ordinal
Iβ = TP = |_| α < β TP
α = |_| α < β Iα where β is a limit ordinal
TP operator (L-valued case)
T There is a successor ordinal such that TP = TP
-
1 , i.e. there is a least fixpoint of TP.
Furthermore, the least model of definite program P coincides with the least fixpoint of TP.
In general, more than iterations might be needed to reach the least fixpoint. However, if lattice L is finite then at most iterations are enough.
Computation of minimal models
• For the 2-valued case there is a complete method: SLD resolution (Linear resolution with a selection function for definite sentences).
• A SLD-goal of the form← A1, …, Am, L, C1, …, Cn
has a successor ← (A1, …, Am, B1, …, Bk, C1, …, Cn ) θ
for each rule H :- B1, …, Bk, belonging to the program such that L and H unify with mgu θ.
• A SLD-derivation is a sequence of applications of SLD-resolution, and a SLD-refutation is a SLD-derivation which ends in the empty clause, i.e. no goals after ←.
• For the lattice-valued case, there are proof procedures based on tabulation methods, which we will not present.
On Minimal Models
• SLD can be used as a proof procedure for the minimal models semantics:– If the is a SLD-derivation for A, then A is true– Otherwise, A is false
• The semantics does not apply to normal programs:– p not q has two minimal models:
{p} and {q}
There is no least model !