24
STRUCTURING PROBABILISTIC DATA BY GALOIS LATTICES PAULA BRITO FAC. ECONOMIA, UNIV. PORTO, PORTUGAL [email protected] GERALDINE POLAILLON SUPELEC, FRANCE [email protected]

STRUCTURING PROBABILISTIC DATA BY GALOIS LATTICES

  • Upload
    kolina

  • View
    40

  • Download
    1

Embed Size (px)

DESCRIPTION

STRUCTURING PROBABILISTIC DATA BY GALOIS LATTICES. PAULA BRITO FAC. ECONOMIA, UNIV. PORTO, PORTUGAL [email protected] GERALDINE POLAILLON SUPELEC, FRANCE [email protected]. GROUP. INSTRUCTION. FOOTBALL. 1. Students. Pr(0.08), Sec(0.88), Sup(0.03). - PowerPoint PPT Presentation

Citation preview

Page 1: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

STRUCTURING PROBABILISTIC DATA BY GALOIS LATTICES

PAULA BRITOFAC. ECONOMIA, UNIV. PORTO, PORTUGAL

[email protected]

GERALDINE POLAILLONSUPELEC, FRANCE

[email protected]

Page 2: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

GROUP INSTRUCTION FOOTBALL

Students Pr(0.08), Sec(0.88), Sup(0.03) yes (0.51), no(0.49), no_ans (0.01)

Retired Pr(0.92), Sec(0.05), Sup(0.03) yes (0.12), no(0.88)

Employed Pr(0.59), Sec(0.39), Sup(0.02) yes (0.22), no(0.78)

Small independents Pr(0.62), Sec(0.31), Sup(0.07) yes (0.32), no(0.67), no_ans (0.01)

Housewives Pr(0.93), Sec(0.07) yes (0.10), no(0.90)

Intermediate executives Pr(0.17), Sec(0.50), Sup(0.33) yes (0.26), no(0.73), no_ans (0.01)

Industrial workers Pr(0.73), Sec(0.27), Sup(0.01) yes (0.40), no(0.60)

Intellectual/ScientificExecutives

Sec(0.01), Sup(0.99) yes (0.22), no(0.78)

Other Pr(0.72), Sec(0.229), Sup(0.07)

yes (0.19), no(0.80), no_ans (0.01)

Manager /Independent Profession

Sec(0.02), Sup(0.98) yes (0.28), no(0.70), no_ans (0.02)

Businessmen Pr(0.08), Sec(0.33), Sup(0.58) yes (0.42), no(0.58)

Page 3: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Outline

• Probabilistic data

• Galois connections

• Lattice construction

• Application

• Conclusion and perspectives

Page 4: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Probabilistic data

A modal variable Y on a set E ={1, 2,…} with

domain O = {y1, y2, …, yk} is a mapping

Y : E M(O)

from E into the family M(O) of all distributions on O,

with values Y() = .

))(p(y,)),(p(y kk11

Page 5: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Probabilistic data

DEFINITIONA modal event is a statement of the form

e = [Y() R {y1(p1), y2(p2), …, yk(pk)}]

where O = {y1, y2, …, yk} is the domain of Y,

and pj is the probability, frequency or weight of yj.

It is not imposed that p1+ p2+…+ pk= 1.

R is a relation on the set of distributions on O.

Page 6: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

We shall consider the following relations:

ii)     “” such that [Y() {y1(p1),…, yk(pk)}] is true iff

iii)    “” such that [Y() {y1(p1),…, yk(pk)}] is true iff

i)      “~” such that [Y() ~ {y1(p1),…, yk(pk)}] is true iff

k,,1,p)(p

k,,1,p)(p

k,,1,p)(p

Page 7: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

A modal object is a conjunction of modal events.

Each individual E is described by a modal probabilistic object:

))(i

ikp(iiky,)),(i

1p(i1y)(iY

p

1i)(s

Let S be the set of all modal objects defined on variables Y1, ..., Yp

Page 8: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Galois Connections THEOREM 1The couple of mappings

fu : S P(E)

s extE s = { E : s() s } 

gu : P(E) S

{ 1, … , k }  

with tji = Max {pj

i(h), h=1,...,k}, j=1,…,ki, i=1,..., p 

form a Galois connection between (P(E) , ) and (S , ). 

)t(y,),t(yYs ik

ik

i1

i1i

p

1i ii

Page 9: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Example

Let A = {G1, G2}

gu (A) =

fu (gu (A)) = {G1, G2}

  Sex Instruction

G1 Masc.(0.4), Fem.(0.6) Prim.(0.3), Sec.(0.4), Sup.(0.3)

G2 Masc.(0.1), Fem.(0.9) Prim.(0.1), Sec.(0.2), Sup.(0.7)

G3 Masc.(0.8), Fem.(0.2) Prim.(0.2), Sec.(0.3), Sup.(0.5)

G4 Masc.(0.5), Fem.(0.5) Prim.(0.3), Sec.(0.2), Sup.(0.5)

Sup.(0.7)(0.4),Sec.,Prim.(0.3)nInstructio

.(0.9)Fem(0.4),Masc.Sex

Page 10: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

THEOREM 2The couple of mappings

fi : S P(E)

s extE s ={ E : s() s }

 

gi : P(E) S

{ 1, … , k }

with tji = Min {pj

i(h), h=1,...,k}, j=1,…,ki, i=1,..., p

 

form a Galois connection between (P(E) , ) and (S, ).

)(ty,),(tyYs ik

ik

i1

i1i

p

1i ii

Page 11: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Example

Let B = {G2, G3}.

gi (B) =

fi (gi (B)) = {G2, G3, G4}

  Sex Instruction

G1 Masc.(0.4), Fem.(0.6) Prim.(0.3), Sec.(0.4), Sup.(0.3)

G2 Masc.(0.1), Fem.(0.9) Prim.(0.1), Sec.(0.2), Sup.(0.7)

G3 Masc.(0.8), Fem.(0.2) Prim.(0.2), Sec.(0.3), Sup.(0.5)

G4 Masc.(0.5), Fem.(0.5) Prim.(0.3), Sec.(0.2), Sup.(0.5)

Sup.(0.5)(0.2),Sec.,Prim.(0.1)nInstructio

.(0.2)Fem(0.1),Masc.Sex

Page 12: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

THEOREM 3

Let (fu,gu) be the Galois connection in theorem 1.

If and

we define s1 s2 =

with tji = Max {rj

i, qji}, j=1,…,ki, i=1,..., p

and s1 s2 =

with zji = Min {rj

i, qji}, j=1,…,ki, i=1,..., p.

 

The set of concepts, ordered by (A1 , s1) (A2 , s2) A1 A2 is a lattice where inf ( ( A1, s1 ), ( A2 , s2 ) ) = ( A1 A2 , (gu o fu) ( s1 s2 ))

sup (( A1, s1 ), ( A2 , s2 ) ) = ( (fu o gu) (A1 A2) , s1 s2 )

This lattice will be called “union lattice”.

)(,),( ik

ik

i1

i1i

p

1i1 ii

ryry~Ys

)(,),( ik

ik

i1

i1i

p

1i2 ii

qyqy~Ys

)t(y,),t(yY ik

ik

i1

i1i

p

1i ii

)z(y,),z(yY ik

ik

i1

i1i

p

1i ii

Page 13: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

The set of concepts, ordered by (A1 , s1) (A2 , s2) A1 A2

is a lattice where

inf ( ( A1, s1 ), ( A2 , s2 ) ) = ( A1 A2 , (gu o fu) ( s1 s2 ))

sup (( A1, s1 ), ( A2 , s2 ) ) = ( (fu o gu) (A1 A2) , s1 s2 )

This lattice will be called “union lattice”.

Page 14: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

THEOREM 4

Let (fi,gi) be the Galois connection in theorem 2.

If and

we define s1 s2 =

with tji = Min {rj

i, qji}, j=1,…,ki, i=1,..., p and

s1 s2 =

with zji = Max {rj

i, qji}, j=1,…,ki, i=1,..., p.

 

)(,),( ik

ik

i1

i1i

p

1i1 ii

ryry~Ys

)(,),( ik

ik

i1

i1i

p

1i2 ii

qyqy~Ys

)t(y,),t(yY ik

ik

i1

i1i

p

1i ii

)z(y,),z(yY ik

ik

i1

i1i

p

1i ii

Page 15: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

The set of concepts, ordered by (A1 , s1) (A2 , s2) A1 A2

is a lattice where

inf ( ( A1, s1 ), ( A2 , s2 ) ) = ( A1 A2 , (gi o fi) ( s1 s2 ))

sup (( A1, s1 ), ( A2 , s2 ) ) = ( (fi o gi) (A1 A2) , s1 s2 )

This lattice will be called “intersection lattice”.

Page 16: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Lattice construction

First step : find the set of concepts

Ganter's algorithm (Ganter & Wille, 1999) : searches for closed sets by exploring the closure space in a certain order.

To reduce computational complexity, we look for the concepts exploring the set of individuals.

Page 17: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Second Step : Graphical representation

Definition of a drawing diagram algorithm

Principle :

Use graph theory

When we insert a new concept in the graph, we explore all nodes of the existing graph only once.

The complexity of this algorithm is o(N²).

Page 18: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Comparison (Herrman & Hölldobler (1996) )

c : {{Ann, Bob, Chris}{Tinnitus (often), Blood Pressure (high)}, =0.85, =0.11}

From the intersection concept lattice:a7 i = [Tinnitus {often0.8), seldom0.0)}] ^ [Blood Pressure {high0.6), normal (0.1), low0.0)}]ext (a7 i) = {Ann, Bob, Chris}

From the union concept lattice:a7u = [Tinnitus {often1.0), seldom0.2)}] ^ [Blood Pressure {high0.9), normal (0.4), low0.0)}]ext (a7 u) = {Ann, Bob, Chris}

  Tinnitus Headache Blood PressureName Often Seldom Often Seldom High Normal Low

Ann 0.8 0.2 0.9 0.1 0.8 0.2 0.0

Bob 1.0 0.0 0.0 1.0 0.6 0.4 0.0

Chris 1.0 0.0 0.1 0.9 0.9 0.1 0.0

Doug 0.3 0.7 0.7 0.3 0.0 0.6 0.4

Eve 0.6 0.4 0.7 0.3 0.0 0.8 0.2

Retrouver la signification du alpha et du omega
Page 19: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Employment data Variables :• Marital status• Education• Economic activity (CEA)• Profession• Searching employment • Full/part time.

12 Groups : Gender Age group

Page 20: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES
Page 21: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

a19 =[Status {widow0.01) divorced0.14) married0.94) single0.71)}] ^

[CEA {agriculture, cattle, hunt, forestry & fishing0.10), construction0.26), other services0.34), real estate, renting & business activities0.06), wholesale and retail trade, repairs0.18), public administration0.09), manufacturing0.38), transport, storage & communication0.08), hotels & restaurants0.10), electricity, gas & water0.02), financial intermediation0.02), mining & quarrying0.01)}] ^

[Profession {skilled agriculture and fishery workers0.08), elementary occupations0.19), plant and machine operators and assemblers0.14), craft and related trade workers0.42), professionals0.11), clerks0.14), service, shop & market sales workers0.28), technicians and associate professionals0.10), legislators, senior officers and managers0.13),armed forces0.02)}] ^

[Education {without education0.05), primary0.69), secondary0.42), superior0.16)}] ^

[Searching {search_no0.99), search_yes0.04)}] ^

[Part/Full {part time0.11), full time0.98)}]

Ext a19 = {Men 15-24, Men 35-44, Men 45-54, Women 15-24, Women 25-34, Women 35-44}

Page 22: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Lattice simplificationDEFINITION (Leclerc, 1994)

Let (P, ) be an ordered set.

An element x P is said to be sup-irreducible if it is not the supremum of a finite part of P not containing it.

Dually, we define an inf-irreducible element.

 

The notion of irreducible elements may be used to simplify the lattice graphical representation (Duquenne, 1987), (Godin et al, 1995).

Page 23: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES
Page 24: STRUCTURING PROBABILISTIC DATA  BY GALOIS LATTICES

Conclusion and Perspectives

• Extension of lattice theory to deal directly with probabilistic data – no prior transformation

• Two kinds of Galois connections• Algorithm to construct the Galois lattices on

these data• Limitation : size of the resulting lattice

Take order into account Simplification of the lattice