22
BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal likelihood integral for general Markov models Piotr Zwiernik IPAM -→ TU Eindhoven (With special thanks to Shaowei Lin.) Singular Learning Theory, AIM, Palo Alto, 13 Dec 2011 1 / 22 Asymptotic behaviour of the marginal likelihood integral for general Markov models

Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Asymptotic behaviour of the marginal likelihood integralfor general Markov models

Piotr ZwiernikIPAM −→ TU Eindhoven

(With special thanks to Shaowei Lin.)

Singular Learning Theory,AIM, Palo Alto, 13 Dec 2011

1 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 2: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Outline of the talk

The asymptotic behavior of the marginal likelihood integral andthe real log-canonical threshold.

The real log-canonical threshold for general Markov models.

q-fibers for (binary) general Markov models.

Polyhedral geometry for the deepest singularity.

2 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 3: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Discrete algebraic statistical model

random variable: X ∈ X , |X | <∞, p = (px)x∈X

probability simplex: ∆|X |−1 = p ∈ RX :∑

x∈X px = 1, px ≥ 0

model: p : Θ→ ∆|X |−1,M = p(Θ), p polynomial map

the true distribution of X: q ∈ ∆|X |−1

3 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 4: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Asymptotics of the marginal likelihood

random sample: X(N) = (X1, . . . ,XN)

marginal likelihood: ZN =∫

ΘL(θ; X(N))ψ(θ)dθ

stochastic complexity: FN = − log ZN ; entropy:S = −∑x∈X qx log qx

Bayesian Information Criterion (BIC)

If p−1(q) = θ lies in the interior of Θ and the Jacobian of p has fullrank then, as N →∞,

EFN = NS +d2

log N + O(1).

4 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 5: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The general case (S. Watanabe)

Kullback-Leibler distance: K(θ) =∑m

i=1 qi log qipi(θ)

K(θ) ≥ 0 on ΘK(θ) = 0 only if p(θ) = q

zeta function on C: ζ(z) =∫

ΘK(θ)−zψ(θ)dθ

real log-canonical threshold: rlctΘ(K;ψ) is the smallest pole of ζits multiplicity: multΘ(K;ψ)

Theorem

With some compactness assumptions, as N →∞ thenEFN = NS + rlctΘ(K;ψ) log N + (multΘ(K;ψ)− 1) log log N + O(1)

5 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 6: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Asymptotics and q-fibers

RLCTΘ(K) = minθ∈Θ0 RLCTΘ0(K)

important distinction: if W0 neighbourhood of θ0 in Rd thenRLCTW0 (K) = RLCTθ0 (K)

RLCTΘ(K(θ)) = RLCTΘ(∑

x∈X (px(θ)− qx)2)

q-fiber: Θ = p−1(q)

θ0 /∈ Θ =⇒ rlctθ0(∑

x∈X (px(θ)− qx)2) =∞

6 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 7: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The binary tripod tree model

b

bc

b

b

1

2

3h

1

X1 ⊥⊥ X2 ⊥⊥ X3|H or

a Bayesian network a tripod tree

seven free parameters

the codimension is zero

7 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 8: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Central moment parametrization

MT :

µ12 = 14(1−s2) η1η2,

µ13 = 14(1−s2) η1η3,

µ23 = 14(1−s2) η2η3,

µ123 = 14(1−s2)s η1η2η3

8 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 9: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Finite q-fibers

µ2123 + 4µ12µ13µ23 = ( 1

4 (1− s2)sη1η2η3)2 + 4( 14 (1− s2))3(η1η2η3)2

= ( 14 (1− s2)η1η2η3)2(s2 + 1− s2) = ( 1

4 (1− s2)η1η2η3)2

µ2123

µ2123+4µ12µ13µ23

=( 1

4 (1−s2)sη1η2η3)2

( 14 (1−s2)η1η2η3)2 = s2

µ2123+4µ12µ13µ23

µ223

=( 1

4 (1−s2)η1η2η3)2

( 14 (1−s2)η2η3)2 = η2

1 , and so on

well defined for q ∈MT such that µ12µ13µ23 6= 0

9 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 10: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Submodels and singularities

(draw four models,show that the fiber for X1 ⊥⊥ X2 ⊥⊥ X3 is a union of affine spaces)

10 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 11: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Sketch picture

b

b

b

b

b

µ12 = µ13 = 0

µ12 = µ23 = 0

µ13 = µ23 = 0

δ2 = 1η1 = 0η2 = 0

η3 = 0

ΩT

MT

11 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 12: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

This generalizes

(X1, . . . ,Xn) ∈ 0, 1n represented by leaves of T

12 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 13: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The general case

let q ∈MT and Σ = [µij]i,j∈[n] the covariance matrix

the asymptotics of EFN determined by zeros in Σ (marginalindependencies).

b

bc b

b

b

b

b

b

b

b

b

bc

bcbc

bc

bc

bc

bc

T1

T2

T3

S1

S2

S3

1

13 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 14: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

General formula

Theorem [Z.]

if p−1(q) is a manifold with corners (there are no degree zeronodes) then, as N →∞,

EFN = N S +1 + 2|E| − 2l2

2log N + O(1).

for trivalent trees

EFN = N S +

(1 + 2|E| − 2l2

2− 5l0

4

)log N − c log log N + O(1),

where c is a nonnegative integer. Moreover c = 0 always if eitherboth r is nondegenerate or if r and all its neighbors aredegenerate.

14 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 15: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Step 1: Reparametrization

tree cumulants: κ = (κI)I⊆[n], I 6=∅, ω = ((sv)v∈V , (ηe)e∈E)

ΘT

fθω

p// ∆2n−1

fpκ

ΩT

fωθ

OO

ψT // KT

fκp

OO

κI(ω) = 14 (1− s2

r(I))∏

v∈N(I) sdeg(v)−2v

∏e∈E(I) ηe, for I ⊆ [n], |I| ≥ 2.

RLCTΘT (∑

x

(px(θ)− qx)2) = (

n2, 0) + RLCTΩT (

∑|I|≥2

(κI(ω)− κI)2)

15 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 16: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Step 2: The main reduction

b

bc b

b

b

b

b

b

b

b

b

bc

bcbc

bc

bc

bc

bc

T1

T2

T3

S1

S2

S3

1

note: if T is trivalent then T1, . . . ,Tk are trivalent

16 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 17: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The deepest singularity

let κij = 0 for all i, j ∈ [n] (then κI = 0 for all I ⊆ [n])

RLCTω0(∑

I(κI(ω)− κI)2) = RLCTω0(

∑i,j∈[n] κ

2ij(ω)) for ω0 ∈ Ω.

the q-fiber is given by a union of affine subspaces withnon-empty common intersection denoted by Ωdeep

RLCTω0(K) ≤ RLCTω(K) for every ω ∈ Ω and ω0 ∈ Ωdeep

17 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 18: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The monomial case

if ω0 ∈ Ωdeep then

RLCTω0(∑

i,j∈[n] κ2ij(ω)) = RLCT0(

∑i,j∈[n] m2

ij(ω)),

where mij(ω) = sr(ij)∏

e∈E(ij) ηe

e.g. (quartet)

m12 = saηa1ηa2, m13 = saηa1ηabηb3 and m34 = sbηb3ηb4

18 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 19: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

The Newton diagram method

Theorem

Let f (x) =∑α cαx2α and Γ+ = conv(2α : cα6=0) + Rn

≥0.

Then RLCT0(f ) = ( 1t , c), where and t be the smallest such that

(t, . . . , t) ∈ Γ+ and c is the codimension of the face hit by(t, . . . , t).

19 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 20: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

rlct from the Newton diagram

coordinates of the ambient space: (xe), (yv)

distinguised facet:∑

e∈Etermxe ≥ 4

show rlct0(∑

ij m2ij) = n

4 by:

note t < 4n then (t, . . . , t) /∈ Γ+

constructing a point P ∈ Γ+ such that P ≤ 4n 1

idea: for n = 41

2

4

5

3

a b

1

2

4

5

3

a b

2 · (2, 0; 2, 2, 0, 0, 0) and 2 · (0, 2; 0, 0, 0, 2, 2) givesP = 1

4 (4, 4; 4, 4, 0, 4, 4) ≤ (1, 1; 1, 1, 1, 1, 1).

20 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 21: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

rlct from the Newton diagram 2

for n = 5

1

2

4

5

3

a b

v

6

7

1

2

4

5

3

a b

v

6

7

gives 15 (4, 4, 2; 4, 4, 0, 4, 4, 4, 4) ≤ 4

5 1.

21 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models

Page 22: Asymptotic behaviour of the marginal likelihood integral ... · BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances Asymptotic behaviour of the marginal

BIC and RLCT q-fibers of GMMs Asymptotics for GMMs The case of zero covariances

Conclusions and remarks

Understanding of q-fibers is essential

In our case the degenerate cases corresponded to graphicalsubmodels

Tree remains important even for the Newton diagram method.

generalization to Gaussian models

22 / 22Asymptotic behaviour of the marginal likelihood integral for general Markov models