156
Natural Image Statistics Siwei Lyu SUNY Albany CIFAR NCAP Summer School 2009 August 7, 2009 Thanks to Eero Simoncelli for sharing some of the slides 1 Monday, August 10, 2009

Siwei lyu: natural image statistics

  • Upload
    zukun

  • View
    632

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Siwei lyu: natural image statistics

Natural Image Statistics

Siwei Lyu

SUNY Albany

CIFAR NCAP Summer School 2009August 7, 2009

Thanks to Eero Simoncelli for sharing some of the slides

1Monday, August 10, 2009

Page 2: Siwei lyu: natural image statistics

big numbers

2Monday, August 10, 2009

Page 3: Siwei lyu: natural image statistics

big numbers• seconds since big bang: ~ 1017

2Monday, August 10, 2009

Page 4: Siwei lyu: natural image statistics

big numbers• seconds since big bang: ~ 1017

• atoms in the universe: ∼1080

2Monday, August 10, 2009

Page 5: Siwei lyu: natural image statistics

big numbers• seconds since big bang: ~ 1017

• atoms in the universe: ∼1080

• 65×65 8-bit gray-scale images: ∼1010000

2Monday, August 10, 2009

Page 6: Siwei lyu: natural image statistics

... ...

3Monday, August 10, 2009

Page 7: Siwei lyu: natural image statistics

... ...

“The distribution of natural images is complicated. Perhaps it is something like beer foam, which is mostly empty but contains a thin mesh-work of fluid which fills the space and occupies almost no volume. The fluid region represents those images which are natural in character.”

[Ruderman 1996]

images are not random3Monday, August 10, 2009

Page 8: Siwei lyu: natural image statistics

natural image statistics• natural images are rare in image space• they distinguish by

nonrandom structures• common statistical

properties of naturalimages is the focalelement in the studyof natural imagestatistics

4Monday, August 10, 2009

Page 9: Siwei lyu: natural image statistics

Retina

LGN

Visual!

cortex

why are we interested innatural image statistics?

5Monday, August 10, 2009

Page 10: Siwei lyu: natural image statistics

Retina

LGN

Visual!

cortex

why are we interested innatural image statistics?

5Monday, August 10, 2009

Page 11: Siwei lyu: natural image statistics

Retina

LGN

Visual!

cortex

why are we interested innatural image statistics?

5Monday, August 10, 2009

Page 12: Siwei lyu: natural image statistics

Retina

LGN

Visual!

cortex

why are we interested innatural image statistics?

5Monday, August 10, 2009

Page 13: Siwei lyu: natural image statistics

= ? + noise

(c)

(a)

(b)

!

6Monday, August 10, 2009

Page 14: Siwei lyu: natural image statistics

= ? + noise

(c)

(a)

(b)

!

6Monday, August 10, 2009

Page 15: Siwei lyu: natural image statistics

computer vision applications• image restoration

- de-noising, de-blurring and de-mosaicing, super-resolution and in-painting

• image compression

• texture synthesis

• image segmentation

• features for object detection and classification (SIFT, gist, “primal sketch”, saliency, etc)

• many others

7Monday, August 10, 2009

Page 16: Siwei lyu: natural image statistics

natural!

image!

statistics

math

statisticsbiology

computer!

science

image!

processing

machine!

learning

computer!

vision

optimization

perception

neuro-!

science

signal!

processing

8Monday, August 10, 2009

Page 17: Siwei lyu: natural image statistics

scope of this tutorial• important developments following a

general theme• focusing on concepts

- light on math or specific applications• gray-scale intensity image, do not cover

- color- time (video)- multi-image information (stereo)

9Monday, August 10, 2009

Page 18: Siwei lyu: natural image statistics

representation

main components

10Monday, August 10, 2009

Page 19: Siwei lyu: natural image statistics

representationNeural characterization

Ingredients:

• stimuli• response model• estimation method

Transform Representation

11Monday, August 10, 2009

Page 20: Siwei lyu: natural image statistics

why representation matters? • example (from David Marr)• representation for numbers

• Arabic: 123• Roman: MCXXIII• binary: 1111011• English: one hundred and twenty

three

12Monday, August 10, 2009

Page 21: Siwei lyu: natural image statistics

why representation matters? • example (from David Marr)• representation for numbers

• Arabic: 123 × 10• Roman: MCXXIII × X• binary: 1111011 × 110• English: one hundred and twenty

three × ten

13Monday, August 10, 2009

Page 22: Siwei lyu: natural image statistics

why representation matters? • example (from David Marr)• representation for numbers

• Arabic: 123 × 4• Roman: MCXXIII × IV• binary: 1111011 × 100• English: one hundred and twenty

three × four

14Monday, August 10, 2009

Page 23: Siwei lyu: natural image statistics

linear representations• pixel

• Fourier

• wavelet- localized- oriented

15Monday, August 10, 2009

Page 24: Siwei lyu: natural image statistics

representation

observations

main components

16Monday, August 10, 2009

Page 25: Siwei lyu: natural image statistics

• calibrated - linearized response

• relatively large number

image data

[van Hateren & van der Schaaf, 1998]17Monday, August 10, 2009

Page 26: Siwei lyu: natural image statistics

observations• second-order pixel correlations

• 1/f power law of frequency domain energy

• importance of phases

• heavy-tail non-Gaussian marginals in wavelet domain

• near elliptical shape of joint densities in wavelet domain

• decay of dependency in wavelet domain

• ………….

18Monday, August 10, 2009

Page 27: Siwei lyu: natural image statistics

representation

observations

model

main components

19Monday, August 10, 2009

Page 28: Siwei lyu: natural image statistics

models• physical imaging process (e.g., occlusion)

• nonlinear manifold of natural images

• non-parametric implicit model based on large set of images

• matching statistics of natural image signalswith density models <-- our focus

F -11/f2

P(c)

Gaussian model is weak

a. b.

!2F F!1

-20 20

-20

20

4-4

4

-420-20

20

-20

a. b. c.

P(x)

F!1!!2naturalimages

all possible images

20Monday, August 10, 2009

Page 29: Siwei lyu: natural image statistics

representation

observations

applications

model

main components

Bayesian framework

21Monday, August 10, 2009

Page 30: Siwei lyu: natural image statistics

Bayesian framework

• x’(y): estimator

• L(x,x’(y)): loss functional

• p(x): prior model for natural images

• p(y|x): likelihood -- from corruption process

image (x) obs. (y)corruption??

est. (x’)

minx!

!

xL(x, x!(y))p(x|y)dx

! minx!

!

xL(x, x!(y))p(y|x)p(x)dx

22Monday, August 10, 2009

Page 31: Siwei lyu: natural image statistics

application: Bayesian denoising• additive Gaussian noise

• maximum a posterior (MAP)

• minimum mean squares error (MMSE)

xMAP = argmaxx

p(x|y) = argmaxx

p(y|x)p(x)

xMMES = argminx!

!

x!x" x!!2p(x|y)dx

="

x xp(y|x)p(x)dx"x p(y|x)p(x)dx

= E(x|y)

y = x + w

p(y|x) ! exp["(y " x)2/2!2w]

23Monday, August 10, 2009

Page 32: Siwei lyu: natural image statistics

representation

observations

applications

model

main components

efficient coding

24Monday, August 10, 2009

Page 33: Siwei lyu: natural image statistics

representation

Neural characterization

Ingredients:

• stimuli• response model• estimation method

Transform Representation

• unsupervised learning

• specify desired properties of the transform outputs

what are such properties?

25Monday, August 10, 2009

Page 34: Siwei lyu: natural image statistics

what makes a good representation?• intuitively, transformed signal should be

“simpler”- reduced dimensionality

!"#$%&'(#)%"*#%*+,##-$"*.",/01)1

2)(,$&%*3'#)-$$-4561'",

7-8,1*.9:*;")<-$1)#0

=>

+?.*-8,@A/-*BCD

! !"#$%&'()*+&%$",-$%(%*.,/&01#,2%,3%(%1".&&45#)&2$#,2(+%5(1(%$&1%#21,%1-,%5#)&2$#,2$" !"#$#%&&'()*+,*-$)./0)$1*)*&/"2%$#/")/.)$1*),&/34()$1*0*)#5)"/)%--%0*"$)5$03,$30*)#")$1*)5*$)/.)-/#"$5

" 61//5#"2)%")%--0/-0#%$*)0/$%$#/")%&&/75)35)$/)3"8*#&)$1*)3"4*0&'#"2)5$03,$30*9):;/3),%")$1#"<)/.)$1#5)0/$%$#/")%5)=7%&<#"2)%0/3"4=)$1*)$10**>4#?*"5#/"%&)5*$()&//<#"2)./0)$1*)@*5$)8#*7-/#"$A

! 678%0(2%"&+*%3#25%$90"%925&.+:#2;%$1.9019.&<%=1%$&+&01$%(%.,1(1#,2%$90"%1"(1%),$1%,3%1"&%>(.#(?#+#1:%-#1"#2%1"&%5(1(%$&1%#$%.&*.&$&21&5%#2%1"&%3#.$1%3&-%5#)&2$#,2$%,3%1"&%.,1(1&5%5(1(

" !")/30)$10**>4#?*"5#/"%&),%5*()$1#5)?%')5**?)/.)&#$$&*)35*

" B/7*8*0()71*")$1*)4%$%)#5)1#21&')?3&$#4#?*"5#/"%&):CDE5)/.)4#?*"5#/"5A()$1#5)%"%&'5#5)#5)F3#$*)-/7*0.3&

26Monday, August 10, 2009

Page 35: Siwei lyu: natural image statistics

what makes a good representation?• intuitively, transformed signal should be

“simpler”- reduced dependency

r has less dependency than x

- optimum: r is independent,

• reducing dependency is a general approach to relieve the curse of dimensionality

• are there dependency in natural images?

x r

p(r) =d!

i=1

p(ri)

27Monday, August 10, 2009

Page 36: Siwei lyu: natural image statistics

redundancy in natural images• structure = predictability = redundancy

[Kersten, 1987]

28Monday, August 10, 2009

Page 37: Siwei lyu: natural image statistics

measure of statistical dependency

multi-information (MI):

[Studeny and Vejnarova, 1998]

I(!x) = DKL

!p(!x)

"""""#

k

p(xk)

$

=%

!xp(!x) log

p(!x)&k p(xk)

d!x

=d'

i=1

H(xk)!H(!x)

29Monday, August 10, 2009

Page 38: Siwei lyu: natural image statistics

efficient coding [Attneave ’54; Barlow ’61; Laughlin ’81; Atick ’90; Bialek etal ‘91]

• maximize mutual information of stimulus & response, subject to constraints (e.g. metabolic)

• noiseless case => redundancy reduction:

- independent components- efficient (maxEnt) marginals

I(r, x) = H(r)!H(r|x)x r

H(r|x) = 0! I(r, x) = H(r) =d!

i=1

H(ri)" I(r)

30Monday, August 10, 2009

Page 39: Siwei lyu: natural image statistics

representation

observations

applications

model

main components

31Monday, August 10, 2009

Page 40: Siwei lyu: natural image statistics

closed loop

representation

observations

applications

model

32Monday, August 10, 2009

Page 41: Siwei lyu: natural image statistics

pixel domain

33Monday, August 10, 2009

Page 42: Siwei lyu: natural image statistics

I(x,y)

I(x+1

,y)

I(x,y)

I(x+2

,y)

I(x,y)

I(x+4

,y)

10 20 30 400

1

Spatial separation (pixels)

Corre

lation

a. b.

I(x,y)

I(x+1

,y)

I(x,y)

I(x+2

,y)

I(x,y)

I(x+4

,y)

10 20 30 400

1

Spatial separation (pixels)

Corre

lation

a. b.

observation

34Monday, August 10, 2009

Page 43: Siwei lyu: natural image statistics

model• maximum entropy density [Jaynes 54]

- assume zero mean- : consistent w/ second order statistics- find with maximum entropy- solution:

! = E(!x!xT )

p(!x)

p(!x) ! exp!"1

2!xT !!1!x

"

35Monday, August 10, 2009

Page 44: Siwei lyu: natural image statistics

Gaussian model for Bayesian denoising

• additive Gaussian noise

• Gaussian model

• posterior density (another Gaussian)

• inference (Wiener filter)

p(!x) ! exp!"1

2!xT !!1!x

"

p(!x|!x) ! exp!"1

2!xT !!1!x" #!x" !y#2

2"2w

"

!xMAP = !xMMSE = !(! + "2wI)!1!y

!y = !x + !w

p(!y|!x) ! exp["#!y " !x#2/2"2w]

36Monday, August 10, 2009

Page 45: Siwei lyu: natural image statistics

efficient coding transform• for Gaussian p(x)

• minimum (independent) when Σ is diagonal

• a transform that diagonalizes Σ can eliminate all dependencies (second-order)

I(!x) !d!

i=1

log(!)ii " log det(!)

37Monday, August 10, 2009

Page 46: Siwei lyu: natural image statistics

PCA• eigen-decomposition of Σ:

- U: orthonormal matrix (rotation) UTU = UUT = I- Λ: diagonal matrix, Λii ≥ 0 -- eigenvalue

• s = UTx, or x = Us, s is independent Gaussian

• principal component analysis (PCA) - Karhunen Loeve transform

! = U"UT

E{UT !x(UT !x)T } = UT E{!x!xT }U= UT U!UT U = !

38Monday, August 10, 2009

Page 47: Siwei lyu: natural image statistics

PCA

!x

39Monday, August 10, 2009

Page 48: Siwei lyu: natural image statistics

PCA

!x

!xPCA = UT !x

40Monday, August 10, 2009

Page 49: Siwei lyu: natural image statistics

PCA bases learned from natural images (U)

41Monday, August 10, 2009

Page 50: Siwei lyu: natural image statistics

representation• PCA is for local patches

- data dependent- expensive for large images

• assume translation invariance cyclic boundary handling - image lattice on a torus - covariance matrix is block circulant - eigenvectors are complex exponential - diagonalized (decorrelated) with DFT - PCA => Fourier representation

42Monday, August 10, 2009

Page 51: Siwei lyu: natural image statistics

observations• spectral power

[Field, 1994]

figure from [Simoncelli 05]

Spectral power

Structural:

F (s!) = spF (!)

F (!) ! 1!p

[Ritterman 52; DeRiugin 56; Field 87; Tolhurst 92; Ruderman/Bialek 94; ...]

Assume scale-invariance:

then:

0 1 2 30

1

2

3

4

5

6

Log10

spatialfrequency (cycles/image)

Lo

g 10 p

ow

er

Empirical:

43Monday, August 10, 2009

Page 52: Siwei lyu: natural image statistics

model• power law

- scale invariance• denoising (Wiener filter in frequency

domain)

X̂(!) =A/!!

A/!! + "2· Y (!)

F (s!) = spF (!)

F (!) =A

!!

44Monday, August 10, 2009

Page 53: Siwei lyu: natural image statistics

[Torralba and Oliva, 2003]

further observations

45Monday, August 10, 2009

Page 54: Siwei lyu: natural image statistics

PCA

whitening

!x

UT !x

! = U"UT

!!12 UT !x

V !!12 UT !xnot unique!

46Monday, August 10, 2009

Page 55: Siwei lyu: natural image statistics

zero-phase (symmetric) whitening (ZCA)

minimum wiring lengthreceptive fields of retina neurons [Atick & Redlich, 92]

A!1 = U!!12 UT

47Monday, August 10, 2009

Page 56: Siwei lyu: natural image statistics

second-order constraints are weak

F -11/f2

P(c)

Gaussian model is weak

a. b.

!2F F!1

P(x)

F!1!!2

figure courtesy of Eero Simoncelli

1/ϖγ Gaussian sample

whitened natural image

48Monday, August 10, 2009

Page 57: Siwei lyu: natural image statistics

pixel

summary

49Monday, August 10, 2009

Page 58: Siwei lyu: natural image statistics

pixel

second order

summary

49Monday, August 10, 2009

Page 59: Siwei lyu: natural image statistics

pixel

second order

Gaussian

summary

49Monday, August 10, 2009

Page 60: Siwei lyu: natural image statistics

pixel

second order

Gaussian

summary

PCA/Fourier

49Monday, August 10, 2009

Page 61: Siwei lyu: natural image statistics

pixel

second order

Gaussian

summary

PCA/Fourier

power spectrum

49Monday, August 10, 2009

Page 62: Siwei lyu: natural image statistics

pixel

second order

Gaussian

summary

PCA/Fourier

power spectrum

power law model

49Monday, August 10, 2009

Page 63: Siwei lyu: natural image statistics

pixel

second order

Gaussian

summary

Not enough!

PCA/Fourier

power spectrum

power law model

49Monday, August 10, 2009

Page 64: Siwei lyu: natural image statistics

bandpass filter domain

! =

50Monday, August 10, 2009

Page 65: Siwei lyu: natural image statistics

• sparseness

[Burt&Adelson 82; Field 87; Mallat 89; Daugman 89, ...]

observation

log

p(x)

0

51Monday, August 10, 2009

Page 66: Siwei lyu: natural image statistics

model• if we only enforce consistency on 1D

marginal densities, i.e., p(xi) = qi(xi)- maximum entropic density is the factorial density

- multi-information is non-negative, and achieves minimum (zero) when xis are independent

• there are second order dependencies, so derived model is a linearly transformed factorial (LTF) model

p(!x) =!d

i=1 qi(xi)

H(!x) =!

i H(xi)! I(!x)

52Monday, August 10, 2009

Page 67: Siwei lyu: natural image statistics

model• linearly transformed factorial (LTF)

- independent sources:- A: invertible linear transform (basis)

- A-1: filters for analysis

p(!s) =!d

i=1 p(si)

!x = A!s =

!

"| · · · |

!a1 · · · !ad

| · · · |

#

$

!

%"s1...sd

#

&$

= s1!a1 + · · · + sd!ad

!s = A!1!x

53Monday, August 10, 2009

Page 68: Siwei lyu: natural image statistics

LTF model• SVD of matrix A:

- U,V: orthonormal matrices (rotation) UTU = UUT = I and VTV = VVT = I - Λ: diagonal matrix (Λii)1/2 ≥ 0 -- singular value

s x = U Λ1/2VTs

rotation scale rotation

A = U!1/2V T

VTs Λ1/2VTs

54Monday, August 10, 2009

Page 69: Siwei lyu: natural image statistics

marginal model• well fit with generalized Gaussian

Marginal densities

P (x) ! exp"|x/s|p

[Mallat 89; Simoncelli&Adelson 96; Moulin&Liu 99; ...]

Well-fit by a generalized Gaussian:

Wavelet coefficient value

log

(Pro

ba

bili

ty)

p = 0.46

!H/H = 0.0031

Wavelet coefficient value

log

(Pro

ba

bili

ty)

p = 0.58

!H/H = 0.0011

Wavelet coefficient valuelo

g(P

rob

ab

ility

)

p = 0.48

!H/H = 0.0014

Wavelet coefficient value

log

(Pro

ba

bili

ty)

p = 0.59

!H/H = 0.0012

Fig. 4. Log histograms of a single wavelet subband of four example images (see Fig. 1 for image description). For eachhistogram, tails are truncated so as to show 99.8% of the distribution. Also shown (dashed lines) are fitted model densitiescorresponding to equation (3). Text indicates the maximum-likelihood value of p used for the fitted model density, andthe relative entropy (Kullback-Leibler divergence) of the model and histogram, as a fraction of the total entropy of thehistogram.

non-Gaussian than others. By the mid 1990s, a numberof authors had developed methods of optimizing a ba-sis of filters in order to to maximize the non-Gaussianityof the responses [e.g., 36, 4]. Often these methods oper-ate by optimizing a higher-order statistic such as kurto-sis (the fourth moment divided by the squared variance).The resulting basis sets contain oriented filters of differentsizes with frequency bandwidths of roughly one octave.Figure 5 shows an example basis set, obtained by opti-mizing kurtosis of the marginal responses to an ensembleof 12 ! 12 pixel blocks drawn from a large ensemble ofnatural images. In parallel with these statistical develop-ments, authors from a variety of communities were devel-oping multi-scale orthonormal bases for signal and imageanalysis, now generically known as “wavelets” (see chap-ter 4.2 in this volume). These provide a good approxima-tion to optimized bases such as that shown in Fig. 5.

Once we’ve transformed the image to a multi-scalewavelet representation, what statistical model can we useto characterize the the coefficients? The statistical moti-vation for the choice of basis came from the shape of themarginals, and thus it would seem natural to assume thatthe coefficients within a subband are independent andidentically distributed. With this assumption, the modelis completely determined by the marginal statistics of thecoefficients, which can be examined empirically as in theexamples of Fig. 4. For natural images, these histogramsare surprisingly well described by a two-parameter gen-eralized Gaussian (also known as a stretched, or generalizedexponential) distribution [e.g., 31, 47, 34]:

Pc(c; s, p) =exp("|c/s|p)

Z(s, p), (3)

where the normalization constant is Z(s, p) = 2 sp!( 1

p ).An exponent of p = 2 corresponds to a Gaussian den-sity, and p = 1 corresponds to the Laplacian density. In

Fig. 5. Example basis functions derived by optimizing amarginal kurtosis criterion [see 35].

5

[Mallat 89; Simoncelli&Adelson 96; Moulin&Liu 99; …]

p(s) ! exp!" |s|p

!

"

55Monday, August 10, 2009

Page 70: Siwei lyu: natural image statistics

Bayesian denoisingII. BLS for non-Gaussian prior

• Assume marginal distribution [Mallat ‘89]:

• Then Bayes estimator is generally nonlinear:

P (x) ! exp"|x/s|p

p = 2.0 p = 1.0 p = 0.5

[Simoncelli & Adelson, ‘96]

56Monday, August 10, 2009

Page 71: Siwei lyu: natural image statistics

scale mixture of Gaussians (GSM)•

- u: zero mean Gaussian with unit variance- z: positive random variable- special cases (different p(z)) generalized Gaussian, Student’s t, Bessel’s K, Cauchy, α-stable, etc

p(x)

x

p(x)

x

=

[Andrews & Mallows 74, Wainwright & Simoncelli, 99]x = u

!z

57Monday, August 10, 2009

Page 72: Siwei lyu: natural image statistics

efficient coding transform• LTF model => independent component

analysis (ICA) [Comon 94; Cardoso 96; Bell/Sejnowski 97; …]

- many different implementations (JADE, InfoMax, FastICA, etc.)- interpretation using SVD

- where to get U!s = A!1!x = V !!1/2UT !x

E{!x!xT } = AE{!s!sT }AT

= U!1/2V T IV !1/2UT

= U!UT

58Monday, August 10, 2009

Page 73: Siwei lyu: natural image statistics

PCAICA

!x

59Monday, August 10, 2009

Page 74: Siwei lyu: natural image statistics

PCAICA

!x

!xPCA = UT !x

59Monday, August 10, 2009

Page 75: Siwei lyu: natural image statistics

PCAICA

!x

!xwht = !!12 UT !x

!xPCA = UT !x

59Monday, August 10, 2009

Page 76: Siwei lyu: natural image statistics

PCAICA

!x

!xwht = !!12 UT !x

!xPCA = UT !x

!xICA = V !!12 UT !x

59Monday, August 10, 2009

Page 77: Siwei lyu: natural image statistics

• find final rotation that maximizes non-Gaussianity - linear mixing makes more Gaussian (CLT)- equivalent to maximize sparseness

Computational Vision & Neuroscience Group

/73

Higher-order redundancy reduction:Independent Component Analysis (ICA)

Find the most non-Gaussian directions:

48finding V

figure from [Bethge 2008]60Monday, August 10, 2009

Page 78: Siwei lyu: natural image statistics

ICA bases (squared columns of A) learned from natural images- similar shape to receptive field of V1 simple cells [Olshausen & Field 1996, Bell & Sejnowski 1997]

61Monday, August 10, 2009

Page 79: Siwei lyu: natural image statistics

break

62Monday, August 10, 2009

Page 80: Siwei lyu: natural image statistics

representation• ICA basis resemble wavelet and other

multi-scale oriented linear representations- localized in spatial location, frequency band and local orientation

• ICA basis are learned from data, while wavelet basis are fixed

Gabor wavelet basis

63Monday, August 10, 2009

Page 81: Siwei lyu: natural image statistics

band-pass

summary

64Monday, August 10, 2009

Page 82: Siwei lyu: natural image statistics

band-pass

sparsity

summary

64Monday, August 10, 2009

Page 83: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

64Monday, August 10, 2009

Page 84: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

64Monday, August 10, 2009

Page 85: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

higher-orderdependency

64Monday, August 10, 2009

Page 86: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

Not enough!

ICA/wavelet

higher-orderdependency

64Monday, August 10, 2009

Page 87: Siwei lyu: natural image statistics

problems with LTF• any band-pass or high-pass filter will

lead to heavy tail marginals (even random ones)

• if natural images are truly linear mixture of independent non-Gaussian sources, random projection (filtering) should look like Gaussian- central limit theorem

65Monday, August 10, 2009

Page 88: Siwei lyu: natural image statistics

problems with LTF

• Large-magnitude subband coefficients are found at

neighboring positions, orientations, and scales.

[Simoncelli ’97; Buccigrossi &Simoncelli ’99]

66Monday, August 10, 2009

Page 89: Siwei lyu: natural image statistics

ICA achieves very little improvementover PCA in terms of dependency reduction [Bethge 06, Lyu & Simoncelli 08]

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits

/co

eff)

Separation

raw

pca/ica

67Monday, August 10, 2009

Page 90: Siwei lyu: natural image statistics

LTF also a weak model...

Sample Gaussianized

Sample ICA-transformed

and Gaussianized

figure courtesy of Eero Simoncelli

sample from LTFnatural images after

ICA filtering

68Monday, August 10, 2009

Page 91: Siwei lyu: natural image statistics

remedy• assumptions in LTF model and ICA

- factorial marginals for filter outputs- linear combination- invertible

69Monday, August 10, 2009

Page 92: Siwei lyu: natural image statistics

remedy• assumptions in LTF model and ICA

- factorial marginals for filter outputs- linear combination- invertible

• model => [Zhu, Wu & Mumford 1997; Portilla & Simoncelli 2000]MaxEnt joint density with constraints on filter output

• representation => sparse coding [Olshausen & Field 1996]- find filters giving optimum sparsity- compressed sensing [Candes & Donoho 2003]

70Monday, August 10, 2009

Page 93: Siwei lyu: natural image statistics

remedy• assumptions in LTF model and ICA

- factorial marginals for filter outputs- linear combination nonlinear - invertible

71Monday, August 10, 2009

Page 94: Siwei lyu: natural image statistics

joint density of natural image band-pass filter responses with separation of 2 pixels

72Monday, August 10, 2009

Page 95: Siwei lyu: natural image statistics

elliptically symmetric density

pesd(!x) =1

"|!| 12f

!!1

2!xT !!1!x

"pssd(!x) =

1"

f

!!1

2!xT !x

"

spherically symmetric density

whitening

(Fang et.al. 1990)

73Monday, August 10, 2009

Page 96: Siwei lyu: natural image statistics

74Monday, August 10, 2009

Page 97: Siwei lyu: natural image statistics

75Monday, August 10, 2009

Page 98: Siwei lyu: natural image statistics

3 6 9 12 15 18 200

0.05

0.1

0.15

0.2

kurtosis

blk size = 3x3

blksphericalfactorial

Spherical vs LTF

3x3 7x7 15x15

3 6 9 12 15 18 200

0.05

0.1

0.15

0.2

kurtosis

blk size = 7x7

blksphericalfactorial

3 6 9 12 15 18 200

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

kurtosis

blk size = 11x11

blksphericalfactorial

data (ICA’d): factorialized:sphericalized:

• Histograms, kurtosis of projections of image blocks onto random

unit-norm basis functions.

• These imply data are closer to spherical than factorial

[Lyu & Simoncelli 08]

76Monday, August 10, 2009

Page 99: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

77Monday, August 10, 2009

Page 100: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

78Monday, August 10, 2009

Page 101: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

79Monday, August 10, 2009

Page 102: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

80Monday, August 10, 2009

Page 103: Siwei lyu: natural image statistics

PCAICA

81Monday, August 10, 2009

Page 104: Siwei lyu: natural image statistics

PCAICA

82Monday, August 10, 2009

Page 105: Siwei lyu: natural image statistics

[Fang et.al. 1990]

- Simoncelli, 1997;- Zetzsche and Krieger, 1999;- Huang and Mumford, 1999; - Wainwright and Simoncelli, 2000; - Hyvärinen et al., 2000; - Parra et al., 2001; - Srivastava et al., 2002; - Sendur and Selesnick, 2002; - Teh et al., 2003; - Gehler and Welling, 2006- etc.

elliptical models of natural images

83Monday, August 10, 2009

Page 106: Siwei lyu: natural image statistics

p(x)

x

joint GSM model

!x = !u!

z

GSM simulation

!!" " !"#"

"

#"!

!!" " !"#"

"

#"!

Image data GSM simulation

[Wainwright & Simoncelli, NIPS*99]

84Monday, August 10, 2009

Page 107: Siwei lyu: natural image statistics

PCA/whitening

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

85Monday, August 10, 2009

Page 108: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

PCA/whitening

ICA

86Monday, August 10, 2009

Page 109: Siwei lyu: natural image statistics

EllipticalLinearly

transformed factorial

Factorial Gaussian Spherical

PCA/whitening

???ICA

87Monday, August 10, 2009

Page 110: Siwei lyu: natural image statistics

nonlinear representations• complex wavelet phase-based [Ates &

Orchid, 2003]

• orientation-based [Hammand & Simoncelli 2006]

• nonlinear whitening [Gluckman 2005]

• local divisive normalization [Malo et.al. 2004]

• global divisive normalization [Lyu & Simoncelli 2007,2008]

88Monday, August 10, 2009

Page 111: Siwei lyu: natural image statistics

Gaussian is the only density that can be both factorial and spherically symmetric [Nash and Klamkin 1976]

p(!x) =1!

(2")dexp

"!!xT !x

2

#

=1!

(2")dexp

$!1

2

d%

i=1

x2i

&

=d'

i=1

1"2"

exp"!1

2x2

i

#

=d'

i=1

p(xi)

89Monday, August 10, 2009

Page 112: Siwei lyu: natural image statistics

PCAICA

?

90Monday, August 10, 2009

Page 113: Siwei lyu: natural image statistics

radial Gaussianization (RG)

[Lyu & Simoncelli, 2008,2009]91Monday, August 10, 2009

Page 114: Siwei lyu: natural image statistics

radial Gaussianization (RG)

[Lyu & Simoncelli, 2008,2009]92Monday, August 10, 2009

Page 115: Siwei lyu: natural image statistics

radial Gaussianization (RG)

p!(r) ! r exp("r2/2)

pr(r) ! rf("r2/2)

[Lyu & Simoncelli, 2008,2009]93Monday, August 10, 2009

Page 116: Siwei lyu: natural image statistics

radial Gaussianization (RG)

g(r) = F!1! Fr(r)

p!(r) ! r exp("r2/2)

pr(r) ! rf("r2/2)

[Lyu & Simoncelli, 2008,2009]94Monday, August 10, 2009

Page 117: Siwei lyu: natural image statistics

radial Gaussianization (RG)

!xrg =g(!!xwht!)!!xwht!

!xwht

g(r) = F!1! Fr(r)

p!(r) ! r exp("r2/2)

pr(r) ! rf("r2/2)

[Lyu & Simoncelli, 2008,2009]95Monday, August 10, 2009

Page 118: Siwei lyu: natural image statistics

96Monday, August 10, 2009

Page 119: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 120: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 121: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 122: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 123: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 124: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 125: Siwei lyu: natural image statistics

1 2 4 8 16 320

0.1

0.2

0.3

0.4

0.5

MI

(bits/c

oe

ff)

Separation

!

raw

pca/ica

rg

97Monday, August 10, 2009

Page 126: Siwei lyu: natural image statistics

0.2 0.3 0.4 0.5 0.6

0.2

0.3

0.4

0.5

0.6

0.7blk size = 3x3

0.6 0.7 0.8 0.9 1 1.1

0.6

0.7

0.8

0.9

1

1.1

1.2

1.3 blk size = 15x15

IPCA ! IRAW IPCA ! IRAW

blocks of local mean removed pixel blocks of natural images

(+)IRG ! IRAW

(!)IICA " IRAW

(Lyu & Simoncelli, Neural Computation, to appear)98Monday, August 10, 2009

Page 127: Siwei lyu: natural image statistics

PCAICA RG

unification asGaussianization?

99Monday, August 10, 2009

Page 128: Siwei lyu: natural image statistics

marginal Gaussianization

p(x)

x

p(y)

y

100Monday, August 10, 2009

Page 129: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

101Monday, August 10, 2009

Page 130: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

higher-orderdependency

101Monday, August 10, 2009

Page 131: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

higher-orderdependency

elliptical model

101Monday, August 10, 2009

Page 132: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

ICA/wavelet

higher-orderdependency

elliptical model

RG

101Monday, August 10, 2009

Page 133: Siwei lyu: natural image statistics

band-pass

sparsity

LTF

summary

Not enough!

ICA/wavelet

higher-orderdependency

elliptical model

RG

101Monday, August 10, 2009

Page 134: Siwei lyu: natural image statistics

Joint densitiesadjacent near far other scale other ori

!100 0 100

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!500 0 500

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

!500 0 500

!150

!100

!50

0

50

100

150

!100 0 100

!150

!100

!50

0

50

100

150

Fig. 8. Empirical joint distributions of wavelet coefficients associated with different pairs of basis functions, for a singleimage of a New York City street scene (see Fig. 1 for image description). The top row shows joint distributions as contourplots, with lines drawn at equal intervals of log probability. The three leftmost examples correspond to pairs of basis func-tions at the same scale and orientation, but separated by different spatial offsets. The next corresponds to a pair at adjacentscales (but the same orientation, and nearly the same position), and the rightmost corresponds to a pair at orthogonal orien-tations (but the same scale and nearly the same position). The bottom row shows corresponding conditional distributions:brightness corresponds to frequency of occurance, except that each column has been independently rescaled to fill the fullrange of intensities.

remain. First, although the normalized coefficients arecertainly closer to a homogeneous field, the signs of thecoefficients still exhibit important structure. Second, thevariance field itself is far from homogeneous, with mostof the significant values concentrated on one-dimensionalcontours.

4 Discussion

After nearly 50 years of Fourier/Gaussian modeling, thelate 1980s and 1990s saw sudden and remarkable shift inviewpoint, arising from the confluence of (a) multi-scaleimage decompositions, (b) non-Gaussian statistical obser-vations and descriptions, and (c) variance-adaptive sta-tistical models based on hidden variables. The improve-ments in image processing applications arising from theseideas have been steady and substantial. But the completesynthesis of these ideas, and development of further re-finements are still underway.

Variants of the GSMmodel described in the previous sec-tion seem to represent the current state-of-the-art, both interms of characterizing the density of coefficients, and interms of the quality of results in image processing appli-

cations. There are several issues that seem to be of pri-mary importance in trying to extend such models. First,a number of authors have examined different methods ofdescribing regularities in the local variance field. Theseinclude spatial random fields [23, 26, 24], and multiscaletree-structured models [40, 55]. Much of the structure inthe variance field may be attributed to discontinuous fea-tures such as edges, lines, or corners. There is a substan-tial literature in computer vision describing such struc-tures [e.g., 57, 32, 17, 27, 56], but it has proven difficultto establish models that are both explicit and flexible. Fi-nally, there have been several recent studies investigat-ing geometric regularities that arise from the continuityof contours and boundaries [45, 16, 19, 21, 60]. These andother image structures will undoubtedly be incorporatedinto future statistical models, leading to further improve-ments in image processing applications.

References

[1] F. Abramovich, T. Besbeas, and T. Sapatinas. Em-pirical Bayes approach to block wavelet function es-timation. Computational Statistics and Data Analysis,39:435–451, 2002.

9

[Simoncelli, ‘97; Wainwright&Simoncelli, ‘99]

• Nearby: densities are approximately circular/elliptical

• Distant: densities are approximately factorial

102Monday, August 10, 2009

Page 135: Siwei lyu: natural image statistics

extended models• independent subspace and topographical ICA

[Hoyer & Hyvarinen, 2001,2003; Karklin & Lewicki 2005]

• adaptive covariance structures [Hammond & Simoncelli, 2006; Guerrero-Colon et.al. 2008; Karklin & Lewicki 2009]

• product of t experts [Osindero et.al. 2003]

• fields of experts [Roth & Black, 2005]

• tree and fields of GSMs [Wainwright & Simoncelli, 2003; Lyu & Simoncelli, 2008]

• implicit MRF model [Lyu 2009]

103Monday, August 10, 2009

Page 136: Siwei lyu: natural image statistics

x field

z field

FoGSM: !xd= !u!

"!z

• !u : zero mean homogeneous Gauss MRF

• !z : exponentiated homogeneous Gauss MRF

• !x|!z : inhomogeneous Gauss MRF

• !x#"

!z : homogeneous Gauss MRF

• marginal distribution is GSM

• generative model: e!cient sampling

104Monday, August 10, 2009

Page 137: Siwei lyu: natural image statistics

x u log z

105Monday, August 10, 2009

Page 138: Siwei lyu: natural image statistics

! = 1 ! = 8 ! = 32 orientation scale

subband

FoGSM

simulation: subband pairwise joint density

Barbara boat house

subband, sample, · · · · · · Gaussian

simulation: marginal densities

marginal

joint

106Monday, August 10, 2009

Page 139: Siwei lyu: natural image statistics

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. X, NO. X, XX 200X 9

Barbara Lena Boats

! "#"! $! !# %! "##

!&

!'

!$

!"

#

"

!

"()*+,

! "#"! $! !# %! "##

!&

!'

!$

!"

#

"

!

"()*+,

! "#"! $! !# %! "##

!&

!'

!$

!"

#

"

!

"()*+,

Fig. 6. Performance comparison of denoising methods for three di!erent images. Plotted are di!erences in PSNR for di!erent input noise levels (!) betweenFoGSM and four other methods (! BM3D [37], " BLS-GSM [17], # kSVD [39] and $ FoE [27]). The PSNR values for these methods were taken fromcorresponding publications.

original image noisy image (! = 50) (PSNR = 14.15dB)

local GSM [17] (PSNR = 25.45dB) FoGSM (PSNR = 26.40dB)

Fig. 7. Denoising results using local GSM [17] and FoGSM.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.

FoE

kSVD

GSM

BM3DFoGSM

[Lyu&Simoncelli, PAMI 08]

State-of-the-art denoising

20 ! log10255!"

i,j (Ioriginal(i, j)" Idenoised(i, j))2

peak-signal-to-noise-ratio (PSNR)

107Monday, August 10, 2009

Page 140: Siwei lyu: natural image statistics

original image noisy image (! = 25) matlab wiener2 FoGSM

(14.15dB) (27.19dB) (30.02dB)

108Monday, August 10, 2009

Page 141: Siwei lyu: natural image statistics

original image noisy image (! = 100) (8.13dB)

matlab wiener2(29.32dB) (18.38dB) FoGSM (23.01dB)

109Monday, August 10, 2009

Page 142: Siwei lyu: natural image statistics

pairwise conditional density

x1

x 2

p(x2|x1)

E(x2|x1)

E(x2|x1)+std (x2|x1)

E(x2|x1)-std (x2|x1)

“bow-tie”[Buccigrossi & Simoncelli, 97]

110Monday, August 10, 2009

Page 143: Siwei lyu: natural image statistics

pairwise conditional density

x1

x 2

E(x2|x1)

E(x2|x1)+std (x2|x1)

E(x2|x1)-std (x2|x1)

E(x2|x1) ! ax1

var(x2|x1) ! b + cx21

111Monday, August 10, 2009

Page 144: Siwei lyu: natural image statistics

conditional density

• maxEnt conditional density

- singleton conditionals- joint MRF density can be determined by all singletons (Brook’s lemma)

µi = E(xi|xj,j!N (i)) =!

j!N(i)

ajxj

!2i = var(xi|xj,j!N (i)) = b +

!

j!N(i)

cjx2j

p(xi|xj,j!N (i)) =1!2!"2

i

exp"! (xi ! µi)2

2"2i

#

112Monday, August 10, 2009

Page 145: Siwei lyu: natural image statistics

implicit MRF• defined by all singletons• joint density (and clique potential) is

implicit• learning: maximum pseudo-likelihood

113Monday, August 10, 2009

Page 146: Siwei lyu: natural image statistics

ICM-MAP denoising

argmax!x

p(!x|!y) = argmax!x

p(!y|!x)p(!x) = argmax!x

log p(!y|!x) + log p(!x)

- set initial value for !x(0), and t = 1

- repeat until convergence

- repeat for all i

- compute the current estimation for xi, as

x(t)i = argmax

xi

log p(x(t)1 , · · · , x(t)

i!1,

xi, x(t!1)i+1 , · · · , x(t!1)

d |!y).

- t! t + 1

114Monday, August 10, 2009

Page 147: Siwei lyu: natural image statistics

ICM-MAP denoisingargmax

xi

log p(!y|!x) + log p(!x)

= argmaxxi

log p(!y|!x) + log p(x1, · · · , xi!1, xi, xi+1, · · · , xn)

= argmaxxi

log p(!y|!x)! "# $can be further simplified

+ log p(xi|xj,j"N(i))! "# $singleton conditional

+ !!!!!!!!!!!!!!log p(xj,j"N(i))! "# $constant w.r.t xi

.

local adaptive and iterative Wiener filtering

xi =!2

w!2i

!2w + !2

i

!

" yi

!2w

+µi

!2i

!#

i!=j

wij(xj ! yj)

$

%

.

115Monday, August 10, 2009

Page 148: Siwei lyu: natural image statistics

summary

representation

observations

applications

model

116Monday, August 10, 2009

Page 149: Siwei lyu: natural image statistics

what need to be done• inhomogeneous structures

- structural (edge, contour, etc.)- textual (grass, leaves, etc.)- smooth (fog, sky, etc.)

• local orientations and relative phases

holy grail: comprehensive model & representations to capture all these variations

117Monday, August 10, 2009

Page 150: Siwei lyu: natural image statistics

big question marks• what are natural images, anyway?

• ironically, white noises are “natural” as they are the result of cosmic radiations

• naturalness is subjective

118Monday, August 10, 2009

Page 151: Siwei lyu: natural image statistics

peeling the onion

all possible images

naturalimages

119Monday, August 10, 2009

Page 152: Siwei lyu: natural image statistics

peeling the onion

images of same marginal stats

all possible images

naturalimages

119Monday, August 10, 2009

Page 153: Siwei lyu: natural image statistics

peeling the onion

images of same marginal stats

images of same second order stats

all possible images

naturalimages

119Monday, August 10, 2009

Page 154: Siwei lyu: natural image statistics

peeling the onion

images of same marginal stats

images of same second order stats

images of same higher order stats

all possible images

naturalimages

119Monday, August 10, 2009

Page 155: Siwei lyu: natural image statistics

resources• D. L. Ruderman. The statistics of natural images. Network:

Computation in Neural Systems, 5:517–548, 1996.• E. P. Simoncelli and B. Olshausen. Natural image statistics and

neural representation. Annual Review of Neuroscience, 24:1193–1216, 2001.

• S.-C. Zhu. Statistical modeling and conceptualization of visual patterns. IEEE Trans PAMI, 25(6), 2003

• A. Srivastava, A. B. Lee, E. P. Simoncelli, and S.-C. Zhu. On advances in statistical modeling of natural images. J. Math. Imaging and Vision, 18(1):17–33, 2003.

• E. P. Simoncelli. Statistical modeling of photographic images. In Handbook of Image and Video Processing, 431–441. Academic Press, 2005.

• A. Hyvärinen, J. Hurri, and P. O. Hoyer. Natural Image Statistics: A probabilistic approach to early computational vision. Springer, 2009.

120Monday, August 10, 2009

Page 156: Siwei lyu: natural image statistics

thank you

121Monday, August 10, 2009