17
2/12/2008 1 Math review Lecture 3 Overview Projects Removing JPEG artifcats: “The Duck Caper” Hot or Not Projects Projects David Hockney’s Joiners Projects 2D Panorama creation

3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

1

Math review

Lecture 3

Overview

Projects• Removing JPEG artifcats: “The Duck Caper”

• Hot or Not

Projects

Projects

• David Hockney’s Joiners

Projects

• 2‐D Panorama creation

Page 2: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

2

Projects

• Image Analogies

A A' B B'

Projects

• Seam carving

Projects

• Spectralresponsecurveestimtationestimtation

Overview of today

• Linear Algebra review

• Least Squares Optimization

• Linear Systems

• Fourier domain

Whiteboard PCA ‐ Faces

http://www.cs.princeton.edu/~cdecoro/eigenfaces/

Page 3: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

3

PCA ‐ Faces

http://www.cs.princeton.edu/~cdecoro/eigenfaces/

PCA ‐ Faces

http://www.cs.princeton.edu/~cdecoro/eigenfaces/

Overview of today

• Linear Algebra review

• Least Squares Optimization

• Linear Systems

• Fourier domain

Whiteboard

Convolution

∑ −−=⊗=lk

lkglnkmIgInmf,

],[],[],[

I

Slide credit: Bill Freeman

Linear filtering (warm-up slide)

cien

t

1 0 ?original

0Pixel offset

coef

fic 1.0 ?

Slide credit: Bill Freeman

Page 4: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

4

Linear filtering (warm-up slide)

cien

t

1 0

original0

Pixel offset

coef

fic 1.0

Filtered(no change)

Slide credit: Bill Freeman

Linear filtering

ffici

ent

1.0

?0

Pixel offset

coef

f

original

?

Slide credit: Bill Freeman

shift

ffici

ent

1.0

0Pixel offset

coef

f

original shifted

Slide credit: Bill Freeman

Linear filtering

ffici

ent

?0

Pixel offset

coef

f

original

0.3 ?

Slide credit: Bill Freeman

Blurring

ffici

ent

0Pixel offset

coef

f

original

0.3

Blurred (filterapplied in both dimensions).

Slide credit: Bill Freeman

Blur examples

0Pixel offset

coef

ficie

nt

0.3

original

8

filtered

2.4

impulse

Pixel offsetoriginal filtered

Slide credit: Bill Freeman

Page 5: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

5

Blur examples

impulse

0Pixel offset

coef

ficie

nt0.3

original

8

filtered

2.4

0Pixel offset

coef

ficie

nt

0.3

original

8

filtered

48

4edge

Pixel offsetoriginal filtered

Slide credit: Bill Freeman

Linear filtering (warm-up slide)

2.0

?1.0

original

0

?0

Slide credit: Bill Freeman

Linear filtering (no change)

2.0

1.0

original

0 0

Filtered(no change)

Slide credit: Bill Freeman

Linear filtering

2.0

0 33 ?original

0 0

0.33 ?

Slide credit: Bill Freeman

(remember blurring)

ffici

ent

0Pixel offset

coef

f

original

0.3

Blurred (filterapplied in both dimensions).

Slide credit: Bill Freeman

Sharpening

2.0

0 33

original

0 0

0.33

Sharpened original

Slide credit: Bill Freeman

Page 6: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

6

Sharpening example

cien

t8

11.21.7

8co

effi

-0.3original Sharpened

(differences areaccentuated; constant

areas are left untouched).

-0.25

Slide credit: Bill Freeman

Sharpening

before after

Slide credit: Bill Freeman

Spatial resolution and color

R

G

Boriginal

Slide credit: Bill Freeman

Blurring the G component

R

G

B

original processed

Slide credit: Bill Freeman

Blurring the R component

R

original processed

G

B

Slide credit: Bill Freeman

Blurring the B component

R

original

G

Bprocessed

Slide credit: Bill Freeman

Page 7: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

7

From W. E. Glenn, in Digital I dImages and Human Vision, MIT Press, edited by Watson, 1993

Slide credit: Bill Freeman

Lab color components

L

a

A rotation of the color coordinates into directions that are morea

b

are more perceptually meaningful: L: luminance, a: red-green, b: blue-yellow

Slide credit: Bill Freeman

Blurring the L Lab component

L

aa

boriginal processed

Slide credit: Bill Freeman

Blurring the a Lab component

L

a

original

a

bprocessed

Slide credit: Bill Freeman

Blurring the b Lab component

L

a

original

a

bprocessed

Slide credit: Bill Freeman

Overview of today

• Linear Algebra review• Least Squares Optimization• Linear Systems• Fourier domain

Page 8: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

8

Linear image transformations

• In analyzing images, it’s often useful to make a change of basis.

transformed image

Fourier transform, orWavelet transform, or

Steerable pyramid transform

fUFrr

= Vectorized image

transformed image

Slide credit: Bill Freeman

Self-inverting transforms

FUfrr

−= 1

Same basis functions are used for the inverse transform

FU

FUfr

+=

=

U transpose and complex conjugate

Slide credit: Bill Freeman

An example of such a transform: the Fourier transform

discrete domain

∑∑−

=

=

⎟⎠⎞

⎜⎝⎛ +−

=1

0

1

0

ln

],[],[M

k

N

l

NMkmi

elkfnmFπ

Forward transform

∑∑−

=

=

⎟⎠⎞

⎜⎝⎛ ++

=1

0

1

0

ln

],[1],[M

k

N

l

NMkmi

enmFMN

lkfπ

Inverse transform

Slide credit: Bill Freeman

To get some sense of what basis elements look like, we plot a basis element --- or rather, its real part ---as a function of x,y for some fixed u, v. We get a function that is constant when (ux+vy) is constant. The magnitude of the vector (u, v) gives a frequency, and its direction gives an orientation. The gfunction is a sinusoid with this frequency along the direction, and constant perpendicular to the direction.

u

v( )vyuxie +−π

( )vyuxie +π

Slide credit: Bill Freeman

Here u and v are larger than in the previous slide.

u

v( )vyuxie +−π

( )vyuxie +π

Slide credit: Bill Freeman

And larger still...

u

v( )vyuxie +−π

( )vyuxie +π

Slide credit: Bill Freeman

Page 9: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

9

Phase and Magnitude

• Fourier transform of a real function is complex– difficult to plot, visualize– instead, we can think of the

phase and magnitude of the transform

• Curious fact– all natural images have

about the same magnitude transform

– hence, phase seems to matter, but magnitude the transform

• Phase is the phase of the complex transform

• Magnitude is the magnitude of the complex transform

, glargely doesn’t

• Demonstration– Take two pictures, swap

the phase transforms, compute the inverse - what does the result look like?

Slide credit: Bill Freeman Slide credit: Bill Freeman

This is the magnitude transform of the cheetah pic

Slide credit: Bill Freeman

This is the phase transform of the cheetah pic

Slide credit: Bill Freeman

Slide credit: Bill Freeman

This is the magnitude transform of the zebra pic

Slide credit: Bill Freeman

Page 10: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

10

This is the phase transform of the zebra pic

Slide credit: Bill Freeman

Reconstruction with zebra phase, cheetah magnitude

Slide credit: Bill Freeman

Reconstruction with cheetah phase, zebra magnitude

Slide credit: Bill Freeman

Example image synthesis with Fourier basis.

• Following are 16 images showing the reconstruction of an image from a random selection of Fourier basis functions.

• Note, the selection of basis functions to include was not made according to basis magnitude. Doing that would have given us an approximate version of the image much sooner.

Slide credit: Bill Freeman

2

Slide credit: Bill Freeman

6

Slide credit: Bill Freeman

Page 11: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

11

18

Slide credit: Bill Freeman

50

Slide credit: Bill Freeman

82

Slide credit: Bill Freeman

136

Slide credit: Bill Freeman

282

Slide credit: Bill Freeman

538

Slide credit: Bill Freeman

Page 12: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

12

1088

Slide credit: Bill Freeman

2094

Slide credit: Bill Freeman

4052.

4052

Slide credit: Bill Freeman

8056.

Slide credit: Bill Freeman

15366

Slide credit: Bill Freeman

28743

Slide credit: Bill Freeman

Page 13: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

13

49190.

Slide credit: Bill Freeman

65536.

Slide credit: Bill Freeman

Fourier transform magnitude

Slide credit: Bill Freeman

Masking out the fundamental and harmonics from periodic pillars

Slide credit: Bill Freeman

Forsyth&Ponce

Discrete-time, continuous frequency Fourier transform

Oppenheim, Schafer and Buck,Discrete-time signal processing,Prentice Hall, 1999

Page 14: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

14

Why is the Fourier domain particularly useful?

• It tells us the effect of linear convolutions.• There is a fast algorithm for performing the

DFT, allowing for efficient signal filtering.Th F i d i ff lt ti• The Fourier domain offers an alternative domain for understanding and manipulating the image.

Slide credit: Bill Freeman

Analysis of our simple filters

original0Pixel offset

coef

ficie

nt

1.0

Filtered(no change)(no change)

1

][][1

0

=

= ∑−

=

⎟⎠⎞

⎜⎝⎛−M

k

Mkmi

ekfmFπ

0

1.0 constant

Slide credit: Bill Freeman

Analysis of our simple filters

0Pixel offset

coef

ficie

nt

original

1.0

shifted

δ

Mmi

M

k

Mkmi

e

ekfmF

δπ

π

=

⎟⎠⎞

⎜⎝⎛−

=

= ∑

][][1

0

0

1.0

Constant magnitude, linearly shifted phase

Slide credit: Bill Freeman

Analysis of our simple filters

0Pixel offsetco

effic

ient

original

0.3

blurred

⎟⎟⎠

⎞⎜⎜⎝

⎛⎟⎠⎞

⎜⎝⎛+=

= ∑−

=

⎟⎠⎞

⎜⎝⎛−

Mm

ekfmFM

k

Mkmi

π

π

cos2131

][][1

0 Low-pass filter

0

1.0

Slide credit: Bill Freeman

Analysis of our simple filters

original0

2.0

0

0.33

sharpened

⎟⎟⎠

⎞⎜⎜⎝

⎛⎟⎠⎞

⎜⎝⎛+−=

= ∑−

=

⎟⎠⎞

⎜⎝⎛−

Mm

ekfmFM

k

Mkmi

π

π

cos21312

][][1

0 high-pass filter

0

1.0

2.3

Slide credit: Bill Freeman

Convolution versus FFT

• 1-d FFT: O(NlogN) computation time, where N is number of samples.

• 2-d FFT: 2N(NlogN), where N is number of pixels on a sidepixels on a side

• Convolution: K N2, where K is number of samples in kernel

• Say N=210, K=100. 2-d FFT: 20 220, while convolution gives 100 220

Slide credit: Bill Freeman

Page 15: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

15

Sampling and Reconstruction Sampling and Reconstruction

• Simple example: a sign wave

© 2006 Steve Marschner • 86

Undersampling

• What if we “missed” things between the samples?

• Simple example: undersampling a sine wavei i lt i f ti i l t

© 2006 Steve Marschner • 87

– unsurprising result: information is lost

Undersampling

• What if we “missed” things between the samples?

• Simple example: undersampling a sine wavei i lt i f ti i l t

© 2006 Steve Marschner • 88

– unsurprising result: information is lost

– surprising result: indistinguishable from lower frequency

Undersampling

• What if we “missed” things between the samples?

• Simple example: undersampling a sine wave– unsurprising result: information is lost

© 2006 Steve Marschner • 89

– surprising result: indistinguishable from lower frequency

– also was always indistinguishable from higher frequencies

– aliasing: signals “traveling in disguise” as other frequencies

Aliasing in images

Page 16: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

16

The Fourier transform of a sampled signal

F Sample2D f (x,y)( )( )= F f (x, y) δ(x − i,y − j)i=−∞

∑i=−∞

∑⎛ ⎝ ⎜

⎞ ⎠ ⎟

= F f (x,y)( )**F δ(x − i, y − j)i=−∞

∑i=−∞

∑⎛ ⎝ ⎜

⎞ ⎠ ⎟

= F u − i,v − j( )j=−∞

∑i=−∞

Slide credit: Bill Freeman

Image half-sizing

This image is too big tofit on the screen. Howcan we reduce it?

How to generate a halfHow to generate a half-sized version?

Image sub-sampling

Throw away every other row and column to create a 1/2 size image

- called image sub-sampling

1/4

1/8

Slide by Steve Seitz

Image sub-sampling

1/4 (2x zoom) 1/8 (4x zoom)

Aliasing! What do we do?

1/2

Slide by Steve Seitz

Page 17: 3 math overview - NYU Computer ScienceLecture 3 Overview Projects ... 3 1 2 [ ] [ ] 1 0 high-pass filter 0 1.0 2.3 Slide credit: Bill Freeman Convolution versus FFT • 1-d FFT: O(NlogN)

2/12/2008

17

Gaussian (lowpass) pre-filtering

G 1/4

G 1/8

Gaussian 1/2

Solution: filter the image, then subsample• Filter size should double for each ½ size reduction. Why?

Slide by Steve Seitz

Subsampling with Gaussian pre-filtering

G 1/4 G 1/8Gaussian 1/2

Slide by Steve Seitz

Compare with...

1/4 (2x zoom) 1/8 (4x zoom)1/2

Slide by Steve Seitz