58
1 Amihood Amir Bar-Ilan University and Georgia Tech UWSL 2006

Theoretical Issues of Searching Aerial Photographs: a bird's eye view

  • Upload
    enya

  • View
    61

  • Download
    2

Embed Size (px)

DESCRIPTION

Theoretical Issues of Searching Aerial Photographs: a bird's eye view. Amihood Amir. Bar-Ilan University and Georgia Tech. UWSL 2006. Issues of Concern:. Local Errors: - Occlusion - Transmission and resolution - Details Scaling Rotation Integration of all above issues. - PowerPoint PPT Presentation

Citation preview

Page 1: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

1

Amihood Amir

Bar-Ilan UniversityandGeorgia Tech UWSL 2006

Page 2: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

2

Page 3: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

3

Issues of Concern: Local Errors:

- Occlusion - Transmission and resolution

- Details Scaling Rotation Integration of all above issues

Page 4: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

4

It seems daunting, but…

Page 5: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

5

CPM 2003: Morelia, Mexico

Page 6: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

6

Some History… String Matching –

motivated by text editing.

over alphabet

ntttT 10

mpppP 10

Page 7: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

7

Historic Two Dimensional Model:

nnnn

n

n

ttt

tttttt

T

,1,0,

,11,10,1

,01,00,0

mmmm

m

m

ppp

pppppp

P

,1,0,

,11,10,1

,01,00,0

.,...,0,;,...,0,;, ,, mlknjipt lkji

Page 8: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

8

Bird-Baker Algorithm (1976) Time: for bounded fixed alphabets.

for infinite alphabets. Technique: linearization.

)( 2nO

)log( 2 mnO

Page 9: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

9

Linearization Concatenate rows of Text (or

pattern) and use string matching tools.

In this case – The Aho and Corasick algorithm.

Page 10: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

10

Find all pattern rows…then align them.

Page 11: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

11

Another linearization-pad with “don’t cares”

n-mmTime: Fischer-Paterson (1972))log( 2 mnO

Page 12: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

12

Advantages and Disadvantages of Model Pros: Can use known techniques.

Cons: - Complexity degradation (e.g. extra

log factor in exact matching). - Inherent difficulties in definitions

(will be addressed later).

Page 13: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

13

First Truly 2d Algorithm – The Dueling Method Idea: Assume the situation is: All potential pattern “starts” agree on overlap. A i.e. all want to see

the same symbol in every text location.

(A-Benson-Farach 1991)

Page 14: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

14

Dueling Method … Time for checking every text element’s

correctness: linear.

Every candidate with incorrect element in its range is eliminated.

Method: The “wave”.

Total Time: )( 2nO

Page 15: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

15

Dueling Method…How do we arrange for candidates to agree on overlap? – duel!

A A A A A A AA A A A A A AA A A A A A AA A A A V A AA A A A A A AA A A A A A A

A A A A A A AA A A A A A AA A A A A A AA A A A V A AA A A A A A AA A A A A A A

When there is conflict between two candidates, a single text check eliminates at least one candidate.

The text location can be pre-computed because of transitivity.

The dueling phase is thus linear time.

Page 16: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

16

Discrete Scaling (A-Landau- Vishkin 1990)

In our limited model, the meaning of scaling is“blowing up” a symbol.

Example: scaling a symbol A by 3, means a 3x3 matrix

X X X X X XX X X X X XX X O O X XX X O O X XX X X X X XX X X X X X

X X XX O XX X X

A A AA A AA A A

Scaling the matrix by 2 gives:

Page 17: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

17

X X XX O XX X X

Scaled Occurrences of Pattern in Text:

X X X X X X X X X X X X X X X X

X X X X X X X X X X X X X X X X X XX X X X X X X X X O X X X X X X X XX X O O X X X X X X X X X X X X X XX X O O X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X O O O X X X X X X X XX X X X X X X O O O X X X X X X X XX X X X X X X O O O X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X XX X X X X X X X X X X X X X X X X X

Scale 1Scale 2

X X X X X XX X X X X XX X O O X XX X O O X XX X X X X XX X X X X X

Scale 3

X X X X X X X X XX X X X X X X X XX X X X X X X X XX X X O O O X X XX X X O O O X X XX X X O O O X X XX X X X X X X X XX X X X X X X X XX X X X X X X X X

Page 18: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

18

Discrete Scaling Algorithms A-Landau-Vishkin 90: Can find all

discrete scales of pattern in linear time (alphabet dependent).

A-Calinescu 94: Alphabet independent and dictionary linear-time discrete scaling algorithm.

Page 19: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

19

Tools used: For comparing substrings in constant time: Suffix trees and LCA or Weiner 1973, Harel-Tarjan 1984

Suffix arrays and LCP. Kärkkäinen-Sanders 2003 For computing number of sub-row

repetitions in constant time: Range-Minimum queries. Gabow-Bentley-Tarjan 1984

Page 20: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

20

How is it used?Do LCA query to find out that the orange line occurs here

How many times does this line repeat?

How is this done?

Page 21: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

21

Construct an array of numbers where every location is the length of the LCP of this row and the next

k0kkkkkkkkk00

To make sure that the orange line appears in this range, the minimum number in this range has to be greater than k.

Page 22: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

22

How do we know what scale the orange line has? Run-length compression.

Find the symbol part, then the repetition factor.

This idea led to the compressed matching paradigm…

AAABBCCCCDAAAABBBBBBCA

A B C DA B CA 3 2 4 1 4 6 1 1

Page 23: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

23

Compressed Matching Suppose the text (and pattern?) are

compressed. Examples: run-length of rows (fax). LZ78 of rows (gif).

Find pattern in text without decompressing.

A-Benson 92, A-Benson-Farach 94, A-Landau-Sokol 03(x2)

This led to a decade of work in the stringology and data compression community.

Page 24: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

24

Compressed Matching (very partial list from citeseer…) Pattern Matching in Compressed Raster Images -

Pajarola,Widmayer (1996) Direct Pattern Matching on Compressed Text - de Moura

, Navarro, Ziviani (1998) A General Practical Approach to Pattern Matching over.. - Navarro

, Raffinot (1998)

Randomized Efficient Algorithms for Compressed Strings: the.. - Gasieniec, al. (1996)

Approximate String Matching over Ziv-Lempel Compressed Text - Kärkkäinen, Navarro, Ukkonen (2000)

Pattern Matching Machine for Text Compressed Using Finite State.. - Takeda (1997)

…  

Page 25: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

25

Model Deficiencies. How do we scale to non-discrete

sizes? (e.g. 1.35)

How do we model rotations?

Page 26: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

26

A Model of Digitization (Landau-Vishkin 1994)

“Real-Life” resolution is fine enough to be assumed continuous.

This is dealt with by a discrete sampling of space done by, e.g. the camera.

Digitized sample

“Real life”

Page 27: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

27

Rotation (Fredriksson-Ukkonen 1998) Consider the text as a grid of pixels, each

having a color. Consider the pattern as an m x m grid of

pixels with colors. Assume the center of every pattern pixel

has a “hole”. Lay the pattern grid on the text, with the

center declared the “rotation pivot”.

Page 28: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

28

7 6 5 4 3 2 1 0

7 6 5 4 3 2 1 0

T[1,1] T[1,2] T[1,3]

T[2,1] T[2,2] T[2,3]

T[3,1] T[3,2] T[3,3]

T[5,4]

T[7,7]

7x7 text

Page 29: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

29

4 3 2 1 0

4 3 2 1 0

The rotation pivot

4x4 pattern

Page 30: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

30

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

45O

4x4 pattern over 8x8 text in location

)45),4,3(( 0

Page 31: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

31

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

Page 32: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

32

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

Page 33: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

33

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

Page 34: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

34

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

Page 35: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

35

7 6 5 4 3 2 1 0 8

7 6 5 4 3 2 1 0 8

Page 36: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

36

Rotated Matching Algorithms Fredrikkson-Ukkonen 1998: Filter. Good

expected time. worst case. Fredrikkson-Navarro-Ukkonen 2000: A-Butman-Crochemore-Landau-Schaps

2004: Proved that output size is

A-Kapah-Tsur 2004:

)( 52mnO

)( 32mnO

)( 22mnO)( 32mnO

Page 37: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

37

A Taste of handling Rotations

16151413121110987654321

16,15141211109,13

8,476532,1

12161514811101347693215

Naïve Idea: Try all possible rotated patterns. Examples:

161215148111013

4769325

1

Original 19 rotation 21 rotation 26 rotationo o o

Page 38: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

38

Proposed Solution Every rotated pattern can be found in

the text using FFT in time

If there are N rotated patterns the total time is

N What is N?

)log( 2 mnO

)log( 2 mnO

Page 39: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

39

Upper Bound There are pixels.

Each pixel center crosses at most grid lines.

Therefore there are different rotated patterns.

2m

m4

)( 3mO

Page 40: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

40-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

O

Page 41: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

41-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

O

Page 42: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

42-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

O

Page 43: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

43-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

O

Page 44: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

44

Lower BoundCould many points cross a gridline together?

We will show: Lower Bound:

Restriction:

We consider only points in set P defined as follows.

)( 3m

Page 45: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

45

Our Subset of Consideration:P is a subset of pattern coordinates Such that:1) The coordinates are in quadrant I

I2) The coordinates are only the points (x,y) where x and y are co-prime

Page 46: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

46

Key Lemma (A-Butman-Crochemore-Landau-Schaps 2003)

PXX 21,

2X

it is impossible that

and cross a grid line at

the same rotation angle.

1X

Page 47: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

47-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

X1Y1

X2

Y2

O Z

Page 48: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

48

How does it help?

)log(6|||| 2

2

mmomP

Theorem (Geometry):

i.e. )(|||| 2mP

Page 49: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

49

4,0,),(|),(4

myxPyxyxP m

P

4mPP

Consider

Schematically: shaded area.

In shaded area there are points.

So in there are at least

points, i.e.

points.

16

2m

166 2

2

2 mm

)(16

96 222

2

mm

Page 50: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

50

Each of the points in

(the yellow area) crosses

the grid times and no two

of them cross together.

Conclude: There are

different rotated patterns.

)( 2m

4mPP

)(m

)( 3m

Page 51: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

51

Real Scaled Matching (A-Butman-Lewentein-Porat 2003, A-Chencinsky 2006) Assume the text and pattern grids

are the unit scale. A scale up of the pattern increases

the grid. The center of the underlying unit

grid takes the color of the scaled pattern pixel under it.

Page 52: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

52

Pattern

Pattern scaled continuously to 1.6

Pattern scaled continuously to 1.6 with superimposed unit grid

Pattern discretely scaled to 1.6

Page 53: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

53

Does This work?

We tried it on “Lenna”…

Page 54: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

54

Scale 1.3 Lenna

Original Lenna

Scale 2 Lenna

Page 55: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

55

Lenna Today

Page 56: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

56

Algorithm’s running time For text size n x n

and

pattern size m x m:

)( 22mnO

Page 57: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

57

The Future? Faster rotation: did not utilize

pattern, did not utilize neighboring information.

Faster scaling. The holy grail – INTEGRATION. Compressed Matching: lossy

compressions.

Page 58: Theoretical Issues of Searching  Aerial Photographs:  a bird's eye view

58

THANK YOU