39
VisualRank: Applying PageRank to Large- Scale Image Search Yushi Jing, Member, IEEE Shumeet Baluja, Member, IEEE IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, NOVEMBER 2008 [24] Y. Jing, S. Baluja, and H. Rowley, “Canonical Image Selection from the Web,” Proc. Sixth Int’l Conf. Image and Video Retrieval, pp. 280-287, 2007.

VisualRank : Applying PageRank to Large-Scale Image Search

  • Upload
    katy

  • View
    22

  • Download
    0

Embed Size (px)

DESCRIPTION

VisualRank : Applying PageRank to Large-Scale Image Search. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, NOVEMBER 2008. Yushi Jing, Member, IEEE Shumeet Baluja , Member, IEEE. - PowerPoint PPT Presentation

Citation preview

Page 1: VisualRank : Applying  PageRank  to Large-Scale Image Search

VisualRank: Applying PageRank to Large-Scale Image Search

Yushi Jing, Member, IEEEShumeet Baluja, Member, IEEE

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, NOVEMBER 2008

[24] Y. Jing, S. Baluja, and H. Rowley, “Canonical Image Selection from the Web,” Proc. Sixth Int’l Conf. Image and Video Retrieval, pp. 280-287, 2007.

Page 2: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]• PageRank & VisualRank• Hashing• Experiments• Conclusion

Page 3: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]• PageRank & VisualRank• Hashing• Experiments• Conclusion

Page 4: VisualRank : Applying  PageRank  to Large-Scale Image Search

Search for “d80” & “coca cola” by traditional search engine

Page 5: VisualRank : Applying  PageRank  to Large-Scale Image Search

Introduction

• Visual theme, ex: “coca cola” logo• CBIR: content-based image retrieval– Pure– Composite• “Visual-filter” via Probabilistic Graphical Models(PGMs)

[7]

• Compare:– Object category learner– image search engine

[7] R. Fergus, P. Perona, and A. Zisserman, “A Visual Category Filter for Google Images,” Proc. Eighth European Conf. Computer Vision, pp. 242-256, 2004.

Page 6: VisualRank : Applying  PageRank  to Large-Scale Image Search

Introduction

Page 7: VisualRank : Applying  PageRank  to Large-Scale Image Search

Introduction

• Combine[24]– pairwise visual similarity among images– nonvisual signals

• VisualRank– Based on PageRank– Large number of queries & images

• Goal– More accurate search ranking

Page 8: VisualRank : Applying  PageRank  to Large-Scale Image Search

introducton

Page 9: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction

• Similarity graph[24]• PageRank & VisualRank• Hashing• Experiments• Conclusion

Page 10: VisualRank : Applying  PageRank  to Large-Scale Image Search

Features generation

• Local descriptor– SIFT & compare[29]

[29] K. Mikolajczyk and C. Schmid, “A Performance Evaluation of Local Descriptors,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, Oct. 2005.

Page 11: VisualRank : Applying  PageRank  to Large-Scale Image Search

Similarity graph

• pairwise

Page 12: VisualRank : Applying  PageRank  to Large-Scale Image Search

Similarity graph

• Top 1000 results of “Mona-lisa”

Page 13: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]

• PageRank & VisualRank• Hashing• Experiments• Conclusion

Page 14: VisualRank : Applying  PageRank  to Large-Scale Image Search

PageRank

• Conception– Vote– eigenvector centrality A

B D C

PR(A) = PR(B) + PR(C) + PR(D)

Page 15: VisualRank : Applying  PageRank  to Large-Scale Image Search

PageRank

A

B

D

C

Page 16: VisualRank : Applying  PageRank  to Large-Scale Image Search

PageRank

q=0.15Random walk

Page 17: VisualRank : Applying  PageRank  to Large-Scale Image Search

PageRank

• Markov matrix

Page 18: VisualRank : Applying  PageRank  to Large-Scale Image Search

VisualRank

usually d>0.8

Page 19: VisualRank : Applying  PageRank  to Large-Scale Image Search

Link spam

• Well connected image V.S. VisualRank, “Nemo”

Page 20: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]• PageRank & VisualRank

• Hashing• Experiments• Conclusion

Page 21: VisualRank : Applying  PageRank  to Large-Scale Image Search

Matching

• Precluster– “Paris”, “Eiffel Tower”, and “Arc de Triomphe”

• Top-N, and compute VisualRank• Hashing– Locality Sensitive Hashing (LSH)– Feature descriptor as the key

Page 22: VisualRank : Applying  PageRank  to Large-Scale Image Search

Locality Sensitive Hashing (LSH)

• An approximate k-NN technique• Hash function:

– a is d-dimensional random vector– b is real number from range– W defines the quantization of the features– V is the original feature vector

Page 23: VisualRank : Applying  PageRank  to Large-Scale Image Search

Flow(1/3)

1. Resize 500*500 pix, 1000 web images 3000,000 to 700,000 feature vectores

2. L hash table H=H1, H2,…,HL, each with K hash functions, L=40, W=100, K=3

Page 24: VisualRank : Applying  PageRank  to Large-Scale Image Search

Flow(2/3)

3. Matched descriptor– Have same key more than C=3 hash table

4. Hough Transform

Page 25: VisualRank : Applying  PageRank  to Large-Scale Image Search

Flow(3/3)

5. Similarity– Matched images• More than 3 features

– no. of matches divide by their avg. number of local features

6. Given similarity matrix S, and use VisualRank

Page 26: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]• PageRank & VisualRank• Hashing

• Experiments• Conclusion

Page 27: VisualRank : Applying  PageRank  to Large-Scale Image Search

Experiments

• 2,000 most popular product queries on Google, ex: “ipod”, “Xbox”

• the top 1,000 search results each query in July 2007 Google

• Filter– Fewer than 5% images at least one connection– Remaining 1,000 queries

Page 28: VisualRank : Applying  PageRank  to Large-Scale Image Search

Experiment 1

• Evaluate– “irrelevancy” of our ranking

• Mixed Top 10 VisualRank & top 10 google Remove duplicates and ask “which are least relevant?”

• Ask 150 evaluators, randomly 50 queries

Page 29: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 30: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 31: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 32: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 33: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 34: VisualRank : Applying  PageRank  to Large-Scale Image Search

Experiment 2

• VisualRankbias,

• pT=VjT=[1/m, …, 1/m, 0, …, 0]

• HeuristicRank – a pure CBIR system

j

Page 35: VisualRank : Applying  PageRank  to Large-Scale Image Search
Page 36: VisualRank : Applying  PageRank  to Large-Scale Image Search

Experiment 3

• Collected 40 top images each click numbers from google

• Compare – Sum of VisualRank top 20 click numbers– Sum of default ranking top 20 click numbers

• VisualRank exceeds 17.5% than default Google ranking

Page 37: VisualRank : Applying  PageRank  to Large-Scale Image Search

Landmarks

• 80 common landmark, ex: “Eiffel Tower,”“Big Ben,” “Coliseum,” and “Lincoln Memorial.”

Page 38: VisualRank : Applying  PageRank  to Large-Scale Image Search

Outline

• Introduction• Similarity graph[24]• PageRank & VisualRank• Hashing• Experiments

• Conclusion

Page 39: VisualRank : Applying  PageRank  to Large-Scale Image Search

Conclusion

• VisualRank applying PageRank conception and combined – Default Google ranking– similarity graph between images

• VisualRank can outperform the default Google on the vast majority of queries

• Reduce the number of irrelevant images efficiently