34
Richer Human-Machine Communication in Attributes-based Visual Recognition Devi Parikh TTIC

Richer Human-Machine Communication in Attributes-based Visual Recognition

  • Upload
    season

  • View
    31

  • Download
    0

Embed Size (px)

DESCRIPTION

Richer Human-Machine Communication in Attributes-based Visual Recognition. Devi Parikh TTIC. Traditional Recognition. Dog. Chimpanzee. Tiger. ???. Attributes-based Recognition. Furry White. Black Big. Stripped Yellow. Stripped Black White Big. Dog. Chimpanzee. Tiger. - PowerPoint PPT Presentation

Citation preview

Page 1: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Richer Human-Machine Communication in Attributes-based Visual Recognition

Devi ParikhTTIC

Page 2: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Traditional Recognition

Dog Chimpanzee Tiger ???

Page 3: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes-based Recognition

FurryWhite

BlackBig

StrippedYellow

StrippedBlackWhite

BigTigerChimpanzeeDog

Page 4: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Applications

Zebra

A Zebra is…WhiteBlack

Stripped

Zero-shot learning

Image description

StrippedBlackWhite

Big

Attributes provide a mode of

communication between humans and

machines!

Page 5: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Agenda

Enriching the mode of communication

• Nameable and Discriminative Attributes(to appear CVPR 2011)

• Relative Attributes(under review)

Kristen Grauman

Page 6: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes

Attributes are most useful if they are• Discriminative• Nameable

Approaches Discriminative Nameable

Page 7: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes

Attributes are most useful if they are• Discriminative• Nameable

Approaches Discriminative NameableHand-

generatedMaybe not Yes

Page 8: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes

Attributes are most useful if they are• Discriminative• Nameable

Approaches Discriminative NameableHand-

generatedMaybe not Yes

Mining the web Maybe not Yes

Page 9: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes

Attributes are most useful if they are• Discriminative• Nameable

Approaches Discriminative NameableHand-

generatedMaybe not Yes

Mining the web Maybe not YesAutomatic splits Yes Maybe

not

Page 10: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attributes

Attributes are most useful if they are• Discriminative• Nameable

Approaches Discriminative NameableHand-

generatedMaybe not Yes

Mining the web Maybe not YesAutomatic splits Yes Maybe

notProposed Yes Yes

Page 11: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Interactive system1. Name: Fluffy2. Name: x3. Name: Metal…

How do we show the user a candidate-attribute?How do we ensure proposals are discriminative?

How do we ensure proposals are nameable?

Page 12: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attribute visualization

Page 13: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Attribute Visualization

Page 14: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Ensure Discriminability

Normalized cuts

Max Margin Clustering

Page 15: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Ensure Nameability1. Name: Fluffy2. Name: x3. Name: Metal…

Page 16: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Ensure Nameability1. Name: Fluffy2. Name: x3. Name: Metal…

Mixture of Probabilistic PCA

Page 17: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Interactive System

Page 18: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Outdoor Scenes • Animals with Attributes• Public Figures Face

• Gist and Color features (LDA)

Page 19: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Interactive System

Page 20: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

“Black”

… ~25000 responses

Page 21: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

“Spotted”

… ~25000 responses

Page 22: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

Unnameable

… ~25000 responses

Page 23: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

“Green”

… ~25000 responses

Page 24: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

“Congested”

… ~25000 responses

Page 25: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Evaluation

• Annotate all candidates off-line

“Smiling”

… ~25000 responses

Page 26: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Results

Our active approach discovers more discriminative splits than baselines

Structure exists in nameability space allowing for prediction

Page 27: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Results

Comparing to discriminative-only baseline

Page 28: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Results

Comparing to descriptive-only baseline

Page 29: Richer Human-Machine Communication  in Attributes-based Visual Recognition

ResultsAutomatically generated descriptions

Page 30: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Summary

• Machines need to understand us– Attributes need to be detectable & discriminative

• We need to understand machines– Attributes need to be nameable

• Interactive system for discovering attributes

• Relative Attributes• More precise communication– Helps machines (zero-shot learning)– Helps humans (image descriptions)

Page 31: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Relative Attributes

Page 32: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Summary

• Machines need to understand us– Attributes need to be detectable & discriminative

• We need to understand machines– Attributes need to be nameable

• Interactive system for discovering attributes

• Relative Attributes• More precise communication– Helps machines (zero-shot learning)– Helps humans (image descriptions)

Page 33: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Human-Debugging

Larry Zitnick

(CVPR 2008, 2010, 2011, under review, in progress)

Page 34: Richer Human-Machine Communication  in Attributes-based Visual Recognition

Thank you.