43
Learning object affordances based on structural object representation Kadir F. Uyanik Asil Kaan Bozcuoglu EE 583 Pattern Recognition Jan 4, 2011

Learning Object Affordances Based on Structural Object Representation

Embed Size (px)

Citation preview

Page 1: Learning Object Affordances Based on Structural Object Representation

Learning object affordances based on structural object representation

Kadir F. UyanikAsil Kaan Bozcuoglu

EE 583 Pattern RecognitionJan 4, 2011

Page 2: Learning Object Affordances Based on Structural Object Representation

Content• Goal• Inspirations• Potential Difficulties• Problem Definition• Proposed Method• References• Appendix

Page 3: Learning Object Affordances Based on Structural Object Representation

Goal

Page 4: Learning Object Affordances Based on Structural Object Representation

Goal

Page 5: Learning Object Affordances Based on Structural Object Representation

Goal

Page 6: Learning Object Affordances Based on Structural Object Representation

Goal

Page 7: Learning Object Affordances Based on Structural Object Representation

InspirationsEcological Psychologist James Jerome Gibson

1904 -1979

Cognitive PsychologistIrving Biederman

1939 -

Page 8: Learning Object Affordances Based on Structural Object Representation

Inspirations:Affordances[1]

[1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN 0-470-99014-7.[2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur , G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior , 2007 pp: 447-472

“… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective-objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yetneither. An affordance points both ways, to the environment and to the observer.”

Page 9: Learning Object Affordances Based on Structural Object Representation

Inspirations:Affordances[1]

[1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN 0-470-99014-7.[2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur , G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior , 2007 pp: 447-472

“… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective-objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yetneither. An affordance points both ways, to the environment and to the observer.”

Page 10: Learning Object Affordances Based on Structural Object Representation

Inspirations:Affordances[1]

[1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN 0-470-99014-7.

<entity> <behavior>

<effect>

environment agent

(<effect>, <(entity, behavior)>)Revised Definition: An affordance is an acquired relation between a <(entity, behavior)> tuple of an agent such that the application of the <behavior> on the <entity> generates a certain <effect>[2].

[2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur , G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior , 2007 pp: 447-472

“… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective-objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yetneither. An affordance points both ways, to the environment and to the observer.”

Page 11: Learning Object Affordances Based on Structural Object Representation

[3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp. 115-148

“There are small number of geometric components that constitute the primitive elements of the object recognition system (like letters to form words)”

Inspirations:Human Image Understanding[3]

Page 12: Learning Object Affordances Based on Structural Object Representation

[3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp. 115-148

“There are small number of geometric components that constitute the primitive elements of the object recognition system (like letters to form words)”

Inspirations:Human Image Understanding[3]

Page 13: Learning Object Affordances Based on Structural Object Representation

Potential Difficulties[4]

• Structural description not enough, also need metric info

[4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Page 14: Learning Object Affordances Based on Structural Object Representation

Potential Difficulties[4]

• Structural description not enough, also need metric info

• Difficult to extract geons from real images

[4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Page 15: Learning Object Affordances Based on Structural Object Representation

Potential Difficulties[4]

• Structural description not enough, also need metric info

• Difficult to extract geons from real images

• Ambiguity in the structural description: most often we have several candidates

[4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Page 16: Learning Object Affordances Based on Structural Object Representation

Potential Difficulties[4]

• Structural description not enough, also need metric info

• Difficult to extract geons from real images

• Ambiguity in the structural description: most often we have several candidates

• For some objects, deriving a structural representation can be difficult

[4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Page 17: Learning Object Affordances Based on Structural Object Representation

Problem Definition

HOW TO• decompose objects into parts/components ?• find relations between components ?• find a generic graph representation of an

<action-entity-effect> three tuple ?

Page 18: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 19: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 20: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 21: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 22: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 23: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 24: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 25: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 26: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 27: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 28: Learning Object Affordances Based on Structural Object Representation

Object Decomposition

Proposed Algorithm

Page 29: Learning Object Affordances Based on Structural Object Representation

Object DecompositionWhat is missing?

•Use/try different clustering algorithms

•Triangulate 3D surfaces, Delaunay

• Compute gaussian curvature on each vertex

• Detect region boundaries, curvature thresholding

•Perform iterative region growing, flood fill

Page 30: Learning Object Affordances Based on Structural Object Representation

Graphical Representation

• We represent each objects in non-directed graphs as follows:– Each node has the info of geometric

shape of the part– Each edge has the information of

direction of edge for three axises, i.e from node1 to node2, x axis increases.

Page 31: Learning Object Affordances Based on Structural Object Representation

Graphical Representation

Similarity Checking

[isIsomorphic, label_list]= check_Isomorphism(G1, G2)If isIsomorphic

Check geometric shapes of same labeled nodes in two graphsCheck direction of equivalent edges in both graphsIf both are matched, return trueElse return false

Else return false

Page 32: Learning Object Affordances Based on Structural Object Representation

Isomorphism check: Two candidates: - n1 = n6, n2 = n4, n3 = n5 (Attributes matched!) - n1 = n4, n2 = n6, n3 = n5 (Attributes isn’t matched)

Graphical Representation

Similarity Checking

Page 33: Learning Object Affordances Based on Structural Object Representation
Page 34: Learning Object Affordances Based on Structural Object Representation

Current System• 80% is successful • Assumes no occlusion.

– For the cup case, handles should always be visible

• Needs metric info to distinguish bigger objects from small ones

Page 35: Learning Object Affordances Based on Structural Object Representation

One way to go…• Learning a generic graph for each affordance type.• Checking the maximal- cliques of the match graph while comparing graph

of an object and a generic graph.• Mahalanobis distance metric for generic graphs and use MLE

Page 36: Learning Object Affordances Based on Structural Object Representation

Tools

Page 37: Learning Object Affordances Based on Structural Object Representation

References

[1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN 0-470-99014-7.[2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur , G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior , 2007 pp: 447-472[3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp. 115-148[4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Page 38: Learning Object Affordances Based on Structural Object Representation

Thanks for listening

Page 39: Learning Object Affordances Based on Structural Object Representation

Appendix

Page 40: Learning Object Affordances Based on Structural Object Representation

Human Image Understanding• Hypothesis: small number of geometric components that constitute

the primitive elements of the object recognition system (like letters to form words)

• Geons are directly recognized from edges, based on their nonaccidental properties (i.e., 3D features that are usually preserved by the projective imaging process).

– edges are straight or curved– pairs of edges are parallel or non-parallel– vertices will always appear to be vertices

• Non-accidental properties allows geons to be recognized from any perspective.

• The information in the geons are redundant so that they can be recognized even when partially occluded.

Page 41: Learning Object Affordances Based on Structural Object Representation

AppendixThe Importance of spatial arrangement

Page 42: Learning Object Affordances Based on Structural Object Representation

AppendixThe Principal of non-accidentalness

Examples:

• Colinearity

• Smoothness

• Symmetry

• Parallelism

• Cotermination

Page 43: Learning Object Affordances Based on Structural Object Representation

AppendixSome non-accidental differences