11
1 人間の動作のモデル化とロボットへの応用 Part 2 中村 仁彦 機械情報工学科(機械B) 東京大学 情報技術論 2011.04.25 index symbol icon Evolution / Development C.S. Peirce (1839-1914) index Proto Symbol Space (a set of HMM) Natural Language Processing Dic?onary Corpus Statistical System Technological Realization association through luxurious linguistic knowledge Linguistics Phonology 音韻論 Morphology 形態論 Syntax 統語論 Semantics 意味論 Pragmatics 語用論 Machine Translation Natural Language Processing Machine Translation of Languages (MIT Press 1955 translation) Warren Weaver 1949 applying statistical and cryptanalytic techniques ‘’The Mathematics of Statistical Machine Translation: Parameter Estimation. ’’ P.F. Brown, S.S Della Pietra, V.J.Della Pietra, R.L. Mercer (IBM T.J. Watson Research Center ) Computer Linguistics, Volume 19, No.2, 263-311, 1993. The IBM Translation Model

人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

Embed Size (px)

Citation preview

Page 1: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

1

人間の動作のモデル化とロボットへの応用  Part  2

中村 仁彦

機械情報工学科(機械B) 東京大学

情報技術論 2011.04.25

index  

symbol

icon

Evolution / Development

C.S. Peirce (1839-1914)

index

Proto  Symbol  Space  (a  set  of  HMM)

Natural  Language  Processing

Dic?onary  Corpus

Statistical System

Technological Realization

association through

luxurious linguistic

knowledge

Linguistics

Phonology 音韻論 Morphology 形態論

Syntax 統語論 Semantics 意味論 Pragmatics 語用論

Machine Translation

Natural Language Processing

“Machine Translation of Languages “ (MIT Press 1955 translation) Warren Weaver 1949 applying statistical and cryptanalytic techniques

‘’The Mathematics of Statistical Machine Translation: Parameter Estimation. ’’ P.F. Brown, S.S Della Pietra, V.J.Della Pietra, R.L. Mercer (IBM T.J. Watson Research Center ) Computer Linguistics, Volume 19, No.2, 263-311, 1993.

The IBM Translation Model

Page 2: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

2

Landauer, T. K. & Dumais, S. T. (1997). A solution to Plato's problem: The Latent Semantic Analysis theory of acquisition, induction and representation of knowledge. Psychological Review, 104, 211-240.

"Latent Semantic Analysis (LSA)” a high dimensional linear associative model to analyze a large corpus of natural text and generate a representation that captures the similarity of words and text passages.

"Plato's Problem" the term given by Noam Chomsky. It presents the question of how we account for our knowledge when environmental conditions seem to be an insufficient source of information. Noam Chomsky, "Barriers", MIT Press, 1986.

The Proto-Symbol Space

Motion recognition / abstract

Motion G

eneration 6

4

1

Discrete HMM 3

Proto-symbol

Motion elements

joint angles

Motion

2 5

Mathematical Model of Mirror Neuron

Continuous HMM

Walk Run

[Inamura et al. 02] Hidden Markov Model

1q 2q q NqN−1

11a

12a

22a

NNa 1−

}{ 21 ΜoooO =

)(1 ob )(2 ob

)}(,{ oba iij=λParameter of HMM :state transition probabilities

),; ()(1

jjij

M

jiji ocob Σ=∑

=

µN

)(obi :output probabilities

ijaProto-symbol

Continuous

Page 3: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

3

Relationship between proto-symbols

Kullback-Leibler information as pseudo-distance

Definition of distance between proto-symbols

Similarity measure between probability density functions

( ) ( )[ ]∑ −=i

TT

n

ii ypypTn

D 211121 |log|log11),( λλλλ

Kullback-Leibler information

{ }),(),(21

1221 λλλλ DDDs +=

Run Walk Transition in the proto-symbol space

Experiment 1 : Behavior Recognition

walk kick

Deforming Proto-Symbol Space

W. Takano, H. Tanie, and Y. Nakamura, “Key Feature Extraction for Probabilistic Categorization of Human Motion Patterns,” Proc. of IEEE International Conference on Advanced Robotics, pp.424-430, 2005.

16 Motion Categories

walk, raise, wave, kick, bow, sit, crawl1,crawl2 stand, primitive walk, rise, run, applause, lie,

open and close legs, standing up right

Each motion category includes 100 motion patterns

1600 motion patterns are used and proto symbols of the samenumber can be acquired.

Motion patterns applied with proto symbol space

Page 4: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

4

Proto Symbol Space Based on Joint Angles

Sit,Lie Standing

4 DOF in each arm, 6 DOF in each leg, position (x,y,z) of a body, attitude matrix Sequence of 32 dimensional data

Proto Symbol Space Based on Cartesian Coordinates

x,y,z coordinates of 3 joints in each arm, 3 joints in each leg, and body

Sequence of 39 dimensional data

Clusters of proto symbols seem to be located distributed

Proto Symbol Space in Joint-Angle and Cartesian Coordinates

Cluster analysis is implemented using probability density functions that approximate the distribution of proto symbols

Joint Angles

Cartesian Coordinates

Optimization of Weights

page 16

central point of the cluster is defined as the target point for each proto symbol corresponding to the cluster

Optimization of weights by the steepest descent method such that each proto symbol is placed at the target point in the space

Page 5: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

5

Result of Cluster Analysis

Initial State Optimized state

Clusters are distributed by optimizing the weights

Extraction of Key Features

Raising right hand

Waving left hand

Kicking with right leg

Applauding

A body B left shoulder C left elbow D left hand E right shoulder F right elbow G right hand H left hip I left knee J left ankle K right hip L right knee M right ankle The weights indicate the parts

that characterize the motion patterns

Extracting motion modifier of whole body motion patterns based on information geometric representation Hideharu Suzuki MS thesis, March 2008 Department of Mechano-Informatics University of Tokyo Advisor: Y. Nakamura

Tangential Property of Proto-Symbol Space

Information Geometry

abstracted (punch)

modifier (large pumch)

Tangent vector

geodedic

Parametric space

(kick)

(walk)

Proto-symbol space

Mapping to tangent spaces

large(punch) large(kick)

・modifier = local tangent ・conceptual modifier = common properties from many local tangents

Page 6: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

6

Conceptual Modifier (1)

Basic Motion

Modified by modifiers

Conceptual Modifier (2)

Basic Motion

Modified by modifiers

Conceptual Modifier (3)

Basic Motion

Modified by modifiers

W. Takano, K. Yamane, and Y. Nakamura, “Capture Database through Symbolization, Recognition and Generation of Motion Patterns,” Proc. of IEEE International Conference on Robotics and Automation, pp.3092-3097, 2007. W. Takano, D. Kulic, and Y. Nakamura, “Interactive Topology Formation of Linguistic Space and Motion Space,” Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1416 -- 1422, 2007.

Deformation of Proto-Symbol Space by Word Labeling

Page 7: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

7

The hierarchical structure of motion symbols and words

Machine Translation

Natural Language Processing

“Machine Translation of Languages “ (MIT Press 1955 translation) Warren Weaver 1949 applying statistical and cryptanalytic techniques

‘’The Mathematics of Statistical Machine Translation: Parameter Estimation. ’’ P.F. Brown, S.S Della Pietra, V.J.Della Pietra, R.L. Mercer (IBM T.J. Watson Research Center ) Computer Linguistics, Volume 19, No.2, 263-311, 1993.

The IBM Translation Model

The number of word labels : 64

Motion capture corpus related to baseball

Capture sampling : 33ms The number of motion data:537

The number of markers : 34

The number of symbols : 50 Total length of data : 4088s

Motion Database Specification

“diving stand_up right_throw_pose standing”

Constructed word space

Page 8: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

8

Reconstructed symbol space

Distances among symbols become larger. This means that discrimination can be improved.

“left_swing run”

Experimental result (motion search)

W. Takano and Y. Nakamura, “Integrating Whole Body Motion Primitives and Natural language for Humanoid Robots”, Proc. of IEEE-RAS International Conference on Humanoid Robots, pp.708-713, 2008. W. Takano and Y. Nakamura, “Statistically Integrated Semiotics that Enables Mutual Inference Between Linguistic and Behavioral Symbols for Humanoid Robots,” Proc. of IEEE International Conference on Robotics and Automation, 2009. W.Takano and Y. Nakamura, “Incremental Learning of Integrated Semiotics Based on Linguistic and Behavioral Symbols,” Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.2545-2550, 2009.

Where motion meets language

Research Purpose

Natural language model for Semantics and syntaxs based on text corpus

Robot can make inferences by using rich linguistic knowledge

Page 9: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

9

Research Purpose Linguistic processing by integrating behavioral symbolization model and natural language models for humanoid robots.

・Behavioral symbolization based on HMMs is stochastic graphical framework. ・Natural language model for morphological analysis is also similar stochastic structured. ex. ・ CHASEN [K. Takeuchi, Y. Matsumoto 1995] :HMM. ・ MECAB [T. Kudo, K. Yamamoto Y. Matsumoto 2004] :Conditional Random Field.

High-Integrity between behavioral symbolization model and natural language model

Integration of Behavioral Symbol and Natural Language Model

play

0.1

run throw

player ball

0.2 0.05

0.0 0.0

player throw a

player throw a ball a

player run a

0.5

0.45

0.3

evaluation

Morphological Analysis

"Rosette Morphological Analysis System" google, amazon, MSN, Rakuten Dictionary for Japanese Morphological Analysis "UniDic" National Institute of Japanese Language "Juman" Morphological Analysis System for Japanese Language Language Media Laboratory Kyoto University (based on HMM) "ChaSen" Computational Linguistics Lab Nara Institute of Science and Technology Yuji Matsumoto IPA word-class system "MeCab" Taku Kudou Google, Mac OS X, iPhone Open Source IPA word-class system IPADIC , 3-4 times faster than ChaSen

Natural Language Processing Experimental Result: Interpretation of motion primitive as sentences

The number of latent state in motion language model : 50 The number of proto symbol : 10 The number of words : 15 The number of word class : 5

Page 10: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

10

Experimental Result: Interpretation of motion primitive as sentences

Experimental Result: Motion Generation from Sentences

A player runs. A pitcher crouches on the mound..

Pedestrian scramble Shibuya, Tokyo

Humanoid Robotics = Science of Anthropomorphism

Page 11: 人間の動作のモデル化とロボットへの応用 Part2chik/InfoTech11/03 Nakamura2.pdf · Pragmatics 語用論 Machine Translation ... Extraction of Key Features ... “Statistically

11

many thanks to my dear colleagues

W. Takano, K. Yamane, T. Inamura, T. Sugihara, D. Kulic K. Yamamoto, H. Tanie, H. Suzuki

and many more at YNL @UT

Category S of Grant-in-Aid for Scientific Research (20220001) FY2008-2012 Japan Society for the Promotion of Science. "Establishing Man-Machine Communication in Unifying Body Motion and Language"

Category S of Grant-in-Aid for Scientific Research (15100002) FY2003-2007 Japan Society for the Promotion of Science. "Development of Dynamic Information Processing Model of Intelligence"

Acknowledgements