12
Challenges in Perception for Learning, Cognition and Control Approaches Darius Burschka Machine Vision Perception Group (MVP) Technische Universität München, Germany

Challenges in Perception for Learning, Cognition and Control Approaches

Embed Size (px)

Citation preview

Page 1: Challenges in Perception for Learning, Cognition and Control Approaches

Challenges in Perception for Learning, Cognition and Control

Approaches

DariusBurschkaMachineVisionPerceptionGroup(MVP)TechnischeUniversitätMünchen,Germany

Page 2: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Structure of a CNN for Robotics Applications

Rawsensorinformation(depthimagefromanRGB-Dsensor)istakenasinputtoconnectwiththerobotcontrolcommanddirectly

Idealcase:learnthecouplingbetweenperceptionandcontrolcommandsfromdemonstrations

Page 3: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Curse of Complexity in Direct ProcessingIntheory,onecouldusealltheextractedfeatureswithaclassifiersuchasasoftmaxclassifier,butthiscanbecomputationallychallenging.Forinstanceimagesofsize96x96pixelswith400learnedfeaturesover8x8inputs.Eachconvolutionresultsinanoutputofsize(96−8+1)∗(96−8+1)=7921,andsincewehave400features,thisresultsinavectorof892∗400=3,168,400featuresperexample.Learningaclassifierwithinputshaving3+millionfeaturescanbeunwieldy,andcanalsobepronetoover-fitting.

Page 4: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Deep-Learning for Perception in Robotics

Whatdoallthesetaskshaveincommon?

Source:L.Tai,“Deep-LearninginMobileRobotics…”

Page 5: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Learning Approaches vs. Conventional Tools

Imagesource:Berkley

Grouping/Segmentation,Labeling,Identification

becausedifferencesinthepropertiesofpixelstoentireimageareconsideredduetothestructureofCNNs

Problem:notclear,whattheprocessingisbasedon!

?

Hand-designedfeatureshelptoestimatemetricdistancesbetweenthemusingcalibrationparameters

Page 6: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Deep-Learning for Control in Robotics

Source:L.Tai,“Deep-LearninginMobileRobotics…”

Page 7: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Can we avoid Metric Information in Representation of Environments?

Page 8: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Fast Uncalibrated Monocular Estimation of Independent Motion Components

Page 9: Challenges in Perception for Learning, Cognition and Control Approaches

Representation of the Environment in Collision Space

Page 10: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Examples of non-metric Control (RoboMobil)

Page 11: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Examples of non-metric Control (RoboMobil)

Page 12: Challenges in Perception for Learning, Cognition and Control Approaches

DariusBurschka–MachineVisionandPerceptionGroup(TUM)

Conclusions

• Imagelabelingandimageretrieval(indexing)representalargeapplicationfieldforperceptionandimageprocessing(Google,Apple,etc.),butroboticrequiresoftenametricmappingfromthesensorontocontrolvalues• Workonalternativesforcouplingbetweenthesensorandtheactuatorshouldbeoneoftheperceptiongoals• Currentlyuseddatarepresentationsneedtobere-definedforlearningapplications.