F. Kaplan, P. Oudeyer, E. Kubinyi and A. Miklosi Mart van de Sanden

F. Kaplan, P. Oudeyer, E. Kubinyi and A. Miklosi

Mart van de Sanden

AIBO As a Digital Creaturen Animal-like entertainment robot A companion How to teach it to do new things? Train like real pets? Through

interaction!

How Does it Work For Real Pets? How to teach a dolphin to

jump? Show it to him? Explain it to him? It needs to discover on its

own! But what if the action is rare

or complex? We need to guide it!

The same goes for robots!

How Not To Do It

Chanting while pushing the dog to sit Split attention between

learning a new move and listening to the trainer.

Which part of the behavior is sit?

Often the command is given while the dog is still standing.

Then How?

First teach the behavior. Then add the command!

Modelling (or molding)

Physically manipulating the animal into the desired position.

Then give positive feedback. Never used by professional trainers. The dog is not actively involved. Learning performance is poor. Used for teaching industrial robots! Not convenient for autonomous robots. Not good for teaching complex

movements.

Luring (or “magnet method”) Same as modeling, but with the use

of a lure. Gives satisfactory results for real

dogs. Can only teach positions or simple

movements. Not really used with robots.

Capturing

Exploits behavior that the animal performs spontaneously.

Wait for the correct behavior and give a positive reinforcement.

Takes to much time when multiple commands need to be learned.

The use of imitation?

Animal anatomy mostly does not resemble ours.

Only higher animals (e.g. primates) are able to imitate.

Has been done with robotics. It can handle the learning of sequences

of actions and rare behaviors. Requires elaborate vision techniques.

Shaping

Breaks behaviors down into small steps.

Which can be trained used any of the mentioned techniques.

Clicker training!

Clicker Training

B.F. Skinner: Operant conditioning.

A Clicker emits a brief sharp sound.

Which is associated with a primary reinforcer. Foods, toys, etc. It becomes a secondary

reinforcer. It will act as a positive cue.

Clicker Training

The clicker can be used to guide animals in the right direction.

By only giving the clicker sound when the animal performs the desired behavior.

Clicker Training

Four steps: Charging the clicker. Getting the behavior. Adding the command word. Testing the behavior.

It can be used to learn rare behaviors.

It can be used to learn sequences of behaviors.

Discussion!

Do you want to train your robot using this way or do you rather use a computer to program it?

Or build in another way of training? Because clicker training does not exactly come natural.

Robotic Clicker Training

Robot: Hierarchical schemata based behavior

model. Behavior selection according to:

Opportunities in the environment Natural instincts Emotion of the robot User expectation model

(associative memory)

Charging the Clicker

Primary reinforcer -> event within 5 seconds.

After 30 times it becomes a secondary reinforcer.

TRAINER scratches the robot’s head and says “Good”.ROBOT learns association in user’s expectation module.TRAINER scratches the robot’s head and says “Good”.ROBOT learns association in user’s expectation module.Etc.

Guiding the Robot

The robot starts out just doing what it wants to do.

When the trainers says “good”, the training module reinforces the current top-level schemata.

This means that the robot does the underlying behaviors more often.

Adding the Command Word

When a word is heared, the expection modules associates it with all the reinforced actions in the training session.

It creates a new schema for them. A new schema has a confidence

level. After reaching a certain level it

becomes permanent.

F. Kaplan, P. Oudeyer, E. Kubinyi and A. Miklosi Mart van de Sanden

Documents

2012 - Colorado General Assemblyleg.colorado.gov/sites/default/files/2012_minutes.pdf · 2016. 8. 1. · January 17, 2012, Page 2 7:02 a.m. Representative Miklosi arrived at the hearing

Changing Paradigms in Drug Discovery - Kubinyi · Changing Paradigms in Drug Discovery Hugo Kubinyi University of Heidelberg Germany E-Mail kubinyi@t-online.de ... Open Questions

Miklosi vs. Miklos

Chemical Biology and Chemogenomics in Drug Discovery and...Hugo Kubinyi Chemical Biology and Chemogenomics in Drug Discovery Hugo Kubinyi Weisenheim am Sand, D E-Mail kubinyi@t-online.de

Hugo Kubinyi

Intelligent Adaptive Curiosity: a source of Self-Developmentcogprints.org/4144/1/oudeyer.pdf · 2018. 1. 17. · Self-Development Pierre-Yves Oudeyer Frederic Kaplan Sony Computer

Drug Discovery - Introduction - Hugo Kubinyi, Home Page · · 2005-12-25Drug Discovery - Introduction Hugo Kubinyi Germany ... ein sehr nützlich und heilsamb Kraut ist“ (Johannes

POLLACK EPO 2014 http:pollackepoDittrich Ernő Hidro Consulting Kft. ... Kubinyi Antal Air Trade Centre Hungary Kft. 1130 – 1200 EasyBus – Tűzvédelmi- és légtechnikai vezérlő

Pierre-Yves Oudeyer Email : pierre-yves.oudeyer@inria · Keynote speeches at international conferences: Highlight : 2019 keynote at International Conference on Learning Representations

Miskolci Egyetem, Történettudományi Intézettortenelemszak.uni-miskolc.hu/gesta/gesta200662/200662... · 2011. 4. 5. · zak Portré: Kubinyi András Ola volt, a tizenéveseknek

Hugo Kubinyi, Drug Design - Problems in

Kubinyi András Miért lettem a középkor kutatója?epa.oszk.hu/00400/00414/00015/pdf/t_12kubinyi.pdf · Kubinyi András Miért lettem a középkor kutatója?0* Ez a kérdés csak

Hugo Kubinyi,

Serendipity and Rational Design - Kubinyi · Hugo Kubinyi, Serendipity and Rational Design Hugo Kubinyi Germany E-Mail kubinyi@t-online.de HomePage …

Kubinyi Ágoston - Magyarországi Mérges Növények

Chemogenomics in Drug Discoverydownload.e-bookshelf.de/...G-0000602719-0002363983.pdf · Chemogenomics in Drug Discovery A Medicinal Chemistry Perspective Edited by Hugo Kubinyi and

Kubinyi Andrásnak a középkori magyarországi várakra ... · PDF file14‒16. század Kubinyi András ... Magyarország történelmi földrajza a Hunyadiak ... A Hunt-Páznán nemzetségbeli

From hardware and software to kernels and envelopes: …pyoudeyer.com/kaplan-oudeyer-Neuromorphic10.pdf · 4 kernels and envelopes of this chapter discusses in more details the epistemological

Miklosi 2017 trend report

Drug Metabolism Pharmacognosy 491... · bioavailability Hugo Kubinyi, Drug Metabolism For the elimination of xenobiotics, mainly in the liver. - oxidations, reductions and hydrolyses