14
Online reinforcement learning by a spiking network model of the basal ganglia Hideyuki Yoshimura, Tadashi Yamazaki The University of Electro-Communications 2019/03/26 NICE 2019 @Albany

Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Online reinforcement learningby a spiking network model of

the basal gangliaHideyuki Yoshimura, Tadashi Yamazaki

The University of Electro-Communications

2019/03/26 NICE 2019 @Albany

Page 2: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Name Compute Products Killer app

AI accelerator Tensor calculation TPU2, Tensor core, Movidius, MN-2, ... Deep learning

Annealer Simulated annealingD-Wave, Hitachi CMOS, Fujitsu

digital, ...

Combinatorial optimization

Neuromorphic Spiking neural network

Loihi, TrueNorth, SpiNNaker,

BrainScaleS, ...?

What is a killer app for neuromorphic chips?

Page 3: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

(Deep) Reinforcement learning

Page 4: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Basal ganglia

Motor cortexPrefrontal cortex

Thalamus

Basal ganglia is a site for RL in the brain

Cerebellum

Page 5: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Model anatomical structure

900 spike response model (SRM0) neurons

(Based on Kim et al. 2017)

Page 6: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Learning rule

Spike timing-based learning rule (TD-LTP) (Frémaux et al. 2013)

Page 7: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Learning rule

Spike timing-based learning rule (TD-LTP) (Frémaux et al. 2013)

Page 8: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Implementation

・C++ and CUDA

・Explicit Euler method with dt = 1 ms

・Realtime simulation

→ Allowing online RL

Experiment Environment: NVIDIA GeForce GTX TITAN Z, Intel Core i7-4960X, 64GB RAM

1 s simualtion completes in 0.61 s

Page 9: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Example 1: Water maze task

Page 10: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Example 1: Water maze task

Before learning

After learning

Page 11: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Example 1: Water maze task

Successfully learned to find the goal

Page 12: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Example 2: Reaching task

Page 13: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Example 2: Reaching task

Successfully learned to reach to the given target

Page 14: Online reinforcement learning by a spiking network model ...€¦ · Basal ganglia. Prefrontal cortex. Motor cortex. Thalamus: Basal ganglia is a site for RL in the brain. Cerebellum

Summary

・Built a spiking network model of the basal ganglia

・The model can perform RL

・Realtime simulation → Online RL

・We have a poster!