1

AlphaGo: Mastering the Game of Go with Deep Neural Networks and Tree Search Presented paper by David Silver, Aja Huang, Demis Hassabis et al. (from Google DeepMind) (http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html) Who: Karel Ha ([email protected]) When: 20 th April 2016 (Wednesday), 9:00 - 10:30 Where: S9 Why: Optimization Seminar (NOPT053) The game of Go has long been viewed as the most challenging of classic games for artiﬁcial intelligence owing to its enormous search space and the diﬃculty of evaluating board positions and moves. A new approach to computer Go introduces value networks to evaluate board positions and policy networks to select moves. These deep neural networks are trained by a novel combination of su- pervised learning from human expert games, and reinforcement learning from games of self-play. Furthermore, a new search algorithm is introduced: it combines Monte Carlo simulation with value and policy networks. Using this search algorithm, the computer program AlphaGo developed by Google DeepMind achieved a 99.8 % winning rate against other Go programs. Theorem 1. The (distributed version of) AlphaGo plays Go at the super-human level. Proof. The proof is left as an exercise to the reader. The exercise is to come to the talk. :-)

Leaflet for the talk on AlphaGo

Download PDF Report

Upload
karel-ha
View
120
Download
4

Embed Size (px)

Citation preview

Page 1: Leaflet for the talk on AlphaGo

AlphaGo: Mastering the Game of Gowith Deep Neural Networks and Tree Search

Presented paper by David Silver, Aja Huang, Demis Hassabis et al. (from Google DeepMind)

(http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html)

Who: Karel Ha ([email protected])When: 20th April 2016 (Wednesday), 9:00 - 10:30Where: S9Why: Optimization Seminar (NOPT053)

The game of Go has long been viewed as the most challenging of classic games for artificialintelligence owing to its enormous search space and the difficulty of evaluating board positionsand moves.

A new approach to computer Go introduces value networks to evaluate board positions and policynetworks to select moves. These deep neural networks are trained by a novel combination of su-pervised learning from human expert games, and reinforcement learning from games of self-play.

Furthermore, a new search algorithm is introduced: it combines Monte Carlo simulation with valueand policy networks. Using this search algorithm, the computer program AlphaGo developedby Google DeepMind achieved a 99.8 % winning rate against other Go programs.

Theorem 1. The (distributed version of) AlphaGo plays Go at the super-human level.

Proof. The proof is left as an exercise to the reader. The exercise is to come to the talk. :-)

http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html

mailto:[email protected]

AI, AlphaGo and computer Hex - University of Albertawebdocs.cs.ualberta.ca/~hayward/talks/aghex.pdf · evolution computer Hex AI, AlphaGo and computer Hex a math andcomputingstory

AI, AlphaGo and computer Hex - University of Albertawebdocs.cs.ualberta.ca/~hayward/talks/aghex.pdf · evolution computer Hex AI, AlphaGo and computer Hex a math andcomputingstory

Documents

Stochastic Planning in Games: an AlphaGo Case-studyTommaso Macrì | 23.03.20 | 18 AlphaGo: the 4 Neural Networks Forward Propagation: 2µs 3ms Accuracy: 24% 57% Mastering the game

Stochastic Planning in Games: an AlphaGo Case-studyTommaso Macrì | 23.03.20 | 18 AlphaGo: the 4 Neural Networks Forward Propagation: 2µs 3ms Accuracy: 24% 57% Mastering the game

Documents

Maurizio Lenzerini Tiziana Catarci · 2016: AlphaGo beats Go champion Lee Sedol Thanks to machine learning techniques AlphaGo may develop «intuitions» for Go playing

Maurizio Lenzerini Tiziana Catarci · 2016: AlphaGo beats Go champion Lee Sedol Thanks to machine learning techniques AlphaGo may develop «intuitions» for Go playing

Documents

Mastering the game of Go without human knowledge · Alphago Zero (This paper) The second Alphago paper Mastering the game of Go without human knowledge 100 - 0 Alphago Lee David Silver,

Mastering the game of Go without human knowledge · Alphago Zero (This paper) The second Alphago paper Mastering the game of Go without human knowledge 100 - 0 Alphago Lee David Silver,

Documents

End of Life Care - NHS Gloucestershire CCG · end of life care 4 A new leaflet to support conversations about Dementia “Time to talk” Dying Matters has launched its eleventh leaflet,

End of Life Care - NHS Gloucestershire CCG · end of life care 4 A new leaflet to support conversations about Dementia “Time to talk” Dying Matters has launched its eleventh leaflet,

Documents

Neurónové siete a AlphaGo - cuni.cz

Neurónové siete a AlphaGo - cuni.cz

Documents

AlphaGo – Artificial Intelligencecis.csuohio.edu/~sschung/CIS601/Presentation1... · AlphaGo –Artificial Intelligence CIS 601 - Graduate Seminar Presentation ... Employing Deep

AlphaGo – Artificial Intelligencecis.csuohio.edu/~sschung/CIS601/Presentation1... · AlphaGo –Artificial Intelligence CIS 601 - Graduate Seminar Presentation ... Employing Deep

Documents

Neuroverkot ja AlphaGo Zero - jyx.jyu.fi

Neuroverkot ja AlphaGo Zero - jyx.jyu.fi

Documents

Committed to supporting and improving young people’s ...youthtalk.org.uk/static/uploads/uploads/Youth Talk leaflet for website... · Youth Talk provides an open door and a safe,

Committed to supporting and improving young people’s ...youthtalk.org.uk/static/uploads/uploads/Youth Talk leaflet for website... · Youth Talk provides an open door and a safe,

Documents

Documents

PATIENT LEAFLET IN · side effects not listed in this leaflet. Reporting of side effects If you get any side effects, talk to your doctor or pharmacist. This includes any possible

PATIENT LEAFLET IN · side effects not listed in this leaflet. Reporting of side effects If you get any side effects, talk to your doctor or pharmacist. This includes any possible

Documents

Machine Learning Techniques at the core of ... - Meetupfiles.meetup.com/20381738/AlphaGo meet up 2016.pdf · AlphaGo ML System Conclusion Machine Learning Techniques at the core of

Machine Learning Techniques at the core of ... - Meetupfiles.meetup.com/20381738/AlphaGo meet up 2016.pdf · AlphaGo ML System Conclusion Machine Learning Techniques at the core of

Documents

CMPUT 396: AlphaGo - University of Albertawebdocs.cs.ualberta.ca/~hayward/396/hoven/11alphago.pdf · CMPUT 396: AlphaGo The race to build a superhuman Go-bot is over! AlphaGo won,

CMPUT 396: AlphaGo - University of Albertawebdocs.cs.ualberta.ca/~hayward/396/hoven/11alphago.pdf · CMPUT 396: AlphaGo The race to build a superhuman Go-bot is over! AlphaGo won,

Documents

Mastering the Game of Go Without Human Knowledge€¦ · Ladders. AlphaZero. AlphaGo Zero’s Gift. Discussion. Discussion How can the AlphaGo Zero algorithm be extended to different

Mastering the Game of Go Without Human Knowledge€¦ · Ladders. AlphaZero. AlphaGo Zero’s Gift. Discussion. Discussion How can the AlphaGo Zero algorithm be extended to different

Documents

Alphago Tech talk

Alphago Tech talk

Technology

AlphaGo, 새로운 시대의 시작

AlphaGo, 새로운 시대의 시작

Technology

O1 - Softchoice...(Google AlphaGo), and enables hyper-targeted e-commerce and content recommendations across the web, as we see ... from talk to action as companies have begun investing

O1 - Softchoice...(Google AlphaGo), and enables hyper-targeted e-commerce and content recommendations across the web, as we see ... from talk to action as companies have begun investing

Documents

AlphaGo Zero & AlphaZero Mastering Go, Chess and Shogi ... · AlphaGo Zero is the Google Deepmind’s successor to AlphaGo [11]. It falls into the category of deep learning augmented

AlphaGo Zero & AlphaZero Mastering Go, Chess and Shogi ... · AlphaGo Zero is the Google Deepmind’s successor to AlphaGo [11]. It falls into the category of deep learning augmented

Documents

AlphaGo 囲碁AI Master 〜AlphaGoから何を学ぶのか〜

AlphaGo 囲碁AI Master 〜AlphaGoから何を学ぶのか〜

Education

Linear logic and deep learning - The Rising Seatherisingsea.org/notes/talk-lldl.pdf · Move 37, Game 2, Sedol-AlphaGo “It’s not a human move.I’ve never seen a human play this

Linear logic and deep learning - The Rising Seatherisingsea.org/notes/talk-lldl.pdf · Move 37, Game 2, Sedol-AlphaGo “It’s not a human move.I’ve never seen a human play this

Documents

neural networks and tree search - Semantic Scholar€¦ · The final version of AlphaGo used 40 search threads, 48 CPUs, and 8 GPUs. Distributed version of AlphaGo that exploited

neural networks and tree search - Semantic Scholar€¦ · The final version of AlphaGo used 40 search threads, 48 CPUs, and 8 GPUs. Distributed version of AlphaGo that exploited

Documents

TALK TO YOUR HEALTHCARE PROFESSIONAL NOW! › pdf › self-test-leaflet-eng.pdfIf you have answered YES to one or more, you probably have AR. Talk to your healthcare professional

TALK TO YOUR HEALTHCARE PROFESSIONAL NOW! › pdf › self-test-leaflet-eng.pdfIf you have answered YES to one or more, you probably have AR. Talk to your healthcare professional

Documents

AlphaGo for DISCO group of ETH Zurich

AlphaGo for DISCO group of ETH Zurich

Science

UNDERSTANDING & GENERALIZING ALPHAGO ZERO

UNDERSTANDING & GENERALIZING ALPHAGO ZERO

Documents

Deep Q-Learning, AlphaGo and AlphaZero

Deep Q-Learning, AlphaGo and AlphaZero

Documents

Alphago vs Lee Se-Dol: Tweeter Analysis using Hadoop and Spark

Alphago vs Lee Se-Dol: Tweeter Analysis using Hadoop and Spark

Engineering

DeepBlue, AlphaGo, and AI?

DeepBlue, AlphaGo, and AI?

Documents

AlphaGo andMonteCarloTreeSearch - University of Washington5/30/17 1 AlphaGo andMonteCarloTreeSearch Machine0Learning0forBig0Data00 CSE547/STAT548, University0of0Washington Sham0Kakade

AlphaGo andMonteCarloTreeSearch - University of Washington5/30/17 1 AlphaGo andMonteCarloTreeSearch Machine0Learning0forBig0Data00 CSE547/STAT548, University0of0Washington Sham0Kakade

Documents

EI Malcriado Leaflet Series, No. 13, July 21,1975 Let's Talk ......EI Malcriado Leaflet Series, No. 13,July 21,1975 Let's Talk About Benefits Jose Galindo Fierro (r) receives Ken nedy

EI Malcriado Leaflet Series, No. 13, July 21,1975 Let's Talk ......EI Malcriado Leaflet Series, No. 13,July 21,1975 Let's Talk About Benefits Jose Galindo Fierro (r) receives Ken nedy

Documents

Go in numbers€¦ · Evaluating Seoul AlphaGo against computers Crazy Stone Zen 3000 2500 2000 1500 1000 500 0 3500 AlphaGo (Nature v13) Pachi Fuego Gnu Go 4000 4500 AlphaGo (Seoul

Go in numbers€¦ · Evaluating Seoul AlphaGo against computers Crazy Stone Zen 3000 2500 2000 1500 1000 500 0 3500 AlphaGo (Nature v13) Pachi Fuego Gnu Go 4000 4500 AlphaGo (Seoul

Documents