1 An Application of Reinforcement Learning to Aerobatic Helicopter Greg McChesney Texas Tech...

An Application of Reinforcement Learning to Aerobatic Helicopter

Greg McChesneyTexas Tech University

Greg.mcchesney@ttu.edu

Apr 08, 2009CS5331: Autonomous Mobile

Robots

Overview

Creating a robot that can fly autonomously

Software developed at Stanford as part of their AI lab

This paper is slightly outdated as many new maneuvers have been created.

Robots 2

Learning Approach

Apprenticeship Collect data from human trying

maneuver (multiple times) Learn a model from the data Find controller than can simulate based

on model Test on helicopter (pray it doesn’t

crash)

Robots 3

Helicopters State

Position Velocity Angular Velocity Controlled with 4 dimensions

Cyclic pitch Tail rotor

Take gravity out when calculating the model

Robots 4

Controller Design

Use a Markov decision process Sextuple (S,A,T,H,s(0),R)

S-set of states A-set of actions (inputs) T-dynamic model-set of probability

distributions for the next state H-horizon or number of time steps of

interest s(0)-initial state R-reward function

Robots 5

Differential Dynamic Programming(DDP)

Compute the linear approximation Compute the optimal solution to the

linear quadratic regulator Must take into account error state Cost for change in input-needed in real

testing

Robots 6

DDP-Continued

2 phases DDP to find open loop input sequence Use DDP again refining the inputs as a

deviation from the nominal open-loop input sequence

Integral control-take into account wind and errors in the model

Robots 7

Rewards

24 features Used inverse reinforcement learning Rewards from inverse reinforcement

usually did not produce correct result

Took inverse results and manually tuned them to get good results

Robots 8

Helicopter

Xcell Tempest 54” long 19” high 13 lbs Two-stroke engine Orientation sensors GPS-doesn’t work during flips

Robots 9

Robots 10

Robots 11

Robots 12

Tail-In Funnel

Robots 13

Nose-In Funnel

Robots 14

Questions

Motivations/Who pays for it I can see applications in the defense

sector DARPA

Could more maneuvers be done just by changing some parameters? Probably not because the filter is

learned based on a model so you would need to create a new model

Robots 15

1 An Application of Reinforcement Learning to Aerobatic Helicopter Greg McChesney Texas Tech...

Documents

The Vietnam Center and Archive Stephen Maxner, Ph.D. DirectorSteve.maxner@ttu.edu

High-Throughput Field Phenotyping of Plants Sri Harsha Atluri Sriharsha.atluri@ttu.edu

Engineering Graphics Welcome to E GR 1207. Engineering Graphics Coordinator Lee Reynolds Office: ME 224A Email: Howard.L.Reynolds@ttu.edu Howard.L.Reynolds@ttu.edu

Table of Contents · Sarai Brinker TEACH Peer Consultant 742-0133 s.brinker@ttu.edu Rebecca Densley TEACH Peer Consultant 742-0133 rebecca.densley@ttu.edu Mindi Price TEACH Peer Consultant

Greg McChesney Thesis Defense Presentation Computer Science, TTU greg.mcchesney@ttu

1 Welcome Maria McChesney Director, IS, City of Hamilton

SUPPLY CHAIN MANAGEMENT: An Organizational Competency INTRODUCTION TO THE COURSE alan.whitebread@ttu.edu ALAN L. WHITEBREAD

Dr. Darren Hudson Larry Combest Chair of Agricultural Competitiveness darren.hudson@ttu.edudarren.hudson@ttu.edu, 742.2821x272, 206 AGSCI

Proposed Storm Water and Grading Policies and Standards January 13, 2011 Don McChesney

Doodle Studies and Etudes - Bob McChesney

The Endless Crisis - John Bellamy Foster & Robert W. McChesney

SECC Coordinator Training August 2015 | 742-7025 | secc@ttu.edu

2014 State Employee Charitable Campaign | 742-7025 | secc@ttu.edu

for Handbells Unlimited - my friends David Jordan and … - Granados/McChesney - duet with keyboard - page 5 of 12

myweb.ttu.edu/bban lev.ban@ttu.edu Real Analysis · 2018. 3. 30. · myweb.ttu.edu/bban lev.ban@ttu.edu Real Analysis Byeong Ho Ban Mathematics and Statistics Texas Tech University

Martin, William McChesney, Jr Statement before the Committee on Finance. United States Senate

Greg McChesney Thesis Proposal Presentation Computer Science, TTU Greg.mcchesney@ttu.edu Service Context Management for Exertion-oriented Programming

Thomas Halverson and Bill Poirier Tom.Halverson@ttu.edu Texas Tech University Department of Physics 6-9-13

McChesney 2013 - Digital Disconnect Ch1

Greg McChesney Thesis Defense Presentation Computer Science, TTU greg.mcchesney@ttu.edu