Off-Policy Imitation Learning from ObservationsIn a non-injective MDP, the discrepancy of DKL [VT...

View
1
Download
0
Category

Documents

Preview:

Citation preview

http://lamda.nju.edu.cn

Off-Policy Imitation Learning from Observations

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

Objective of GAIL

- Requires expert action => Learning from observations - (BCO, GAILfO)

- On-policy (low sample efficiency) - off-policy algorithms rather than TRPO (SQIL, DAC) - offline reward estimation (PWIL)

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

Learning from Demonstration Learning from Observation

Connection between LfD and LfO

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

Connection between LfD and LfO

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

surrogate expert buffer: 𝐷𝑠 = {(𝑠, 𝑎)𝑖}

perform AIRL on 𝐷𝑆

Similar “Surrogate Distribution”

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

pseudo-reward

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

no on-policy expectation left

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

Implementation Details

optimize the ratio using GAN

additional regularization

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

http://lamda.nju.edu.cn

End

Recommended

Nil Injective Ring

Documents

Noncommutative Geometry in Hawai’imath.hawaii.edu/~dellaiera/NCG_Seminar.pdf · De nition 1.3.1. A C-algebra Mis injective if, given an inclusion of C-algebra AˆB, any injective

Documents

Injective hulls of simple modules over Noetherian rings

Education

DKL Series

Documents

DKL - devkron.org

Documents

Septimiu Crivei INJECTIVE MODULES RELATIVE TO TORSION

Documents

RKPUBLICA DKL PKRU - INGEMMET

Documents

DKL Catalogue

Documents

on injective modules for infinitesimal algebraic groups, i

Documents

ON INJECTIVE HOMOMORPHISMS BETWEEN TEICHMULLER MODULAR … · automorphisms, one might expect that essentially all injective homo-morphisms between modular groups are geometric. Our

Documents

DKL - Velp

Documents

An Injective Language for Reversible Computation

Documents

DKL One sheet 020421 lighten

Documents

2013 Montana Statewide Spring Canola Variety Trial · Monsanto DKL 30-03 H RR Barbara Kutzner DKL 30-42 H RR 1428 N. Locan Avenue DKL 38-48 H RR Fresno, CA 93727 DKL 55-55 H RR PH:

Documents

Deposit Insurance Database Paper DKL

Documents

DKL LifeGuard™ Model 1.0 Computer Assisted

Documents

Jategaonkar’s Ring and its Injective Modules

Documents

Indecomposable injective modules over Down-Up algebras

Documents

Lecture 2 Injective and ternary envelopes and multipliers of

Documents

Dkl Lect 1. Intro

Documents