Skip to content

achao2013/Learning-To-Reinforcement-Learn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A mxnet implementation of meta-RL

This is an attempt to implement the bandits algorithm in paper Learning to reinforcement learn.

The algorithm should be mostly correct. And this work is based on the repo:https://github.com/awjuliani/Meta-RL (TensorFlow)

The Labryinth experiments will be pushed soon.

Usage

run python a3c-bandit.py --num-threads=32 --episode-len=100

About

a implement of meta-RL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages