You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
All development of Gym has been moved to Gymnasium, a new package in the Farama Foundation that's maintained by the same team of developers who have maintained Gym for the past 18 months.
If you're already using the latest release of Gym (v0.26.2), then you can switch to v0.27.0 of Gymnasium by simply replacing import gym with import gymnasium as gym with no additional steps.
Gym will not be receiving any future updates or bug fixes, and no further changes will be made to the core API in Gymnasium.
Environments
CliffWalking-v0
State space
A $4 \times 12$ grid world
Positions represented as flattened index
For example, the starting point (3, 0) is represented as $3 \times 12 + 0 = 36$
Action space
0: Move up
1: Move right
2: Move down
3: Move left
Instructions
run pip install requirements.txt
run python main.py
the main.ipynb Jupyter Notebook file contains a brief walkthrough of the current implementation
Status
Currently, I'm actively working on modifying this project to make it more readable and modularized
Notes
The main purpose of creating this repository is to have fun playing around with RL
More specifically, I wanted to
run experiment new RL algorithms on simple game-like environments
refresh my memory of the basic RL algorithms
hopefully I can figure out some interesting variations of the existing RL algorithms and contribute to this field
share my implementation with the community to get feedback
maybe help people who are also interested in RL and just started to explore this intriguing field
Researchers from DeepMind published an article about the relationship between neuroscience and AI in early 2020: Dopamine and temporal difference learning: A fruitful relationship between neuroscience and AI