Grid World Path Planning using PPO

A simple experimental project using Proximal Policy Optimization (PPO) from OpenAI's Spinning Up library, applied to a custom Grid World environment for path planning.

This is an active work-in-progress (WIP). Currently experimenting with:

🧭 Increasing action space (more directional controls)
🎮 Integrating imitation learning for guided policy initialization
⚙️ Exploring environment variations

Overview

The goal is to train an agent to navigate a 2D grid world, reach the target efficiently, and avoid obstacles using reinforcement learning.

Current Setup:

Environment: Custom Grid World
Algorithm: PPO from OpenAI Spinning Up
Experiments:
- Action space scaling
- Imitation learning integration
- Custom reward shaping

Quick Start

Clone the repo, install dependencies
Run training:
```
python algorithms/ppo/ppo.py
```

Work In Progress

PPO baseline training
Expand action space
Add imitation learning
Experiment with multiple targets

Acknowledgements

OpenAI Spinning Up
PPO Algorithm: "Proximal Policy Optimization Algorithms"

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
algorithms		algorithms
environment		environment
models		models
.gitignore		.gitignore
expert.py		expert.py
ppo_model.pth		ppo_model.pth
readme.md		readme.md
user_config.py		user_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grid World Path Planning using PPO

Overview

Current Setup:

Quick Start

Work In Progress

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Grid World Path Planning using PPO

Overview

Current Setup:

Quick Start

Work In Progress

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages