Q-Learning Grid World Solver

This project implements a Reinforcement Learning agent that learns to navigate a 6×6 grid. The agent must find the optimal path from a starting position to a goal while avoiding static obstacles.

🎮 The Environment

The world is a coordinate-based grid where:

Grid Size: 6×6
Start Position (S): (5, 0) (Bottom-Left)
Goal Position (G): (0, 5) (Top-Right)
Obstacles (X): Static blocks located at specific coordinates that penalize the agent.

🧠 Reinforcement Learning Logic

The agent uses the Q-Learning algorithm to populate a 3D Q-Table (6, 6, 4), representing 4 possible actions (Up, Right, Down, Left) for every grid cell.

Key Hyperparameters

Alpha (α): 0.3 (Learning Rate)
Gamma (γ): 0.95 (Discount Factor)
Epsilon (ϵ): Starts at 0.9 and decays over time to balance exploration and exploitation.

Reward Structure

Goal Reach: +100
Obstacle/Boundary Hit: -10
Each Step: -0.5 (Encourages the shortest path)

🚀 How to Run

Initialize the Q-Table with zeros.
Run the train_agent() function for 20,000 episodes.
Use visualize_best_grid(q_table) to view the learned policy in the console.

📍 Final Learned Path

Below is the visual representation of the agent's optimal policy after training. Arrows indicate the action with the highest Q-value for each state.

🛠 Features

Epsilon-Greedy Policy: Ensures the agent explores the grid thoroughly before settling on a path.
Boundary Protection: The is_valid_state function prevents the agent from leaving the 6×6 area.
Detailed Visualization: Formatted console output using :7.2f for aligned and readable Q-values.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
grid_game.ipynb		grid_game.ipynb
learned_path.png		learned_path.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-Learning Grid World Solver

🎮 The Environment

🧠 Reinforcement Learning Logic

Key Hyperparameters

Reward Structure

🚀 How to Run

📍 Final Learned Path

🛠 Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Q-Learning Grid World Solver

🎮 The Environment

🧠 Reinforcement Learning Logic

Key Hyperparameters

Reward Structure

🚀 How to Run

📍 Final Learned Path

🛠 Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages