Watch an AI agent learn to navigate a grid world using Q-Learning
Grid World
Training Statistics
0
Episodes
0
Total Steps
0
Wins
0
Avg Reward
Q-Values Visualization
Legend: đŖ Agent | đĸ Goal (+100) | đ´ Obstacles (-10) | Empty cells (-1 per step Actions: â Up | â Down | â Left | â Right
The agent learns to find the shortest path to the goal while avoiding obstacles.