Interactive Environment
Grid World
Q-Values
Agent
Goal (+10)
Obstacle (-5)
Path History
Learning Parameters
Learning Mode
Optimal Policy Mode
Performance Metrics
Episodes
0
Current Steps
0
Success Rate
0%
Avg. Steps
0
Learning Progress
Steps per Episode
Leaderboard - Best Paths
Rank | Episode | Steps | Reward |
---|---|---|---|
No successful episodes yet |