Classic Snake with Deep Q-Learning
0
0
0
0
The AI sees the entire grid and gets distance-based rewards for approaching apples.
128-layer convolutional network processes spatial relationships efficiently.
+10 for apples, -5 for berries, +distance rewards, -10 for death.