Shortest path detectionPython(ML)

Reinforcement Learning

tomatoes

Abstact

The task here is teaching Robots how to deliver packages across a perilous environment. Our Robots are not very smart and they know very little about the environment they occupy. The task is to implement the Value Iteration (Part 1) and Q-Learning (Part 2) algorithms such that they allow our Robots to deliver packages optimally.