March 2017 – David Quail

Over the last 4 months, I’ve completed several different projects using Reinforcement learning on a dynamixel servo providing real time continuous sensorimotor data to the learning algorithms. In particular, I’ve experimented with creating, running, and measuring thousands of GVF demons making predictions in parallel, policy gradient actor critic methods, and pavlovian control.

I’ve attempted to document what I learned as best as I could. All code and experimental writeups can be found on my github page at:

David Quail

Random thoughts and rants

Month: March 2017

Real time Reinforcement learning examples