Skip to content

proxymallick/EM-Guided-Policy-Search

Repository files navigation

EM-based Optimal Control and Learning

Sample efficient model based reinforcement learning with Maximum-likelihood

Guided policy search algorithms have proved to work with incredible accuracy for not only controlling a complicated dynamical system, but also learning optimal policies for different unseen instances. In most of algorithms which have been proposed for policy search and learning, one assumes the true nature of the states of the problem. This approach deals with a stastical trajectory optimization procedure for unknown systems and extends it towards learning policies (optimal) which has less noise in them because of the lower variance in the optimal trajectories.

  • Contributions :
    • Robust numerical implementation of EM which has been extensively used in system identification and extend it towards learning and generalization of optimal control for unseen initial conditions.
    • Efficient exploitation of the highly uncertain explored state space, which is analytically quantified from the theoretical analysis of the covariance matrix.
    • Promising sample efficiency and good success of generalization from multiple testing instances

example GIF

Related Papers

  • Paper 1 - P. Mallick, Z. Chen, and M. Zamani.
  • Paper 2 - P. Mallick and Z. Chen

Releases

No releases published

Packages

No packages published

Languages