This is my answer for lesson "Proximal Policy Optimization", one of the lesson in the Udacity Deep Reinforcement Learning Nanodegree An original source codes are not placed in the Udacity Nanodegree Repository, only disclosed to the students.
For details, please check out notebooks.