Reinforcement learning for pole balancing task. Simulation uses OpenCV to make animation of moving pole and to plot the best action for every state.