Regulating Reward Training by Means of Certainty Prediction in a Neural Network-Implemented Pong Game
We present the first reinforcement-learning model to self-improve its reward-modulated training implemented through a continuously improving 'intuition' neural network. An agent was trained how to play the arcade video game Pong with two reward-based



























