Stability analysis of pRAM reinforcement learning

Paulo J. L. AdeodatoJohn G. Taylor

Generalisation has been a major issue in RAM-based neural networks. In pRAM networks generalisation is produced by noisy reinforcement learning-a completely hardware implementable (built-in) algorithm. This paper presents the first part of a modular technique to analyse the formation of the basins of attraction in such systems. It proves that reinforcement learning in a single pRAM site is a globally stable system in the continuous limit of incremental learning. It also shows how the stable state depends on the penalty/reward ratio and on the learning rate. The evolution of learning in the time domain shows the effects of the initial state and of the halting moment in the final state. The paper ends with considerations on how noise contributes to the formation of basins of attraction in pRAM neurons.

