Control of a nonlinear non affine discrete system using neural networks and online training with reinforcement learning methods