Reinforcement Learning with RNNs
Forward model (Werbos, Jordan & Rumelhart, Nguyen & Widrow)
Train model, freeze it, use it to compute gradient for controller
Recurrent Controller & Model (Schmidhuber 1990)
Back to J. Schmidhuber's Recurrent neural network page