BPTT never reached satisfactory solution
LSTM learned perfect solution in 2 out of 10 runs (after 6,250,000 it.). In 8 runs the pole balances in both modes for hundreds or thousands of timesteps (after 8,095,000 it.).
Internal state evolution of memory cells after learning
Back to J. Schmidhuber's Recurrent neural network page