1980s: BPTT, RTRL - gradients based on “unfolding” etc. (Williams, Werbos, Robinson)
Previous slide
Next slide
Back to first slide
View graphic version
Back to
J. Schmidhuber
's
Recurrent neural network page