Gauss-Newton Based Learning for Fully Recurrent Neural Networks

Aniket Vartak
4 comments

The thesis discusses a novel off-line and on-line learning approach for Fully Recurrent Neural Networks (FRNNs). The most popular algorithm for training FRNNs, the Real Time Recurrent Learning (RTRL) algorithm, employs the gradient descent technique for finding the optimum weight vectors in the recurrent neural network. Within the framework of the research presented, a new off-line and on-line variation of RTRL is presented, that is based on the Gauss-Newton method. The method itself is an approximate Newton’s method tailored to the specific optimization problem, (non-linear least squares), which aims to speed up the process of FRNN training. The new approach stands as a robust and effective compromise between the original gradient-based RTRL (low computational complexity, slow convergence) and Newton-based variants of RTRL (high computational complexity, fast convergence). By gathering information over time in order to form Gauss-Newton search vectors, the new learning algorithm, GN-RTRL, is capable of converging faster to a better quality solution than the original algorithm. Experimental results reflect these qualities of GN-RTRL, as well as the fact that GN-RTRL may have in practice lower computational cost in comparison, again, to the original RTRL.



Comments:

ankaa13
Said:
great
4 years ago
0
Reply
Sorry, you need javascript enabled to post any comments.
Manjeet_s1
Said:
Wow
4 years ago
0
Reply
Sorry, you need javascript enabled to post any comments.
sidekick
Said:
I haven't gone through the document yet, and I've a quick question; The document was published in 2001, How relevant it is today, specially for ARM Cortex series SOCs?
3 years ago
+1
Reply
Sorry, you need javascript enabled to post any comments.
dskokic
Said:
thx
3 years ago
0
Reply
Sorry, you need javascript enabled to post any comments.
Sorry, you need javascript enabled to post any comments.