Recurrent Neural Network

Feb 21, 2025

tags: Neural Network

See other extensions: LSTM, GRU, (Chandar et al. 2019), (Goudreau et al. 1994), (Sutskever, Martens, and Hinton 2011), (Cho et al. 2014)

Getting Started

Some basic RNN resources. Here is some stuff to get you started: Really early RNN Work:

LSTMs:

Some other useful things:

(review on more modern learning techniques) https://ieeexplore.ieee.org/abstract/document/6639349
(On the difficulty of training) http://proceedings.mlr.press/v28/pascanu13.pdf
(overview on some basic RNN algorithms) https://pdfs.semanticscholar.org/cccd/3fd7a45e7643f26391bd539ffbede0690f36.pdf
(Colah’s blog) https://colah.github.io/posts/2015-08-Understanding-LSTMs/

Some ideas that I believe could use some more research:

(Echo State Netwoks) http://www.scholarpedia.org/article/Echo_state_network
(Bi Directional LSTMs/RNNs) https://ieeexplore.ieee.org/document/4288069

Chandar, Sarath, Chinnadhurai Sankar, Eugene Vorontsov, Samira Ebrahimi Kahou, and Yoshua Bengio. 2019. “Towards Non-Saturating Recurrent Units for Modelling Long-Term Dependencies.” In Proceedings of the AAAI Conference on Artificial Intelligence, 33:3280–87.

Cho, Kyunghyun, Bart van Merriënboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014. “On the Properties of Neural Machine Translation: Encoder–Decoder Approaches.” In Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, 103–11. Association for Computational Linguistics. doi:10.3115/v1/W14-4012.

Goudreau, M.W., C.L. Giles, S.T. Chakradhar, and D. Chen. 1994. “First-Order versus Second-Order Single-Layer Recurrent Neural Networks.” IEEE Transactions on Neural Networks 5 (3): 511–13. doi:10.1109/72.286928.

Sutskever, Ilya, James Martens, and Geoffrey Hinton. 2011. “Generating Text with Recurrent Neural Networks.” In Proceedings of the 28th International Conference on Machine Learning, 28:1017–24. PMLR.

Matthew Schlegel

Recurrent Neural Network

Getting Started

Links to this note: