Name		Name	Last commit message	Last commit date
parent directory ..
Backpropagation_Through_Time.ipynb		Backpropagation_Through_Time.ipynb
Beam_Search.ipynb		Beam_Search.ipynb
Deep_Recurrent_Neural_Networks.ipynb		Deep_Recurrent_Neural_Networks.ipynb
Encoder-Decoder_Architecture.ipynb		Encoder-Decoder_Architecture.ipynb
Gated_Recurrent_Units.ipynb		Gated_Recurrent_Units.ipynb
Implementation_of_Recurrent_Neural_Networks_from_Scratch.ipynb		Implementation_of_Recurrent_Neural_Networks_from_Scratch.ipynb
Language_Models.ipynb		Language_Models.ipynb
Long_Short_Term_Memory.ipynb		Long_Short_Term_Memory.ipynb
Machine_Translation_and_Data_Sets.ipynb		Machine_Translation_and_Data_Sets.ipynb
README.md		README.md
Recurrent_Neural_Networks.ipynb		Recurrent_Neural_Networks.ipynb
Sequence_Models.ipynb		Sequence_Models.ipynb
Sequence_to_Sequence.ipynb		Sequence_to_Sequence.ipynb
Text_Preprocessing.ipynb		Text_Preprocessing.ipynb

README.md

Recurrent Neural Networks

So far we encountered two types of data: generic vectors and images. For the latter we designed specialized layers to take advantage of the regularity properties in them. In other words, if we were to permute the pixels in an image, it would be much more difficult to reason about its content of something that would look much like the background of a test pattern in the times of Analog TV.
Most importantly, so far we tacitly assumed that our data is generated iid, i.e. independently and identically distributed, all drawn from some distribution. Unfortunately, this isn’t true for most data. For instance, the words in this paragraph are written in sequence, and it would be quite difficult to decipher its meaning if they were permuted randomly. Likewise, image frames in a video, the audio signal in a conversation, or the browsing behavior on a website, all follow sequential order. It is thus only reasonable to assume that specialized models for such data will do better at describing it and at solving estimation problems.
Another issue arises from the fact that we might not only receive a sequence as an input but rather might be expected to continue the sequence. For instance, the task could be to continue the series 2, 4, 6, 8, 10, … This is quite common in time series analysis, to predict the stock market, the fever curve of a patient or the acceleration needed for a race car. Again we want to have models that can handle such data.
In short, while convolutional neural networks can efficiently process spatial information, recurrent neural networks are designed to better handle sequential information. These networks introduces state variables to store past information and, together with the current input, determine the current output.
Many of the examples for using recurrent networks are based on text data. Hence, we will emphasize language models in this chapter. After a more formal review of sequence data we discuss basic concepts of a language model and use this discussion as the inspiration for the design of recurrent neural networks. Next, we describe the gradient calculation method in recurrent neural networks to explore problems that may be encountered in recurrent neural network training. For some of these problems, we can use gated recurrent neural networks, such as LSTMs and GRUs, described later in this chapter.

INDEX

Sequence Models
Language Models
Recurrent Neural Networks
Text Preprocessing
Implementation of Recurrent Neural Networks from scratch
Concise Implementation of Recurrent Neural Networks
Backpropagation through time
Gated Recurrent Units (GRU)
Deep Recurrent Neural Networks
Bidirectional Recurrent Neural Networks
Machine Translation and datasets
Encoder-Decoder Architecture
Sequence to Sequence
Beam Search

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ch10_Recurrent_Neural_Networks

Ch10_Recurrent_Neural_Networks

README.md

Recurrent Neural Networks

INDEX

Files

Ch10_Recurrent_Neural_Networks

Directory actions

More options

Directory actions

More options

Latest commit

History

Ch10_Recurrent_Neural_Networks

Folders and files

parent directory

README.md

Recurrent Neural Networks

INDEX