Skip to content

TheanoLM is a recurrent neural network language modeling tool implemented using Theano

License

Notifications You must be signed in to change notification settings

sgangireddy/theanolm

 
 

Repository files navigation

TheanoLM

Introduction

TheanoLM is a recurrent neural network language modeling tool implemented using the Python library Theano. Theano allows the user to customize and extend the neural network very conveniently, still generating highly efficient code that can utilize multiple GPUs or CPUs for parallel computation. TheanoLM allows the user to specify an arbitrary network architecture. New layer types and optimization methods can be easily implemented.

Implementations of some of the currently popular layer types such as long short-term memory (LSTM), gated recurrent units (GRU), and highway networks are provided, as well as Stochastic Gradient Descent (SGD), RMSProp, AdaGrad, ADADELTA, and Adam optimizers. In addition to the standard cross-entropy cost, one can use sampling based noise-contrastive estimation or BlackOut. TheanoLM can be used for rescoring n-best lists, decoding word lattices, or generating text.

About the project

TheanoLM is open source and licensed under the Apache License, Version 2.0. The source code is available on GitHub. Documentation can be read online on Read the Docs.

Author

About

TheanoLM is a recurrent neural network language modeling tool implemented using Theano

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 93.0%
  • Shell 7.0%