PTB Language Modelling task

This repository is used for a language modelling pareto competition at TTIC.

Sampled Softmax

Pre-trained word embedding(GloVe)

Adagrad optimizer

Competition

(time ratio, perplexity)

time ratio = training time/ base model training time. (The base model is trained with default params and trained on a single CPU with rougly 1 hour)

Note: Your training time must be calculated from a single CPU.

Model

I implemented Sampled Softmax method to the originial RNNS model. In addition, an implementation of using pre-trained word embedding with size of 200 and 300 from GloVe can be found in the main.py. The model is also trained with Adagrad optimizer + L2 weight decay.

Software Requirements

This codebase requires Python 3.5, PyTorch

GloVe

Please download the GloVe from here: word2vec-api

or

download from here directly: Wikipedia+Gigaword 5.

Usage

-python main.py --soft --adagrad --lr 0.01		# Train a LSTM on PTB with sampled softmax and using Adagrad as the optimizer with learning rate = 0.01
python main.py --pre --emsize 300       # Train a LSTM on PTB with pre-trained embedding with emsize 300
python generate.py                      # Generate samples from the trained LSTM model.

Acknowledge

This repository contains the code originally forked from the Word-level language modeling RNN that is modified to present attention layer into the model.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
__pycache__		__pycache__
data		data
.DS_Store		.DS_Store
README.md		README.md
data.py		data.py
generate.py		generate.py
main.py		main.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PTB Language Modelling task

Sampled Softmax

Pre-trained word embedding(GloVe)

Adagrad optimizer

Competition

Model

Software Requirements

GloVe

Usage

Acknowledge

About

Releases

Packages

Languages

edchengg/PTB-LSTM-LanguageModelling

Folders and files

Latest commit

History

Repository files navigation

PTB Language Modelling task

Sampled Softmax

Pre-trained word embedding(GloVe)

Adagrad optimizer

Competition

Model

Software Requirements

GloVe

Usage

Acknowledge

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages