Contextual String Embeddings for Sequence Labeling

Implementation of the language model for Contextual chinese strokes Embeddings with PyTorch

This repository contains an implementation with PyTorch of the sequential model presented in the paper"Contextual String Embeddings for Sequence Labeling" by Alan Akbik et al. in 2018.

source code: Falir

Structure of the model

Structure of the code

At the root of the project, you will see:

├── pyLM
|  └── callback
|  |  └── lrscheduler.py　　
|  |  └── trainingmonitor.py　
|  |  └── ...
|  └── config
|  |  └── basic_config.py #a configuration file for storing model parameters
|  └── dataset　　　
|  └── io　　　　
|  |  └── dataset.py　　
|  |  └── data_transformer.py　　
|  └── model
|  |  └── nn　
|  |  └── layers
|  └── output #save the ouput of model
|  └── preprocessing #text preprocessing 
|  └── train #used for training a model
|  |  └── trainer.py 
|  |  └── ...
|  └── test
|  |  └── embedding.py
|  └── utils # a set of utility functions
├── obtain_word_embedding.py
├── train_stroke_lm.py

Dependencies

csv
tqdm
numpy
pickle
scikit-learn
PyTorch 1.0
matplotlib
pandas

How to use the code

Prepare data, you can modify the io.data_transformer.py to adapt your data.
Modify configuration information in pyLM/config/basic_config.py(the path of data,...).
Run train_stroke_lm.py to training language model.
Run obtain_word_embedding.py to obtaining word embedding.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
pyLM		pyLM
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
obtain_word_embedding.py		obtain_word_embedding.py
train_stroke_lm.py		train_stroke_lm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contextual String Embeddings for Sequence Labeling

Structure of the model

Structure of the code

Dependencies

How to use the code

result

About

Releases

Packages

Languages

lonePatient/Contextual-Chinese-Strokes-Embeddings

Folders and files

Latest commit

History

Repository files navigation

Contextual String Embeddings for Sequence Labeling

Structure of the model

Structure of the code

Dependencies

How to use the code

result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages