Chinese-Text-Classification-Pytorch-Tuning

中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention, DPCNN, Transformer, 基于pytorch，开箱即用。

现也已加入对Bert的支持。

基于ray.tune实现了对不同模型进行超参数优化的功能。简单易用。

环境

python 3.7
pytorch 1.1
tqdm
sklearn
tensorboardX
ray

使用说明

第一步：安装ray - pip install ray

第二步：选定要做超参数优化的模型: 如TextRNN
(Bert需要参照此处额外下载文件，不用Bert可跳过）

第三步：根据第二步选中的模型，在run.py中设定相关超参数的search_space。具体的语法可参照这里。如

search_space = {
    'learning_rate': tune.loguniform(1e-5, 1e-2),
    'num_epochs': tune.randint(5, 21),
    'dropout': tune.uniform(0, 0.5),
    'hidden_size': tune.randint(32, 257),
    'num_layers': tune.randint(1,3)
}

此处请注意确认相关参数是否适用于选择的模型，否则会报错

第四步：启动50次超参数优化实验

python run.py --model TextCNN --tune_param True --tune_samples 50

第五步：在自动生成的实验结果文件tune_results_.csv中查看实验记录

更多用法

# 使用GPU
python run.py --model TextRNN --tune_param True --tune_gpu True

# 使用Bert
python run.py --model bert --tune_param True --tune_gpu True

# 自定义实验结果文件后缀名
python run.py --model TextRNN --tune_param True --tune_file rnn_char

# 使用ASHA scheduler来做early stopping
python run.py --model TextRNN --tune_param True --tune_asha True

# 使用当前的超参数进行模型训练，不进行超参数优化
python run.py --model TextRNN --tune_param False

更多细节请参照源文档

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
THUCNews		THUCNews
bert_pretrain		bert_pretrain
models		models
pytorch_pretrained		pytorch_pretrained
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
run.py		run.py
train_eval.py		train_eval.py
utils.py		utils.py
utils_bert.py		utils_bert.py
utils_fasttext.py		utils_fasttext.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chinese-Text-Classification-Pytorch-Tuning

环境

使用说明

About

Releases

Packages

Languages

License

bupthuangxin/Chinese-Text-Classification-Pytorch-Tuning

Folders and files

Latest commit

History

Repository files navigation

Chinese-Text-Classification-Pytorch-Tuning

环境

使用说明

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages