BERT-BiLSMT-CRF-NER

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning

使用谷歌的BERT模型在BLSTM-CRF模型上进行预训练用于中文命名实体识别的Tensorflow代码'

Welcome to star this repository!

The Chinese training data($PATH/NERdata/) come from:https://github.com/zjy-ucas/ChineseNER

The CoNLL-2003 data($PATH/NERdata/ori/) come from:https://github.com/kyzhouhzau/BERT-NER

The evaluation codes come from:https://github.com/guillaumegenthial/tf_metrics/blob/master/tf_metrics/__init__.py

Try to implement NER work based on google's BERT code and BiLSTM-CRF network!

add crf only model

Just alter bert_lstm_crf.py line 450, the params of the function of add_blstm_crf_layer: crf_only=True or False

ONLY CRF output layer:

    blstm_crf = BLSTM_CRF(embedded_chars=embedding, hidden_unit=FLAGS.lstm_size, cell_type=FLAGS.cell, num_layers=FLAGS.num_layers,
                          dropout_rate=FLAGS.droupout_rate, initializers=initializers, num_labels=num_labels,
                          seq_length=max_seq_length, labels=labels, lengths=lengths, is_training=is_training)
    rst = blstm_crf.add_blstm_crf_layer(crf_only=True)

BiLSTM with CRF output layer

    blstm_crf = BLSTM_CRF(embedded_chars=embedding, hidden_unit=FLAGS.lstm_size, cell_type=FLAGS.cell, num_layers=FLAGS.num_layers,
                          dropout_rate=FLAGS.droupout_rate, initializers=initializers, num_labels=num_labels,
                          seq_length=max_seq_length, labels=labels, lengths=lengths, is_training=is_training)
    rst = blstm_crf.add_blstm_crf_layer(crf_only=False)

How to train

using config param in terminal

  python3 bert_lstm_ner.py   \
                  --task_name="NER"  \ 
                  --do_train=True   \
                  --do_eval=True   \
                  --do_predict=True
                  --data_dir=NERdata   \
                  --vocab_file=checkpoint/vocab.txt  \ 
                  --bert_config_file=checkpoint/bert_config.json \  
                  --init_checkpoint=checkpoint/bert_model.ckpt   \
                  --max_seq_length=128   \
                  --train_batch_size=32   \
                  --learning_rate=2e-5   \
                  --num_train_epochs=3.0   \
                  --output_dir=./output/result_dir/

OR replace the BERT path and project path in bert_lstm_ner.py.py

if os.name == 'nt':
   bert_path = '{your BERT model path}'
   root_path = '{project path}'
else:
   bert_path = '{your BERT model path}'
   root_path = '{project path}'

result:

all params using default

In dev data set:

In test data set

entity leval result:

last two result are label level result, the entitly level result in code of line 796-798,this result will be output in predict process. show my entitl level result :

reference:

The evaluation codes come from:https://github.com/guillaumegenthial/tf_metrics/blob/master/tf_metrics/__init__.py
https://github.com/google-research/bert
https://github.com/kyzhouhzau/BERT-NER
https://github.com/zjy-ucas/ChineseNER

Any problem please email me([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
NERdata		NERdata
bert		bert
03E18A6A9C16082CF22A9E8837F7E35F.png		03E18A6A9C16082CF22A9E8837F7E35F.png
README.md		README.md
bert_lstm_ner.py		bert_lstm_ner.py
conlleval.pl		conlleval.pl
conlleval.py		conlleval.py
lstm_crf_layer.py		lstm_crf_layer.py
picture1.png		picture1.png
picture2.png		picture2.png
tf_metrics.py		tf_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT-BiLSMT-CRF-NER

add crf only model

How to train

using config param in terminal

OR replace the BERT path and project path in bert_lstm_ner.py.py

result:

In dev data set:

In test data set

entity leval result:

reference:

About

Releases

Packages

Languages

iceelor/BERT-BiLSTM-CRF-NER

Folders and files

Latest commit

History

Repository files navigation

BERT-BiLSMT-CRF-NER

add crf only model

How to train

using config param in terminal

OR replace the BERT path and project path in bert_lstm_ner.py.py

result:

In dev data set:

In test data set

entity leval result:

reference:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages