GitHub - 2efPer/BERT-BiLSTM-CRF-NER: BERT模型做NER任务

BERT-BiLSMT-CRF-NER

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning

How to train

1. Download BERT chinese model :

wget https://storage.googleapis.com/bert_models/2018_11_03/chinese_L-12_H-768_A-12.zip

2. create output dir

create output path in project path:

mkdir output

3. Train model

1) direct trainning

  python3 bert_lstm_ner.py   \
                  --task_name="NER"  \ 
                  --do_train=True   \
                  --do_eval=True   \
                  --do_predict=True
                  --data_dir=NERdata   \
                  --vocab_file=checkpoint/vocab.txt  \ 
                  --bert_config_file=checkpoint/bert_config.json \  
                  --init_checkpoint=checkpoint/bert_model.ckpt   \
                  --max_seq_length=128   \
                  --train_batch_size=32   \
                  --learning_rate=2e-5   \
                  --num_train_epochs=3.0   \
                  --output_dir=./output/result_dir/

2) replace the BERT path and project path in bert_lstm_ner.py

if os.name == 'nt': #windows path config
   bert_path = '{your BERT model path}'
   root_path = '{project path}'
else: # linux path config
   bert_path = '{your BERT model path}'
   root_path = '{project path}'

Than Run:

python3 bert_lstm_ner.py

USING BLSTM-CRF OR ONLY CRF FOR DECODE!

Just alter bert_lstm_ner.py line of 450, the params of the function of add_blstm_crf_layer: crf_only=True or False

ONLY CRF output layer:

    blstm_crf = BLSTM_CRF(embedded_chars=embedding, hidden_unit=FLAGS.lstm_size, cell_type=FLAGS.cell, num_layers=FLAGS.num_layers,
                          dropout_rate=FLAGS.droupout_rate, initializers=initializers, num_labels=num_labels,
                          seq_length=max_seq_length, labels=labels, lengths=lengths, is_training=is_training)
    rst = blstm_crf.add_blstm_crf_layer(crf_only=True)

BiLSTM with CRF output layer

    blstm_crf = BLSTM_CRF(embedded_chars=embedding, hidden_unit=FLAGS.lstm_size, cell_type=FLAGS.cell, num_layers=FLAGS.num_layers,
                          dropout_rate=FLAGS.droupout_rate, initializers=initializers, num_labels=num_labels,
                          seq_length=max_seq_length, labels=labels, lengths=lengths, is_training=is_training)
    rst = blstm_crf.add_blstm_crf_layer(crf_only=False)

Result:

all params using default

In dev data set:

In test data set

entity leval result:

last two result are label level result, the entitly level result in code of line 796-798,this result will be output in predict process. show my entity level result :

ONLINE PREDICT

If model is train finished, just run

python3 terminal_predict.py

Using your own data to train

if you want to use your own data to train ner model,you just need to modify the get_labes function.

def get_labels(self):
       return ["O", "B-PER", "I-PER", "B-ORG", "I-ORG", "B-LOC", "I-LOC", "X", "[CLS]", "[SEP]"]

NOTE: "X", “[CLS]”, “[SEP]” These three are necessary, you just replace your data label to this return list.
Or you can use last code lets the program automatically get the label from training data


def get_labels(self):
        # 通过读取train文件获取标签的方法会出现一定的风险。
        if os.path.exists(os.path.join(FLAGS.output_dir, 'label_list.pkl')):
            with codecs.open(os.path.join(FLAGS.output_dir, 'label_list.pkl'), 'rb') as rf:
                self.labels = pickle.load(rf)
        else:
            if len(self.labels) > 0:
                self.labels = self.labels.union(set(["X", "[CLS]", "[SEP]"]))
                with codecs.open(os.path.join(FLAGS.output_dir, 'label_list.pkl'), 'wb') as rf:
                    pickle.dump(self.labels, rf)
            else:
                self.labels = ["O", 'B-TIM', 'I-TIM', "B-PER", "I-PER", "B-ORG", "I-ORG", "B-LOC", "I-LOC", "X", "[CLS]", "[SEP]"]
        return self.labels

NEW UPDATE

2019.1.9: Add code to remove the adam related parameters in the model, and reduce the size of the model file from 1.3GB to 400MB.

2019.1.3: Add online predict code

reference:

https://github.com/google-research/bert
https://github.com/kyzhouhzau/BERT-NER
https://github.com/zjy-ucas/ChineseNER
[https://github.com/guillaumegenthial/tf_metrics/blob/master/tf_metrics/init.py](evaluation codes)

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
NERdata		NERdata
bert		bert
pictures		pictures
.gitignore		.gitignore
README.md		README.md
bert_lstm_ner.py		bert_lstm_ner.py
conlleval.pl		conlleval.pl
conlleval.py		conlleval.py
data_process.py		data_process.py
lstm_crf_layer.py		lstm_crf_layer.py
requirement.txt		requirement.txt
terminal_predict.py		terminal_predict.py
tf_metrics.py		tf_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT-BiLSMT-CRF-NER

How to train

1. Download BERT chinese model :

2. create output dir

3. Train model

1) direct trainning

2) replace the BERT path and project path in bert_lstm_ner.py

USING BLSTM-CRF OR ONLY CRF FOR DECODE!

Result:

In dev data set:

In test data set

entity leval result:

ONLINE PREDICT

Using your own data to train

NEW UPDATE

reference:

About

Releases

Packages

Languages

2efPer/BERT-BiLSTM-CRF-NER

Folders and files

Latest commit

History

Repository files navigation

BERT-BiLSMT-CRF-NER

How to train

1. Download BERT chinese model :

2. create output dir

3. Train model

1) direct trainning

2) replace the BERT path and project path in bert_lstm_ner.py

USING BLSTM-CRF OR ONLY CRF FOR DECODE!

Result:

In dev data set:

In test data set

entity leval result:

ONLINE PREDICT

Using your own data to train

NEW UPDATE

reference:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages