-
how-to-train-tokenizer Public
Forked from yanqiangmiffy/how-to-train-tokenizer怎么训练一个LLM分词器
Python UpdatedJul 13, 2023 -
BiSET Public
BiSET: Bi-directional Selective Encoding with Template for Abstractive Summarization (ACL 2019)
-
notebooks Public
Forked from huggingface/notebooksNotebooks using the Hugging Face libraries 🤗
Jupyter Notebook Apache License 2.0 UpdatedFeb 20, 2022 -
-
python-pinyin Public
Forked from mozillazg/python-pinyin汉字转拼音(pypinyin)
Python MIT License UpdatedOct 23, 2021 -
MacBERT Public
Forked from ymcui/MacBERTRevisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)
Apache License 2.0 UpdatedJul 21, 2021 -
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python Apache License 2.0 UpdatedDec 7, 2020 -
-
MarCo-Dialog Public
The code of ACL 2020 paper "Multi-Domain Dialogue Acts and Response Co-Generation"
-
Guyu Public
Forked from lipiji/Guyupre-training and fine-tuning framework for text generation
Python MIT License UpdatedApr 26, 2020 -
-
Mem2Seq Public
Forked from HLTCHKUST/Mem2SeqMem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems
Python MIT License UpdatedMar 24, 2020 -
-
PLMpapers Public
Forked from thunlp/PLMpapersMust-read Papers on pre-trained language models.
MIT License UpdatedNov 4, 2019 -
NLP-progress Public
Forked from sebastianruder/NLP-progressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
-