Skip to content
View lihongzheng-nlp's full-sized avatar
  • Beijing Institute of Technology
  • Beijing

Block or report lihongzheng-nlp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

中国大模型

5,686 479 Updated Nov 30, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 34,214 10,268 Updated Dec 29, 2024

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

676 97 Updated Jul 23, 2024

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,329 895 Updated Dec 29, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,415 1,181 Updated Dec 1, 2024

Fast BPE

C++ 658 98 Updated Jun 18, 2024

12306智能刷票,订票

Python 33,931 9,788 Updated Apr 2, 2023

A tool for extracting plain text from Wikipedia dumps

Python 3,775 966 Updated May 23, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,548 1,548 Updated May 23, 2024

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Python 1,119 206 Updated Nov 28, 2022

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,896 497 Updated Feb 14, 2023

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

Python 1,654 280 Updated Mar 29, 2023

Phrase-Based & Neural Unsupervised Machine Translation

Python 1,505 262 Updated Sep 15, 2021

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,741 6,434 Updated Oct 18, 2024

General purpose unsupervised sentence representations

C++ 1,195 256 Updated Aug 3, 2022

Language-Agnostic SEntence Representations

Jupyter Notebook 3,608 463 Updated May 2, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,766 1,390 Updated Jul 31, 2023

Library for fast text representation and classification.

HTML 26,002 4,726 Updated Mar 22, 2024

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,801 2,244 Updated Jun 27, 2024

TensorFlow code and pre-trained models for BERT

Python 38,458 9,631 Updated Jul 23, 2024

bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目

1,852 347 Updated Mar 21, 2021

Unsupervised Neural Machine Translation

Python 474 78 Updated Jul 8, 2020

A framework to learn cross-lingual word embedding mappings

Python 648 131 Updated Apr 22, 2023

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2,434 449 Updated Aug 9, 2024

Pre-trained ELMo Representations for Many Languages

Python 1,462 243 Updated May 19, 2021

An open-source NLP research library, built on PyTorch.

Python 11,780 2,252 Updated Nov 22, 2022

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,899 27,409 Updated Dec 29, 2024

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,064 21,116 Updated Oct 3, 2024

📖 [译] scikit-learn(sklearn) 中文文档

CSS 5,134 1,474 Updated Jul 21, 2023

A python tool for evaluating the quality of sentence embeddings.

Python 2,088 310 Updated Mar 19, 2024
Next