Starred repositories
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
TensorFlow code and pre-trained models for BERT
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PyTorch implementations of Generative Adversarial Networks.
100+ Chinese Word Vectors 上百种预训练中文词向量
A natural language modeling framework based on PyTorch
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
An Open-Source Package for Neural Relation Extraction (NRE)
农业知识图谱(AgriKG):农业领域的信息检索,命名实体识别,关系抽取,智能问答,辅助决策
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
Multi-Task Deep Neural Networks for Natural Language Understanding
A python tool for evaluating the quality of sentence embeddings.
An Efficient Lexical Analyzer for Chinese
sentence embedding by Smooth Inverse Frequency weighting scheme
Alibaba Cloud SDK for Python
About Muti-Label Text Classification Based on Neural Network.
文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据
Speech Enhancement Generative Adversarial Network in PyTorch
第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案
self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。
Time entity recognition tool based on regular expression 基于正则表达式的中文时间实体识别(时间提取)工具