Starred repositories
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
TensorFlow code and pre-trained models for BERT
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
100+ Chinese Word Vectors 上百种预训练中文词向量
all kinds of text classification models and more with deep learning
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
CNN-RNN中文文本分类,基于TensorFlow
使用Bert,ERNIE,进行中文文本分类
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
Multilingual text (NLP) processing toolkit
An Efficient Lexical Analyzer for Chinese
Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
Turn Chinese natural language into structured data 中文自然语言理解
Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。
该项目是基于医疗领域知识图谱的问答系统。实现比较简单。
Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
A python implementation of the Rapid Automatic Keyword Extraction