Starred repositories
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
The Python micro framework for building web applications.
TensorFlow code and pre-trained models for BERT
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
OpenMMLab Detection Toolbox and Benchmark
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
100+ Chinese Word Vectors 上百种预训练中文词向量
A paper list of object detection using deep learning.
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
Google AI 2018 BERT pytorch implementation
A TensorFlow Implementation of the Transformer: Attention Is All You Need
CNN-RNN中文文本分类,基于TensorFlow
使用Bert,ERNIE,进行中文文本分类
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
Starter code for working with the YouTube-8M dataset.
HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit