Skip to content
View KaguraTyan's full-sized avatar

Block or report KaguraTyan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

53 results for source starred repositories written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,130 28,265 Updated Mar 13, 2025

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 71,600 14,734 Updated May 10, 2024

The Python micro framework for building web applications.

Python 69,046 16,317 Updated Jan 5, 2025

Deepfakes Software For All

Python 53,452 13,350 Updated Feb 26, 2025

TensorFlow code and pre-trained models for BERT

Python 38,818 9,671 Updated Jul 23, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 34,580 10,426 Updated Jan 15, 2025

结巴中文分词

Python 33,830 6,729 Updated Aug 21, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,138 6,490 Updated Jan 9, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 30,508 9,585 Updated Aug 21, 2024

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…

Python 24,338 4,636 Updated Oct 15, 2023

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,644 4,720 Updated Mar 11, 2025

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,600 2,076 Updated Jan 23, 2024

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 11,952 2,326 Updated Oct 30, 2023

A paper list of object detection using deep learning.

Python 11,363 2,778 Updated Feb 12, 2024

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,351 1,286 Updated Aug 31, 2024

Google AI 2018 BERT pytorch implementation

Python 6,326 1,322 Updated Sep 15, 2023

Language Technology Platform

Python 5,058 1,047 Updated Mar 11, 2025

🌿 中文近义词:聊天机器人,智能问答工具包

Python 5,057 900 Updated Nov 24, 2023

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,339 1,309 Updated May 21, 2023

CNN-RNN中文文本分类,基于TensorFlow

Python 4,196 1,468 Updated Mar 31, 2024

使用Bert,ERNIE,进行中文文本分类

Python 4,172 905 Updated Jun 28, 2024

中文公开聊天语料库

Python 4,085 788 Updated Apr 23, 2024

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,082 545 Updated May 23, 2024

中文分词

Python 3,165 808 Updated Jan 16, 2025

Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet

Python 3,059 514 Updated Apr 20, 2022

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,681 412 Updated Jul 22, 2024

Starter code for working with the YouTube-8M dataset.

Python 2,339 848 Updated Oct 25, 2021

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

Python 2,220 547 Updated Jan 5, 2022

中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)

Python 2,186 537 Updated Jun 21, 2022

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,875 410 Updated Sep 6, 2023
Next