Skip to content
View KaguraTyan's full-sized avatar

Block or report KaguraTyan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

Python 1,409 172 Updated Apr 6, 2023

翻墙-科学上网

Kotlin 39,304 7,321 Updated Mar 12, 2025

该仓库主要记录 NLP 算法工程师相关的面试题

2,512 513 Updated Oct 10, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,132 6,490 Updated Jan 9, 2025

中英机器文本翻译

Python 155 41 Updated Jul 2, 2019

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Python 1,803 169 Updated Apr 15, 2024

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,681 412 Updated Jul 22, 2024

Google AI 2018 BERT pytorch implementation

Python 6,326 1,322 Updated Sep 15, 2023

今日头条中文新闻(文本)分类数据集

Python 371 63 Updated May 19, 2021

使用Bert,ERNIE,进行中文文本分类

Python 4,171 905 Updated Jun 28, 2024

State of the Art Natural Language Processing

Scala 3,936 722 Updated Mar 11, 2025

A Toolkit for Industrial Topic Modeling

C++ 2,639 593 Updated Jul 1, 2021

🍟 a notebook sql client. what you get when have a lot of sequels.

JavaScript 4,010 263 Updated Dec 10, 2022

百度NLP:分词,词性标注,命名实体识别,词重要性

C++ 3,912 595 Updated May 25, 2021

100 Days of ML Coding

46,643 10,813 Updated Dec 29, 2023

中文分词

Python 3,165 808 Updated Jan 16, 2025

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,082 545 Updated May 23, 2024

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Python 1,480 303 Updated Nov 21, 2022

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

Python 2,220 547 Updated Jan 5, 2022

今日头条中文新闻文本(多层)分类数据集

Python 396 124 Updated May 6, 2021

中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)

Python 2,185 537 Updated Jun 21, 2022

基于tensorflow深度学习的中文的命名实体识别

Python 1,048 400 Updated Mar 11, 2018

中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF

Python 1,424 398 Updated Mar 15, 2020

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,874 410 Updated Sep 6, 2023

中华人民共和国国家标准 GB/T 2260 行政区划代码

Python 1,516 206 Updated May 22, 2023

🌿 中文近义词:聊天机器人,智能问答工具包

Python 5,057 900 Updated Nov 24, 2023

pyltp: the python extension for LTP

C++ 1,542 351 Updated Jul 24, 2022

Algorithm Engineer Toolbox, for the purpose of quickly iterating new ideas

Python 420 201 Updated Jun 8, 2020

强大的哔哩哔哩增强脚本

TypeScript 24,795 1,621 Updated Mar 12, 2025

A curated list of awesome awesomeness

Ruby 32,419 3,567 Updated Jun 2, 2024
Next