KaguraTyan

Kagura KaguraTyan

3 followers · 4 following

Achievements

Starred repositories

100 results for source starred repositories

Clear filter

ymcui / Chinese-ELECTRA

Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）

Python 1,409 172 Updated Apr 6, 2023

bannedbook / fanqiang

翻墙-科学上网

Kotlin 39,307 7,321 Updated Mar 12, 2025

km1994 / NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

2,512 513 Updated Oct 10, 2023

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,133 6,490 Updated Jan 9, 2025

foamliu / Machine-Translation

中英机器文本翻译

Python 155 41 Updated Jul 2, 2019

425776024 / nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

Python 1,803 169 Updated Apr 15, 2024

brightmart / roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,681 412 Updated Jul 22, 2024

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,326 1,322 Updated Sep 15, 2023

aceimnorstuvwxz / toutiao-text-classfication-dataset

今日头条中文新闻（文本）分类数据集

Python 371 63 Updated May 19, 2021

649453932 / Bert-Chinese-Text-Classification-Pytorch

使用Bert，ERNIE，进行中文文本分类

Python 4,171 905 Updated Jun 28, 2024

JohnSnowLabs / spark-nlp

State of the Art Natural Language Processing

Scala 3,936 722 Updated Mar 11, 2025

baidu / Familia

A Toolkit for Industrial Topic Modeling

C++ 2,639 593 Updated Jul 1, 2021

HVF / franchise

🍟 a notebook sql client. what you get when have a lot of sequels.

JavaScript 4,010 263 Updated Dec 10, 2022

baidu / lac

百度NLP：分词，词性标注，命名实体识别，词重要性

C++ 3,912 595 Updated May 25, 2021

Avik-Jain / 100-Days-Of-ML-Code

100 Days of ML Coding

46,644 10,814 Updated Dec 29, 2023

hankcs / pyhanlp

中文分词

Python 3,165 808 Updated Jan 16, 2025

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,082 545 Updated May 23, 2024

CLUEbenchmark / CLUENER2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Python 1,480 303 Updated Nov 21, 2022

NLP-LOVE / Introduction-NLP

HanLP作者的新书《自然语言处理入门》详细笔记！业界良心之作，书中不是枯燥无味的公式罗列，而是用白话阐述的通俗易懂的算法模型。从基本概念出发，逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

Python 2,220 547 Updated Jan 5, 2022

aceimnorstuvwxz / toutiao-multilevel-text-classfication-dataset

今日头条中文新闻文本(多层)分类数据集

Python 396 124 Updated May 6, 2021

luopeixiang / named_entity_recognition

中文命名实体识别（包括多种模型：HMM，CRF，BiLSTM，BiLSTM+CRF的具体实现）

Python 2,185 537 Updated Jun 21, 2022

shiyybua / NER

基于tensorflow深度学习的中文的命名实体识别

Python 1,048 400 Updated Mar 11, 2018

buppt / ChineseNER

中文命名实体识别，实体抽取，tensorflow，pytorch，BiLSTM+CRF

Python 1,424 398 Updated Mar 15, 2020

Tencent / NeuralNLP-NeuralClassifier

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,874 410 Updated Sep 6, 2023

cn / GB2260

中华人民共和国国家标准 GB/T 2260 行政区划代码

Python 1,517 206 Updated May 22, 2023

chatopera / Synonyms

🌿 中文近义词：聊天机器人，智能问答工具包

Python 5,057 900 Updated Nov 24, 2023

monkeyDemon / AI-Toolbox

Algorithm Engineer Toolbox, for the purpose of quickly iterating new ideas

Python 420 201 Updated Jun 8, 2020

the1812 / Bilibili-Evolved

强大的哔哩哔哩增强脚本

TypeScript 24,795 1,621 Updated Mar 12, 2025

bayandin / awesome-awesomeness

A curated list of awesome awesomeness

Ruby 32,419 3,567 Updated Jun 2, 2024

apachecn / Interview

Interview = 简历指南 + 算法题 + 八股文 + 源码分析

Jupyter Notebook 8,801 2,181 Updated Oct 20, 2023

Kagura KaguraTyan

Starred repositories

image-classification

video-classification

image-generation

Algorithm

Deep learning