Saicat

linhaozhi Saicat

sysu + cuhk student

2 followers · 3 following

cuhk
china

Stars

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,853 633 Updated Jan 2, 2025

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,622 250 Updated Dec 17, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

27,699 2,289 Updated Jun 18, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,701 8,176 Updated Jan 2, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,990 4,347 Updated Jan 3, 2025

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 145,285 27,265 Updated Dec 28, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,495 23,018 Updated Jan 3, 2025

bigscience-workshop / bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 981 101 Updated Jul 29, 2024

MuQiuJun-AI / bert4pytorch

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

Python 410 66 Updated Apr 18, 2022

clue-ai / PromptCLUE

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 658 67 Updated Jun 16, 2023

thu-coai / CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,803 252 Updated Jun 12, 2023

lonePatient / BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Python 2,122 429 Updated Mar 11, 2023

RUCAIBox / MVP

This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.

70 4 Updated Nov 1, 2022

PaddlePaddle / PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,261 2,971 Updated Jan 3, 2025

cubefs / cubefs

cloud-native distributed storage

Go 4,757 675 Updated Jan 3, 2025

Guang000 / Awesome-Dataset-Distillation

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,473 136 Updated Dec 23, 2024

algolet / question_generation

Python 25 3 Updated Apr 18, 2022

btobab / R-Drop

Python 3 1 Updated Aug 6, 2021

xuyige / BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 624 100 Updated Oct 19, 2021

zhaogaofeng611 / TextMatch

基于Pytorch的，中文语义相似度匹配模型（ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet）

Python 791 148 Updated Mar 22, 2020

subeeshvasu / Awesome-Learning-with-Label-Noise

A curated list of resources for Learning with Noisy Labels

2,650 352 Updated May 3, 2024

smoothnlp / SmoothNLP

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference

Java 626 113 Updated Feb 3, 2021

CLUEbenchmark / SimCLUE

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 288 40 Updated Oct 11, 2022

sheepzh / poetry

地球上最全的华语现代诗歌语料库，3k+诗人，80K+诗歌，15M+字

Python 682 83 Updated Jan 3, 2025

zhpmatrix / nlp-competitions-list-review

复盘所有NLP比赛的TOP方案，只关注NLP比赛，持续更新中！

2,692 420 Updated Dec 18, 2024

pwxcoo / chinese-xinhua

📙 中华新华字典数据库。包括歇后语，成语，词语，汉字。

Python 11,009 2,592 Updated Dec 26, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,095 4,180 Updated Jan 3, 2025

OpenBMB / BMInf

Efficient Inference for Big Models

Python 572 67 Updated Jan 24, 2023

cdpierse / transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Jupyter Notebook 1,309 98 Updated Aug 30, 2023

koren-v / Interpret

The PyTorch implementation the Smooth Grad [https://arxiv.org/pdf/1706.03825.pdf] and Integrated Gradients [https://arxiv.org/pdf/1703.01365.pdf] for NLP Models.

Python 46 8 Updated Oct 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly