Skip to content
View Saicat's full-sized avatar
  • cuhk
  • china

Block or report Saicat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,853 633 Updated Jan 2, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,622 250 Updated Dec 17, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,699 2,289 Updated Jun 18, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,701 8,176 Updated Jan 2, 2025

Making large AI models cheaper, faster and more accessible

Python 38,990 4,347 Updated Jan 3, 2025

Stable Diffusion web UI

Python 145,285 27,265 Updated Dec 28, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,495 23,018 Updated Jan 3, 2025

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 981 101 Updated Jul 29, 2024

超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新

Python 410 66 Updated Apr 18, 2022

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 658 67 Updated Jun 16, 2023

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,803 252 Updated Jun 12, 2023

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Python 2,122 429 Updated Mar 11, 2023

This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.

70 4 Updated Nov 1, 2022

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,261 2,971 Updated Jan 3, 2025

cloud-native distributed storage

Go 4,757 675 Updated Jan 3, 2025

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,473 136 Updated Dec 23, 2024
Python 25 3 Updated Apr 18, 2022
Python 3 1 Updated Aug 6, 2021

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 624 100 Updated Oct 19, 2021

基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)

Python 791 148 Updated Mar 22, 2020

A curated list of resources for Learning with Noisy Labels

2,650 352 Updated May 3, 2024

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference

Java 626 113 Updated Feb 3, 2021

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 288 40 Updated Oct 11, 2022

地球上最全的华语现代诗歌语料库,3k+诗人,80K+诗歌,15M+字

Python 682 83 Updated Jan 3, 2025

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

2,692 420 Updated Dec 18, 2024

📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Python 11,009 2,592 Updated Dec 26, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,095 4,180 Updated Jan 3, 2025

Efficient Inference for Big Models

Python 572 67 Updated Jan 24, 2023

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Jupyter Notebook 1,309 98 Updated Aug 30, 2023

The PyTorch implementation the Smooth Grad [https://arxiv.org/pdf/1706.03825.pdf] and Integrated Gradients [https://arxiv.org/pdf/1703.01365.pdf] for NLP Models.

Python 46 8 Updated Oct 30, 2020
Next