Skip to content
View oYoungCo's full-sized avatar

Block or report oYoungCo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
42 stars written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,846 27,634 Updated Jan 21, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,147 5,924 Updated Aug 24, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,558 4,740 Updated Jan 21, 2025

TensorFlow code and pre-trained models for BERT

Python 38,539 9,648 Updated Jul 23, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,547 4,598 Updated Jan 18, 2025

结巴中文分词

Python 33,607 6,726 Updated Aug 21, 2024

PyTorch implementations of Generative Adversarial Networks.

Python 16,696 4,101 Updated Jun 18, 2024

A list of all named GANs!

Python 14,418 2,560 Updated Oct 6, 2023

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,102 912 Updated Oct 3, 2024

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,533 2,075 Updated Jan 23, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,792 1,392 Updated Jul 31, 2023

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,713 477 Updated Jan 16, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,813 481 Updated Aug 6, 2024

An Open-Source Framework for Prompt-Learning.

Python 4,439 458 Updated Jul 16, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,353 789 Updated Nov 21, 2023

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,217 617 Updated Nov 21, 2022

Seamlessly integrate LLMs into scikit-learn.

Python 3,404 277 Updated Jan 5, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,044 236 Updated Apr 14, 2024

pytorch tutorial for beginners

Python 3,002 1,088 Updated Feb 12, 2022

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

Python 2,207 546 Updated Jan 5, 2022

A python tool for evaluating the quality of sentence embeddings.

Python 2,091 309 Updated Mar 19, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,089 255 Updated Nov 27, 2024

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Python 1,795 168 Updated Apr 15, 2024

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

Python 1,364 240 Updated May 31, 2022

mixup: Beyond Empirical Risk Minimization

Python 1,171 227 Updated Oct 12, 2021

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 928 83 Updated Oct 12, 2024

unified embedding model

Python 847 66 Updated Sep 1, 2023

近年来事件抽取方法总结,包括中文事件抽取、开放域事件抽取、事件数据生成、跨语言事件抽取、小样本事件抽取、零样本事件抽取等类型,DMCNN、FramNet、DLRNN、DBRNN、GCN、DAG-GRU、JMEE、PLMEE等方法

Python 765 130 Updated Aug 23, 2022

Toolbox to integrate optimal transport loss functions using automatic differentiation and Sinkhorn's algorithm

Python 437 39 Updated May 14, 2018

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 289 40 Updated Oct 11, 2022
Next