oYoungCo

oYoungCo

1 follower · 3 following

Stars

42 stars written in Python

Clear filter

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,846 27,634 Updated Jan 21, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,147 5,924 Updated Aug 24, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,558 4,740 Updated Jan 21, 2025

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 38,539 9,648 Updated Jul 23, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,547 4,598 Updated Jan 18, 2025

fxsjy / jieba

结巴中文分词

Python 33,607 6,726 Updated Aug 21, 2024

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 16,696 4,101 Updated Jun 18, 2024

hindupuravinash / the-gan-zoo

A list of all named GANs!

Python 14,418 2,560 Updated Oct 6, 2023

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,102 912 Updated Oct 3, 2024

jina-ai / clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,533 2,075 Updated Jan 23, 2024

ymcui / Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Python 9,792 1,392 Updated Jul 31, 2023

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,713 477 Updated Jan 16, 2025

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,813 481 Updated Aug 6, 2024

thunlp / OpenPrompt

An Open-Source Framework for Prompt-Learning.

Python 4,439 458 Updated Jul 16, 2024

InsaneLife / ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Python 4,353 789 Updated Nov 21, 2023

CLUEbenchmark / CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Python 4,217 617 Updated Nov 21, 2022

BeastByteAI / scikit-llm

Seamlessly integrate LLMs into scikit-learn.

Python 3,404 277 Updated Jan 5, 2025

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,044 236 Updated Apr 14, 2024

L1aoXingyu / pytorch-beginner

pytorch tutorial for beginners

Python 3,002 1,088 Updated Feb 12, 2022

NLP-LOVE / Introduction-NLP

HanLP作者的新书《自然语言处理入门》详细笔记！业界良心之作，书中不是枯燥无味的公式罗列，而是用白话阐述的通俗易懂的算法模型。从基本概念出发，逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

Python 2,207 546 Updated Jan 5, 2022

facebookresearch / SentEval

A python tool for evaluating the quality of sentence embeddings.

Python 2,091 309 Updated Mar 19, 2024

alibaba / EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,089 255 Updated Nov 27, 2024

425776024 / nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

Python 1,795 168 Updated Apr 15, 2024

zhanlaoban / EDA_NLP_for_Chinese

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

Python 1,364 240 Updated May 31, 2022

facebookresearch / mixup-cifar10

mixup: Beyond Empirical Risk Minimization

Python 1,171 227 Updated Oct 12, 2021

LetheSec / HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 928 83 Updated Oct 12, 2024

wangyuxinwhy / uniem

unified embedding model

Python 847 66 Updated Sep 1, 2023

xiaoqian19940510 / Event-Extraction

近年来事件抽取方法总结，包括中文事件抽取、开放域事件抽取、事件数据生成、跨语言事件抽取、小样本事件抽取、零样本事件抽取等类型，DMCNN、FramNet、DLRNN、DBRNN、GCN、DAG-GRU、JMEE、PLMEE等方法

Python 765 130 Updated Aug 23, 2022

gpeyre / SinkhornAutoDiff

Toolbox to integrate optimal transport loss functions using automatic differentiation and Sinkhorn's algorithm

Python 437 39 Updated May 14, 2018

CLUEbenchmark / SimCLUE

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 289 40 Updated Oct 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly