Skip to content
View oYoungCo's full-sized avatar

Block or report oYoungCo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A playbook for systematically maximizing the performance of deep learning models.

27,700 2,290 Updated Jun 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,247 4,590 Updated Jan 4, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 37,143 4,760 Updated Nov 18, 2024

TEXTOIR is the first opensource toolkit for text open intent recognition. (ACL 2021)

Python 210 31 Updated Jun 26, 2024

大模型基础: 一文了解大模型基础知识

3,369 302 Updated Dec 25, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,204 125 Updated Dec 17, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,320 306 Updated Dec 10, 2024

A useful list of NLP(Natural Language Processing) resources

293 75 Updated Jul 7, 2020

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,894 888 Updated Oct 3, 2024

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 902 83 Updated Oct 12, 2024

A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.

309 21 Updated Oct 18, 2023

LLM guided text clustering

Python 78 11 Updated Oct 18, 2023

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,616 464 Updated Nov 21, 2024

万卷1.0多模态语料

550 28 Updated Oct 20, 2023

A python tool for evaluating the quality of sentence embeddings.

Python 2,090 310 Updated Mar 19, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 32,285 9,281 Updated Oct 7, 2024

该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记

C++ 3,946 659 Updated Aug 18, 2023

unified embedding model

Python 844 66 Updated Sep 1, 2023

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,753 480 Updated Aug 6, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,011 765 Updated Oct 16, 2024

记录本人整理的一些数据集

1,015 132 Updated Jun 16, 2022

Dataset and Baseline for SMP-MCC2020

Python 23 6 Updated Jul 6, 2023

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,344 789 Updated Nov 21, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,238 1,644 Updated Sep 19, 2024

ChatGPT相关资源汇总

54 6 Updated Apr 24, 2023

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,151 419 Updated Nov 14, 2024

Seamlessly integrate LLMs into scikit-learn.

Python 3,398 276 Updated Jan 4, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,414 4,576 Updated Dec 26, 2024

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,042 237 Updated Apr 14, 2024

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

TypeScript 24,059 1,748 Updated Nov 16, 2024
Next