bug-orz

Follow

bug-orz

Follow

寻找大语言模型相关工作中！

18 followers · 14 following

Fudan University
Shanghai, China

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

RL-MLDM / alphagen

Generating sets of formulaic alpha (predictive) stock factors via reinforcement learning.

Python 558 177 Updated Dec 18, 2024

theZJD / AspectMMKG

8 1 Updated Oct 21, 2023

yinzhangyue / SelfAware

Do Large Language Models Know What They Don’t Know?

Python 85 5 Updated Nov 8, 2024

zhushiyun88 / teaching-boyfriend-llm

228 11 Updated Dec 29, 2024

tianyi-lab / Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 343 30 Updated Sep 6, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,509 331 Updated Jan 3, 2025

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,622 250 Updated Dec 17, 2024

TIGER-AI-Lab / MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 129 9 Updated Oct 27, 2024

RUCAIBox / LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 667 85 Updated Dec 9, 2024

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,320 159 Updated Apr 20, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,599 320 Updated May 21, 2024

openai / simple-evals

Python 2,115 182 Updated Dec 18, 2024

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,054 99 Updated May 23, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,365 1,981 Updated Jan 2, 2025

Tongjilibo / build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 350 44 Updated Aug 29, 2024

HqWu-HITCS / Awesome-LLM-Survey

An Awesome Collection for LLM Survey

317 31 Updated Sep 12, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,455 694 Updated Dec 24, 2024

KenyonY / openai-forward

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 864 291 Updated Oct 7, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,198 4,585 Updated Jan 3, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,151 1,223 Updated Dec 12, 2024

wangcunxiang / QA-Eval

The repository for paper <Evaluating Open-QA Evaluation>

Python 23 Updated Apr 9, 2024

wangcunxiang / LLM-Factuality-Survey

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

330 29 Updated Apr 25, 2024

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,634 1,946 Updated Apr 4, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,512 2,592 Updated Dec 15, 2024

telmo-correa / all-of-statistics

Self-study on Larry Wasserman's "All of Statistics"

Jupyter Notebook 1,020 283 Updated Dec 11, 2022

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,776 3,384 Updated Jul 23, 2024

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,839 204 Updated May 20, 2024

ga642381 / ML2021-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Jupyter Notebook 853 324 Updated Nov 9, 2023

craigary / nobelium

A static blog build on top of Notion and NextJS, deployed on Vercel.

JavaScript 3,022 1,977 Updated Aug 20, 2024

Raki-j / nlp-beginner-Raki

复旦大学nlp实验室入门小实验nlp-beginner

Python 23 3 Updated Jan 22, 2022