Skip to content
View bug-orz's full-sized avatar
  • Fudan University
  • Shanghai, China

Block or report bug-orz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generating sets of formulaic alpha (predictive) stock factors via reinforcement learning.

Python 558 177 Updated Dec 18, 2024
8 1 Updated Oct 21, 2023

Do Large Language Models Know What They Don’t Know?

Python 85 5 Updated Nov 8, 2024

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 343 30 Updated Sep 6, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,509 331 Updated Jan 3, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,622 250 Updated Dec 17, 2024

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 129 9 Updated Oct 27, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 667 85 Updated Dec 9, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,320 159 Updated Apr 20, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,599 320 Updated May 21, 2024
Python 2,115 182 Updated Dec 18, 2024

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,054 99 Updated May 23, 2024

A framework for few-shot evaluation of language models.

Python 7,365 1,981 Updated Jan 2, 2025

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 350 44 Updated Aug 29, 2024

An Awesome Collection for LLM Survey

317 31 Updated Sep 12, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,455 694 Updated Dec 24, 2024

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 864 291 Updated Oct 7, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,198 4,585 Updated Jan 3, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,151 1,223 Updated Dec 12, 2024

The repository for paper <Evaluating Open-QA Evaluation>

Python 23 Updated Apr 9, 2024

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

330 29 Updated Apr 25, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,634 1,946 Updated Apr 4, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,512 2,592 Updated Dec 15, 2024

Self-study on Larry Wasserman's "All of Statistics"

Jupyter Notebook 1,020 283 Updated Dec 11, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,776 3,384 Updated Jul 23, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,839 204 Updated May 20, 2024

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Jupyter Notebook 853 324 Updated Nov 9, 2023

A static blog build on top of Notion and NextJS, deployed on Vercel.

JavaScript 3,022 1,977 Updated Aug 20, 2024

复旦大学nlp实验室入门小实验nlp-beginner

Python 23 3 Updated Jan 22, 2022
Next