Skip to content
View heyLinsir's full-sized avatar

Block or report heyLinsir

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

Python 54 8 Updated Oct 27, 2024

语言学竞赛集成 / Collection on Linguistics Olympiad (Chinese version only)

19 2 Updated Oct 26, 2024

Must-read Papers on Knowledge Editing for Large Language Models.

957 60 Updated Nov 20, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,531 215 Updated Dec 5, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,006 285 Updated Dec 15, 2024
JavaScript 15 2 Updated Feb 29, 2024

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Python 1,083 165 Updated Aug 17, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

1,014 60 Updated Dec 10, 2024

Google Research

Jupyter Notebook 34,488 7,948 Updated Dec 13, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,247 186 Updated Aug 11, 2024

Code for "Learning to summarize from human feedback"

Python 999 144 Updated Sep 5, 2023

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,547 1,576 Updated Jul 10, 2024

Example models using DeepSpeed

Python 6,151 1,050 Updated Dec 14, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,535 472 Updated Jan 8, 2024

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,677 473 Updated Oct 12, 2023

Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.

894 115 Updated Mar 25, 2021
Python 262 21 Updated Nov 22, 2023

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,308 101 Updated Mar 3, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,866 4,167 Updated Dec 14, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,273 400 Updated Sep 13, 2024

Fast and memory-efficient exact attention

Python 14,634 1,373 Updated Dec 13, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,736 1,855 Updated Jun 27, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,686 301 Updated Dec 12, 2023

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,816 5,233 Updated Jun 27, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,672 608 Updated Jul 25, 2023

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,969 130 Updated Oct 5, 2023

Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"

Python 97 10 Updated Nov 27, 2022

A python library that makes AMR parsing, generation and visualization simple.

Python 227 34 Updated Jan 22, 2024

SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.

Python 246 48 Updated Jun 15, 2023

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Python 255 37 Updated Apr 15, 2023
Next