Skip to content
View Javkonline's full-sized avatar
  • USTC | Big Data
  • HeFei

Block or report Javkonline

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,869 370 Updated Jan 20, 2025
Python 863 101 Updated Jan 10, 2025

Scalable RL solution for advanced reasoning of language models

Python 936 60 Updated Jan 17, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,582 4,742 Updated Jan 21, 2025
Python 107 19 Updated Jun 18, 2024

DSPy: The framework for programming—not prompting—language models

Python 21,260 1,612 Updated Jan 20, 2025
Jupyter Notebook 87 1 Updated Dec 16, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,727 1,070 Updated Oct 9, 2024

A collection of resources on multimodal knowledge graph, including datasets, papers and contests.

146 16 Updated Jun 25, 2024