Javkonline

Javkonline

A freshman of kg and llm

Stars

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,869 370 Updated Jan 20, 2025

Python 863 101 Updated Jan 10, 2025

Scalable RL solution for advanced reasoning of language models

Python 936 60 Updated Jan 17, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,582 4,742 Updated Jan 21, 2025

Python 107 19 Updated Jun 18, 2024

DSPy: The framework for programming—not prompting—language models

Python 21,260 1,612 Updated Jan 20, 2025

Jupyter Notebook 87 1 Updated Dec 16, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,727 1,070 Updated Oct 9, 2024

A collection of resources on multimodal knowledge graph, including datasets, papers and contests.