Stars
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Train transformer language models with reinforcement learning.
A curated list of reinforcement learning with human feedback resources (continually updated)
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Reference implementation for DPO (Direct Preference Optimization)
A comprehensive list of awesome document image rectification papers.
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Convert images of LaTex math equations into LaTex code.
Pytorch implementation of large network design in continous control RL.
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
This repository contains implementations and illustrative code to accompany DeepMind publications
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
PRML algorithms implemented in Python
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
《Python Cookbook》 3rd Edition Translation
Repo for counting stars and contributing. Press F to pay respect to glorious developers.