orangeadegit

Mengzhang Cai orangeadegit

11 followers · 27 following

Achievements

Stars

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,889 3,193 Updated Aug 12, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 25,003 2,849 Updated Oct 2, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,698 354 Updated Jan 8, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,570 1,367 Updated Jan 11, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,610 221 Updated Dec 5, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 12,759 1,409 Updated Jan 4, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,314 190 Updated Aug 11, 2024

fh2019ustc / Awesome-Document-Image-Rectification

A comprehensive list of awesome document image rectification papers.

384 29 Updated Apr 2, 2024

orangeadegit / RL-Algo-Zoo

Forked from teslacool/RL-Algo-Zoo

Python 3 Updated Jul 28, 2020

fh2019ustc / DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python 364 50 Updated Jul 21, 2024

kingyiusuen / image-to-latex

Convert images of LaTex math equations into LaTex code.

Python 2,085 312 Updated Oct 4, 2022

djiajunustc / TransVG

Python 171 27 Updated Feb 27, 2024

djiajunustc / H-23D_R-CNN

Python 65 4 Updated Aug 11, 2021

LQNew / Deeper_Larger_Actor-Critic_RL

Pytorch implementation of large network design in continous control RL.

Python 19 Updated Jan 5, 2022

djiajunustc / Voxel-R-CNN

Python 265 41 Updated Feb 12, 2022

teslacool / m-curl

M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning

Python 28 4 Updated Nov 5, 2020

teslacool / preprocess_iwslt

data preprocess for fairseq input

Shell 8 Updated Apr 5, 2020

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,400 2,616 Updated Nov 18, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,793 6,442 Updated Jan 9, 2025

google-deepmind / dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 3,880 678 Updated Jan 6, 2025

teslacool / RL-Algo-Zoo

Python 4 1 Updated Dec 29, 2020

ctgk / PRML

PRML algorithms implemented in Python

Jupyter Notebook 11,522 3,252 Updated Sep 27, 2024

fuck-xuexiqiangguo / Fuck-XueXiQiangGuo

学习强国懒人刷分工具自动学习

8,347 1,793 Updated Jul 12, 2021

knazeri / edge-connect

EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212

Python 2,538 533 Updated Feb 3, 2024

lib-pku / libpku

贵校课程资料民间整理

TeX 30,492 8,254 Updated Jan 5, 2022

yidao620c / python3-cookbook

《Python Cookbook》 3rd Edition Translation

Jupyter Notebook 11,778 2,971 Updated Jul 24, 2024

996icu / 996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,065 21,113 Updated Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mengzhang Cai orangeadegit

Achievements

Achievements

Block or report orangeadegit

Stars

meta-llama / llama3

karpathy / llm.c

OpenRLHF / OpenRLHF

huggingface / trl

opendilab / awesome-RLHF

liguodongiot / llm-action

eric-mitchell / direct-preference-optimization

fh2019ustc / Awesome-Document-Image-Rectification

orangeadegit / RL-Algo-Zoo

fh2019ustc / DocTr

kingyiusuen / image-to-latex

djiajunustc / TransVG

djiajunustc / H-23D_R-CNN

LQNew / Deeper_Larger_Actor-Critic_RL

djiajunustc / Voxel-R-CNN

teslacool / m-curl

teslacool / preprocess_iwslt

google-deepmind / deepmind-research

facebookresearch / fairseq

google-deepmind / dm_control

teslacool / RL-Algo-Zoo

ctgk / PRML

fuck-xuexiqiangguo / Fuck-XueXiQiangGuo

knazeri / edge-connect

lib-pku / libpku

yidao620c / python3-cookbook

996icu / 996.ICU