Skip to content
View orangeadegit's full-sized avatar

Block or report orangeadegit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official Meta Llama 3 GitHub site

Python 27,889 3,193 Updated Aug 12, 2024

LLM training in simple, raw C/CUDA

Cuda 25,003 2,849 Updated Oct 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,698 354 Updated Jan 8, 2025

Train transformer language models with reinforcement learning.

Python 10,570 1,367 Updated Jan 11, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,610 221 Updated Dec 5, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 12,759 1,409 Updated Jan 4, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,314 190 Updated Aug 11, 2024

A comprehensive list of awesome document image rectification papers.

384 29 Updated Apr 2, 2024
Python 3 Updated Jul 28, 2020

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python 364 50 Updated Jul 21, 2024

Convert images of LaTex math equations into LaTex code.

Python 2,085 312 Updated Oct 4, 2022
Python 171 27 Updated Feb 27, 2024
Python 65 4 Updated Aug 11, 2021

Pytorch implementation of large network design in continous control RL.

Python 19 Updated Jan 5, 2022
Python 265 41 Updated Feb 12, 2022

M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning

Python 28 4 Updated Nov 5, 2020

data preprocess for fairseq input

Shell 8 Updated Apr 5, 2020

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,400 2,616 Updated Nov 18, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,793 6,442 Updated Jan 9, 2025

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 3,880 678 Updated Jan 6, 2025
Python 4 1 Updated Dec 29, 2020

PRML algorithms implemented in Python

Jupyter Notebook 11,522 3,252 Updated Sep 27, 2024

学习强国 懒人刷分工具 自动学习

8,347 1,793 Updated Jul 12, 2021

EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212

Python 2,538 533 Updated Feb 3, 2024

贵校课程资料民间整理

TeX 30,492 8,254 Updated Jan 5, 2022

《Python Cookbook》 3rd Edition Translation

Jupyter Notebook 11,778 2,971 Updated Jul 24, 2024

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,065 21,113 Updated Oct 3, 2024