Skip to content
View SatireY's full-sized avatar

Highlights

  • Pro

Block or report SatireY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,837 101 Updated Nov 20, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,747 10,715 Updated Mar 4, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,653 3,467 Updated Jul 23, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,058 273 Updated Dec 11, 2024

Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)

Python 98 7 Updated Oct 19, 2023

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Python 76 2 Updated Sep 26, 2024

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,570 338 Updated Mar 3, 2025

RL implementations

Jupyter Notebook 1,025 168 Updated Feb 11, 2025

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,902 264 Updated May 3, 2024
Jupyter Notebook 59 11 Updated Mar 9, 2024

A Zsh theme

Shell 48,159 2,254 Updated Jan 29, 2025

Isaac Gym Reinforcement Learning Environments

Python 2,247 453 Updated Oct 26, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 526 25 Updated Feb 10, 2024

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 808 48 Updated Aug 12, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,439 712 Updated Mar 3, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,946 1,774 Updated Feb 20, 2025

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 985 99 Updated Apr 27, 2024

The official Meta Llama 3 GitHub site

Python 28,428 3,297 Updated Jan 26, 2025

Example models using DeepSpeed

Python 6,326 1,067 Updated Feb 14, 2025

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Python 175 13 Updated Dec 16, 2023

An open source implementation of CLIP.

Python 11,128 1,051 Updated Mar 1, 2025

Code for RoboFlamingo

Python 348 30 Updated May 8, 2024

SMAC: The StarCraft Multi-Agent Challenge

Python 1,157 232 Updated Feb 18, 2024

An open-source framework for training large multimodal models.

Python 3,835 297 Updated Aug 31, 2024
Jupyter Notebook 87 12 Updated Feb 1, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,776 181 Updated Feb 8, 2025

Train transformer language models with reinforcement learning.

Python 12,165 1,649 Updated Mar 3, 2025

Official Code for RVT-2 and RVT

Jupyter Notebook 317 41 Updated Feb 14, 2025

Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"

Python 278 37 Updated Aug 17, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 487 68 Updated Feb 14, 2025
Next