SatireY

Follow

MrClownC SatireY

Follow

4 followers · 11 following

Highlights

Pro

Lists (4)

Sort

competition

竞赛相关仓库

Mystudy

25 repositories

🔨tools

一些拿来用的工具项目.

22 repositories

📄web

开发web工具

Starred repositories

Tele-AI / Telechat

Python 1,837 101 Updated Nov 20, 2024

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,747 10,715 Updated Mar 4, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,653 3,467 Updated Jul 23, 2024

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,058 273 Updated Dec 11, 2024

penn-pal-lab / LIV

Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)

Python 98 7 Updated Oct 19, 2023

2toinf / DecisionNCE

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Python 76 2 Updated Sep 26, 2024

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,570 338 Updated Mar 3, 2025

Denys88 / rl_games

RL implementations

Jupyter Notebook 1,025 168 Updated Feb 11, 2025

eureka-research / Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,902 264 Updated May 3, 2024

MurpheyLab / MaxDiffRL

Jupyter Notebook 59 11 Updated Mar 9, 2024

romkatv / powerlevel10k

A Zsh theme

Shell 48,159 2,254 Updated Jan 29, 2025

isaac-sim / IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Python 2,247 453 Updated Oct 26, 2024

corl-team / CORL

Forked from tinkoff-ai/CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 526 25 Updated Feb 10, 2024

google-research / rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 808 48 Updated Aug 12, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,439 712 Updated Mar 3, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,946 1,774 Updated Feb 20, 2025

beyondguo / LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 985 99 Updated Apr 27, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,428 3,297 Updated Jan 26, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,326 1,067 Updated Feb 14, 2025

liziniu / ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Python 175 13 Updated Dec 16, 2023

mlfoundations / open_clip

An open source implementation of CLIP.

Python 11,128 1,051 Updated Mar 1, 2025

RoboFlamingo / RoboFlamingo

Code for RoboFlamingo

Python 348 30 Updated May 8, 2024

oxwhirl / smac

SMAC: The StarCraft Multi-Agent Challenge

Python 1,157 232 Updated Feb 18, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,835 297 Updated Aug 31, 2024

geronimi73 / phi2-finetune

Jupyter Notebook 87 12 Updated Feb 1, 2024

facebookresearch / Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,776 181 Updated Feb 8, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,165 1,649 Updated Mar 3, 2025

NVlabs / RVT

Official Code for RVT-2 and RVT

Jupyter Notebook 317 41 Updated Feb 14, 2025

nickgkan / 3d_diffuser_actor

Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"

Python 278 37 Updated Aug 17, 2024

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 487 68 Updated Feb 14, 2025

Starred topics

openai