Skip to content
View nbtpj's full-sized avatar

Block or report nbtpj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of our paper "Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning" (NeurIPS 2024)

Python 4 Updated Jan 14, 2025

PyTorch implementation of SAC-Discrete.

Python 298 35 Updated Jul 25, 2024

A collection of MARL benchmarks based on TorchRL

Python 355 60 Updated Feb 27, 2025

Mean Field Multi-Agent Reinforcement Learning

Python 389 100 Updated Mar 11, 2020

Official inference repo for FLUX.1 models

Python 20,637 1,455 Updated Feb 6, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,848 6,082 Updated Mar 8, 2025

Collection of reinforcement learning algorithms

Python 2,605 557 Updated Jun 17, 2024

Neural Turing Machines (NTM) - PyTorch Implementation

Jupyter Notebook 597 129 Updated Jun 26, 2018

Ultralytics YOLO11 🚀

Python 37,575 7,300 Updated Mar 8, 2025

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

Python 404 37 Updated Mar 3, 2025

Implementation of the POIS algorithm

Python 14 3 Updated Apr 9, 2019

Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.

Python 31 9 Updated Nov 23, 2021

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,553 272 Updated Feb 5, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 8,481 945 Updated Mar 6, 2025

Aruco Marker Calibration and Pose Estimation using OpenCV in Python

Python 39 6 Updated Dec 3, 2019

Acceptance rates for the major AI conferences

Jupyter Notebook 4,394 306 Updated Jan 24, 2025

Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction

Python 159 38 Updated Jul 17, 2020

Code for conservative Q-learning

Python 426 71 Updated Dec 7, 2021

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,600 711 Updated Mar 8, 2025

Count the MACs / FLOPs of your PyTorch model.

Python 4,963 529 Updated Jul 8, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 143 Updated Aug 3, 2023

An offline deep reinforcement learning library

Python 1,401 245 Updated Mar 7, 2025

An elegant PyTorch offline reinforcement learning library for researchers.

Python 307 35 Updated Apr 17, 2024

Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022

Python 26 6 Updated Jun 2, 2023

The unofficial LaTeX Beamer template for presentation slides at Singapore Management University (SMU).

TeX 1 1 Updated Apr 20, 2023

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 352 51 Updated Feb 23, 2025

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 684 58 Updated Mar 7, 2025

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,272 245 Updated Nov 29, 2023

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,294 167 Updated Jul 25, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,461 705 Updated Jan 28, 2025
Next