High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,119 134 Updated Aug 3, 2023

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,350 243 Updated Nov 24, 2024

yihaosun1124 / OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Python 291 35 Updated Apr 17, 2024

marc-rigter / rambo

Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022

Python 25 6 Updated Jun 2, 2023

felixnie / smu-beamer

The unofficial LaTeX Beamer template for presentation slides at Singapore Management University (SMU).

TeX 1 1 Updated Apr 20, 2023

Farama-Foundation / Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 312 45 Updated Dec 9, 2024

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 609 48 Updated Dec 18, 2024

rail-berkeley / softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,239 244 Updated Nov 29, 2023

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,245 164 Updated Jul 25, 2023

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,440 705 Updated Dec 7, 2024

gpt-engineer-org / gpt-engineer

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,670 6,845 Updated Nov 17, 2024

josStorer / RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…

TypeScript 5,394 515 Updated Dec 13, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,801 874 Updated Dec 17, 2024

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 34,988 8,615 Updated Oct 11, 2024

geekyutao / Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,716 567 Updated Feb 29, 2024

booydar / recurrent-memory-transformer

Forked from yurakuratov/t5-experiments

[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.

Jupyter Notebook 758 61 Updated Oct 25, 2024

lucidrains / imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,130 775 Updated Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nguyen Minh Quang nbtpj

Block or report nbtpj

Stars

loudinthecloud / pytorch-ntm

ultralytics / ultralytics

araffin / sbx

T3p / pois

Rondorf / BOReL

google / brax

Farama-Foundation / Gymnasium

ddelago / Aruco-Marker-Calibration-and-Pose-Estimation

lixin4ever / Conference-Acceptance-Rate

aviralkumar2907 / BEAR

aviralkumar2907 / CQL

wandb / wandb

Lyken17 / pytorch-OpCounter

tinkoff-ai / CORL