nbtpj

Follow

Nguyen Minh Quang nbtpj

Follow

I am PhD student in Computer Science. My research focuses on policy optimisation in Reinforcement Learning.

4 followers · 8 following

Singapore Management University
Singapore
12:49 (UTC +08:00)
[email protected]
https://orcid.org/0009-0006-5080-0702
in/mquang-nguyen

Stars

marvinalles / c-lap

Official repository of our paper "Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning" (NeurIPS 2024)

Python 4 Updated Jan 14, 2025

toshikwa / sac-discrete.pytorch

PyTorch implementation of SAC-Discrete.

Python 298 35 Updated Jul 25, 2024

facebookresearch / BenchMARL

A collection of MARL benchmarks based on TorchRL

Python 355 60 Updated Feb 27, 2025

mlii / mfrl

Mean Field Multi-Agent Reinforcement Learning

Python 389 100 Updated Mar 11, 2020

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,637 1,455 Updated Feb 6, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,848 6,082 Updated Mar 8, 2025

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,605 557 Updated Jun 17, 2024

loudinthecloud / pytorch-ntm

Neural Turing Machines (NTM) - PyTorch Implementation

Jupyter Notebook 597 129 Updated Jun 26, 2018

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 37,575 7,300 Updated Mar 8, 2025

araffin / sbx

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

Python 404 37 Updated Mar 3, 2025

T3p / pois

Implementation of the POIS algorithm

Python 14 3 Updated Apr 9, 2019

Rondorf / BOReL

Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.

Python 31 9 Updated Nov 23, 2021

google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,553 272 Updated Feb 5, 2025

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 8,481 945 Updated Mar 6, 2025

ddelago / Aruco-Marker-Calibration-and-Pose-Estimation

Aruco Marker Calibration and Pose Estimation using OpenCV in Python

Python 39 6 Updated Dec 3, 2019

lixin4ever / Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Jupyter Notebook 4,394 306 Updated Jan 24, 2025

aviralkumar2907 / BEAR

Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction

Python 159 38 Updated Jul 17, 2020

aviralkumar2907 / CQL

Code for conservative Q-learning

Python 426 71 Updated Dec 7, 2021

wandb / wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,600 711 Updated Mar 8, 2025

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 4,963 529 Updated Jul 8, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 143 Updated Aug 3, 2023

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,401 245 Updated Mar 7, 2025

yihaosun1124 / OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Python 307 35 Updated Apr 17, 2024

marc-rigter / rambo

Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022

Python 26 6 Updated Jun 2, 2023

felixnie / smu-beamer

The unofficial LaTeX Beamer template for presentation slides at Singapore Management University (SMU).

TeX 1 1 Updated Apr 20, 2023

Farama-Foundation / Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 352 51 Updated Feb 23, 2025

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 684 58 Updated Mar 7, 2025

rail-berkeley / softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,272 245 Updated Nov 29, 2023

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,294 167 Updated Jul 25, 2023

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,461 705 Updated Jan 28, 2025