nbtpj

Nguyen Minh Quang nbtpj

I am PhD student in Computer Science. My research focuses on policy optimisation in Reinforcement Learning.

5 followers · 8 following

Singapore Management University
Singapore
15:49 (UTC +08:00)
[email protected]
https://orcid.org/0009-0006-5080-0702
in/mquang-nguyen

Stars

42 stars written in Python

Clear filter

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,485 27,329 Updated Dec 18, 2024

gpt-engineer-org / gpt-engineer

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,671 6,845 Updated Nov 17, 2024

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 34,988 8,615 Updated Oct 11, 2024

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 34,186 6,574 Updated Dec 19, 2024

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,899 4,880 Updated Aug 1, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,804 874 Updated Dec 17, 2024

openai / DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Python 10,807 1,940 Updated Jan 31, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,360 1,331 Updated Dec 18, 2024

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,440 705 Updated Dec 7, 2024

wandb / wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,273 682 Updated Dec 19, 2024

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,575 1,057 Updated Oct 9, 2024

nebuly-ai / optimate

A collection of libraries to optimise AI model performances

Python 8,374 637 Updated Jul 22, 2024

lucidrains / imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,130 775 Updated Oct 7, 2024

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 7,640 859 Updated Dec 17, 2024

zihangdai / xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,184 1,178 Updated May 28, 2023

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 4,912 528 Updated Jul 8, 2024

kimiyoung / transformer-xl

Python 3,620 762 Updated Sep 21, 2022

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,066 52 Updated Nov 27, 2024

idiap / fast-transformers

Pytorch library for fast transformer implementations

Python 1,657 179 Updated Mar 23, 2023

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,350 243 Updated Nov 24, 2024

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,245 164 Updated Jul 25, 2023

rail-berkeley / softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,239 244 Updated Nov 29, 2023

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,120 134 Updated Aug 3, 2023

allenai / natural-instructions

Expanding natural instructions

Python 964 190 Updated Dec 11, 2023

locuslab / deq

[NeurIPS'19] Deep Equilibrium Models

Python 732 79 Updated Jul 4, 2022

mks0601 / I2L-MeshNet_RELEASE

Official PyTorch implementation of "I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image", ECCV 2020

Python 724 127 Updated Jul 10, 2024

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 609 48 Updated Dec 18, 2024

aviralkumar2907 / CQL

Code for conservative Q-learning

Python 415 71 Updated Dec 7, 2021

lucidrains / recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Python 397 15 Updated Nov 19, 2024

araffin / sbx

SBX: Stable Baselines Jax (SB3 + Jax)

Python 360 35 Updated Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly