Skip to content
View nbtpj's full-sized avatar

Block or report nbtpj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Neural Turing Machines (NTM) - PyTorch Implementation

Jupyter Notebook 592 129 Updated Jun 26, 2018

Ultralytics YOLO11 🚀

Python 34,182 6,574 Updated Dec 18, 2024

SBX: Stable Baselines Jax (SB3 + Jax)

Python 360 35 Updated Dec 6, 2024

Implementation of the POIS algorithm

Python 14 2 Updated Apr 9, 2019

Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.

Python 29 9 Updated Nov 23, 2021

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,407 260 Updated Dec 12, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 7,638 859 Updated Dec 17, 2024

Aruco Marker Calibration and Pose Estimation using OpenCV in Python

Python 38 6 Updated Dec 3, 2019

Acceptance rates for the major AI conferences

Jupyter Notebook 4,299 303 Updated Dec 10, 2024

Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction

Python 158 38 Updated Jul 17, 2020

Code for conservative Q-learning

Python 415 71 Updated Dec 7, 2021

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,273 682 Updated Dec 19, 2024

Count the MACs / FLOPs of your PyTorch model.

Python 4,912 528 Updated Jul 8, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,119 134 Updated Aug 3, 2023

An offline deep reinforcement learning library

Python 1,350 243 Updated Nov 24, 2024

An elegant PyTorch offline reinforcement learning library for researchers.

Python 291 35 Updated Apr 17, 2024

Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022

Python 25 6 Updated Jun 2, 2023

The unofficial LaTeX Beamer template for presentation slides at Singapore Management University (SMU).

TeX 1 1 Updated Apr 20, 2023

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 312 45 Updated Dec 9, 2024

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 609 48 Updated Dec 18, 2024

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,239 244 Updated Nov 29, 2023

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,245 164 Updated Jul 25, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,440 705 Updated Dec 7, 2024

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,670 6,845 Updated Nov 17, 2024

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…

TypeScript 5,394 515 Updated Dec 13, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,801 874 Updated Dec 17, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 34,988 8,615 Updated Oct 11, 2024

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,716 567 Updated Feb 29, 2024

[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.

Jupyter Notebook 758 61 Updated Oct 25, 2024

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,130 775 Updated Oct 7, 2024
Next