Skip to content
View hutslib's full-sized avatar

Highlights

  • Pro

Block or report hutslib

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 5,468 668 Updated Feb 11, 2025

[ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"

Python 8 Updated Feb 24, 2025

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python 191 33 Updated Mar 4, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 1,982 243 Updated Feb 28, 2025

Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]

Python 68 5 Updated Nov 21, 2024
Python 543 42 Updated Feb 26, 2025

Famous Vision Language Models and Their Architectures

Markdown 683 35 Updated Feb 24, 2025

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random g…

Python 758 134 Updated Aug 9, 2024

A goal-driven autonomous exploration through deep reinforcement learning (ICRA 2022) system that combines reactive and planned robot navigation in unknown environments

Python 148 16 Updated Feb 5, 2022

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,821 5,727 Updated Mar 5, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,096 99 Updated Jan 2, 2025

The official implementation of flow Q-learning (FQL)

Python 105 7 Updated Feb 13, 2025

X-MOBILITY

Python 18 1 Updated Feb 25, 2025

Witness the aha moment of VLM with less than $3.

Python 3,036 241 Updated Mar 1, 2025

s1: Simple test-time scaling

Python 5,837 664 Updated Mar 4, 2025
3 Updated Feb 21, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,059 225 Updated Feb 19, 2025

Official implementation of "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"

Python 712 66 Updated Feb 24, 2025

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

Python 278 32 Updated Feb 28, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,940 1,394 Updated Feb 1, 2025

[RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Python 86 4 Updated Feb 7, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 1,593 205 Updated Mar 3, 2025

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

803 23 Updated Jan 14, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,945 5,986 Updated Aug 24, 2024

[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding

Python 124 3 Updated Nov 20, 2023

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,260 49 Updated Jan 12, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,249 2,716 Updated Mar 5, 2025

Helpful tools and examples for working with flex-attention

Python 672 36 Updated Feb 18, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,614 487 Updated Feb 28, 2025
Next