Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Foundational Models for State-of-the-Art Speech and Text Translation
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Master programming by recreating your favorite technologies from scratch.
TuShare is a utility for crawling historical data of China stocks
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Open source Python library for converting PDF to DOCX.
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)
A GUI client for Windows, Linux and macOS, support Xray and sing-box and others
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
An elegant PyTorch deep reinforcement learning library.
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉