Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,913 6,072 Updated Jul 13, 2023

MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python 9,043 5,026 Updated Mar 31, 2024

frankschindler / OptimizedTensorContraction

Global search algorithms for finding optimal tensor network contraction sequences.

Python 15 4 Updated Feb 13, 2020

dechterlab / quickbb

Code for QuickBB by Vibhav Gogate

Shell 3 1 Updated Dec 29, 2020

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 5,014 636 Updated Feb 11, 2025

jingyaogong / minimind-v

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM！🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 947 95 Updated Feb 10, 2025

poloclub / transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 3,906 363 Updated Feb 8, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,722 890 Updated Dec 20, 2024

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,038 5,138 Updated Oct 10, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,071 4,911 Updated Feb 11, 2025

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,312 198 Updated Jan 24, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,992 1,418 Updated Dec 25, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,283 3,435 Updated Jul 23, 2024

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,148 862 Updated Aug 12, 2024

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 26,136 3,306 Updated Dec 30, 2024

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 39,203 5,755 Updated Feb 11, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 171,340 45,029 Updated Feb 11, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 8,495 862 Updated Feb 11, 2025

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,293 1,322 Updated Sep 15, 2023

jianzhnie / LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Python 591 63 Updated Jan 24, 2025

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,129 1,028 Updated Feb 8, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 16,695 1,378 Updated Feb 1, 2025

zake7749 / word2vec-tutorial

中文詞向量訓練教學

Python 518 165 Updated Dec 12, 2022

freedmand / semantra

Multi-tool for semantic search

Python 2,560 149 Updated Aug 27, 2024

Tencent / NeuralNLP-NeuralClassifier

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,870 409 Updated Sep 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hcw111

Block or report hcw111

Stars

deepseek-ai / DeepSeek-LLM

zhongyang219 / TrafficMonitor

sweetice / Deep-reinforcement-learning-with-pytorch

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

openai / gym

dennybritz / reinforcement-learning