Skip to content
View DrZero0's full-sized avatar
🍉
🍉

Block or report DrZero0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Autonomous LLM Agent for Complex Task Solving

Python 8,250 850 Updated Aug 12, 2024

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Python 161 13 Updated Dec 16, 2023

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,767 104 Updated Jun 1, 2023

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,438 325 Updated Dec 20, 2024

Community interface for generative AI

TypeScript 8,876 888 Updated Apr 30, 2024

Unified Reinforcement Learning Framework

Python 661 63 Updated Sep 6, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,521 2,924 Updated Sep 2, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,885 44,698 Updated Dec 28, 2024

Official repo for consistency models.

Python 6,220 425 Updated Mar 22, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,035 4,172 Updated Dec 28, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,309 5,710 Updated Sep 18, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,837 1,982 Updated Sep 26, 2024

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini/Claude LLM 应用。

TypeScript 78,068 59,819 Updated Dec 29, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,616 103 Updated Aug 30, 2023

A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.

Python 22 3 Updated Dec 9, 2024

Foundation Model for MineDojo

Python 247 32 Updated Apr 2, 2023

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,604 8,154 Updated Dec 28, 2024

OpenFE: automated feature generation with expert-level performance

Python 810 101 Updated May 27, 2024

Toolkit of Causal Model-based Reinforcement Learning.

Python 32 1 Updated Jun 5, 2023

Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”

Python 23 7 Updated Mar 6, 2023
7 Updated Oct 14, 2022

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,869 5,523 Updated Dec 28, 2024
Python 225 37 Updated Feb 15, 2024
Python 11 Updated May 14, 2024

D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.

Python 9 1 Updated Jun 2, 2022

The official code of "Adversarial Counterfactual Environment Model Learning" (NeurIPS'23 spotlight)

Python 4 1 Updated Jan 10, 2024

健康学习到150岁 - 人体系统调优不完全指南

13,222 968 Updated May 9, 2024

A python module designed for agile RL algorithm developing.

Python 26 3 Updated Jul 11, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 126,312 23,269 Updated Sep 22, 2024
Next