fzp0424

fzppp fzp0424

13 followers · 13 following

Zhejiang University
16:01 (UTC +08:00)

Achievements

Lists (4)

Sort

Stars

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,919 167 Updated Feb 23, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 951 44 Updated Feb 25, 2025

pgf-tikz / pgf

A Portable Graphic Format for TeX

TeX 1,179 110 Updated Dec 20, 2024

HuaizhengZhang / AI-System-School

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

2,809 321 Updated Aug 14, 2024

SU-JIAYUAN / M-MAD

Repo for paper "M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation"

Python 11 Updated Feb 19, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 5,316 656 Updated Feb 11, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 1,968 173 Updated Feb 21, 2025

HAITrans-lab / instruction-tuned-medical-LLM

Python 2 Updated Oct 9, 2024

zhentingqi / rStar

Python 895 105 Updated Jan 23, 2025

vincen-github / mlimpl

This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based on Numpy, Pandas or Torch. You can deepen your understanding …

Shell 464 115 Updated Feb 18, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,023 497 Updated Feb 22, 2025

kengz / awesome-deep-rl

A curated list of awesome Deep Reinforcement Learning resources.

726 73 Updated Jul 30, 2024

aiwaves-cn / agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,476 436 Updated Sep 26, 2024

fzp0424 / EC-Guide-KDDUP-2024

The solution and dataset of Team ZJU_AI4H in Amazon KDDCUP 2024 (Track 2 Top 2; Track 5 Top 5)

7 Updated Aug 12, 2024

thunlp / ChatEval

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Python 260 16 Updated Oct 19, 2024

OpenBMB / AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript 4,364 428 Updated Sep 9, 2024

jiangsongtao / Med-MoE

[EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"

Python 88 6 Updated Dec 20, 2024

FloridSleeves / LLMDebugger

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step

Python 502 51 Updated Sep 10, 2024

allwefantasy / byzer-llm

Easy, fast, and cheap pretrain,finetune, serving for everyone

Python 282 42 Updated Feb 21, 2025

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 876 54 Updated Jan 7, 2025

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 485 47 Updated Dec 28, 2024

YutongWang1216 / ReflectionLLMMT

Code and data realeases for the paper -- TasTe: Teaching Large Language Models to Translate through Self-Reflection

Python 9 2 Updated Aug 17, 2024

andrewyng / translation-agent

Python 5,176 626 Updated Aug 4, 2024

fzp0424 / MT-Ladder

[EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"

Python 17 2 Updated Jun 29, 2024

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 27,122 3,689 Updated Feb 24, 2025

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 72,099 10,522 Updated Feb 25, 2025