Skip to content
View zzh8241102's full-sized avatar
:shipit:
work
:shipit:
work

Highlights

  • Pro

Block or report zzh8241102

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,511 243 Updated Dec 17, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,680 311 Updated Dec 18, 2024

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

HTML 9,230 1,446 Updated Apr 15, 2023

Official implementation of paper "Controllable 3D Outdoor Scene Generation"

3 Updated Nov 16, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

1,018 60 Updated Dec 10, 2024

Efficient Triton Kernels for LLM Training

Python 3,866 229 Updated Dec 18, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,651 227 Updated Dec 4, 2024

Ongoing research training transformer models at scale

Python 10,845 2,423 Updated Dec 18, 2024

Robust recipes to align language models with human and AI preferences

Python 4,803 418 Updated Nov 21, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,256 187 Updated Aug 11, 2024

Train transformer language models with reinforcement learning.

Python 10,346 1,327 Updated Dec 18, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

1,994 150 Updated Oct 28, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,341 163 Updated Jun 25, 2024
Python 94 6 Updated Mar 20, 2024

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 310 36 Updated Dec 18, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,673 409 Updated Dec 18, 2024

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 3,506 240 Updated Dec 12, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,205 395 Updated Aug 7, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 10,977 676 Updated Dec 4, 2024

A efficient and effective few-shot NL2SQL method on GPT-4.

Python 455 73 Updated Jun 4, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 11,932 1,254 Updated Dec 17, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 11,621 1,678 Updated Dec 7, 2024

Collection of Summer 2025 tech internships!

35,475 2,800 Updated Dec 18, 2024

[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

Python 348 21 Updated Apr 27, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 602 33 Updated Oct 22, 2024
Python 441 44 Updated Jul 15, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,737 1,035 Updated Dec 16, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,062 979 Updated Nov 18, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,778 376 Updated Mar 14, 2024
Next