zzh8241102

work

Zihan Zhou zzh8241102

work

Code is the language of creativity. So let's create more :) Prev MLE intern @ {Alibaba, DiDi} | Interested in (M)LLM Post Training & Disruptive Innovation.

13 followers · 27 following

University of California San Diego
La Jolla, CA
in/zihan-zhou-cs

Achievements

Highlights

Starred repositories

OpenDriveLab / End-to-end-Autonomous-Driving

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,511 243 Updated Dec 17, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,680 311 Updated Dec 18, 2024

chiphuyen / machine-learning-systems-design

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

HTML 9,230 1,446 Updated Apr 15, 2023

yuhengliu02 / control-3d-scene

Official implementation of paper "Controllable 3D Outdoor Scene Generation"

3 Updated Nov 16, 2024

ibm-granite / granite-3.0-language-models

233 22 Updated Dec 4, 2024

huybery / Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

1,018 60 Updated Dec 10, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,866 229 Updated Dec 18, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,651 227 Updated Dec 4, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,845 2,423 Updated Dec 18, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,803 418 Updated Nov 21, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,256 187 Updated Aug 11, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,346 1,327 Updated Dec 18, 2024

eosphoros-ai / Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

1,994 150 Updated Oct 28, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,341 163 Updated Jun 25, 2024

mutonix / RefGPT

Python 94 6 Updated Mar 20, 2024

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 310 36 Updated Dec 18, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,673 409 Updated Dec 18, 2024