Skip to content
View cccvs's full-sized avatar
  • Peking University
  • Beijing, China
  • 10:53 (UTC +08:00)

Highlights

  • Pro

Block or report cccvs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,138 111 Updated Mar 10, 2024

DeepEP: an efficient expert-parallel communication library

Cuda 7,163 625 Updated Mar 13, 2025
Python 166 11 Updated Dec 2, 2024

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

Python 130 7 Updated Feb 7, 2025

Overseas Summer Research Guidance 海外暑研申请指南

263 2 Updated Oct 30, 2024

DeMo: Decoupled Momentum Optimization

Python 182 9 Updated Dec 2, 2024

official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

Python 30 7 Updated Dec 11, 2024

[MLSys 2024] Does Compressing Activations Help Model Parallel Training?

Python 4 2 Updated May 8, 2024

Code for experiments with activations and gradients compression for model-parallel training.

Jupyter Notebook 3 2 Updated Aug 31, 2023

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 18,658 2,264 Updated Nov 13, 2024

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 929 70 Updated Mar 7, 2025

Simple Implementation of the CVPR 2024 Paper "JointSQ: Joint Sparsification-Quantization for Distributed Learning"

Python 10 Updated Dec 29, 2024

Making large AI models cheaper, faster and more accessible

Python 40,591 4,477 Updated Mar 13, 2025

LDAdam - Adaptive Optimization from Low-Dimensional Gradient Statistics

Python 6 Updated Nov 6, 2024

Pytorch distributed backend extension with compression support

C++ 15 Updated Mar 11, 2025

GRACE - GRAdient ComprEssion for distributed deep learning

Python 139 44 Updated Jul 23, 2024

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 146 32 Updated Oct 29, 2024

A PyTorch native library for large model training

Python 3,446 311 Updated Mar 14, 2025

nanoGPT style version of Llama 3.1

Python 1,335 80 Updated Aug 8, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 353 24 Updated Aug 7, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 13,673 1,574 Updated Mar 13, 2025
Python 6 5 Updated Mar 11, 2019

Machine Learning and Computer Vision Engineer - Technical Interview Questions

3,355 549 Updated Jan 4, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,301 837 Updated Jun 10, 2024

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)

Python 30 1 Updated Nov 1, 2024
Jupyter Notebook 3,618 1,064 Updated Jul 9, 2024

Retrieval-Augmented Theorem Provers for Lean

Python 258 57 Updated Jan 30, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 44,056 5,386 Updated Mar 13, 2025

[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

Python 244 13 Updated Mar 4, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,578 1,434 Updated Feb 26, 2025
Next