cccvs

Chuyan Chen cccvs

A fan of daniwell~

26 followers · 46 following

Peking University
Beijing, China
10:53 (UTC +08:00)

Achievements

Highlights

Lists (7)

Sort

Stars

AGI-Edgerunners / LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,138 111 Updated Mar 10, 2024

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,163 625 Updated Mar 13, 2025

nikhilvyas / SOAP

Python 166 11 Updated Dec 2, 2024

Chongjie-Si / Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

Python 130 7 Updated Feb 7, 2025

GuangLun2000 / summer-research-app

Overseas Summer Research Guidance 海外暑研申请指南

263 2 Updated Oct 30, 2024

bloc97 / DeMo

DeMo: Decoupled Momentum Optimization

Python 182 9 Updated Dec 2, 2024

bytedance / SDP4Bit

official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

Python 30 7 Updated Dec 11, 2024

uw-mad-dash / MCBench

[MLSys 2024] Does Compressing Activations Help Model Parallel Training?

Python 4 2 Updated May 8, 2024

Glemhel / ActivationsGradientsCompressionForMP

Code for experiments with activations and gradients compression for model-parallel training.

Jupyter Notebook 3 2 Updated Aug 31, 2023

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 18,658 2,264 Updated Nov 13, 2024

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 929 70 Updated Mar 7, 2025

HaoweiLi778 / JointSQ

Simple Implementation of the CVPR 2024 Paper "JointSQ: Joint Sparsification-Quantization for Distributed Learning"

Python 10 Updated Dec 29, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 40,591 4,477 Updated Mar 13, 2025

IST-DASLab / LDAdam

LDAdam - Adaptive Optimization from Low-Dimensional Gradient Statistics

Python 6 Updated Nov 6, 2024

IST-DASLab / torch_cgx

Pytorch distributed backend extension with compression support

C++ 15 Updated Mar 11, 2025

sands-lab / grace

GRACE - GRAdient ComprEssion for distributed deep learning

Python 139 44 Updated Jul 23, 2024

epfml / powersgd

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 146 32 Updated Oct 29, 2024

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,446 311 Updated Mar 14, 2025

karpathy / nano-llama31

nanoGPT style version of Llama 3.1

Python 1,335 80 Updated Aug 8, 2024

kongds / MoRA

MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning

Python 353 24 Updated Aug 7, 2024

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 13,673 1,574 Updated Mar 13, 2025

NOBLES5E / DoubleSqueeze

Python 6 5 Updated Mar 11, 2019

andrewekhalel / MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

3,355 549 Updated Jan 4, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,301 837 Updated Jun 10, 2024

andyjm3 / SLTrain

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)

Python 30 1 Updated Nov 1, 2024

langchain-ai / rag-from-scratch

Jupyter Notebook 3,618 1,064 Updated Jul 9, 2024

lean-dojo / ReProver

Retrieval-Augmented Theorem Provers for Lean

Python 258 57 Updated Jan 30, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 44,056 5,386 Updated Mar 13, 2025

Ledzy / BAdam

[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

Python 244 13 Updated Mar 4, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,578 1,434 Updated Feb 26, 2025

Chuyan Chen cccvs

Highlights

Lists (7)

amazing⭐

codebase🌟

collab🌈

learning

LLM

Quantization

tools

Stars