jianyuheng

jianyuheng

deep learning

7 followers · 1 following

Tencent

Achievements

Lists (1)

Sort

🚀 My stack

Stars

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,118 1,031 Updated Dec 18, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,670 185 Updated Nov 14, 2024

myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,036 668 Updated Dec 16, 2024

choiHkk / nix-tts

End-To-End SpeechSynthesis system with knowledge distillation

Jupyter Notebook 16 7 Updated Jul 16, 2022

OpenGVLab / EfficientQAT

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 232 18 Updated Oct 8, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,946 1,086 Updated Dec 16, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,120 894 Updated Dec 20, 2024

mit-han-lab / qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 466 28 Updated Nov 9, 2024

spcl / QuaRot

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 301 25 Updated Nov 26, 2024

LiqunMa / FBI-LLM

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

Python 46 2 Updated Jul 11, 2024

mobiusml / hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Python 719 72 Updated Nov 22, 2024

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 363 18 Updated Nov 16, 2024

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 201 21 Updated Oct 25, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,345 163 Updated Jun 25, 2024

MFaceTech / AIGC-SD-Acceleration

Python 23 3 Updated Mar 7, 2024

MFaceTech / InstantID

Python 165 32 Updated May 24, 2024

Vahe1994 / AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,191 180 Updated Nov 28, 2024

Skhaki18 / optin-transformer-pruning

[ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe

Python 22 2 Updated Sep 2, 2024

princeton-nlp / LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 566 48 Updated Mar 4, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,933 1,028 Updated Dec 17, 2024

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 742 56 Updated Oct 8, 2024

turboderp / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,775 289 Updated Dec 18, 2024

jaymody / speculative-sampling

Simple implementation of Speculative Sampling in NumPy for GPT-2.

Python 89 9 Updated Aug 20, 2023

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,340 97 Updated Dec 9, 2024

ruiqixu37 / distill_diffusion

Implementation of the 2023 CVPR Award Candidate: On Distillation of Guided Diffusion Models

Python 42 3 Updated Aug 16, 2023

Nota-NetsPresso / BK-SDM

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 267 17 Updated Jul 6, 2024

Cornell-RelaxML / QuIP

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

Python 354 32 Updated Feb 24, 2024

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,755 284 Updated Dec 18, 2024

segmind / distill-sd

Segmind Distilled diffusion

Python 579 37 Updated Oct 18, 2023

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 19,431 1,599 Updated Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jianyuheng

Achievements

Achievements

Block or report jianyuheng

Lists (1)

🚀 My stack

Stars

SWivid / F5-TTS

ictnlp / LLaMA-Omni

myshell-ai / MeloTTS

choiHkk / nix-tts

OpenGVLab / EfficientQAT

Lightning-AI / litgpt

axolotl-ai-cloud / axolotl

mit-han-lab / qserve

spcl / QuaRot

LiqunMa / FBI-LLM

mobiusml / hqq

hao-ai-lab / Consistency_LLM

hemingkx / Spec-Bench

FasterDecoding / Medusa

MFaceTech / AIGC-SD-Acceleration

MFaceTech / InstantID

Vahe1994 / AQLM

Skhaki18 / optin-transformer-pruning

princeton-nlp / LLM-Shearing

NVIDIA / TensorRT-LLM

OpenGVLab / OmniQuant

turboderp / exllamav2

jaymody / speculative-sampling

horseee / Awesome-Efficient-LLM

ruiqixu37 / distill_diffusion

Nota-NetsPresso / BK-SDM

Cornell-RelaxML / QuIP

microsoft / LMOps

segmind / distill-sd

mlc-ai / mlc-llm