Skip to content
View hydrochloricacid's full-sized avatar

Block or report hydrochloricacid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 32,572 2,165 Updated Feb 27, 2025

Awesome coreset/core-set/subset/sample selection works.

172 9 Updated Jun 30, 2024

Code for coreset selection methods

Python 221 41 Updated Feb 27, 2023

Fully open reproduction of DeepSeek-R1

Python 21,660 1,919 Updated Feb 27, 2025

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

850 50 Updated Feb 27, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 22,910 2,277 Updated Feb 27, 2025

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,628 1,670 Updated Feb 27, 2025

Exclusively Dark (ExDARK) dataset which to the best of our knowledge, is the largest collection of low-light images taken in very low-light environments to twilight (i.e 10 different conditions) to…

MATLAB 552 104 Updated Aug 29, 2023

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 2,381 347 Updated Aug 15, 2024

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 716 49 Updated Feb 22, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,411 2,158 Updated Feb 1, 2025

Ongoing research training transformer models at scale

Python 11,567 2,595 Updated Feb 26, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 10,963 1,092 Updated Feb 27, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 79,429 9,476 Updated Feb 27, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 129,942 10,625 Updated Feb 27, 2025

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML …

Python 335 33 Updated Feb 26, 2024
Jupyter Notebook 79 12 Updated Oct 15, 2023

Data validation using Python type hints

Python 22,610 2,024 Updated Feb 27, 2025

the LLM vulnerability scanner

Python 3,949 359 Updated Feb 26, 2025

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,464 445 Updated Feb 27, 2025

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 88 12 Updated Feb 24, 2025

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,309 75 Updated Feb 21, 2025

Fork of https://huggingface.co/hexgrad/Kokoro-82M

Python 26 7 Updated Jan 12, 2025

SoftVC VITS Singing Voice Conversion

Python 26,614 4,915 Updated Nov 11, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,340 4,614 Updated Feb 27, 2025

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 1,217 80 Updated Nov 7, 2024

[CVPR 2023] DepGraph: Towards Any Structural Pruning

Python 2,887 342 Updated Feb 20, 2025

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,919 209 Updated Mar 21, 2024
Next
Showing results