Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,716 414 Updated Dec 21, 2024

jingyaogong / minimind

「大模型」3小时完全从0训练26M的小参数GPT，个人显卡即可推理训练！

Python 3,061 379 Updated Dec 13, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,669 185 Updated Nov 14, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 21,720 1,558 Updated Dec 20, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,146 395 Updated Dec 10, 2024

adbar / trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,750 266 Updated Dec 11, 2024

HamaWhiteGG / langchain-java

Java version of LangChain, while empowering LLM for Big Data.

Java 551 109 Updated Mar 8, 2024

RUCKBReasoning / TableLLM

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Python 163 16 Updated Sep 24, 2024

mlabonne / llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

2,144 186 Updated Dec 13, 2024

yhat / pandasql

sqldf for pandas

Python 1,345 185 Updated Jul 24, 2024

CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Python 234 9 Updated Nov 25, 2024

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,528 556 Updated Dec 5, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,373 103 Updated Oct 8, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,809 419 Updated Nov 21, 2024

NVlabs / DiffiT

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

477 17 Updated Oct 31, 2024

BlackPearl-Lab / KddCup-2024-OAG-Challenge-1st-Solutions

Python 150 35 Updated Jul 9, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,601 1,674 Updated Aug 1, 2024

Sense-X / UniFormer

[ICLR2022] official implementation of UniFormer

Python 833 111 Updated Mar 29, 2024

gomate-community / TrustRAG

TrustRAG：The RAG Framework within Reliable input,Trusted output

Python 586 52 Updated Dec 19, 2024

ymcui / Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,777 153 Updated Sep 23, 2024

reczoo / RecSys2024_CTR_Challenge

A Strong FuxiCTR Baseline for News CTR Challenge at RecSys 2024

Python 15 4 Updated Jul 13, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,960 532 Updated Oct 24, 2024

xai-org / grok-1

Grok open release

Python 49,743 8,346 Updated Aug 30, 2024

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,929 175 Updated Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhimin Lin chizhu

Achievements