submartingales

submartingales

21 followers · 194 following

Stars

democratize-llms

10 repositories

ggerganov / llama.cpp

LLM inference in C/C++

C++ 69,383 9,994 Updated Dec 18, 2024

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,538 472 Updated Jan 8, 2024

meta-llama / llama

Inference code for Llama models

Python 56,846 9,614 Updated Aug 18, 2024

shibing624 / textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLOOM，GPT2，Seq2Seq，BART，T5，UDA等模型的训练和预测，开箱即用。

Python 942 108 Updated Sep 14, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,272 4,468 Updated Dec 17, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,737 2,281 Updated Aug 12, 2024

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,976 236 Updated Sep 6, 2023

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,953 532 Updated Oct 24, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,296 722 Updated Aug 5, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,805 1,196 Updated Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly