Skip to content
View submartingales's full-sized avatar

Block or report submartingales

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

democratize-llms

10 repositories

LLM inference in C/C++

C++ 69,383 9,994 Updated Dec 18, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,538 472 Updated Jan 8, 2024

Inference code for Llama models

Python 56,846 9,614 Updated Aug 18, 2024

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Python 942 108 Updated Sep 14, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,272 4,468 Updated Dec 17, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,737 2,281 Updated Aug 12, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,976 236 Updated Sep 6, 2023

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,953 532 Updated Oct 24, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,296 722 Updated Aug 5, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,805 1,196 Updated Dec 12, 2024