Highlights
- Pro
Starred repositories
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A quick guide (especially) for trending instruction finetuning datasets
Curated list of datasets and tools for post-training.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Decentralized Autonomous Regulated Company (DARC), a company virtual machine that runs on any EVM-compatible blockchain, with on-chain law system, multi-level tokens and dividends mechanism.
Transformer related optimization, including BERT, GPT
A Python implementation of global optimization with gaussian processes.
Collection of common code that's shared among different research projects in FAIR computer vision team.
Arithmetic Intensity calculator for PyTorch models