Skip to content
View intfloat's full-sized avatar

Block or report intfloat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference framework for 1-bit LLMs

C++ 12,463 873 Updated Dec 20, 2024

A bibliography and survey of the papers surrounding o1

TeX 981 39 Updated Nov 16, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,185 1,749 Updated Oct 15, 2024

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 141 5 Updated Dec 16, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,906 322 Updated Dec 26, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,687 1,383 Updated Dec 26, 2024
Jupyter Notebook 579 65 Updated Dec 10, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 678 48 Updated Sep 27, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,186 555 Updated Oct 19, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 155 13 Updated Oct 28, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 796 53 Updated Dec 16, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,886 113 Updated Jul 29, 2024

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 36,385 5,258 Updated Dec 26, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,512 125 Updated Dec 26, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,624 216 Updated Dec 20, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 577 42 Updated Nov 18, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 17,674 1,656 Updated Dec 26, 2024

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 121 6 Updated Nov 9, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,601 4,960 Updated Dec 26, 2024

Grok open release

Python 49,750 8,345 Updated Aug 30, 2024

Robust recipes to align language models with human and AI preferences

Python 4,830 419 Updated Nov 21, 2024

Large Context Attention

Python 659 53 Updated Aug 12, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,011 64 Updated Sep 27, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,318 514 Updated Jul 31, 2024

Train Models Contrastively in Pytorch

Python 554 41 Updated Nov 18, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,097 41 Updated Dec 25, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,622 177 Updated Aug 17, 2024

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…

Python 126 16 Updated Apr 11, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,320 56 Updated Dec 10, 2024
Next