Stars
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
Pretraining code for a large-scale depth-recurrent language model
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
Align Anything: Training All-modality Model with Feedback
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Medical o1, Towards medical complex reasoning with LLMs
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…
🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
List of Molecular and Material design using Generative AI and Deep Learning
[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
Large Concept Models: Language modeling in a sentence representation space
Instruct-tune LLaMA on consumer hardware