Stars
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
A tool to configure, launch and manage your machine learning experiments.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024
Scalable data pre processing and curation toolkit for LLMs
A collection of design patterns/idioms in Python
✨✨Latest Advances on Multimodal Large Language Models
Robust recipes to align language models with human and AI preferences
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Minimalistic large language model 3D-parallelism training
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Scalable toolkit for efficient model alignment
Ongoing research training transformer models at scale
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Machine Learning Engineering Open Book
Resources from the EleutherAI Math Reading Group
neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
chatbot does what you ask, like open Google search, post a Tweet, etc.
LAVIS - A One-stop Library for Language-Vision Intelligence
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.