Lists (5)
Sort Name ascending (A-Z)
Starred repositories
A project to improve skills of large language models
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
DSPy: The framework for programming—not prompting—language models
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Everything about the SmolLM & SmolLM2 family of models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
RAG applications repo for Uplimit course
Build and run Docker containers leveraging NVIDIA GPUs
C++ HPC Tutorial materials
A cloud-native vector database, storage for next generation AI applications
A tool to configure, launch and manage your machine learning experiments.
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
cuVS - a library for vector search and clustering on the GPU
Scalable data pre processing and curation toolkit for LLMs
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.