Skip to content
View jxtngx's full-sized avatar

Block or report jxtngx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A project to improve skills of large language models

Python 205 44 Updated Dec 12, 2024

Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.

Python 52 2 Updated Dec 12, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,363 2,536 Updated Aug 15, 2024

DSPy: The framework for programming—not prompting—language models

Python 20,136 1,526 Updated Dec 13, 2024

A library of extension and helper modules for Python's data analysis and machine learning libraries.

Python 4,926 875 Updated Nov 15, 2024

Everything about the SmolLM & SmolLM2 family of models

Python 1,411 66 Updated Dec 2, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 2,964 652 Updated Dec 10, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 789 66 Updated Dec 13, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 11,784 818 Updated Dec 13, 2024

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

Python 222 24 Updated Dec 13, 2024

NVIDIA ACE samples, workflows, and resources

HCL 213 50 Updated Dec 12, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,615 3,401 Updated Dec 12, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,460 1,407 Updated Dec 9, 2024

structured outputs for llms

Python 8,551 678 Updated Dec 12, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,710 520 Updated Dec 13, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,653 4,057 Updated Jul 17, 2024

cuML - RAPIDS Machine Learning Library

C++ 4,292 536 Updated Dec 13, 2024

cuDF - GPU DataFrame Library

C++ 8,525 912 Updated Dec 13, 2024

RAG applications repo for Uplimit course

Python 5 93 Updated Dec 10, 2024

Build and run Docker containers leveraging NVIDIA GPUs

17,274 2,028 Updated Dec 6, 2023

C++ HPC Tutorial materials

C++ 6 Updated Dec 22, 2022

A cloud-native vector database, storage for next generation AI applications

Go 31,283 2,966 Updated Dec 13, 2024
Jupyter Notebook 57 21 Updated Dec 13, 2024

A tool to configure, launch and manage your machine learning experiments.

Python 85 21 Updated Dec 13, 2024

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

C++ 464 89 Updated Oct 23, 2024

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 803 197 Updated Dec 12, 2024

cuVS - a library for vector search and clustering on the GPU

Cuda 241 67 Updated Dec 13, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 668 89 Updated Dec 13, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,256 405 Updated Dec 13, 2024

CUDA Python Low-level Bindings

Python 996 82 Updated Dec 13, 2024
Next