Skip to content
Change the repository type filter

All

    Repositories list

    • The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
      LLVM
      Other
      13k000Updated Jan 7, 2025Jan 7, 2025
    • HugeCTR

      Public
      HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
      C++
      Apache License 2.0
      200001Updated Nov 3, 2024Nov 3, 2024
    • ucx

      Public
      Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
      C
      Other
      437000Updated Nov 3, 2024Nov 3, 2024
    • Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
      Jupyter Notebook
      2.3k002Updated Nov 3, 2024Nov 3, 2024
    • verilator

      Public
      Verilator open-source SystemVerilog simulator and lint system
      C++
      GNU Lesser General Public License v3.0
      635000Updated Oct 12, 2024Oct 12, 2024
    • protobuf

      Public
      Protocol Buffers - Google's data interchange format
      C++
      Other
      16k004Updated Sep 19, 2024Sep 19, 2024
    • TensorRT

      Public
      NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
      C++
      Apache License 2.0
      2.2k104Updated Sep 10, 2024Sep 10, 2024
    • Merlin Models is a collection of deep learning recommender system model reference implementations
      Python
      Apache License 2.0
      50001Updated Sep 3, 2024Sep 3, 2024
    • openlane2

      Public
      The next generation of OpenLane, rewritten from scratch with a modular architecture
      Python
      Apache License 2.0
      46001Updated Sep 3, 2024Sep 3, 2024
    • The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
      Jupyter Notebook
      Apache License 2.0
      1.5k000Updated Aug 18, 2024Aug 18, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
      C++
      Apache License 2.0
      1.1k001Updated Aug 11, 2024Aug 11, 2024
    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      Other
      1.1k000Updated Jul 11, 2024Jul 11, 2024
    • Python
      Apache License 2.0
      207000Updated May 11, 2024May 11, 2024
    • TensorDict is a pytorch dedicated tensor container.
      Python
      MIT License
      81001Updated May 6, 2024May 6, 2024
    • sionna

      Public
      Sionna: An Open-Source Library for Next-Generation Physical Layer Research
      Jupyter Notebook
      Other
      265000Updated Apr 25, 2024Apr 25, 2024
    • rl

      Public
      A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
      Python
      MIT License
      335000Updated Apr 14, 2024Apr 14, 2024
    • mujoco

      Public
      Multi-Joint dynamics with Contact. A general purpose physics simulator.
      C++
      Apache License 2.0
      899000Updated Mar 29, 2024Mar 29, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.9k100Updated Mar 29, 2024Mar 29, 2024
    • memray

      Public
      Memray is a memory profiler for Python
      Python
      Apache License 2.0
      397000Updated Mar 29, 2024Mar 29, 2024
    • projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
      C++
      Apache License 2.0
      76000Updated Mar 21, 2024Mar 21, 2024
    • A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
      Jupyter Notebook
      Other
      254000Updated Mar 12, 2024Mar 12, 2024
    • The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
      Jupyter Notebook
      Apache License 2.0
      5.8k000Updated Feb 21, 2024Feb 21, 2024
    • locust

      Public
      Write scalable load tests in plain Python 🚗💨
      Python
      MIT License
      3k000Updated Feb 4, 2024Feb 4, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.6k000Updated Jan 16, 2024Jan 16, 2024
    • llm-awq

      Public
      AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
      Python
      MIT License
      232000Updated Jan 13, 2024Jan 13, 2024
    • cusignal

      Public
      cuSignal - RAPIDS Signal Processing Library
      Python
      Other
      132000Updated Dec 22, 2023Dec 22, 2023
    • MatX

      Public
      An efficient C++17 GPU numerical computing library with Python-like syntax
      C++
      BSD 3-Clause "New" or "Revised" License
      93000Updated Dec 19, 2023Dec 19, 2023
    • C++/CUDA/Python multimedia utilities for NVIDIA Jetson
      C++
      MIT License
      300000Updated Dec 3, 2023Dec 3, 2023
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      MIT License
      2.1k000Updated Nov 14, 2023Nov 14, 2023
    • linux

      Public
      Linux kernel source tree
      C
      Other
      55k000Updated Nov 8, 2023Nov 8, 2023