Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is the machine learning infrastructure platform to serve models of any size and modality and do so performantly, scalably, and affordably for production use cases.

Get started:

Truss

Truss, an open-source project by Baseten, is the simplest way to serve AI/ML models in production.

Why Truss?

  • Write once, run anywhere: Package and test model code, weights, and dependencies with a model server that behaves the same in development and production.
  • Fast developer loop: Implement your model with fast feedback from a live reload server, and skip Docker and Kubernetes configuration with Truss' done-for-you model serving environment.
  • Support for all Python frameworks: From transformers and diffusors to PyTorch and Tensorflow to XGBoost and sklearn, Truss supports models created with any framework, even entirely custom models.

Get started:

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 929 74

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 147 37

Repositories

Showing 10 of 45 repositories
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 929 MIT 74 60 (5 issues need help) 19 Updated Dec 9, 2024
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 147 MIT 37 11 56 Updated Dec 7, 2024
  • autoscaler Public Forked from kubernetes/autoscaler

    Autoscaling components for Kubernetes

    basetenlabs/autoscaler’s past year of commit activity
    Go 0 Apache-2.0 4,057 0 2 Updated Nov 12, 2024
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    basetenlabs/axolotl’s past year of commit activity
    Python 0 Apache-2.0 898 0 2 Updated Nov 7, 2024
  • HackMIT-2024 Public
    basetenlabs/HackMIT-2024’s past year of commit activity
    Jupyter Notebook 2 1 0 0 Updated Sep 14, 2024
  • basetenlabs/Workshop-TRT-LLM’s past year of commit activity
    Python 16 11 0 0 Updated Jun 26, 2024
  • gpu-operator Public Forked from NVIDIA/gpu-operator

    NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

    basetenlabs/gpu-operator’s past year of commit activity
    Go 0 Apache-2.0 316 0 3 Updated Apr 19, 2024
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    basetenlabs/TensorRT-LLM’s past year of commit activity
    C++ 0 Apache-2.0 1,030 0 0 Updated Apr 2, 2024
  • .github Public
    basetenlabs/.github’s past year of commit activity
    0 0 0 0 Updated Jan 26, 2024
  • triton-inference-server Public Forked from triton-inference-server/server

    The Triton Inference Server provides an optimized cloud and edge inferencing solution.

    basetenlabs/triton-inference-server’s past year of commit activity
    Python 0 BSD-3-Clause 1,512 0 0 Updated Jan 11, 2024

Top languages

Loading…

Most used topics

Loading…