Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.

Python 4,873 468 Updated Dec 20, 2024

instill-ai / deprecated-model

⚗️ Instill Model contains components for AI model orchestration

Makefile 20 4 Updated Mar 18, 2024

neulab / prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 1,974 177 Updated May 20, 2024

featureform / featureform

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

Jupyter Notebook 1,827 97 Updated Dec 21, 2024

caraml-dev / merlin

Kubernetes-friendly ML model management, deployment, and serving.

Go 168 43 Updated Dec 20, 2024

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 6,918 531 Updated Dec 21, 2024

zenml-io / mlstacks

A series of Terraform based recipes to provision popular MLOps stacks on the cloud.

HCL 253 33 Updated Oct 9, 2024

EthicalML / awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

17,732 2,265 Updated Dec 2, 2024

IDSIA / sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,269 383 Updated Nov 26, 2024

kelvins / awesome-mlops

😎 A curated list of awesome MLOps tools

Python 4,186 575 Updated Nov 29, 2024

jbellis / jvector

JVector: the most advanced embedded vector search engine

Java 1,528 113 Updated Dec 20, 2024

TimefoldAI / timefold-solver

The open source Solver AI for Java, Python and Kotlin to optimize scheduling and routing. Solve the vehicle routing problem, employee rostering, task assignment, maintenance scheduling and other pl…

Java 1,060 94 Updated Dec 20, 2024

sematic-ai / sematic

An open-source ML pipeline development platform

Python 976 60 Updated Dec 20, 2024

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,551 5,873 Updated Dec 20, 2024

kedro-org / kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,065 910 Updated Dec 18, 2024

bentoml / OpenLLM

Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.

Python 10,263 652 Updated Dec 19, 2024

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,928 175 Updated Nov 20, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,241 553 Updated Oct 28, 2024

ssheng / BentoChain

A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and 🍱 BentoML.

Python 192 22 Updated Nov 13, 2023

zylon-ai / private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 54,484 7,318 Updated Nov 13, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,480 1,106 Updated Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pierre Lecerf eledhwen

Achievements

Achievements

Organizations

Block or report eledhwen

⚙️ MLOps

InternLM / lmdeploy

ShishirPatil / gorilla

deepjavalibrary / djl

langchain4j / langchain4j

rustformers / llm

dottxt-ai / outlines

TheoKanning / openai-java

apple / ml-ferret

polyaxon / polyaxon

superduper-io / superduper