#

serving

Here are 112 public repositories matching this topic...

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Updated Dec 4, 2024
Python

tensorflow / serving

A flexible, high-performance serving system for machine learning models

python machine-learning deep-neural-networks deep-learning neural-network cpp tensorflow ml serving

Updated Dec 1, 2024
C++

vespa

vespa-engine / vespa

AI + Data, online. https://vespa.ai

java search-engine machine-learning big-data ai server cpp tensorflow vespa serving serving-recommendation vector-search

Updated Dec 4, 2024
Java

SeldonIO / seldon-core

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

kubernetes machine-learning deployment serving aiops production-machine-learning mlops machine-learning-operations

Updated Dec 2, 2024
HTML

ahkarami / Deep-Learning-in-Production

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

Updated Nov 9, 2024

pytorch / serve

Serve, optimize and scale PyTorch models in production

docker kubernetes machine-learning cpu deep-learning metrics gpu optimization pytorch serving mlops

Updated Dec 1, 2024
Java

PaddlePaddle / FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

android intel rockchip object-detection jetson tensorrt serving onnx openvino onnxruntime graphcore yolov5 kunlun uie picodet stable-diffusion yolov8

Updated Nov 21, 2024
C++

evadb

georgia-tech-db / evadb

Database system for AI-powered apps

agent database ai data-analysis eva object-detection labeling hacktoberfest video-analytics serving huggingface gpt-4 llm chatgpt langchain gpt4all auto-gpt

Updated May 17, 2024
Python

Lightning-AI / LitServe

Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.

api web ai deep-learning rest-api artificial-intelligence developer-tools serving fastapi

Updated Dec 4, 2024
Python

tobegit3hub / tensorflow_template_application

TensorFlow template application for deep learning

machine-learning csv deep-learning tensorflow inference cnn lstm tensorboard mlp libsvm tfrecords wide-and-deep serving

Updated Jul 5, 2023
Python

llm-applications

ray-project / llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

machine-learning openai ray serving fine-tuning anyscale llms llama2

Updated Aug 2, 2024
Jupyter Notebook

dingodb / dingo

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

structured-data serving unstructured-data unified-sql vector-database mysql-compatibility embedding-search embedding-store key-value-distributed-store vector-ocean real-time-semantic-search

Updated Dec 4, 2024
Java

Delta-ML / delta

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

Updated Apr 19, 2024
Python

ray-project / ray-llm

RayLLM - LLMs on Ray

distributed-systems transformers ray serving large-language-models llm llmops llm-serving llm-inference

Updated May 28, 2024
Python

PaddlePaddle / Serving

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

python docker deep-learning pipeline gpu prediction micro-service rpc-service dag paddle microservice-toolkit predictor serving online-service paddle-serving

Updated May 6, 2024
C++

tobegit3hub / simple_tensorflow_serving

Generic and easy-to-use serving service for machine learning models

http client machine-learning deep-learning tensorflow tensorflow-models serving savedmodel

Updated Jan 3, 2021
JavaScript

openvinotoolkit / model_server

A scalable inference server for models optimized with OpenVINO™

kubernetes machine-learning cloud ai deep-learning inference edge dag model-serving serving openvino

Updated Dec 4, 2024
C++

meta-soul / MetaSpore

A unified end-to-end machine intelligence platform

training ai machinelearning deeplearning abtesting serving

Updated Sep 2, 2024
Python

underneathall / pinferencia

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

Updated Feb 14, 2023
Python

polyaxon / haupt

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

Updated Nov 27, 2024
Python

Improve this page

Add a description, image, and links to the serving topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the serving topic, visit your repo's landing page and select "manage topics."