-
serve Public
Forked from pytorch/serveServe, optimize and scale PyTorch models in production
Java Apache License 2.0 UpdatedAug 12, 2024 -
deep-learning-containers Public
Forked from aws/deep-learning-containersAWS Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in TensorFlow, TensorFlow 2, PyTorch, and MXNet.
Python Other UpdatedAug 9, 2024 -
djl-serving Public
Forked from deepjavalibrary/djl-servingA universal scalable machine learning model deployment solution
Java Apache License 2.0 UpdatedJul 23, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJun 25, 2024 -
djl-demo Public
Forked from deepjavalibrary/djl-demoDemo applications showcasing DJL
Jupyter Notebook Apache License 2.0 UpdatedJan 18, 2024 -
djl Public
Forked from deepjavalibrary/djlAn Engine-Agnostic Deep Learning Framework in Java
Java Apache License 2.0 UpdatedSep 22, 2023 -
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedSep 10, 2023 -
fastertransformer_backend Public
Forked from triton-inference-server/fastertransformer_backendPython BSD 3-Clause "New" or "Revised" License UpdatedAug 1, 2023 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedAug 1, 2023 -
sagemaker-inference-toolkit Public
Forked from aws/sagemaker-inference-toolkitServe machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.
Python Apache License 2.0 UpdatedApr 3, 2023 -
sagemaker-pytorch-inference-toolkit Public
Forked from aws/sagemaker-pytorch-inference-toolkitToolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at https://github.com/aws/deep-learning-containers.
Python Apache License 2.0 UpdatedApr 1, 2023 -
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python MIT License UpdatedJan 10, 2023 -
amazon-sagemaker-examples Public
Forked from aws/amazon-sagemaker-examplesExample 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Jupyter Notebook Apache License 2.0 UpdatedJul 18, 2022 -
server Public
Forked from triton-inference-server/serverThe Triton Inference Server provides an optimized cloud and edge inferencing solution.
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 25, 2022 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedNov 9, 2020 -
pytorch-micro-benchmarking Public
Forked from ROCm/pytorch-micro-benchmarkingPython UpdatedOct 14, 2020 -
CppND-Memory-Management-Chatbot Public
Forked from udacity/CppND-Memory-Management-ChatbotC++ UpdatedJul 6, 2020 -
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 18, 2020 -
-
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
Python UpdatedJun 9, 2020 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Python Apache License 2.0 UpdatedApr 21, 2020 -
dlrm Public
Forked from facebookresearch/dlrmAn implementation of a deep learning recommendation model (DLRM)
-
CppND-Garbage-Collector Public
Forked from udacity/CppND-Garbage-CollectorProject starter code for the Garbage Collector project.
C++ UpdatedSep 11, 2019 -
rocm-caffe2 Public
Forked from rocmarchive/realcaffe2The official Caffe2 port on AMD platform, rocm-caffe2 is
C++ Apache License 2.0 UpdatedAug 10, 2018 -
-
-
-
-
Thrust Public
Forked from ROCm/ThrustHIP back-end for Thrust
C++ Apache License 2.0 UpdatedApr 16, 2018 -
caffe2 Public
Forked from facebookarchive/caffe2Caffe2 is a lightweight, modular, and scalable deep learning framework.
C++ Apache License 2.0 UpdatedMar 26, 2018