title	description	services	ms.service	ms.component	ms.topic	ms.reviewer	ms.author	author	ms.date
ONNX and Azure Machine Learning \| Create and deploy models	Learn about ONNX and how to use Azure Machine Learning to create and deploy ONNX models	machine-learning	machine-learning	core	conceptual	jmartens	prasantp	prasanthpul	09/24/2018

ONNX and Azure Machine Learning: Create and deploy interoperable AI models

The Open Neural Network Exchange (ONNX) format is an open standard for representing machine learning models. ONNX is supported by a community of partners, including Microsoft, who create compatible frameworks and tools. Microsoft is committed to open and interoperable AI so that data scientists and developers can:

Use the framework of their choice to create and train models
Deploy models cross-platform with minimal integration work

Microsoft supports ONNX across its products including Azure and Windows to help you achieve these goals.

Why choose ONNX?

The interoperability you get with ONNX makes it possible to get great ideas into production faster. With ONNX, data scientists can choose their preferred framework for the job. Similarly, developers can spend less time getting models ready for production, and deploy across the cloud and edge.

You can create ONNX models from many frameworks, including PyTorch, Chainer, Microsoft Cognitive Toolkit (CNTK), MXNet, ML.Net, TensorFlow, Keras, SciKit-Learn, and more.

There is also an ecosystem of tools for visualizing and accelerating ONNX models. A number of pre-trained ONNX models are also available for common scenarios.

ONNX models can be deployed to the cloud using Azure Machine Learning and ONNX Runtime. They can also be deployed to Windows 10 devices using Windows ML. They can even be deployed to other platforms using converters that are available from the ONNX community.

[ ] (./media/concept-onnx/onnx.png#lightbox)

Get ONNX models

You can obtain ONNX models in several ways:

Get a pre-trained ONNX model from the ONNX Model Zoo (see example at the bottom of this article)
Generate a customized ONNX model from Azure Custom Vision service
Convert existing model from another format to ONNX (see example at the bottom of this article)
Train a new ONNX model in Azure Machine Learning service (see example at the bottom of this article)

Save/convert your models to ONNX

You can convert existing models to ONNX or save them as ONNX at the end of your training.

Framework for model	Conversion example or tool
PyTorch	Jupyter notebook
Microsoft Cognitive Toolkit (CNTK)	Jupyter notebook
TensorFlow	tensorflow-onnx converter
Chainer	Jupyter notebook
MXNet	Jupyter notebook
Keras, ScitKit-Learn, CoreML XGBoost, and libSVM	WinMLTools

You can find the latest list of supported frameworks and converters at the ONNX Tutorials site.

Deploy ONNX models in Azure

With Azure Machine Learning service, you can deploy, manage, and monitor your ONNX models. Using the standard deployment workflow and ONNX Runtime, you can create a REST endpoint hosted in the cloud. See a full example Jupyter notebook at the end of this article to try it out for yourself.

Install and configure ONNX Runtime

ONNX Runtime is an open source high-performance inference engine for ONNX models. It provides hardware acceleration on both CPU and GPU, with APIs available for Python, C#, and C. ONNX Runtime supports ONNX 1.2+ models and runs on Linux, Windows, and Mac. Python packages are available on PyPi.org (CPU, GPU), and C# package is on Nuget.org. See more about the project on Github.

To install ONNX Runtime for Python, use:

pip install onnxruntime

To call ONNX Runtime in your Python script, use:

import onnxruntime

session = onnxruntime.InferenceSession("path to model")

The documentation accompanying the model usually tells you the inputs and outputs for using the model. You can also use a visualization tool such as Netron to view the model. ONNX Runtime also lets you query the model metadata, inputs, and outputs:

session.get_modelmeta()
first_input_name = session.get_inputs()[0].name
first_output_name = session.get_outputs()[0].name

To inference your model, use run and pass in the list of outputs you want returned (leave empty if you want all of them) and a map of the input values. The result is a list of the outputs.

results = session.run(["output1", "output2"], {"input1": indata1, "input2": indata2})
results = session.run([], {"input1": indata1, "input2": indata2})

For the complete Python API reference, see the ONNX Runtime reference docs.

Example deployment steps

Here is an example for deploying an ONNX model:

Initialize your Azure Machine Learning service workspace. If you don't have one yet, learn how to create a workspace in this quickstart.

from azureml.core import Workspace

ws = Workspace.from_config()
print(ws.name, ws.resource_group, ws.location, ws.subscription_id, sep = '\n')

Register the model with Azure Machine Learning.

from azureml.core.model import Model

model = Model.register(model_path = "model.onnx",
                       model_name = "MyONNXmodel",
                       tags = ["onnx"],
                       description = "test",
                       workspace = ws)

Create an image with the model and any dependencies.

from azureml.core.image import ContainerImage

image_config = ContainerImage.image_configuration(execution_script = "score.py",
                                                  runtime = "python",
                                                  conda_file = "myenv.yml",
                                                  description = "test",
                                                  tags = ["onnx"]
                                                 )

image = ContainerImage.create(name = "myonnxmodelimage",
                              # this is the model object
                              models = [model],
                              image_config = image_config,
                              workspace = ws)

image.wait_for_creation(show_output = True)

The file score.py contains the scoring logic and needs to be included in the image. This file is used to run the model in the image. See this tutorial for instructions on how to create a scoring script. An example file for an ONNX model is shown below:

import onnxruntime
import json
import numpy as np
import sys
from azureml.core.model import Model

def init():
    global model_path
    model_path = Model.get_model_path(model_name = 'MyONNXmodel')

def run(raw_data):
    try:
        data = json.loads(raw_data)['data']
        data = np.array(data)
     
        sess = onnxruntime.InferenceSession(model_path)
        result = sess.run(["outY"], {"inX": data})
     
        return json.dumps({"result": result.tolist()})
    except Exception as e:
        result = str(e)
        return json.dumps({"error": result})

The file myenv.yml describes the dependencies needed for the image. See this tutorial for instructions on how to create an environment file, such as this sample file:

from azureml.core.conda_dependencies import CondaDependencies 

myenv = CondaDependencies()
myenv.add_pip_package("numpy")
myenv.add_pip_package("azureml-core")
myenv.add_pip_package("onnxruntime")

with open("myenv.yml","w") as f:
 f.write(myenv.serialize_to_string())

Deploy your ONNX model with Azure Machine Learning to:
- Azure Container Instances (ACI): Learn how...
- Azure Kubernetes Service (AKS): Learn how...

Examples

See how-to-use-azureml/deployment/onnx for example notebooks that create and deploy ONNX models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how-to-build-deploy-onnx.md

how-to-build-deploy-onnx.md

ONNX and Azure Machine Learning: Create and deploy interoperable AI models

Why choose ONNX?

Get ONNX models

Save/convert your models to ONNX

Deploy ONNX models in Azure

Install and configure ONNX Runtime

Example deployment steps

Examples

More info

Files

how-to-build-deploy-onnx.md

Latest commit

History

how-to-build-deploy-onnx.md

File metadata and controls

ONNX and Azure Machine Learning: Create and deploy interoperable AI models

Why choose ONNX?

Get ONNX models

Save/convert your models to ONNX

Deploy ONNX models in Azure

Install and configure ONNX Runtime

Example deployment steps

Examples

More info