Operationalization is the process of publishing models as web services, and the consumption of these services to produce business results.

Azure Machine Learning Operationalization is a component of the Azure CLI that enables deployment of models that use the CNTK, SPARK, and Python machine learning platforms.

You can use the Operationalization CLIs from the desktop app (File->Open Command-Line Interface) or install the CLIs direclty from the command line. They are also pre-installed on Azure DSVMs [https://docs.microsoft.com/en-us/azure/machine-learning/machine-learning-data-science-virtual-machine-overview].

Basic Concepts

Use of Docker for deployment

Azure ML Operationalization CLIs use Docker containers for packaging and deploying the model and its dependencies. Once the trained model is saved, it can be deployed using a CLI command.

Local and cluster deployment modes

Azure ML Operationalization provides two deployment modes: local and cluster.

Local mode deployments run in Docker containers on a single computer, whether that is your personal machine or a VM running on Azure. It is recommended to use local mode for development and testing.

In cluster mode, your service is run in Azure Container Service (ACS). The operationalization environment provisions Docker and Kubernetes in the cluster to manage web service deployment. Deploying to ACS allows you to scale your service as needed to meet your business needs.

The CLIs provide commands to set up the environments (local and cluster) for deployment. The envrionment services such as Storage, ACR, and ACS are created in your subscription.

Realtime and Batch processing

Depending on your business needs, you can deploy your service in one of two processing modes:

Realtime: Provides a web service that you call synchronously to score data on an as-needed basis.
Batch: Provides a web service that you call asynchronously to score a batch of data records.

Environment requirements

Before you can operationalize your model, you must set up your environment.

To set up the machine learning operationalization environment, you must have access to an Azure subscription with sufficient permissions to create Azure assets (e.g. Contributor or Admin access).

By default, the environment configuration only supports local deployments. Requirements to run in local mode are:

Azure CLI version 2.0
Azure Machine Learning CLI component
Python 3.5
Docker

Optionally, you can configure the environment to support cluster deployments. To do this you will need:

Spark for Spark based models
Microsoft Cognitive Tool Kit for CNTK-based models

Creating the Web Service

You can create a new web service in the Azure CLI using the ml service create command, which deploys the web service and returns a scoring endpoint URL.

When creating a realtime web service, you must supply a name for the service and a scoring script. The scoring script loads the data and runs it against the model to provide predictions.

Additionally, you can specify the following options for a realtime service:

Files and directories required by the service.
Enable Applications Insights logging.
The model to be deployed.
Add packages needed by the code file.
The runtime of the web service. Valid runtimes are spark-py, cntk-py, tensorflow-py, scikit-py. The default runtime is spark-py.
The input and output schema of the web service.
Number of nodes for the Kubernetes service in a cluster. The default is 1.

For a batch web service you can specify the following options:

Inputs for service to expect.
Outputs for service to expect.
Parameters for service to expect.
The title of service, defaults to service_name.
Files and directories required by the service.

Consuming the web service

Once the web service has been successfully deployed, you can call it using the CLI or by calling the scoring endpoint.

To test or use your new web service, you call it from the Vienna CLI using the ml service run command.

For more information on how to consume the web service, see Consuming your operationalized web service.

Additional Concepts

logging

You can troubleshoot your web service by viewing the logs. In local mode, you can inspect the Docker logs to diagnose issues with your deployment.

In cluster mode, you can specify that logging in performed using Azure Application Insights.

For more information on Docker logs, seeLogs and troubleshooting on the Docker web site.

For more information on logging in cluster mode, see How to enable logging.

Remote models, input data, and output data

Creating and running a web service in cluster mode uses models and data that is stored remotely. The CLI uses the credentials stored in your environment configuration to access blobs located in Azure storage. The CLI uses the same credentials to store output in blobs in the storage account.

For more information on accessing remote data, see How to use remotely stored data in a batch service call and Operationalizing Batch Processing of Machine Learning Models on Azure (Preview).

Scaling your web service

Operationalized models are deployed on ACS clusters on which Kubernetes has been installed, and can be scaled in two ways:

You can increase the number of agent nodes in the cluster.
You can increase the number of Kubernetes pods

For more information, see How to scale operationalization on your ACS cluster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

operationalization-overview.md

operationalization-overview.md

Basic Concepts

Use of Docker for deployment

Local and cluster deployment modes

Realtime and Batch processing

Environment requirements

Creating the Web Service

Consuming the web service

Additional Concepts

logging

Remote models, input data, and output data

Scaling your web service

Files

operationalization-overview.md

Latest commit

History

operationalization-overview.md

File metadata and controls

Basic Concepts

Use of Docker for deployment

Local and cluster deployment modes

Realtime and Batch processing

Environment requirements

Creating the Web Service

Consuming the web service

Additional Concepts

logging

Remote models, input data, and output data

Scaling your web service