Skip to content

Latest commit

 

History

History
475 lines (282 loc) · 18.8 KB

File metadata and controls

475 lines (282 loc) · 18.8 KB

AI Reference Implementation Baseline Pattern Module

Overview

The AI Reference Implementation Baseline Pattern Module provides a secure, observable by default, scalable, and highly configurable foundation for deploying AI workloads on Azure. This pattern module integrates multiple Azure resources, following best practices and architectural standards, to deliver a comprehensive AI Reference Implementation. The goal is to accelerate the deployment of AI solutions by providing a ready-to-use infrastructure that adheres to Azure's Well-Architected Framework.

This pattern module is opinionated, meaning it comes with pre-configured defaults for security, observability, and essential AI resources. However, it remains flexible, allowing users to customize the environment to meet specific project needs by enabling or disabling various components.

Architecture

The pattern module is designed to be modular and composable. The default deployment includes a minimum set of resources required to establish a secure and observable AI environment, but additional resources can be added based on project requirements. Below is a high-level architecture diagram:

AI Reference Implementation Baseline Architecture

Key Features and Goals

This AI Reference Implementation pattern module is designed to accelerate the deployment of AI solutions on Azure, while ensuring security, compliance, flexibility, and observability. The primary objectives and functionalities of this module include:

  1. Security by Default: Ensuring that all deployed resources adhere to Azure's security best practices, including network isolation, encryption, and identity management. This guarantees that AI environments are secure from the outset. For more details, see Security practices.

  2. Observability: Providing out-of-the-box integration of logging, monitoring, and alerting, making the AI environment fully observable from day one. This ensures that all deployments are transparent and issues can be quickly identified and resolved. More details are available in Observability practices.

  3. Modular and Flexible Design: Composed of several modules that can be individually enabled or disabled, this pattern offers a flexible architecture that can be tailored to specific project needs. This modular approach allows teams to start with a minimal setup and expand as required, ensuring the AI environment is scalable and adaptable.

  4. Compliance with Azure Best Practices: Adhering to the recommendations of the Azure Well-Architected Framework, this module ensures that all resources and configurations are optimized for performance, reliability, security, and cost management.

  5. Rapid Deployment and Consistency: By providing a standardized reference implementation, this module accelerates the deployment process and ensures consistency across different projects and teams. This reduces variability and guarantees that best practices are consistently applied in all AI environments.

Usage

This pattern module is designed for both data scientists and engineers who need to quickly stand up a secure, scalable AI environment on Azure. It is also suitable for organizations that require a compliant and secure environment for their AI workloads with the flexibility to customize the setup based on project-specific needs.

Example Usage

To deploy the AI Reference Implementation Baseline with minimal configuration:

module "ai_reference_implementation" {
  source  = "Azure/avm-ptn-ai-reference-implementation/azurerm"
  version = "x.x.x"

  resource_group_name = "<your_resource_group>"
  location            = "<your_location>"
}

This example sets up the AI Reference Implementation with all default resources.

Use Cases

This module is ideal for:

  • Data Scientists: Who need a secure, scalable, and integrated environment to experiment, develop, and train machine learning models.
  • ML Engineers: Looking to deploy machine learning models into production with robust monitoring, scaling, and management capabilities.
  • Organizations: That require a compliant and secure environment for their machine learning workloads, with the flexibility to integrate with existing Azure services.

Extending the Pattern Module

The AI Reference Implementation Baseline Pattern Module is designed to be extended. You can add additional resources or services by integrating other Azure Verified Modules (AVM). For example, you can include additional machine learning environments, data lakes, or advanced AI services by simply integrating their respective modules and configuring them within the pattern module.

Additional Resources

  • Azure Well-Architected Framework: Azure WAF
  • Azure AI Documentation: Azure AI Services
  • Terraform Registry: Terraform AzureRM Provider

Contributing

This module is part of the Azure Verified Modules (AVM) ecosystem, and contributions are welcome. Please follow the standard contribution guidelines if you wish to submit enhancements or report issues.

Requirements

The following requirements are needed by this module:

Resources

The following resources are used by this module:

Required Inputs

The following input variables are required:

Description: The location/region where the resources will be deployed.

Type: string

Description: The name of the this resource.

Type: string

Description: The resource group where the resources will be deployed.

Type: string

Optional Inputs

The following input variables are optional (have default values):

Description: The address space that is used for the Azure Bastion subnet

Type: list(string)

Default:

[
  "10.1.3.0/24"
]

Description: The name of the Azure Bastion resource. if not provided, a name will be generated.

Type: string

Default: ""

Description: The name of the Network Security Group for the Azure Bastion subnet. If not provided, a name will be generated.

Type: string

Default: ""

Description: The name of the Azure Container Registry. If not provided, a name will be generated.

Type: string

Default: ""

Description: This variable controls whether or not telemetry is enabled for the module.
For more information see https://aka.ms/avm/telemetryinfo.
If it is set to false, then no telemetry will be collected.

Type: bool

Default: true

Description: This creates a jumpbox if configured with jumpbox.create = true and defaults to a Windows machine.

Type:

object({
    create  = bool
    name    = optional(string, "jumpbox")
    os_type = optional(string, "Windows")
    size    = optional(string, "Standard_D2s_v3")
    zone    = optional(string, "1")
    image_ref = optional(object({
      publisher = string
      offer     = string
      sku       = string
      version   = string
      }), {
      publisher = "microsoftwindowsdesktop"
      offer     = "windows-11"
      sku       = "win11-22h2-ent"
      version   = "latest"
    })
  })

Default:

{
  "create": false
}

Description: The name of the Azure Key Vault. If not provided, a name will be generated.

Type: string

Default: ""

Description: Controls the Resource Lock configuration for this resource. The following properties can be specified:

  • kind - (Required) The type of lock. Possible values are \"CanNotDelete\" and \"ReadOnly\".
  • name - (Optional) The name of the lock. If not specified, a name will be generated based on the kind value. Changing this forces the creation of a new resource.

Type:

object({
    kind = string
    name = optional(string, null)
  })

Default: null

Description: The name of the Log Analytics Workspace. If not provided, a name will be generated.

Type: string

Default: ""

Description: The name of the Azure Machine Learning Workspace. If not provided, a name will be generated.

Type: string

Default: ""

Description: The name of the Network Security Group for the private endpoints subnet. If not provided, a name will be generated.

Type: string

Default: ""

Description: The address space that is used for the private endpoints subnet

Type: list(string)

Default:

[
  "10.1.2.0/24"
]

Description: A map of role assignments to create on the . The map key is deliberately arbitrary to avoid issues where map keys maybe unknown at plan time.

  • role_definition_id_or_name - The ID or name of the role definition to assign to the principal.
  • principal_id - The ID of the principal to assign the role to.
  • description - (Optional) The description of the role assignment.
  • skip_service_principal_aad_check - (Optional) If set to true, skips the Azure Active Directory check for the service principal in the tenant. Defaults to false.
  • condition - (Optional) The condition which will be used to scope the role assignment.
  • condition_version - (Optional) The version of the condition syntax. Leave as null if you are not using a condition, if you are then valid values are '2.0'.
  • delegated_managed_identity_resource_id - (Optional) The delegated Azure Resource Id which contains a Managed Identity. Changing this forces a new resource to be created. This field is only used in cross-tenant scenario.
  • principal_type - (Optional) The type of the principal_id. Possible values are User, Group and ServicePrincipal. It is necessary to explicitly set this attribute when creating role assignments if the principal creating the assignment is constrained by ABAC rules that filters on the PrincipalType attribute.

Note: only set skip_service_principal_aad_check to true if you are assigning a role to a service principal.

Type:

map(object({
    role_definition_id_or_name             = string
    principal_id                           = string
    description                            = optional(string, null)
    skip_service_principal_aad_check       = optional(bool, false)
    condition                              = optional(string, null)
    condition_version                      = optional(string, null)
    delegated_managed_identity_resource_id = optional(string, null)
    principal_type                         = optional(string, null)
  }))

Default: {}

Description: The name of the Azure Storage Account. If not provided, a name will be generated.

Type: string

Default: ""

Description: A map of tags to add to all resources

Type: map(string)

Default: null

Description: The address space that is used for the virtual machines subnet

Type: list(string)

Default:

[
  "10.1.1.0/24"
]

Description: The name of the Virtual Network. If not provided, a name will be generated.

Type: string

Default: ""

Description: The name of the Network Security Group for the virtual machines subnet. If not provided, a name will be generated.

Type: string

Default: ""

Description: The address space that is used the virtual network

Type: list(string)

Default:

[
  "10.1.0.0/16"
]

Outputs

The following outputs are exported:

Description: This is the full output for the resource.

Description: The Azure resource id of the resource.

Modules

The following Modules are called:

Source: Azure/avm-res-machinelearningservices-workspace/azurerm

Version: 0.1.1

Source: Azure/avm-res-containerregistry-registry/azurerm

Version: ~> 0.2

Source: Azure/avm-res-network-bastionhost/azurerm

Version: 0.3.0

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

Source: Azure/avm-res-compute-virtualmachine/azurerm

Version: 0.15.1

Source: Azure/avm-res-keyvault-vault/azurerm

Version: ~> 0.5

Source: Azure/avm-res-operationalinsights-workspace/azurerm

Version: ~> 0.1

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

Source: Azure/avm-res-storage-storageaccount/azurerm

Version: 0.2.1

Source: Azure/avm-res-network-virtualnetwork/azurerm

Version: ~> 0.2.0

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

Data Collection

The software may collect information about you and your use of the software and send it to Microsoft. Microsoft may use this information to provide services and improve our products and services. You may turn off the telemetry as described in the repository. There are also some features in the software that may enable you and Microsoft to collect data from users of your applications. If you use these features, you must comply with applicable law, including providing appropriate notices to users of your applications together with a copy of Microsoft’s privacy statement. Our privacy statement is located at https://go.microsoft.com/fwlink/?LinkID=824704. You can learn more about data collection and use in the help documentation and our privacy statement. Your use of the software operates as your consent to these practices.