This repository presents a solution and reference architecture for the Knowledge Mining solution accelerator. Please note that the provided code serves as a demonstration and is not an officially supported Microsoft offering.
For additional security, please review how to use Azure API Management with microservices deployed in Azure Kubernetes Service.
-
PowerShell (v5.1+) - available for Windows, macOS, and Linux.
-
Azure CLI (v2.0+) - command-line tool for managing Azure resources.
2a. kubectl - command-line tool for interacting with Kubernetes clusters.
In PowerShell, run the following command:az aks install-cli
2b. aks-preview - extension for Azure CLI to manage Azure Kubernetes Service.
In PowerShell, run the following command:az extension add --name aks-preview
-
Helm - package manager for Kubernetes
-
Docker Desktop: service to containerize and publish into Azure Container Registry. Please make sure Docker desktop is running before executing Deployment script.
-
Azure Access - subscription-level
Owner
orUser Access Administrator
role required.
Due to model availability within various data center regions, the following services have been hard-coded to specific regions.
-
Azure Open AI (GPT 4o mini):
The solution relies onGPT-4o mini
andtext-embedding-3-large
models which are all currently available in the 'WestUS3', 'EastUS', 'EastUS2', 'SwedenCentral' region.
Please check the model summary table and region availability if needed. -
Azure AI Document Intelligence (East US):
The solution relies on a2023-10-31-preview
or later that is currently available inEast US
region.
The deployment region for this model is fixed in 'East US'
The automated deployment process is very straightforward and simplified via a single deployment script that completes in approximately 10-15 minutes:
- Deploy Azure resources.
- Get secret information from Azure resources.
- Update application configuration files with secrets.
- Set Application Configuration in Azure App Configuration.
- Compile application, build image, and push to Azure Container Registry.
- Configure Kubernetes cluster infrastructure.
- Update Kubernetes configuration files.
- Deploy certificates, ingress controller and then application images from Azure Container Registry.
Open PowerShell, change directory where you code cloned, then run the deploy script:
cd .\Deployment\
.\resourdedeployment.ps1
If you run into issue with PowerShell script file not being digitally signed, you can execute below command:
powershell.exe -ExecutionPolicy Bypass -File ".\resourdedeployment.ps1"
You will be prompted for the following parameters with this Screen :
-
Subscription ID - copy/paste from Azure portal
-
Location - Azure data center where resources will be deployed.
*Please check Azure resource availability and note hardcoded regions. The following locations are currently supported:
'EastUS', 'EastUS2', 'WestUS', 'WestUS2', 'WestUS3', 'CentralUS', 'NorthCentralUS', 'SouthCentralUS','WestEurope', 'NorthEurope', 'SoutheastAsia', 'EastAsia', 'JapanEast', 'JapanWest', 'AustraliaEast', 'AustraliaSoutheast', 'CentralIndia', 'SouthIndia', 'CanadaCentral','CanadaEast', 'UKSouth', 'UKWest', 'FranceCentral', 'FranceSouth', 'KoreaCentral','KoreaSouth', 'GermanyWestCentral', 'GermanyNorth', 'NorwayWest', 'NorwayEast', 'SwitzerlandNorth', 'SwitzerlandWest', 'UAENorth', 'UAECentral', 'SouthAfricaNorth','SouthAfricaWest', 'BrazilSouth','BrazilSoutheast', 'QatarCentral', 'ChinaNorth', 'ChinaEast', 'ChinaNorth2', 'ChinaEast2'
-
ModelLocation - Azure data center where GPT model will be deployed.
The following locations are currently available :'WestUS3', 'EastUS', 'EastUS2', 'SwedenCentral'
-
Email - used for issuing certificates in Kubernetes clusters from the Let's Encrypt service. Email address should be valid.
-
GO ! - Deployment Script executes Azure deployment, Azure Infrastructure configuration, Application code compile and publish into Kubernetes Cluster.
Create Content Filter - Please follow below steps
- Navigate to Azure AI Studio
- Click on Create Content Filter and set the filters to a high threshold for the following categories:
Hate, Sexual, Self-harm, Violence
- Please select the checkbox of profanity
- Leave all other configurations at their default settings and click on create
Let's check the message and configure your model's TPM rate higher to get better performance.
You can check the Application URL from the final console message.
Don't miss this Url information. This is the application's endpoint URL and it should be used for your data importing process.
Capacity Note:
- The deployment script creates models with a setting of 1 token per minute (TPM) rate limit.
- Faster performance can be achieved by increasing the TPM limit with Azure OpenAI Studio.
- Capacity varies for regional quota limits as well as for provisioned throughput.
- As a starting point, we recommend the following quota threshold be set up for this service run.
Model Name | TPM Threshold |
---|---|
GPT-4o-mini | 100K TPM |
text-embedding-3-large | 200K TPM |
- Browse to Azure OpenAI Studio and select each of the 2 models within the
Deployments
menu:
- Increase the TPM value for each model for faster report generation:
After increasing the TPM limit for each model, let's upload and process the sample documents.
cd .\Deployment\
Execute uploadfiles.ps1 file with -EndpointUrl parameter as URL in console message.
.\uploadfiles.ps1 -EndpointUrl https://kmgs<your dns name>.<datacenter>.cloudapp.azure.com