Use GCP AI Platform Pipeline to build AutoML NLP Training and Deployment Pipeline

Objective

This project is to use AI Platform Pipeline to orchestrate AutoML NLP Workflow, automating the data importing, training, evaluation and deployment process.

Architecture

This project uses Function as a Http trigger. There are two functions, including pipeline deployment function, and pipeline status inquring function.

The whole workflow uses AI Platform pipeline(based on Kubeflow) as orchestrator. The core service is the pipeline is AutoML NLP service, for text classification model customization. After AutoML model is deployed, the model_id will be stored in Firestore.

Deployment Steps

Create a project in GCP
Use env.sh, to config the project_id variables in CLI. When running env.sh, all the variables will be added to environment.

Use service_account.sh, modify the parameter of ADMIN_NAME, to config the service account.

Enable the related GCP service use config.sh
Create App Engine and Firestore use config.sh

Use config.sh to create GCS bucket, copy the local Kubeflow template to that bucket, and set it as public.

Next, use commands in config.sh to deploy the two functions. automl_deploy is the automl pipeline function，get_automl_status is to inquire pipeline running status.

Open AI platform pipeline service in GCP UI, create an instance, and get the client url of this instance.

Use this client url and GCS path of the training data, we can call the Http function to start AutoML end-to-end workflow. In test_function_command.sh under function-kubeflow folder, we can see how to form the function request body.

To deploy AutoML pipeline, we put AI Platform Pipeline client url, Kubeflow template url, AutoML NLP dataset display name, training data GCS path, and Kubeflow name (to name this time's Kubeflow pipeline and run) in the body.

To inquire pipeline status, we put AI Platform Pipeline client url, and this time's run id(runid will be in the response body of above request) in the body.

After starting automl_deploy, we can also directly check the status in AI platform pipeline(Kubeflow) monitor UI.

Code Description

config.sh, env.sh, service_account.sh are all deployment related scripts.
Function-kubeflow includes code of two functions, function_deploy_command.sh is the script to show how to deploy function (config.sh also has the command). test_function_command.sh is to show how to use the functions.
Kubeflow-automl is the source code of pipeline template. You can also modify the source code and generate your own Kubeflow template using template_generate.sh. In config.sh, we directly upload the ready template from local to GCS.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
function-kubeflow		function-kubeflow
images		images
kubeflow-automl		kubeflow-automl
.DS_Store		.DS_Store
README.md		README.md
config.sh		config.sh
env.sh		env.sh
service_account.sh		service_account.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Use GCP AI Platform Pipeline to build AutoML NLP Training and Deployment Pipeline

Objective

Architecture

Deployment Steps

Code Description

About

Releases

Packages

Languages

cloudymoma/kubeflow-automl-nlp-example

Folders and files

Latest commit

History

Repository files navigation

Use GCP AI Platform Pipeline to build AutoML NLP Training and Deployment Pipeline

Objective

Architecture

Deployment Steps

Code Description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages