Add the MLOps platform as part of the GenAI infra #326

andreeamun · 2024-08-20T11:44:43Z

Description

Add MLOps platform to the proposed infrastructure to enable organisations to automate their ML workloads.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

Dependencies

Have a K8s cluster underneath.

Tests

Describe the tests that you ran to verify your changes.

for more information, see https://pre-commit.ci

mkbhanda · 2024-08-23T03:23:11Z

@andreeamun thank you for your PR. Would like to understand better how MLOPS fits in currently with OPEA, which is focused on RAG/GenAI pipelines. Also the Data science kit. I do agree they both pertain to machine learning but need some use cases to justify why we want to pull it in here. This might be a logical evolution of OPEA and we may also pull in Jupyter Notebooks. Further, if are bringing in Ubuntu related materials, there will need to be more documentation. Currently we have not made any requirements on the operating system running on the platforms/nodes.

andreeamun · 2024-08-27T14:27:15Z

thank you @mkbhanda . An MLOps platform such as Charmed Kubeflow is crucial when it comes to RAG / GenAI pipelines because it automates some of the workloads and enables easy iteration. To be more precise, MLOps platforms can be used to:

Automate pipelines used for data ingestion
Perform model optimisation (eg: hypterparameter tuning, fine-tuning, p-tuning)
Train models and automate some of those pipelines to efficentise work
Benefit from user management, network isolation and further security enhancements for larger teams
Model registry / Experiment tracking with MLflow

Such a platform is a cloud-native application which runs on any CNCF-conformant Kubernetes, giving enterprises freedom to build on their existing infrastructure, regardless of the OS running underneath. However, I am happy to provide more details on what are the benefits of using Ubuntu as the recommended operating system.

Please let me know if this answers your question.

daisy-ycguo · 2024-09-05T02:17:22Z

@andreeamun Thank you for the contribution. Will you rebase your PR and sign off your PR? Refer to the guidance here: https://github.com/opea-project/GenAIInfra/pull/326/checks?check_run_id=29702935232

KfreeZ · 2024-09-06T11:07:32Z

thank you @mkbhanda . An MLOps platform such as Charmed Kubeflow is crucial when it comes to RAG / GenAI pipelines because it automates some of the workloads and enables easy iteration. To be more precise, MLOps platforms can be used to:

Automate pipelines used for data ingestion

Perform model optimisation (eg: hypterparameter tuning, fine-tuning, p-tuning)

Train models and automate some of those pipelines to efficentise work

Benefit from user management, network isolation and further security enhancements for larger teams

Model registry / Experiment tracking with MLflow

Such a platform is a cloud-native application which runs on any CNCF-conformant Kubernetes, giving enterprises freedom to build on their existing infrastructure, regardless of the OS running underneath. However, I am happy to provide more details on what are the benefits of using Ubuntu as the recommended operating system.

Please let me know if this answers your question.

If we want to support the kubeflow or DSS, we'd better provide some detailed use case, examples and code changes, scripts whatever needed.
When we can see the OPEA examples are running on top of kubeflow or DSS, then we can document that we support it.

mkbhanda · 2024-09-06T17:27:02Z

thank you @mkbhanda . An MLOps platform such as Charmed Kubeflow is crucial when it comes to RAG / GenAI pipelines because it automates some of the workloads and enables easy iteration. To be more precise, MLOps platforms can be used to:

Automate pipelines used for data ingestion

Perform model optimisation (eg: hypterparameter tuning, fine-tuning, p-tuning)

Train models and automate some of those pipelines to efficentise work

Benefit from user management, network isolation and further security enhancements for larger teams

Model registry / Experiment tracking with MLflow

Such a platform is a cloud-native application which runs on any CNCF-conformant Kubernetes, giving enterprises freedom to build on their existing infrastructure, regardless of the OS running underneath. However, I am happy to provide more details on what are the benefits of using Ubuntu as the recommended operating system.
Please let me know if this answers your question.

If we want to support the kubeflow or DSS, we'd better provide some detailed use case, examples and code changes, scripts whatever needed. When we can see the OPEA examples are running on top of kubeflow or DSS, then we can document that we support it.

Thank you @KfreeZ for your input! @andreeamun OPEA is starting from solving end-user problems with reference implementations. Perhaps a use case illustrated using in GenAIExamples of fine-tuning or data-prep would be more compelling then just documenting how to install Kubeflow. We have RFCs as PRs in https://github.com/opea-project/docs where we can collaborate on any use case you choose to highlight value.

andreeamun and others added 6 commits August 20, 2024 19:42

Add the MLOps platform as part of the GenAI infra

f1a55b4

[pre-commit.ci] auto fixes from pre-commit.com hooks

5486620

for more information, see https://pre-commit.ci

Add DSS

93d4e5c

[pre-commit.ci] auto fixes from pre-commit.com hooks

813730e

for more information, see https://pre-commit.ci

DSS update

52768e5

[pre-commit.ci] auto fixes from pre-commit.com hooks

ade3548

for more information, see https://pre-commit.ci

lianhao requested a review from mkbhanda August 23, 2024 02:48

Merge branch 'main' into main

3c1ff71

kevinintel requested a review from daisy-ycguo as a code owner September 5, 2024 01:54

xiguiw mentioned this pull request Feb 25, 2025

adding DSS and MLOps platform opea-project/GenAIComps#537

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the MLOps platform as part of the GenAI infra #326

Add the MLOps platform as part of the GenAI infra #326

andreeamun commented Aug 20, 2024

mkbhanda commented Aug 23, 2024

andreeamun commented Aug 27, 2024

daisy-ycguo commented Sep 5, 2024

KfreeZ commented Sep 6, 2024

mkbhanda commented Sep 6, 2024

Add the MLOps platform as part of the GenAI infra #326

Are you sure you want to change the base?

Add the MLOps platform as part of the GenAI infra #326

Conversation

andreeamun commented Aug 20, 2024

Description

Issues

Type of change

Dependencies

Tests

mkbhanda commented Aug 23, 2024

andreeamun commented Aug 27, 2024

daisy-ycguo commented Sep 5, 2024

KfreeZ commented Sep 6, 2024

mkbhanda commented Sep 6, 2024