Build software better, together

whylabs / whylogs

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

python data-science machine-learning analytics logging constraints dataset dataops data-pipeline data-quality calculate-statistics data-constraints mlops model-performance ml-pipelines ai-pipelines approximate-statistics statistical-properties

Updated Jan 10, 2025
Jupyter Notebook

sematic-ai / sematic

Star

An open-source ML pipeline development platform

python data-science machine-learning ai pipeline ml python3 mlops ml-pipeline ml-ops ml-pipelines

Updated Jan 9, 2025
Python

zetane / ZetaForge

Star

Open source AI platform for rapid development of advanced AI and AGI pipelines.

python kubernetes workflow data-science machine-learning ai ml agi developer-tools gpt claude mlops workflow-orchestration ml-pipelines llm zetaforge

Updated Jun 23, 2025
Python

evidentlyai / ml_observability_course

Star

Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in production.

data-quality-checks data-quality production-machine-learning mlops model-monitoring machine-learning-operations model-performance data-drift ml-pipelines ml-monitoring ml-observability llmops

Updated Dec 17, 2023
Jupyter Notebook

udellgroup / oboe

Star

An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.

collaborative-filtering automl ml-pipelines

Updated Oct 23, 2021
Python

opctl / opctl

Star

Free and open source automation platform

devops development automation containers ml-pipelines

Updated May 22, 2025
Go

bodywork-ml / ml-pipeline-engineering

Star

Best practices for engineering ML pipelines.

devops machine-learning tutorial mlops ml-pipelines

Updated Jun 20, 2022
Jupyter Notebook

IBM / sail

Star

Library for streaming data and incremental learning algorithms.

machine-learning deep-learning distributed-computing ray incremental-learning ml-pipelines automl-pipeline

Updated May 7, 2025
Python

Ark-kun / pipeline_components

Star

Components that I have created for Kubeflow Pipelines. Try them in https://cloud-pipelines.net/pipeline-editor/

machine-learning pipelines mlops kubeflow-pipelines ml-pipelines kfp cloud-pipelines

Updated Aug 3, 2025
Python

banzuzi-carioni / cross-border-electricity-flow-prediction

Star

Serverless ML system to predict the direction and volume of electricity flows to and from the Netherlands and its energy transmission partners.

machine-learning serverless etl energy-data xgboost-regression mlops ml-pipelines hopsworks

Updated Apr 7, 2025
Python

prateeksawhney97 / Disaster-Response-Pipeline

Star

This Project is a part of Data Science Nanodegree Program by Udacity in collaboration with Figure Eight. The initial dataset contains pre-labelled tweet and messages from real-life disasters. The aim of this project is to build a Natural Language Processing tool that categorize messages.

data-science machine-learning deep-learning flask-application flask-sqlalchemy disaster-management disaster-response etl-pipeline ml-pipelines

Updated May 14, 2020
Jupyter Notebook

leosmerling-hopeit / fraud-poc

Star

Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/

machine-learning microservices dask fraud-detection dask-ml dask-distributed nbdev ml-pipelines

Updated Apr 12, 2023
Jupyter Notebook

Elkinmt19 / airflow-master

Star

This a repo that was created to learn more about Airflow and develop awesome data engineering projects. 🚀🚀

python docker airflow orchestration data-engineering data-pipelines dags ml-pipelines

Updated Oct 27, 2023
Python

saurabh-kudesia / real-world-ai-projects

Star

A collection of real-world machine learning and AI projects. Explore hands-on implementations of cutting-edge models, practical solutions, and techniques to tackle real-world challenges using AI.

Updated Aug 13, 2025
Jupyter Notebook

souravlouha / ML_Practice_Studio

Star

🧠A hands-on workspace for practicing machine learning concepts, data preprocessing, and experimenting with small ML projects. This repo includes foundational Python scripts, real-world mini-projects, and experiments that reflect a progressive learning journey in applied machine learning.

image-processing data-visualization image-classification data-analysis image-recognition data-collection data-cleaning ai-framework ai-agents ml-project ml-engineering ml-models ml-pipelines nural-network ml-frameworks

Updated Aug 6, 2025
Jupyter Notebook

rochitasundar / DeepLearning.AI-Practical-Data-Science-On-AWS-Cloud-Specialization

Star

This repository contains my code solution to DeepLearning.AIs Practical Data Science On AWS Cloud Specialization.

aws glue s3-bucket ground-truth model-deployment human-in-the-loop-machine-learning mlops feature-store bert-model ml-pipelines blazingtext sagemaker-clarify sagemaker-autopilot

Updated Sep 26, 2023
Jupyter Notebook

chrisliatas / dsnd-ml-pipeline

Star

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

nlp text-classification etl-pipeline ml-pipelines

Updated Aug 26, 2025
Jupyter Notebook

yvgupta03 / Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Star

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniqu…

big-data twitter-sentiment-analysis databricks-notebooks pyspark-mllib ml-pipelines

Updated Jul 8, 2022
Jupyter Notebook

tbsraja / Personalized_Cancer_Treatment

Star

Develop algorithms to classify genetic mutations based on clinical evidence (text).

data-analysis logistic-regression nlp-machine-learning svm-classifier random-forest-classifier ml-pipelines

Updated May 11, 2023
Jupyter Notebook

zacharyvunguyen / Production-Ready-ML-Pipeline-on-GCP-Baby-Weight-Prediction

Star

In this project, I developed a completed Vertex and Kubeflow pipelines SDK to build and deploy an AutoML / BigQuery ML regression model for online predictions. Using this ML Pipeline, I was able to develop, deploy, and manage the production ML lifecycle efficiently and reliably.

bigquery machine-learning google machine google-cloud-platform automl end-to-end-machine-learning streamlit bigqueryml ml-pipelines vertex-ai productionml

Updated May 15, 2025
Jupyter Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ml-pipelines

Here are 34 public repositories matching this topic...

whylabs / whylogs

sematic-ai / sematic

zetane / ZetaForge

evidentlyai / ml_observability_course

udellgroup / oboe

opctl / opctl

bodywork-ml / ml-pipeline-engineering

IBM / sail

Ark-kun / pipeline_components

banzuzi-carioni / cross-border-electricity-flow-prediction

prateeksawhney97 / Disaster-Response-Pipeline

leosmerling-hopeit / fraud-poc

Elkinmt19 / airflow-master

saurabh-kudesia / real-world-ai-projects

souravlouha / ML_Practice_Studio

rochitasundar / DeepLearning.AI-Practical-Data-Science-On-AWS-Cloud-Specialization

chrisliatas / dsnd-ml-pipeline

yvgupta03 / Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

tbsraja / Personalized_Cancer_Treatment

zacharyvunguyen / Production-Ready-ML-Pipeline-on-GCP-Baby-Weight-Prediction

Improve this page

Add this topic to your repo