This repository contains my submission for the Final Project: Build a Machine Learning Pipeline for Airfoil Noise Prediction. The original files were provided by the IBM Skills Network as part of the Machine Learning with Apache Spark course on Coursera. I have made modifications to fulfill the project requirements.
-
You are welcome to use this repository as a reference or starting point for your own project.
-
If you choose to fork this repository, please ensure that you comply with the terms of the Apache License and give proper credit to the original authors.
As a data engineer at an aeronautics consulting company, we take pride in our ability to efficiently design airfoils for use in both planes and sports cars. While our data scientists excel at Machine Learning, they rely on me to handle ETL (Extract, Transform, Load) tasks and construct ML pipelines.
- Clean the dataset
- Create a Machine Learning pipeline
- Evaluate the model's performance
- Persist it for future use
For this project, we will use a modified version of the NASA Airfoil Self-Noise dataset NASA_airfoil_noise_raw.csv
, which is available in this repository.
The original dataset can be found here: NASA Airfoil Self-Noise dataset. https://archive.ics.uci.edu/dataset/291/airfoil+self+noise
The dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Diagram of an airfoil (Source: IBM Skills Network).
Diagram showing the Angle of attack (Source: IBM Skills Network).
The metric values presented in the Final_Project.ipynb
notebook can vary across different Python versions. These variations may occur due to changes in underlying libraries, algorithms, or default behavior. To ensure successful completion of the Quiz: Final Project - Evaluation Submitted, it is essential to complete this project using the Python version available in the Skill Network Labs (SN Labs) environment. Python 3.7.12.
Install the required libraries using the provided requirements.txt
file. The command syntax is:
python3 -m pip install -r requirements.txt
Download the required NASA_airfoil_noise_raw.csv
file using the terminal command:
wget https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMSkillsNetwork-BD0231EN-Coursera/datasets/NASA_airfoil_noise_raw.csv
Create folder to save model:
mkdir -p Final_Project
Execute the code using the command:
python3 Final_Project.py
- IBM Skills Network © IBM Corporation 2023. All rights reserved.