Sklearn Pipeline Tutorial

Overview

This Jupyter notebook provides a quick tutorial on using the scikit-learn library for machine learning in Python. It combines discussions on the impact of hardware, specifically processing cores, with practical machine learning applications. Key sections include:

AI Generated Pipeline: An introduction into a Chatgpt-generated scikit-learn pipeline for logistic regression. This code is an example of a basic scikit-learn pipeline but greedily utilizes all cores which can cause issues on shared systems like the Yens.
Why Cores Matter: This section shows the significance of processing cores in machine learning, providing insights into computational efficiency and performance. A quick comparative study on the performance and implications of using 10 and 40 processing cores.
Predictions: This section shows one way you might perform inference with pre-trained pipelines
Scoring: A quick look into one way to evaluate these machine learning pipeline outputs

Prerequisites

To get the most out of this tutorial, you should have:

Basic knowledge of Python programming.
An understanding of fundamental machine learning concepts.
Python 3.x installed on your machine.
Jupyter Notebook or Jupyter Lab installed.

Installation

Before starting, ensure you have the following Python libraries installed:

scikit-learn: for machine learning algorithms and tools.
numpy: for numerical computations.
pandas: for data manipulation and analysis.

You can install these packages using pip:

pip install scikit-learn numpy pandas

Running the Notebook

Clone or download this repository to your local machine or the Yens.
Navigate to the directory containing the notebook.
Open the Sklearn_Pipeline.ipynb file in the Jupyter interface.
Execute the cells in order to follow along with the tutorial.

Contributing

Contributions to this tutorial are welcome! If you have suggestions or improvements, feel free to submit a pull request or open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.ipynb_checkpoints		.ipynb_checkpoints
README.md		README.md
Sklearn_Pipeline.ipynb		Sklearn_Pipeline.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sklearn Pipeline Tutorial

Overview

Prerequisites

Installation

Running the Notebook

Contributing

About

Releases

Packages

Languages

gsbdarc/sklearn-pipeline

Folders and files

Latest commit

History

Repository files navigation

Sklearn Pipeline Tutorial

Overview

Prerequisites

Installation

Running the Notebook

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages