Name	Name	Last commit message	Last commit date
parent directory ..
images	images
README.md	README.md

Create a managed Jupyter Notebook instance with Amazon SageMaker

Overview

Amazon SageMaker is a fully-managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. Amazon SageMaker removes all the barriers that typically slow down developers who want to use machine learning.

Machine learning often feels a lot harder than it should be to most developers because the process to build and train models, and then deploy them into production is too complicated and too slow. First, you need to collect and prepare your training data to discover which elements of your data set are important. Then, you need to select which algorithm and framework you’ll use. After deciding on your approach, you need to teach the model how to make predictions by training, which requires a lot of compute. Then, you need to tune the model so it delivers the best possible predictions, which is often a tedious and manual effort. After you’ve developed a fully trained model, you need to integrate the model with your application and deploy this application on infrastructure that will scale. All of this takes a lot of specialized expertise, access to large amounts of compute and storage, and a lot of time to experiment and optimize every part of the process. In the end, it's not a surprise that the whole thing feels out of reach for most developers.

Amazon SageMaker removes the complexity that holds back developer success with each of these steps. Amazon SageMaker includes modules that can be used together or independently to build, train, and deploy your machine learning models.

In this section, we will walk you through creating a fully-managed Jupyter Notebook instance with Amazon SageMaker, that will be used to execute our experimentation and build the Machine Learning model.

Create an Amazon S3 bucket

In this section, we will create an Amazon S3 bucket that will be our storage area. Amazon SageMaker and AWS Glue can both use Amazon S3 as the main storage for both data and artifacts.

Sign into the AWS Management Console using the Event Engine dashboard at https://dashboard.eventengine.run and the hashcode provided by the workshop instructors. [Or access it at https://console.aws.amazon.com/ if you are using your own AWS account].
In the upper-right corner of the AWS Management Console, confirm you are in the desired AWS region. For the instructions of these workshop we will assume using the EU West (Ireland) [eu-west-1], but feel free to change the region at your convenience.

The only constraints for changing AWS region are that we keep consistent the region settings for all services used and services are available in the selected region (please check in case you plan to execute this workshop in another AWS region).
Open the Amazon S3 console by choosing the Amazon S3 service in the menu.
In the Amazon S3 console, click the Create Bucket button.
For the Bucket Name, type endtoendml-workshop-[your-initials] in the text box and click Next (take note of the bucket name, it will be needed later for loading data in the notebook instance). Press Next to move to the next screen.

Note: if the bucket name is already taken, feel free to add an extra suffix.
Enable versioning of the objects in the bucket as shown in the screen below. This is not required for the workshop, but it is a suggested best practice to ensure consistency and reproducibility of the experimentations.

Press Next and then Next again leaving the settings as they are in the following screen.
Finally, click Create Bucket in the Review page.