kaggledays-2019-gbdt

Original workshop in Paris:

First part of the updated workshop (basics of Skopt and GBM):

Second part of the updated workshop (LightGBM, XGBoost, CAtBoost, NAS):

Video of the workshop on Kaggle Youtube channel

https://www.youtube.com/watch?v=YQL45hDuP-o

Kaggle Days Paris

Competitive GBDT Specification and Optimization Workshop

Instructors

Luca Massaron @lmassaron - Data Scientist / Author / Google Developer Expert in Machine Learning

Pietro Marinelli @pietro-marinelli-0098b427 - Freelance Data Scientist

About the workshop

Gradient Boosting Decision Trees (GBDT) presently represent the state of the art for building predictors for flat table data. However, they seldom perform the best out-of-the-box (using default values) because of the many hyper-parameters to tune. Especially in the most recent GBDT implementations, such as LightGBM, the over-sophistication of hyper-parameters renders finding the optimal settings by hand or simple grid search difficult because of high combinatorial complexity and long running times for experiments.

Random Optimization (BERGSTRA, James; BENGIO, Yoshua. Random search for hyper-parameter optimization. Journal of Machine Learning Research, 2012, 13.Feb: 281-305.) and Bayesian Optimization (SNOEK, Jasper; LAROCHELLE, Hugo; ADAMS, Ryan P. Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems. 2012. p. 2951-2959) are often the answer you'll find from experts.

In this workshop we demonstrate how to use different optimization approaches based on Scikit-Optimize, a library built on top of NumPy, SciPy and Scikit-Learn, and we present an easy and fast approach to set them ready and usable.

Prerequisites

You should be aware of the role and importance of hyper-parameter optimization in machine learning.

Obtaining the Tutorial Material

In order to make the workshop easily accessible, we are offering cloud access:

Using Google Colab
Using Kaggle Kernels

We also have a brief exercise that can be found at:

Using Google Colab
Using Kaggle Kernels (with solution)

The solution can be found here.

All the materials can be cloned from Github at the kaggledays-2019-gbdt repository. We also have prepared a stand-alone Windows installation using WinPython (just require us for the link).

Local installation notes

In order to successfully run this workshop on your local computer, you need a Python3 installation (we suggest installing the most recent Anaconda distribution) and at least the following packages:

numpy >= 1.15.4
pandas >= 0.23.4
scipy >= 1.1.0
skopt >= 0.5.2
sklearn >= 0.20.2
lightgbm >= 2.2.2
xgboost >= 0.81
catboost >= 0.12.2

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
Kaggle Days Paris - GBDT workshop.ipynb		Kaggle Days Paris - GBDT workshop.ipynb
Kaggle Days Paris - Skopt + CatBoost exercise.ipynb		Kaggle Days Paris - Skopt + CatBoost exercise.ipynb
Kaggle Days Paris - Skopt + CatBoost solution.ipynb		Kaggle Days Paris - Skopt + CatBoost solution.ipynb
README.md		README.md
amazon_submission.csv		amazon_submission.csv
kaggle_youtube.PNG		kaggle_youtube.PNG
kaggledaysparis.png		kaggledaysparis.png
skopt_workshop_part1.ipynb		skopt_workshop_part1.ipynb
skopt_workshop_part2.ipynb		skopt_workshop_part2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kaggledays-2019-gbdt

Video of the workshop on Kaggle Youtube channel

Kaggle Days Paris

Competitive GBDT Specification and Optimization Workshop

Instructors

About the workshop

Prerequisites

Obtaining the Tutorial Material

Local installation notes

About

Releases

Packages

Languages

pierrelandry/kaggledays-2019-gbdt

Folders and files

Latest commit

History

Repository files navigation

kaggledays-2019-gbdt

Video of the workshop on Kaggle Youtube channel

Kaggle Days Paris

Competitive GBDT Specification and Optimization Workshop

Instructors

About the workshop

Prerequisites

Obtaining the Tutorial Material

Local installation notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages