Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)
-These code give public score around 0.42.
It contains the pyspark code using Random Forest Classifier from ML spark library
It contains the package scala using spark and xgboost
It contains code to transform probabilities from scala xgboost into 1/0 labels