Kmeans clustering in Spark Process stackoverflow public data and cluster different languages based on questions/answers ratings.