kmeans in hadoop use mapreduce to implement kmeans algorithm to due with large web page. For more information please goto http://blog.csdn.net/lawrencesgj/article/details/8606532