1. Hou X R. An improved K-means clustering algorithm based on hadoop platform[C]//Proc of Cyber Security Intelligence and Analytics. Cham: Springer, 2020: 1101-1109.
2. Text mining with lucene and Hadoop: document clustering with updated rules of NMF non negative matrix factorization[J];Lydia E L;International Journal of Pure and Applied Mathematics
3. Genetic algorithm based parallel K-means data clustering algorithm using Map Reduce programming paradigm on Hadoop environment (GAPKCA)[C]//Proc of the 4th Annual International Conference on Soft Computing and Data Mining. Cham;Alshammari S;Springer
4. A MapReduce-based K-means clustering algorithm
5. Abdalla H B, Ahmed A M, Al Sibahee M A. Optimization driven Map Reduce framework for indexingand retrieval of big data[J]. KSII Transactions on Internet and Information Systems (TIIS), 2020, 14(5):1886-1908.