1. Research on data cleaning method based on SNM algorithm
2. The Detection Algorithms for Similar Duplicate Data
3. Similar Duplicate Record Detection of Massive Data Based on Partition [J];li;Computer Systems & Applications Jiangsu University,2019
4. Efficient Duplicate Detection Approach for High Dimensional Big Data;zhu;Journal of Computer research and development,2016
5. Detection of approximately duplicated records based on entropy feature selection grouping clustering;zhang;Transducer and Microsystem Technologies,2011