1. Alamuri, M., Surampudi, B.R., Negi, A.: A survey of distance/similarity measures for categorical data. In: International Joint Conference on Neural Networks (IJCNN), pp. 1907–1914. IEEE (2014)
2. Andrzejewski, W., Bębel, B., Boiński, P., Sienkiewicz, M., Wrembel, R.: Text similarity measures in a data deduplication pipeline for customers records. In: International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data DOLAP, co-located with EDBT/ICDT. CEUR Workshop Proceedings, CEUR-WS.org (2023, to appear)
3. Baxter, R., Christen, P.: A comparison of fast blocking methods for record linkage. In: ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation (2003)
4. Bilenko, M., Kamath, B., Mooney, R.J.: Adaptive blocking: learning to scale up record linkage. In: The IEEE International Conference on Data Mining (ICDM), pp. 87–96. IEEE Computer Society (2006)
5. Boiński, P., Sienkiewicz, M., Bębel, B., Wrembel, R., Gałęzowski, D., Graniszewski, W.: On customer data deduplication: lessons learned from a R &D project in the financial sector. In: Workshops of the EDBT/ICDT 2022 Joint Conference. CEUR Workshop Proceedings, vol. 3135. CEUR-WS.org (2022)