1. James Reserve Data Management System. http://cens.jamesreserve.edu/jrcensweb/cmstest/CMS_env_data_list.php.
2. Database-friendly random projections: Johnson-lindenstrauss with binary coins;Achlioptas;Journal of Computer and System Sciences,2003
3. S. Agarwal and A. Trachtenberg, Estimating the number of differences between remote sets, in: IEEE Information Theory Workshop (ITW), Punta del Este, Uruguay, 2006.
4. A.Z. Broder, Identifying and filtering near-duplicate documents, CPM 2000, LNCS 1848, pp. 1–10, 2000.
5. Min-wise independent permutations;Broder;Journal of Computer and System Sciences,2000