1. The design of a similarity based deduplication system;Aronovich,2009
2. Introduction to Algorithms;Cormen,1990
3. Efficient randomized pattern-matching algorithms;Karp;IBM Journal of Research and Development,1987
4. Primes just less than a power of two: http://primes.utm.edu/lists/2small/.
5. Venti: a new approach to archival storage;Quinlan,2002