1. Arasu A, Ganti V, Kaushik R. Efficient exact set-similarity joins. In Proc. the 32nd VLDB, September 2006, pp.918-929.
2. Hadjieleftheriou M, Yu X, Koudas N, Srivastava D. Hashed samples: Selectivity estimators for set similarity selection queries. PVLDB, 2008, 1(1): 201-212.
3. Lee H, Ng R T, Shim K. Power-law based estimation of set similarity join size. PVLDB, 2009, 2(1): 658-669.
4. White R W, Jose J M. A study of topic similarity measures. In Proc. the 27th SIGIR, July 2004, pp.520-521.
5. Zhu X, Song S, Lian X, Wang J, Zou L. Matching heterogeneous event data. In Proc. SIGMOD, June 2014, pp.1211-1222.