1. Cleaning the GenBank Arabidopsis thaliana data set;PG Korning;Nucleic acids research,1996
2. CD-HIT: accelerated for clustering the next-generation sequencing data;L Fu;Bioinformatics,2012
3. Duplicate detection in biological data using association rule mining;JL Koh;Locus,2004
4. Web-Age Information Management;W Fan,2012
5. Understanding fraud: The nature of fraud offences recorded by NSW Police;W Macdonald;NSW Bureau of Crime Statistics and Research,2014