1. Azevedo, A. I. R. L., & Santos, M. F. (2008). KDD, SEMMA and CRISP-DM: A parallel overview. In Paper Presented at the IADIS European Conference on Data Mining. Amsterdam, The Netherlands.
2. Batini, C., Cappiello, C., Francalanci, C., & Maurino, A. (2009). Methodologies for data quality assessment and improvement. ACM Computing Surveys, 41(3), 16–16.52. doi:10.1145/1541880.1541883.
3. Boyd, D. F. (1950). Applying the group chart for X and R. Industrial Quality Control, 7 (3), 22–25.
4. Bradley, P. S., Fayyad, U., & Reina, C. (1998). Scaling clustering algorithms to large databases. In Proceedings of the 4th International Conference on Knowledge Discovery & Data Mining Knowledge Discovery and Data Mining (pp. 9–15).
http://www.aaai.org/Papers/KDD/1998/KDD98-002.pdf
.
5. Brownstein, J. S., Freifeld, C. C., & Madoff, L. C. (2009). Digital disease detection — harnessing the web for public health surveillance. New England Journal of Medicine, 360 (21), 2153–2157. doi:10.1056/NEJMp0900702.