1. Wang, J., Zhu, L., Li, C.: Discussion about the core of search engine again—web crawler. In: 2011 International Conference on Computer Science and Service System (CSSS), pp. 3188–3191. IEEE (2011)
2. Khare, R., Cutting, D., Sitaker, K., Rifkin, A.: Nutch: a flexible and scalable open-source web search engine. Or. State Univ. 1, 32 (2004)
3. Brin, S., Page, L.: Reprint of: the anatomy of a large-scale hypertextual web search engine. Comput. Netw. 56(18), 3825–3833 (2012)
4. http://blog.csdn.net/chaishen10000/article/details/50776662
5. Mohr, G., Stack, M., Ranitovic, I., et al.: An Introduction to Heritrix An open source archival quality web crawler. In: IWAW 2004, 4th International Web Archiving Workshop (2004)