1. Chakrabarti S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, Burlington (2003)
2. Eichmann, D.: The RBSE spider-balancing effective search against web load. In: First World Wide Web Conference, Geneva, Switzerland April 20 1994
3. Qureshi,P.A.R.; Memon, N: Hybrid model of content extraction. J. Comput. Syst. Sci. 78(4), 1248–1257 (2012); ISSN 0022-0000
4. Weninger, T.; Hsu, W.; Han, J.: CETR: content extraction via tag ratios. In: Proceedings of the 19th International Conference on World Wide Web, WWW ’10, ACM, New York, NY, USA, pp. 971–980 (2010)
5. Rahman, A.F.R.; Alam, H.; Hartono, R.: Content extraction from HTML documents. In: 1st International Workshop on Web Document Analysis (WDA2001) (2001)