1. Information extraction from world wide web—a survey;Eikvil,1999
2. Wrapper induction for information extraction;Shm,1997
3. Fu L, Meng Y, Xia Y J. Web content extraction based on webpage layout analysis. 2010 Second International Conference on Information Technology and Computer Science (ITCS), 2010: 40–43.
4. Cai D, Yu S P, Wen J R, et al. VIPS: a vision based on page segmentation algorithm. [S. l.]: Microsoft Co., Tech. Rep.: MSR-TR-2003-79, 2003.
5. Basic semantic units based web page content extraction. SMC'08;Wang,2008