1. Yang, L., Li, X., Geng, G.: Study of web pages content extraction based on layout similarity. Appl. Res. Comput. 32(9), 2581–2586 (2015)
2. Xiong, Z., Zhang, H., Lin, M.: An extraction algorithm of Chinese HTML content based on similarity. J. Southwest Univ. Sci. Technol. 25(1), 80–84 (2010)
3. Chang, Y., Zheng, Y., Chen, Y.: Content extraction technique for web pages based on HTML-tags. J. Comput. Eng. Des. 31(24), 5187–5191 (2010)
4. Cai, D., Yu, S., Wen, J., et al.: VIPS: a vision- based page segmentation algorithm (2003)
5. Lecture Notes in Computer Science;D Cai,2003