1. Web2text: Deep structured boilerplate removal;T Vogels;LNCS,2018
2. Web scraping or web crawling: State of art, techniques, approaches and application;M A Khder;International Journal of Advances in Soft Computing and its Applications,2021
3. Main content extraction from web pages based on node characteristics;Q Liu;Journal of Computing Science and Engineering,2017
4. Web page structured content detection using supervised machine learning;R P Velloso,2019