1. Alarte, J., Insa, D., Silva, J., Tamarit, S.: Web template extraction based on hyperlink analysis. In: Escobar, S. (ed.): XIV Jornadas sobre Programación Y Lenguajes, PROLE 2014, Revised Selected Papers EPTCS, vol. 173, pp. 16–26 (2015)
2. Liu, Q., Shao, M., Wu, L., Zhao, G., Fan, G.: Main content extraction from web pages based on node characteristics. J. Comput. Sci. Eng. 11(2), 39–48 (2017)
3. Ferrara, E., De Meob, P., Fiumarac, G., Baumgartnerd, R.F.: Web Data Extraction, Applications and Techniques: A Survey.
arXiv:1207.0246v4
[cs.IR], 10 June 2014
4. OpenNLP Documentation.
https://opennlp.apache.org/docs/
. Accessed 2 Nov 2019
5. Uzun, E., Doruk, A., Nusret Buluş, H., Özhan, E.: Evaluation of HAP, AngleSharp and HTML document in web content extraction. In: International Scientific Conference, Gabrovo, 18 November 2017 (2017)