1. Arc90. 2010. Readability. https://web.archive.org/web/20100420092540/http://lab.arc90.com/experiments/readability Accessed: 2021-05-11. Arc90. 2010. Readability. https://web.archive.org/web/20100420092540/http://lab.arc90.com/experiments/readability Accessed: 2021-05-11.
2. Codrut-Georgian Artene , Marius Nicolae Tibeica , Dumitru Daniel Vecliuc, and Florin Leon. 2021 . Convolutional Neural Networks for Web Documents Classification. In Intelligent Information and Database Systems - 13th Asian Conference . 289--302. https://doi.org/10.1007/978-3-030-73280-6_23 10.1007/978-3-030-73280-6_23 Codrut-Georgian Artene, Marius Nicolae Tibeica, Dumitru Daniel Vecliuc, and Florin Leon. 2021. Convolutional Neural Networks for Web Documents Classification. In Intelligent Information and Database Systems - 13th Asian Conference. 289--302. https://doi.org/10.1007/978-3-030-73280-6_23
3. Yuri Baburov. 2021. python-readability. https://github.com/buriy/python- readability Accessed: 2021-05-17. Note: This is a python port of the original Arc90 open-sourced Readability project. Yuri Baburov. 2021. python-readability. https://github.com/buriy/python- readability Accessed: 2021-05-17. Note: This is a python port of the original Arc90 open-sourced Readability project.
4. Marco Baroni , Francis Chantree , Adam Kilgarriff , and Serge Sharoff . 2008 . CleanEval: a Competition for Cleaning Web Pages . In Proceedings of the Sixth International Conference on Language Resources and Evaluation. http://www.lrec-conf.org/proceedings/lrec2008/pdf/162_paper.pdf Marco Baroni, Francis Chantree, Adam Kilgarriff, and Serge Sharoff. 2008. CleanEval: a Competition for Cleaning Web Pages. In Proceedings of the Sixth International Conference on Language Resources and Evaluation. http://www.lrec-conf.org/proceedings/lrec2008/pdf/162_paper.pdf
5. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation