1. Armen Aghajanyan , Dmytro Okhonko , Mike Lewis , Mandar Joshi , Hu Xu , Gargi Ghosh , and Luke Zettlemoyer . 2021 . HTLM: Hyper-Text Pre-Training and Prompting of Language Models. CoRR , Vol. abs/ 2107 .06955 (2021). [arXiv]2107.06955 https://arxiv.org/abs/2107.06955 Armen Aghajanyan, Dmytro Okhonko, Mike Lewis, Mandar Joshi, Hu Xu, Gargi Ghosh, and Luke Zettlemoyer. 2021. HTLM: Hyper-Text Pre-Training and Prompting of Language Models. CoRR, Vol. abs/2107.06955 (2021). [arXiv]2107.06955 https://arxiv.org/abs/2107.06955
2. Ammar Al-Dallal and Rasha S Abdul-Wahab . 2011 . Achieving high recall and precision with HTLM documents: an innovation approach in information retrieval . In Proceedings of the World Congress on Engineering , Vol. 3 . Ammar Al-Dallal and Rasha S Abdul-Wahab. 2011. Achieving high recall and precision with HTLM documents: an innovation approach in information retrieval. In Proceedings of the World Congress on Engineering, Vol. 3.
3. Leveraging HTML in Free Text Web Named Entity Recognition
4. Iz Beltagy , Matthew E. Peters , and Arman Cohan . 2020 . Longformer: The Long-Document Transformer. CoRR , Vol. abs/ 2004 .05150 (2020). [arXiv]2004.05150 https://arxiv.org/abs/2004.05150 Iz Beltagy, Matthew E. Peters, and Arman Cohan. 2020. Longformer: The Long-Document Transformer. CoRR, Vol. abs/2004.05150 (2020). [arXiv]2004.05150 https://arxiv.org/abs/2004.05150
5. Sebastian Blohm . 2011. Large-scale pattern-based information extraction from the world wide web . KIT Scientific Publishing . Sebastian Blohm. 2011. Large-scale pattern-based information extraction from the world wide web. KIT Scientific Publishing.