Webformer-Reference-Cited by-同舟云学术

Webformer

Published:2022-07-06 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
language:
Short-container-title:

Author:

Guo Yu¹,Ma Zhengyi¹,Mao Jiaxin¹,Qian Hongjin¹,Zhang Xinyu²,Jiang Hao²,Cao Zhao²,Dou Zhicheng³

Affiliation:

1. Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China

2. Distributed and Parallel Software Lab, Huawei, Beijing, China

3. Gaoling School of Artificial Intelligence, Renmin University of China & Beijing Key Laboratory of Big Data Management and Analysis Methods, Beijing, China

Funder

Beijing Academy of Artificial Intelligence(BAAI)

National Natural Science Foundation of China

China Unicom Innovation Ecological Cooperation Plan

Beijing Outstanding Young Scientist Program

Intelligent Social Governance Platform?Major Innovation & Planning Interdisciplinary Platform for the ``Double-First Class' Initiative, Renmin University of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3477495.3532086

Reference47 articles.

1. Armen Aghajanyan , Dmytro Okhonko , Mike Lewis , Mandar Joshi , Hu Xu , Gargi Ghosh , and Luke Zettlemoyer . 2021 . HTLM: Hyper-Text Pre-Training and Prompting of Language Models. CoRR , Vol. abs/ 2107 .06955 (2021). [arXiv]2107.06955 https://arxiv.org/abs/2107.06955 Armen Aghajanyan, Dmytro Okhonko, Mike Lewis, Mandar Joshi, Hu Xu, Gargi Ghosh, and Luke Zettlemoyer. 2021. HTLM: Hyper-Text Pre-Training and Prompting of Language Models. CoRR, Vol. abs/2107.06955 (2021). [arXiv]2107.06955 https://arxiv.org/abs/2107.06955

2. Ammar Al-Dallal and Rasha S Abdul-Wahab . 2011 . Achieving high recall and precision with HTLM documents: an innovation approach in information retrieval . In Proceedings of the World Congress on Engineering , Vol. 3 . Ammar Al-Dallal and Rasha S Abdul-Wahab. 2011. Achieving high recall and precision with HTLM documents: an innovation approach in information retrieval. In Proceedings of the World Congress on Engineering, Vol. 3.

3. Leveraging HTML in Free Text Web Named Entity Recognition

4. Iz Beltagy , Matthew E. Peters , and Arman Cohan . 2020 . Longformer: The Long-Document Transformer. CoRR , Vol. abs/ 2004 .05150 (2020). [arXiv]2004.05150 https://arxiv.org/abs/2004.05150 Iz Beltagy, Matthew E. Peters, and Arman Cohan. 2020. Longformer: The Long-Document Transformer. CoRR, Vol. abs/2004.05150 (2020). [arXiv]2004.05150 https://arxiv.org/abs/2004.05150

5. Sebastian Blohm . 2011. Large-scale pattern-based information extraction from the world wide web . KIT Scientific Publishing . Sebastian Blohm. 2011. Large-scale pattern-based information extraction from the world wide web. KIT Scientific Publishing.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving First-stage Retrieval of Point-of-interest Search by Pre-training Models;ACM Transactions on Information Systems;2023-12-29

2. Generative AI Frameworks for Web for Good: Food Pantry Information Seeking as an Example;2023 IEEE International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT);2023-10-26

3. FF-IR: An information retrieval system for flash flood events developed by integrating public-domain data and machine learning;Environmental Modelling & Software;2023-09

4. Self-Training for Label-Efficient Information Extraction from Semi-Structured Web-Pages;Proceedings of the VLDB Endowment;2023-07

5. Structure-inducing pre-training;Nature Machine Intelligence;2023-06-01