Machine Reading at Scale: A Search Engine for Scientific and Academic Research-Reference-Cited by-同舟云学术

Machine Reading at Scale: A Search Engine for Scientific and Academic Research

Published:2022-04-05 Issue:2 Volume:10 Page:43
ISSN:2079-8954
Container-title:Systems
language:en
Short-container-title:Systems

Author:

Sousa Norberto^ORCID,Oliveira Nuno^ORCID,Praça Isabel^ORCID

Abstract

The Internet, much like our universe, is ever-expanding. Information, in the most varied formats, is continuously added to the point of information overload. Consequently, the ability to navigate this ocean of data is crucial in our day-to-day lives, with familiar tools such as search engines carving a path through this unknown. In the research world, articles on a myriad of topics with distinct complexity levels are published daily, requiring specialized tools to facilitate the access and assessment of the information within. Recent endeavors in artificial intelligence, and in natural language processing in particular, can be seen as potential solutions for breaking information overload and provide enhanced search mechanisms by means of advanced algorithms. As the advent of transformer-based language models contributed to a more comprehensive analysis of both text-encoded intents and true document semantic meaning, there is simultaneously a need for additional computational resources. Information retrieval methods can act as low-complexity, yet reliable, filters to feed heavier algorithms, thus reducing computational requirements substantially. In this work, a new search engine is proposed, addressing machine reading at scale in the context of scientific and academic research. It combines state-of-the-art algorithms for information retrieval and reading comprehension tasks to extract meaningful answers from a corpus of scientific documents. The solution is then tested on two current and relevant topics, cybersecurity and energy, proving that the system is able to perform under distinct knowledge domains while achieving competent performance.

Funder

FCT

Publisher

MDPI AG

Subject

Information Systems and Management,Computer Networks and Communications,Modeling and Simulation,Control and Systems Engineering,Software

Link

https://www.mdpi.com/2079-8954/10/2/43/pdf

Reference62 articles.

1. Overload and Boredom: Essays on the Quality of Life in the Information Society;Klapp,1986

2. Information overload and coping strategies in the big data context: Evidence from the hospitality sector

3. Semantic Matching in Search

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Health Information Retrieval with Large Language Models: A Study on MedQuAD Dataset;2023 International Conference on Machine Learning and Applications (ICMLA);2023-12-15

2. Contextual Reranking of Search Engine Results;2022 2nd International Conference on Computing and Machine Intelligence (ICMI);2022-04-15