Affiliation:
1. Instituto Nacional de Estadística y Geografía, Aguascalientes, Mexico
2. Department of Computer Science and Engineering, Universidad Carlos III de Madrid, Madrid, Spain
Abstract
Mexico’s National Institute of Statistics and Geography (INEGI) is exploring new opportunities to improve its information search service, with the aim of increasing the accessibility of official statistical data. The upgraded search engine will include a new component that offers more sophisticated search capabilities. These include the ability to conduct intelligent searches that do not require an exact match of the search text, as well as the expansion of searches using related ad-hoc terms. Additionally, the new component will provide feedback through the most appropriate relations. To achieve this, the system will utilize neural network-based distributional word representation systems to identify relationships between related terms. The vector spaces and representation will be manipulated to keep connections within the most relevant vocabulary for the institute’s type of searches. The usability testing department at the institute conducted blind pilot tests to compare the quality reported by users with and without the new enhancements. Although the evaluation survey showed significant improvements in the search engine’s performance, the tool presented is just the first step towards a system that allows continuous interaction and feedback with users to improve the quality of the responses presented. This strategy is not currently implemented by the institute, making this an immediate and easy-to-replicate approach for obtaining useful interactions with users.
Subject
Statistics, Probability and Uncertainty,Economics and Econometrics,Management Information Systems
Reference27 articles.
1. The use of metadata modelling for the modernisation of information management of statistical classifications;Hancock;Statistical Journal of the IAOS,2020
2. Official statistics as a safeguard against fake news;Sæbø;Statistical Journal of the IAOS,2020
3. A Deep Look into neural ranking models for information retrieval;Guo;Information Processing & Management,2020
4. Apache Lucene 4;Białecki;SIGIR 2012 workshop on open source information retrieval,2012
5. Anserini;Yang;Journal of Data and Information Quality,2018