Affiliation:
1. Department of Civil and Environmental Engineering, Seoul National University, Seoul 08826, Republic of Korea
2. Institute of Engineering Research, Seoul National University, Seoul 08826, Republic of Korea
Abstract
Recently, open-domain question-answering systems have achieved tremendous progress because of developments in large language models (LLMs), and have successfully been applied to question-answering (QA) systems, or Chatbots. However, there has been little progress in open-domain question answering in the geographic domain. Existing open-domain question-answering research in the geographic domain relies heavily on rule-based semantic parsing approaches using few data. To develop intelligent GeoQA agents, it is crucial to build QA systems upon datasets that reflect the real users’ needs regarding the geographic domain. Existing studies have analyzed geographic questions using the geographic question corpora Microsoft MAchine Reading Comprehension (MS MARCO), comprising real-world user queries from Bing in terms of structural similarity, which does not discover the users’ interests. Therefore, we aimed to analyze location-related questions in MS MARCO based on semantic similarity, group similar questions into a cluster, and utilize the results to discover the users’ interests in the geographic domain. Using a sentence-embedding-based topic modeling approach to cluster semantically similar questions, we successfully obtained topic models that could gather semantically similar documents into a single cluster. Furthermore, we successfully discovered latent topics within a large collection of questions to guide practical GeoQA systems on relevant questions.
Funder
Korea Agency for Infrastructure Technology Advancement
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference32 articles.
1. QA on structured data using NLIDB approach;Wudaru;Proceedings of the 5th International Conference on Advanced Computing & Communication Systems (ICACCS),2019
2. Geographic question answering: Challenges, uniqueness, classification, and future directions;Mai;AGILE GIScience Ser.,2021
3. Geo-analytical question-answering with GIS;Scheider;Int. J. Digit. Earth,2021
4. Extracting interrogative intents and concepts from geo-analytic questions;Xu;AGILE GIScience Ser.,2020
5. Punjani, D., Singh, K., Both, A., Koubarakis, M., Angelidis, I., Bereta, K., Bilidas, D., Ioannidis, T., Karalis, N., and Lange, C. (2018, January 6). Template-based question answering over linked geospatial data. Proceedings of the 12th Workshop on Geographic Information Retrieval, Seattle, WA, USA.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献