Affiliation:
1. 1 Statistics Netherlands, Research and Development , Henri Faasdreef 312 The Hague 2492 JP , the Netherlands .
Abstract
Abstract
A prototype of a question answering (QA) system, called Farseer, for the real-time calculation and dissemination of aggregate statistics is introduced. Using techniques from natural language processing (NLP), machine learning (ML), artificial intelligence (AI) and formal semantics, this framework is capable of correctly interpreting a written request for (aggregate) statistics and subsequently generating appropriate results. It is shown that the framework operates in a way that is independent of a specific statistical domain under consideration, by capturing domain specific information in a knowledge graph that is input to the framework. However, it is also shown that the prototype still has its limitations, lacking statistical disclosure control. Also, searching the knowledge graph is still time-consuming.
Subject
Statistics and Probability
Reference32 articles.
1. Aho, A.V., R. Sethi, and J.D. Ullman. 1986. Compilers: principles, techniques, and tools. Boston: Addison-Wesley Longman Publishing Co.
2. Andrews, P.R. 2002. An Introduction to Mathematical Logic and Type Theory: To Truth Through Proof. Dordrecht: Kluwer Academic Publications.
3. Axmark, D., and D. Widenius. 2021. MySQL 8.0 Reference Manual. Redwood Shores: Oracle Corporation. Available at: http://dev.mysql.com/doc/refman/8.0/en/ (accessed January 2023).
4. Barendregt, H.P. 1984. The Lambda Calculus; Its Syntax and Semantics. Amsterdam: Elsevier Science B.V.
5. Ben-Gan, I., and T. Moreau. 2000. Advanced Transact-SQL for SQL Server 2000. New York: Springer-Verlag.