Automating Government Report Generation: A Generative AI Approach for Efficient Data Extraction, Analysis, and Visualization

Author:

Gupta Rajan1ORCID,Pandey Gaurav2ORCID,Pal Saibal Kumar3ORCID

Affiliation:

1. Artificial Intelligence & Innovation (AI&I) Lab, Autonomous University of Tamaulipas, Ciudad Victoria, Mexico

2. School of Artificial Intelligence and Data Science, IIT Jodhpur, Jodhpur, India

3. SAG Lab, Defence Research and Development Organisation, New Delhi, India

Abstract

This application paper introduces a transformative solution to address the labour-intensive manual report generation, data searching & report revision process in government entities. Traditional methods of data extraction, analysis, and graph creation for annual reports are not only time-consuming but also prone to human errors. To mitigate these challenges, we propose an innovative system leveraging Generative Artificial Intelligence (GenAI), with a specific focus on large language models (LLMs). Our solution incorporates automated data extraction from diverse sources designated as internal knowledge base, text analysis, summarization using advanced language models, and the generation as well as revisions of informative graphs. Different LLMs like Google's Gemini Pro and OpenAI's GPT 4.0 has been used to read different data visualisation graphs and fetching the information from internal knowledge base, respectively, to update the graphs in automated manner. Solution implementation using Python language on the World Economic Situation and Prospects report by Department of Economic & Social Affairs, United Nation shows that the early result produces almost 0.87% to 17.46% average error rates in the task of factual data visualisation graph. Key benefit of this approach include improved time efficiency, consistency in report format, enhanced insights, and a user-friendly interface. A review and approval workflow facilitate user feedback, contributing to continuous model performance improvement.

Publisher

Association for Computing Machinery (ACM)

Reference16 articles.

1. CAG (2019). Challenges in using data across government. National Audit office. Report accessed from https://www.nao.org.uk/wp-content/uploads/2019/06/Challenges-in-using-data-across-government.pdf on 10th January 2024

2. Chen, Z., Wang, Y., Wang, Q., Wang, Y., & Qu, H. (2019). Towards automated infographic design: Deep learning-based auto-extraction of extensible timeline. IEEE transactions on visualization and computer graphics, 26(1), 917-926.

3. Generative AI: Here to stay, but for good?

4. Team, G., Anil, R., Borgeaud, S., Wu, Y., Alayrac, J. B., Yu, J., ... & Ahn, J. (2023). Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805.

5. Ye, J., Chen, X., Xu, N., Zu, C., Shao, Z., Liu, S., ... & Huang, X. (2023). A comprehensive capability analysis of gpt-3 and gpt-3.5 series models. arXiv preprint arXiv:2303.10420.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3