Modern Methods of Extracting Key Information From Regulatory Documents

Author:

Milkova Maria A.1ORCID,Nevolin Ivan V.2,Pigorev Dmitriy P.1

Affiliation:

1. Central Economics and Mathematics Institute of Russian Academy of Sciences, Moscow

2. Central Economics and Mathematics Institute of Russian Academy of Sciences, Moscow

Abstract

This article is an attempt to comprehend the difficulties and propose approaches to eliminate them when analyzing legal documents in the framework of economic and interdisciplinary research. The utmost goal is to seek incorporating advances in computational linguistics and natural language analysis into the discourse of the digital economy in order to develop methods involved in decision-making and strategy development based on the analysis of textual information. In conditions when the amount of information is too large, is constantly updated and / or the area of study is new, the most expedient at the first stage is to obtain the general structure of the entire collection of documents, some kind of semantic compression of information. The practical part contains the development of an approach for the analysis of regulations governing food and nutrition issues, in particular, related to the prevention of the development of iron deficiency anemia (IDA). The approach includes the extraction of key information of voluminous texts (keywords and key sentences) based on the TextRank graph algorithm. An important link contributing to cognition is also the visualization of semantic relationships between words within documents. In our opinion, it is the combination of semantic compression and visualization of information as a “close-up” of text documents, as well as the possibility of further detailing by linear reading and analysis, which are the most relevant approach in conditions of information overload and attention deficit. The active introduction of text analytics methods for systems that are not involved in attention markets, which lag significantly behind in the convenience of extracting meaningful information, is especially important. Approaches to improve the understanding of large volumes of regulations will be of significant value to researchers in economic, legal or multidisciplinary research.

Publisher

RPO for the Promotion of Institutes DE RAS

Reference38 articles.

1. Vanyushkin A.S., Grashchenko L.A. (2016). Metody i algoritmy izvlecheniya klyuchevyh slov // Novye informacionnye tekhnologii v avtomatizirovannyh sistemah-2016, 85-93. (In Russian)

2. Varshavskij A.E. (2015). Innovacionnye riski v oblasti produktov pitaniya dlya Rossii v usloviyah globalizacii i liberalizacii rynkov // Economics of Contemporary Russia, 2015(4), 91-108. (In Russian)

3. Gusev A.A. (2019). Ekonomicheskie i institucional'nye osnovy obespecheniya ekologicheskoj bezopasnosti // Economics of Contemporary Russia, 2019;(1), 70-81. https://doi.org/10.33293/1609-1442-2019-1(84)-70-81 (In Russian)

4. Isakov, V.B. (2018). Pravovaya analitika kak informacionnyj process // Pravo i informaciya: voprosy teorii i praktiki: sb. mater. mezhdunar. nauch.-prakt. konf. / nauch. red. N. A. Shevelyova. – 271 s. (In Russian)

5. Milkova M.A. (2018). Izvlechenie klyuchevyh terminov napravleniya «Cifrovaya ekonomika»: grafoorientirovannyj podhod // Cifrovaya ekonomika, 4(4), 57-65. (In Russian)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3