A Frequent Construction Mining Scheme Based on Syntax Tree

Author:

CHEN Bob, ,PENG Weiming,SONG Jihua, ,

Abstract

"Natural language processing (NLP) is one of the main research directions in artificial intelligence. One of the goals of NLP is to identify various semantic information in the text. Currently, the mainstream semantic recognition tasks focus more on using the semantic information of each word in the text to perform semantic analysis of the entire sentence. The research on semantics in cognitive linguistics indicates that semantics is determined by both the words contained in the sentence and the arrangement of the words. Linguists refer to permutations and combinations containing certain semantic information as constructions. Since the construction plays an essential role in semantic information, identifying various constructions in text is a crucial work of semantic recognition tasks. Based on this background, the main works performed in this paper are as follows: 1) The definition and program representation of constructions and the corresponding constraints in NLP tasks are proposed. 2) A frequent construction mining algorithm is proposed to extract frequent structures that meet the construction requirements in the grammar structure tree. Based on the above works, the corresponding construction database can be extracted for the specified natural language corpus, which is helpful for more effective text semantic analysis."

Publisher

Editura Academiei Romane

Reference31 articles.

1. "[1] A. E. GOLDBERG, Constructions: A Construction Grammar Approach to Argument Structure, University of Chicago Press, Chicago, IL, 1995.

2. [2] R. W. LANGACKER, Foundations of Cognitive Grammar: Theoretical Prerequisites, Stanford University Press, Stanford, CA, 1987.

3. [3] C. J. FILLMORE, TThe mechanisms of "construction grammar", Annual Meeting of the Berkeley Linguistics Society, Berkeley Linguistics Society, Berkeley 14, CA, pp. 35-55, 1988.

4. [4] A. INOKUCHI, T. WASHIO and H. MOTODA, An apriori-based algorithm for mining frequent substructures from graph data, in Principles of Data Mining and Knowledge Discovery, D. A. Zighed, J. Komorowski and J. ytkow, Eds., Springer-Verlag, Berlin, Heidelberg, pp. 13-23, 2000.

5. [5] X.-F. YAN and J.-W. HANY, gSpan: graph-based substructure pattern mining, Proceedings of 2002 IEEE International Conference on Data Mining, Maebashi City, Japan, pp. 721-724, 2002.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3