Discourse analysis based segregation of relevant document segments for knowledge acquisition

Author:

Madhusudanan N.,Chakrabarti Amaresh,Gurumoorthy B.

Abstract

AbstractDocuments are a useful source of expert knowledge in organizations and can be used to foresee, in an earlier stage of a product's life cycle, potential issues and solutions that might occur in later stages of its life cycle. In this research, these stages are, respectively, design and assembly. Even if these documents are available online, it is rather difficult for users to access the knowledge contained in these documents. It is therefore desirable to automatically extract the knowledge contained in these documents and store them in a computer accessible or manipulable form. This paper describes an approach for the first step in this acquisition process: automatically identifying segments of documents that are relevant to aircraft assembly, so that they can be further processed for acquiring expert knowledge. Such identification of relevant segments is necessary for avoiding processing of unrelated information that is costly and possibly distracting for domain relevance. The approach to extracting relevant segments has two steps. The first step is the identification of sentences that form a coherent segment of text, within which the topic does not shift. The second step is to classify segments that are within the topics of interest for knowledge acquisition, that is, aircraft assembly in this instance. These steps filter out segments that are unrelated, and therefore need not be processed for subsequent knowledge acquisition. The steps are implemented by understanding the contents of documents. Using methods of discourse analysis, in particular, discourse representation theory, a list of discourse entities is obtained. The difference in discourse entities between sentences is used to distinguish between segments. The list of discourse entities in a segment is compared against a domain ontology for classification. The implementation and results of validation on sample texts for these steps are described.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Industrial and Manufacturing Engineering

Reference53 articles.

1. Mozina M. , Guid M. , Krivec J. , Sadikov A. , & Bratko I. (2008). Fighting knowledge acquisition bottleneck with argument based machine learning. Proc. European Conf. Artificial Intelligence, pp. 234–238, Patras, Greece, July 21–25.

2. Design for machining using expert system and fuzzy logic approach

3. Loftus C. , Hicks B. , & McMahon C. (2009). Capturing key relationships and stakeholders over the product life cycle: an email based approach. Proc. 6th In. Conf. Project Life Cycle Management (PLM 09), Bath, July 6–8.

4. Segmented Discourse Representation Theory: Dynamic Semantics With Discourse Structure

5. Internet-based DFX for rapid and economical tool/mould making

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3