Modified CNN with Transfer Learning for Multi-Document Summarization: Proposing Co-Occurrence Matrix Generation-Based Knowledge Extraction

Author:

Ketineni Sunilkumar1,Sheela J.2

Affiliation:

1. Research Scholar, School of Computer Science and Engineering, VIT-AP University, Amaravathi, Andhra Pradesh 522237, India

2. Assistant Professor, School of Computer Science and Engineering, VIT-AP University, Amaravathi, Andhra Pradesh 522237, India

Abstract

The rapid growth of the internet and computing devices significantly increases the rise of the quantity of information. The major technique which is utilized to lower the limit of information quantity is ATS. By this technique, the text documents are precisely summarized within a short time as per user requirements. However, its effectiveness is limited due to various issues like subject identification, elucidation, summary generation and analysis of the generated summary. To tackle these issues and to obtain precise and concise summarized text, this proposed approach results as the efficient multi-document summarization approach by focusing on those issues. This proposed approach is incorporated with preprocessing, feature extraction, knowledge extraction and summarization processes. Initially, preprocessing is done by BERT tokenization and it results in preprocessed tokens from multiple documents. The features extracted from preprocessed tokens are the proposed aspect term feature, TF-IDF feature, Word2Vec feature and average word length feature. After, relevant features are extracted from the preprocessed token to train modified CNN in the feature extraction process. Instantaneously, the preprocessed tokens are employed in the knowledge extraction process for extracting essential knowledge to improve the training ability of modified CNN. This modified CNN is trained with the obtained weight, extracted feature set and knowledge to obtain precise and concise summarized text from multiple documents. Thus, the final summarized outcome has been from modified CNN with transfer learning (MCNN-TL). Finally, a variety of analyses are used to assess the suggested multi-document summary methodology, demonstrating its superiority over current summarization methods.

Publisher

World Scientific Pub Co Pte Ltd

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3