Generating Summaries Through Unigram and Bigram

Author:

Alsharman Nesreen Mohammad1,Pivkina Inna V.2

Affiliation:

1. WISE, Amman, Jordan

2. NMSU, USA

Abstract

This article describes a new method for generating extractive summaries directly via unigram and bigram extraction techniques. The methodology uses the selective part of speech tagging to extract significant unigrams and bigrams from a set of sentences. Extracted unigrams and bigrams along with other features are used to build a final summary. A new selective rule-based part of speech tagging system is developed that concentrates on the most important parts of speech for summarizations: noun, verb, and adjective. Other parts of speech such as prepositions, articles, adverbs, etc., play a lesser role in determining the meaning of sentences; therefore, they are not considered when choosing significant unigrams and bigrams. The proposed method is tested on two problem domains: citations and opinosis data sets. Results show that the proposed method performs better than Text-Rank, LexRank, and Edmundson summarization methods. The proposed method is general enough to summarize texts from any domain.

Publisher

IGI Global

Subject

General Computer Science

Reference32 articles.

1. A rule-based approach for tagging nonvocalized Arabic words.;A.Al-Taani;The International Arab Journal of Information Technology,2009

2. Improving Performance of Text Summarization.;S.Babar;Procedia Computer Science,2015

3. Belica, M. (2014). sumy 0.4.1 is Module for Automatic Summarization of Text Documents and html Pages. Retrieved from http://pydoc.net/Python/sumy/0.4.1/

4. Binwahlan, M. S., Suanmali, L., & Salim, N. (2009). Sentence Features Fusion for Text Summarization using Fuzzy Logic. In Proceedings of the International Conference on Hybrid Intelligent Systems (pp. 142–146). Academic Press.

5. Brill, E. (1992). A simple rule-based part of speech tagger. Proceedings of the Workshop on Speech and Natural Language - HLT ’91 (p. 112). Academic Press.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. The AILA Methodology for Automated and Intelligent Likelihood Assignment in Risk Assessment;IEEE Access;2023

2. EDCOSUM: Text extractive summarization framework based on edge information with coreference resolution;Journal of Intelligent & Fuzzy Systems;2022-06-01

3. The AILA Methodology for Automated and Intelligent Likelihood Assignment;2022 6th International Conference on Cryptography, Security and Privacy (CSP);2022-01

4. An analysis of the writing of ‘suicide cult’ members;Digital Scholarship in the Humanities;2021-06-03

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3