KEYWORD EXTRACTION FROM A SINGLE DOCUMENT USING WORD CO-OCCURRENCE STATISTICAL INFORMATION-Reference-Cited by-同舟云学术

KEYWORD EXTRACTION FROM A SINGLE DOCUMENT USING WORD CO-OCCURRENCE STATISTICAL INFORMATION

Published:2004-03 Issue:01 Volume:13 Page:157-169
ISSN:0218-2130
Container-title:International Journal on Artificial Intelligence Tools
language:en
Short-container-title:Int. J. Artif. Intell. Tools

Author:

MATSUO Y.¹,ISHIZUKA M.²

Affiliation:

1. National Institute of Advanced Industrial Science and Technology, Japan

2. University of Tokyo, Japan

Abstract

We present a new keyword extraction algorithm that applies to a single document without using a corpus. Frequent terms are extracted first, then a set of co-occurrences between each term and the frequent terms, i.e., occurrences in the same sentences, is generated. Co-occurrence distribution shows importance of a term in the document as follows. If the probability distribution of co-occurrence between term a and the frequent terms is biased to a particular subset of frequent terms, then term a is likely to be a keyword. The degree of bias of a distribution is measured by the χ2-measure. Our algorithm shows comparable performance to tfidf without using a corpus.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Artificial Intelligence

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218213004001466

Reference12 articles.

1. Methods of automatic term recognition

Cited by 381 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research progress analysis of live streaming commerce based on CiteSpace;Heliyon;2024-08

2. Keyphrase Extraction from Scientific Articles;International Journal of Scientific Research in Computer Science, Engineering and Information Technology;2024-06-20

3. A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction;Mathematics;2024-04-29

4. A Context-Supported Hyperlink Navigation Process;2024 IEEE International Conference on Big Data and Smart Computing (BigComp);2024-02-18

5. A review of techniques for semantic understanding of the text with term weighting;AIP Conference Proceedings;2024