INCLUSION OF RELEVANCE INFORMATION IN THE TERM DISCRIMINATION MODEL-Reference-Cited by-同舟云学术

INCLUSION OF RELEVANCE INFORMATION IN THE TERM DISCRIMINATION MODEL

Published:1989-02-01 Issue:2 Volume:45 Page:85-109
ISSN:0022-0418
Container-title:Journal of Documentation
language:en
Short-container-title:

Author:

BIRU TESFAYE,EL‐HAMDOUCHI ABDELMOULA,REES RODNEY S.,WILLETT PETER

Abstract

The term discrimination value of an index term has been proposed as a quantitative measure of the extent to which that term can discriminate between documents in bibliographic databases. Previous work has suggested that the most discriminating terms are those with medium frequencies of occurrence. This paper discusses the effect of including relevance data on the calculation of term discrimination values. Two algorithms are described that calculate the ability of index terms to discriminate between relevant documents, between non‐relevant documents or between relevant and non‐relevant documents. The application of these algorithms to several standard document test collections demonstrates that the exact form of the relationship between term frequency and term discrimination depends upon the particular type of discrimination which is being measured; in particular, medium frequency terms are not necessarily the best discriminators when relevance data is available. These results are compared with the discriminatory ability of terms as measured by their relevance weights, where the most discriminating terms are those with low frequencies of occurrence.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference22 articles.

1. A Theory of Indexing

2. A vector space model for automatic indexing

3. A theory of term importance in automatic text analysis

4. Automatic indexing using term discrimination and term precision measurements

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning semantic relatedness from term discrimination information;Expert Systems with Applications;2009-03

2. Integrating information retrieval and data mining to discover project team coordination patterns;Decision Support Systems;2006-11

3. Knowledge map creation and maintenance for virtual communities of practice;Information Processing & Management;2006-03

4. A Conceptual Model for Virtual Organizational Learning;Journal of Organizational Computing and Electronic Commerce;2001-09

5. Visualization of term discrimination analysis;Journal of the American Society for Information Science and Technology;2001