TRANSDUCTIVE LEARNING FOR SHORT-TEXT CLASSIFICATION PROBLEMS USING LATENT SEMANTIC INDEXING-Reference-Cited by-同舟云学术

TRANSDUCTIVE LEARNING FOR SHORT-TEXT CLASSIFICATION PROBLEMS USING LATENT SEMANTIC INDEXING

Published:2005-03 Issue:02 Volume:19 Page:143-163
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

ZELIKOVITZ SARAH¹,MARQUEZ FINELLA¹

Affiliation:

1. Computer Science Department, College of Staten Island of CUNY, 2800 Victory Blvd, Staten Island, NY 10314, USA

Abstract

This paper presents work that uses Transductive Latent Semantic Indexing (LSI) for text classification. In addition to relying on labeled training data, we improve classification accuracy by incorporating the set of test examples in the classification process. Rather than performing LSI's singular value decomposition (SVD) process solely on the training data, we instead use an expanded term-by-document matrix that includes both the labeled data as well as any available test examples. We report the performance of LSI on data sets both with and without the inclusion of the test examples, and we show that tailoring the SVD process to the test examples can be even more useful than adding additional training data. This method can be especially useful to combat possible inclusion of unrelated data in the original corpus, and to compensate for limited amounts of data. Additionally, we evaluate the vocabulary of the training and test sets and present the results of a series of experiments to illustrate how the test set is used in an advantageous way.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001405003971

Reference9 articles.

1. Semi-Supervised Learning on Riemannian Manifolds

2. Adv. Neural Inform. Process. Syst.;Bennet K.,1998

3. Using Linear Algebra for Intelligent Information Retrieval

4. Indexing by latent semantic analysis

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on a Capsule Network Text Classification Method with a Self-Attention Mechanism;Symmetry;2024-04-24

2. Graph Receptive Transformer Encoder for Text Classification;IEEE Transactions on Signal and Information Processing over Networks;2024

3. A Novel Fuzzy Logic-Based Text Classification Method for Tracking Rare Events on Twitter;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2021-07

4. Feature selection for classifying multi-labeled past events;International Journal on Digital Libraries;2020-09-08

5. Modeling Latent Relation to Boost Things Categorization Service;IEEE Transactions on Services Computing;2020-09-01