Abstract
The analysis of research paper collections is an interesting topic that can give insights on whether a research area is stalled in the same problems, or there is a great amount of novelty every year. Previous research has addressed similar tasks by the analysis of keywords or reference lists, with different degrees of human intervention. In this paper, we demonstrate how, with the use of Normalized Relative Compression, together with a set of automated data-processing tasks, we can successfully visually compare research articles and document collections. We also achieve very similar results with Normalized Conditional Compression that can be applied with a regular compressor. With our approach, we can group papers of different disciplines, analyze how a conference evolves throughout the different editions, or how the profile of a researcher changes through the time. We provide a set of tests that validate our technique, and show that it behaves better for these tasks than other techniques previously proposed.
Subject
General Physics and Astronomy
Reference38 articles.
1. Visualization as Seen through its Research Paper Keywords
2. A survey of text similarity approaches;Gomaa;Int. J. Computer Appl.,2013
3. Representation Learning: A Review and New Perspectives
4. From Word Embeddings to Document Distanceshttp://proceedings.mlr.press/v37/kusnerb15.pdf
5. Fuzzy Bag-of-Words Model for Document Representation
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献