The Download Estimation task on KDD Cup 2003-Reference-Cited by-同舟云学术

The Download Estimation task on KDD Cup 2003

Published:2003-12 Issue:2 Volume:5 Page:160-162
ISSN:1931-0145
Container-title:ACM SIGKDD Explorations Newsletter
language:en
Short-container-title:SIGKDD Explor. Newsl.

Author:

Brank Janez¹,Leskovec Jure¹

Affiliation:

1. Jožef Stefan Institute, Ljubljana, Slovenia

Abstract

This paper describes our work on the Download Estimation task for KDD Cup 2003. The task requires us to estimate how many times a paper has been downloaded in the first 60 days after it has been published on arXiv.org , a preprint server for papers on physics and related areas. The training data consists of approximately 29000 papers, the citation graph, and information about the downloads of a subset of these papers. Our approach is based on an extension of the bag-of-words model, with linear SVM regression as the learning algorithm. We describe our experiments with various kinds of features. We focus particularly on issues of feature construction and weighting, which turns out to be quite important for this task.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/980972.980997

Reference4 articles.

1. LIBSVM

2. Authoritative sources in a hyperlinked environment

3. A. J. Smola B. Schölkopf: A tutorial on support vector regression. NeuroCOLT2 Rept. NC2-TR-1998-030 Oct. 1998. A. J. Smola B. Schölkopf: A tutorial on support vector regression. NeuroCOLT2 Rept. NC2-TR-1998-030 Oct. 1998.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analyzing readers behavior in downloading articles from IEEE digital library: a study of two selected journals in the field of education;Scientometrics;2017-01-11

2. References;Machine Learning and Data Mining;2007