Blocking objectionable web content by leveraging multiple information sources-Reference-Cited by-同舟云学术

Blocking objectionable web content by leveraging multiple information sources

Published:2006-06 Issue:1 Volume:8 Page:17-26
ISSN:1931-0145
Container-title:ACM SIGKDD Explorations Newsletter
language:en
Short-container-title:SIGKDD Explor. Newsl.

Author:

Agarwal Nitin¹,Liu Huan¹,Zhang Jianping²

Affiliation:

1. Arizona State University, Tempe, AZ

2. AOL, Inc., Dulles, VA

Abstract

The World Wide Web has now become a humongous archive of various contents. The inordinate amount of information found on the web presents a challenge to deliver right information to the right users. On one hand, the abundant information is freely accessible to all web denizens; on the other hand, much of such information may be irrelevant or even deleterious to some users. For example, some control and filtering mechanisms are desired to prevent inappropriate or offensive materials such as pornographic websites from reaching children. Ways of accessing websites are termed as Access Scenarios . An Access Scenario can include using search engines (e.g., image search that has very little textual content), URL redirection to some websites, or directly typing (porn) website URLs. In this paper we propose a framework to analyze a website from several different aspects or information sources, and generate a classification model aiming to accurately classify such content irrespective of access scenarios. Extensive experiments are performed to evaluate the resulting system, which illustrates the promise of the proposed approach.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1147234.1147238

Reference31 articles.

1. A Subspace Clustering Framework for Research Group Collaboration

2. Combining labeled and unlabeled data with co-training

3. Maximum Likelihood from Incomplete Data Via theEMAlgorithm

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Big Data Analytics: Deep Content-Based Prediction with Sampling Perspective;Computer Systems Science and Engineering;2023

2. Feature Selection Techniques for Big Data Analytics;Electronics;2022-10-03

3. A BTM-Based Adaptive Objectionable Short Text Filtering Framework;Wireless Communications and Mobile Computing;2022-01-22

4. Attributes Reduction in Big Data;Applied Sciences;2020-07-17

5. LWCR: multi-Layered Wikipedia representation for Computing word Relatedness;Neurocomputing;2016-12