A framework for learning web wrappers from the crowd
Author:
Affiliation:
1. Università Roma Tre, Rome, Italy
Publisher
ACM Press
Reference16 articles.
1. D. Angluin. Queries revisited. Theor. Comput. Sci., 313(2):175--194, 2004.
2. A. Arasu and H. Garcia-Molina. Extracting structured data from web pages. In SIGMOD Conference, pages 337--348. ACM, 2003.
3. M.-F. Balcan, S. Hanneke, and J. W. Vaughan. The true sample complexity of active learning. Machine Learning, 80(2-3):111--139, 2010.
4. C.-H. Chang and S.-C. Lui. IEPAD: information extraction based on pattern discovery. In WWW, pages 681--688, 2001.
5. R. Creo, V. Crescenzi, D. Qiu, and P. Merialdo. Minimizing the costs of the training data for learning web wrappers. In VLDS, pages 35--40, 2012.
Cited by 13 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Hybrid Crowd-Machine Wrapper Inference;ACM Transactions on Knowledge Discovery from Data;2019-10-31
2. Big Data Linkage for Product Specification Pages;Proceedings of the 2018 International Conference on Management of Data;2018-05-27
3. Crowdsourcing for data management;Knowledge and Information Systems;2017-05-05
4. Random Query Answering with the Crowd;Journal on Data Semantics;2015-10-27
5. Intensional data on the web;ACM SIGWEB Newsletter;2015-08-17
1.学者识别学者识别
2.学术分析学术分析
3.人才评估人才评估
"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370
www.globalauthorid.com
TOP
Copyright © 2019-2024 北京同舟云网络信息技术有限公司 京公网安备11010802033243号 京ICP备18003416号-3