Affiliation:
1. Technische Universität Braunschweig, Braunschweig, Germany
Abstract
By incorporating human workers into the query execution process crowd-enabled databases facilitate intelligent, social capabilities like completing missing data at query time or performing cognitive operators. But despite all their flexibility, crowd-enabled databases still maintain rigid schemas. In this paper, we extend crowd-enabled databases by flexible query-driven schema expansion, allowing the addition of new attributes to the database at query time. However, the number of crowd-sourced mini-tasks to fill in missing values may often be prohibitively large and the resulting data quality is doubtful. Instead of simple crowd-sourcing to obtain all values individually, we leverage the usergenerated data found in the Social Web: By exploiting user ratings we build
perceptual spaces
, i.e., highly-compressed representations of opinions, impressions, and perceptions of large numbers of users. Using few training samples obtained by expert crowd sourcing, we then can extract all missing data automatically from the perceptual space with high quality and at low costs. Extensive experiments show that our approach can boost both performance and quality of crowd-enabled databases, while also providing the flexibility to expand schemas in a query-driven fashion.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Visual Life Sciences Workflow Design using Distributed and Heterogeneous Resources;IEEE/ACM Transactions on Computational Biology and Bioinformatics;2019
2. Beyond Micro-Tasks;Crowdsourcing;2019
3. Beyond Micro-Tasks;Social Entrepreneurship;2019
4. Beyond Micro-Tasks;Journal of Database Management;2018-01
5. Crowdsourcing for data management;Knowledge and Information Systems;2017-05-05