Correlates of Representation Errors in Internet Data Sources for Real Estate Market

Author:

Beręsewicz Maciej1

Affiliation:

1. Poznań University of Economics and Business , Department of Statistics , al. Niepodległości 10, 61-875 Poznań , Poland .

Abstract

Abstract New data sources, namely big data and the Internet, have become an important issue in statistics and for official statistics in particular. However, before these sources can be used for statistics, it is necessary to conduct a thorough analysis of sources of nonrepresentativeness. In the article, we focus on detecting correlates of the selection mechanism that underlies Internet data sources for the secondary real estate market in Poland and results in representation errors (frame and selection errors). In order to identify characteristics of properties offered online we link data collected from the two largest advertisements services in Poland and the Register of Real Estate Prices and Values, which covers all transactions made in Poland. Quarterly data for 2016 were linked at a domain level defined by local administrative units (LAU1), the urban/rural distinction and usable floor area (UFA), categorized into four groups. To identify correlates of representation error we used a generalized additive mixed model based on almost 5,500 domains including quarters. Results indicate that properties not advertised online differ significantly from those shown in the Internet in terms of UFA and location. A non-linear relationship with the average price per m2 can be observed, which diminishes after accounting for LAU1 units.

Publisher

Walter de Gruyter GmbH

Reference40 articles.

1. Anenberg, E. and S. Laufer. 2017. “A More Timely House Price Index.” Review of Economics and Statistics 99(4): 722–734. Doi: https://doi.org/10.1162/REST_a_00634.10.1162/REST_a_00634

2. Beręsewicz, M. 2016. Internet Data Sources for Real Estate Market Statistics. PhD diss., Poznań University of Economics and Business. Available at: http://www.wbc.poznan.pl/dlibra/docmetadata?id=393454 (accessed February 2019).

3. Beręsewicz, M. 2017. “A Two-Step Procedure to Measure Representativeness of Internet Data Sources.” International Statistical Review 85(3): 473–493. Doi: https://doi.org/10.1111/insr.12217.10.1111/insr.12217

4. Beręsewicz, M., R. Lehtonen, F. Reis, L. Di Consiglio, and M. Karlberg. 2018. An Overview of Methods for Treating Selectivity in Big Data Sources. Statistical Working Papers. Eurostat. Doi: https://doi.org./10.2785/312232.10.2785/312232

5. Brick, J.M. 2015. “Unit Nonresponse and Weighting Adjustments: A Critical Review.” Journal of Official Statistics 29(3): 329–353. Doi: https://doi.org/10.2478/jos-2013-0026.10.2478/jos-2013-0026

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3