Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data-Reference-Cited by-同舟云学术

Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data

Published:2020-10-14 Issue: Volume:2020 Page:1-14
ISSN:1099-0526
Container-title:Complexity
language:en
Short-container-title:Complexity

Author:

Liu Hui¹^ORCID,Jiang Jingqing²,Hou Yaowei³,Song Jie³

Affiliation:

1. School of Metallurgy, Northeastern University, Shenyang, China

2. College of Computer Science and Technology, Inner Mongolia University for the Nationalities, Tongliao, China

3. Software College, Northeastern University, Shenyang, China

Abstract

Cities in the big data era hold the massive urban data to create valuable information and digitally enhanced services. Sources of urban data are generally categorized as one of the three types: official, social, and sensorial, which are from the government and enterprises, social networks of citizens, and the sensor network. These types typically differ significantly from each other but are consolidated together for the smart urban services. Based on the sophisticated consolidation approaches, we argue that a new challenge, fragment complexity that represents a well-integrated data has appropriate but fragmentary schema and difficult to be queried, is ignored in the state-of-art urban data management. Comparing with predefined and rigid schema, fragmentary schema means a dataset contains millions of attributes but nonorthogonally distributed among tables, and of course, values of these attributes are even massive. As far as a query is concerned, locating where these attributes are being stored is the first encountered problem, while traditional value-based query optimization has no contributions. To address this problem, we propose an index on massive attributes as an attributes-oriented optimization, namely, attribute index. Attribute index is a secondary index for locating files in which the target attributes are stored. It contains three parts: ATree for searching keys, DTree for locating keys among files, and ADLinks as a mapping table between ATree and DTree. In this paper, the index architecture, logical structure and algorithms, the implementation details, the creation process, the integration to the existing key-value store, and the urban application scenario are described. Experiments show that, in comparison with B + -Tree, LSM-Tree, and AVL-Tree, the query time of ATree is 1.1x, 1.5x, and 1.2x faster, respectively. Finally, we integrate our proposition with HBase, namely, UrbanBase, whose query performance is 1.3x faster than the original HBase.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Multidisciplinary,General Computer Science

Link

http://downloads.hindawi.com/journals/complexity/2020/8914757.pdf

Reference25 articles.

1. Urban big data fusion based on deep learning: An overview

2. Simulating Intraurban Land Use Dynamics under Multiple Scenarios Based on Fuzzy Cellular Automata: A Case Study of Jinzhou District, Dalian

3. Optimization of Planning Layout of Urban Building Based on Improved Logit and PSO Algorithms

4. In-Depth Analysis of Railway and Company Evolution of Yangtze River Delta with Deep Learning

5. Urban Big Data and the Development of City Intelligence