Author:
Chen Jian,Chen Jianpeng,She Xiangrong,Mao Jian,Chen Gang
Abstract
Address is a structured description used to identify a specific place or point of interest, and it provides an effective way to locate people or objects. The standardization of Chinese place name and address occupies an important position in the construction of a smart city. Traditional address specification technology often adopts methods based on text similarity or rule bases, which cannot handle complex, missing, and redundant address information well. This paper transforms the task of address standardization into calculating the similarity of address pairs, and proposes a contrast learning address matching model based on the attention-Bi-LSTM-CNN network (ABLC). First of all, ABLC use the Trie syntax tree algorithm to extract Chinese address elements. Next, based on the basic idea of contrast learning, a hybrid neural network is applied to learn the semantic information in the address. Finally, Manhattan distance is calculated as the similarity of the two addresses. Experiments on the self-constructed dataset with data augmentation demonstrate that the proposed model has better stability and performance compared with other baselines.
Funder
Key R&D Projects of Wuhu Science and Technology Plan in 2020
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference45 articles.
1. Reexamining the Influence of Work and Nonwork Accessibility on Residential Location Choices with a Microanalytic Framework
2. Binary Codes Capable of Correcting Deletions, Insertions and Reversals;Levenshtein;Soviet Phys. Doklady,1966
3. Nouvelles Recherches Sur la Distribution Florale;Jaccard;Bull. Soc. Vaudoise Sci. Nat.,1908
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献