Quantifying syntax similarity with a polynomial representation of dependency trees
-
Published:2022
Issue:
Volume:53
Page:59-79
-
ISSN:2625-8226
-
Container-title:Glottometrics
-
language:
-
Short-container-title:Glottometrics
Author:
Liu PengyuORCID,
Feng TinghaoORCID,
Liu RuiORCID
Abstract
We introduce a graph polynomial that distinguishes tree structures to represent dependency grammar and a measure based on the polynomial representation to quantify syntax similarity. The polynomial encodes accurate and comprehensive information about the dependency structure and dependency relations of words in a sentence, which enables in-depth analysis of dependency trees with data analysis tools. We apply the polynomial-based methods to analyze sentences in the ParallelUniversal Dependencies treebanks. Specifically, we compare the syntax of sentences and their translations in different languages, and we perform a syntactic typology study of available languages in the Parallel Universal Dependencies treebanks. We also demonstrate and discuss the potential of the methods in measuring syntax diversity of corpora.
Publisher
International Quantitative Linguistics Association
Subject
Applied Mathematics,Linguistics and Language,Language and Linguistics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献