Improving Transition-Based Dependency Parsing of Hindi and Urdu by Modeling Syntactically Relevant Phenomena-Reference-Cited by-同舟云学术

Improving Transition-Based Dependency Parsing of Hindi and Urdu by Modeling Syntactically Relevant Phenomena

Published:2017-04-06 Issue:3 Volume:16 Page:1-35
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Bhat Riyaz Ahmad¹,Bhat Irshad Ahmad¹,Sharma Dipti Misra¹

Affiliation:

1. LTRC, IIIT-H, Hyderabad, India

Abstract

In recent years, transition-based parsers have shown promise in terms of efficiency and accuracy. Though these parsers have been extensively explored for multiple Indian languages, there is still considerable scope for improvement by properly incorporating syntactically relevant information. In this article, we enhance transition-based parsing of Hindi and Urdu by redefining the features and feature extraction procedures that have been previously proposed in the parsing literature of Indian languages. We propose and empirically show that properly incorporating syntactically relevant information like case marking, complex predication and grammatical agreement in an arc-eager parsing model can significantly improve parsing accuracy. Our experiments show an absolute improvement of ∼2% LAS for parsing of both Hindi and Urdu over a competitive baseline which uses rich features like part-of-speech (POS) tags, chunk tags, cluster ids and lemmas. We also propose some heuristics to identify ezafe constructions in Urdu texts which show promising results in parsing these constructions.

Funder

NSF

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3005447

Reference76 articles.

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Children learn ergative case marking in Hindi using statistical preemption and clause-level semantics (intentionality): evidence from acceptability judgment and elicited production studies with children and adults;Open Research Europe;2023-09-13

2. Hybrid embeddings for transition-based dependency parsing of free word order languages;Information Processing & Management;2023-05

3. Children learn ergative case marking in Hindi using statistical preemption and clause-level semantics (intentionality): evidence from acceptability judgment and elicited production studies with children and adults;Open Research Europe;2023-03-29

4. Comparative Analysis of Path-based Similarity Measures for Word Sense Disambiguation;2023 3rd International conference on Artificial Intelligence and Signal Processing (AISP);2023-03-18

5. UrduAI: Writeprints for Urdu Authorship Identification;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-03-31