Affiliation:
1. Allameh Tabataba'i University, Tehran, Iran
2. Alzahra University, Tehran, Iran
3. Shahid Beheshti University, Tehran, Iran
Abstract
In this article, we present an automatic semantic role labeling system in Persian consisting of two modules: argument identification for specifying argument spans and argument classification for categorizing their semantic roles. Our modules have been trained on Persian Proposition Bank in which predicate-argument information is manually added as a layer on top of Persian Dependency Treebank with about 30,000 sentences. Therefore, our system was trained on 216,871 verbal predicates and 42,386 nonverbal ones consisting of 40,813 nouns and 1,573 adjectives with 33 semantic classes. As a supervised method, we used maximum entropy for building an argument identifier that results in human-level accuracy of 99% and support vector machine for an argument classifier with an F1 of 84. Regarding both verbal and nonverbal predicates with an expanded role set, we achieved reasonable results.
Funder
Computer Research Center of Islamic Sciences
Publisher
Association for Computing Machinery (ACM)
Reference27 articles.
1. Lessons from building a Persian written corpus: Peykare;Khan M. Bi Jen;Language Resources and Evaluation,2011
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献