Affiliation:
1. Centre of Excellence in Artificial Intelligence Indian Institute of Technology Kharagpur India
Abstract
AbstractInferring the gender of named entities present in a text has several practical applications in information sciences. Existing approaches toward name gender identification rely exclusively on using the gender distributions from labeled data. In the absence of such labeled data, these methods fail. In this article, we propose a two‐stage model that is able to infer the gender of names present in text without requiring explicit name‐gender labels. We use coreference resolution as the backbone for our proposed model. To aid coreference resolution where the existing contextual information does not suffice, we use a retrieval‐assisted context aggregation framework. We demonstrate that state‐of‐the‐art name gender inference is possible without supervision. Our proposed method matches or outperforms several supervised approaches and commercially used methods on five English language datasets from different domains.
Subject
Library and Information Sciences,Information Systems and Management,Computer Networks and Communications,Information Systems