Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations-Reference-Cited by-同舟云学术

Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Artetxe Mikel,Labaka Gorka,Agirre Eneko

Abstract

Using a dictionary to map independently trained word embeddings to a shared space has shown to be an effective approach to learn bilingual word embeddings. In this work, we propose a multi-step framework of linear transformations that generalizes a substantial body of previous work. The core step of the framework is an orthogonal transformation, and existing methods can be explained in terms of the additional normalization, whitening, re-weighting, de-whitening and dimensionality reduction steps. This allows us to gain new insights into the behavior of existing methods, including the effectiveness of inverse regression, and design a novel variant that obtains the best published results in zero-shot bilingual lexicon extraction. The corresponding software is released as an open source project.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cross-Lingual Word Embedding Generation Based on Procrustes-Hungarian Linear Projection;2024 International Conference on Asian Language Processing (IALP);2024-08-04

2. SeNSe: embedding alignment via semantic anchors selection;International Journal of Data Science and Analytics;2024-03-20

3. Bilingual Lexicon Induction From Comparable and Parallel Data: A Comparative Analysis;Lecture Notes in Computer Science;2024

4. A Scalable Approach to Aligning Natural Language and Knowledge Graph Representations: Batched Information Guided Optimal Transport;2023 IEEE International Conference on Big Data (BigData);2023-12-15

5. Automating the Transition from Dialectal to Literary Forms in Uzbek Language Texts: An Algorithmic Perspective;2023 IEEE XVI International Scientific and Technical Conference Actual Problems of Electronic Instrument Engineering (APEIE);2023-11-10