Complex-valued Neural Network-based Quantum Language Models-Reference-Cited by-同舟云学术

Complex-valued Neural Network-based Quantum Language Models

Published:2022-03-09 Issue:4 Volume:40 Page:1-31
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Zhang Peng¹,Hui Wenjie¹^ORCID,Wang Benyou²,Zhao Donghao¹,Song Dawei³,Lioma Christina⁴,Simonsen Jakob Grue⁴

Affiliation:

1. Tianjin University, Tianjin, China

2. University of Padua, Padua, Italy

3. Beijing Institute of Technology, Beijing, China

4. University of Copenhagen, Copenhagen, Denmark

Abstract

Language modeling is essential in Natural Language Processing and Information Retrieval related tasks. After the statistical language models, Quantum Language Model (QLM) has been proposed to unify both single words and compound terms in the same probability space without extending term space exponentially. Although QLM achieved good performance in ad hoc retrieval, it still has two major limitations: (1) QLM cannot make use of supervised information, mainly due to the iterative and non-differentiable estimation of the density matrix, which represents both queries and documents in QLM. (2) QLM assumes the exchangeability of words or word dependencies, neglecting the order or position information of words. This article aims to generalize QLM and make it applicable to more complicated matching tasks (e.g., Question Answering) beyond ad hoc retrieval. We propose a complex-valued neural network-based QLM solution called C-NNQLM to employ an end-to-end approach to build and train density matrices in a light-weight and differentiable manner, and it can therefore make use of external well-trained word vectors and supervised labels. Furthermore, C-NNQLM adopts complex-valued word vectors whose phase vectors can directly encode the order (or position) information of words. Note that complex numbers are also essential in the quantum theory. We show that the real-valued NNQLM (R-NNQLM) is a special case of C-NNQLM. The experimental results on the QA task show that both R-NNQLM and C-NNQLM achieve much better performance than the vanilla QLM, and C-NNQLM’s performance is on par with state-of-the-art neural network models. We also evaluate the proposed C-NNQLM on text classification and document retrieval tasks. The results on most datasets show that the C-NNQLM can outperform R-NNQLM, which demonstrates the usefulness of the complex representation for words and sentences in C-NNQLM.

Funder

state key development program of China

Natural Science Foundation of China

European Unions Horizon 2020 research and innovation program under the Marie SkodowskaCurie

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3505138

Reference74 articles.

1. Quantum entanglement in concept combinations;Aerts Diederik;CoRR,2013

2. Martin Arjovsky, Amar Shah, and Yoshua Bengio. 2016. Unitary evolution recurrent neural networks. In Proceedings of the International Conference on International Conference on Machine Learning.

3. Esma Balkr. 2014. Using density matrices in a compositional distributional model of meaning. Master’s thesis. University of Oxford.

4. Towards Quantum Language Models

5. William Blacoe. 2014. Semantic composition inspired by quantum measurement. In Proceedings of the International Symposium on Quantum Interaction. Springer, 41–53.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Quantum-inspired language models based on unitary transformation;Information Processing & Management;2024-07

2. Quantum-inspired Neural Network Based on Stochastic Liouville-von Neumann Equation for Sentiment Classification;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. Quantum Topic Model: Topic Modeling Using Variational Quantum Circuits;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

4. Hierarchical Dense Pattern Detection in Tensors;ACM Transactions on Knowledge Discovery from Data;2023-02-28

5. Quantum-Inspired Neural Language Representation, Matching and Understanding;Foundations and Trends® in Information Retrieval;2023