Affiliation:
1. School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China
Abstract
Text matching, as a core technology of natural language processing, plays a key role in tasks such as question-and-answer systems and information retrieval. In recent years, the development of neural networks, attention mechanisms, and large-scale language models has significantly contributed to the advancement of text-matching technology. However, the rapid development of the field also poses challenges in fully understanding the overall impact of these technological improvements. This paper aims to provide a concise, yet in-depth, overview of the field of text matching, sorting out the main ideas, problems, and solutions for text-matching methods based on statistical methods and neural networks, as well as delving into matching methods based on large-scale language models, and discussing the related configurations, API applications, datasets, and evaluation methods. In addition, this paper outlines the applications and classifications of text matching in specific domains and discusses the current open problems that are being faced and future research directions, to provide useful references for further developments in the field.
Funder
intelligent integrated media platform r&d and application demonstration project
Reference121 articles.
1. A Fast Algorithm for Computing Longest Common Subsequences;Hunt;Commun. ACM,1977
2. Binary Codes Capable of Correcting Deletions, Insertions, and Reversals;Levenshtein;Sov. Phys. Dokl.,1966
3. Winkler, W.E. (2024, May 21). String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage, Available online: https://files.eric.ed.gov/fulltext/ED325505.pdf.
4. Measures of the Amount of Ecologic Association between Species;Dice;Ecology,1945
5. The Distribution of the Flora in the Alpine Zone. 1;Jaccard;New Phytol.,1912