Natural language to SQL-Reference-Cited by-同舟云学术

Natural language to SQL

Published:2020-06 Issue:10 Volume:13 Page:1737-1750
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Kim Hyeonji¹,So Byeong-Hoon¹,Han Wook-Shin¹,Lee Hongrae²

Affiliation:

1. POSTECH, Korea

2. Google

Abstract

Translating natural language to SQL (NL2SQL) has received extensive attention lately, especially with the recent success of deep learning technologies. However, despite the large number of studies, we do not have a thorough understanding of how good existing techniques really are and how much is applicable to real-world situations. A key difficulty is that different studies are based on different datasets, which often have their own limitations and assumptions that are implicitly hidden in the context or datasets. Moreover, a couple of evaluation metrics are commonly employed but they are rather simplistic and do not properly depict the accuracy of results, as will be shown in our experiments. To provide a holistic view of NL2SQL technologies and access current advancements, we perform extensive experiments under our unified framework using eleven of recent techniques over 10+ benchmarks including a new benchmark (WTQ) and TPC-H. We provide a comprehensive survey of recent NL2SQL methods, introducing a taxonomy of them. We reveal major assumptions of the methods and classify translation errors through extensive experiments. We also provide a practical tool for validation by using existing, mature database technologies such as query rewrite and database testing. We then suggest future research directions so that the translation can be used in practice.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/3401960.3401970

Cited by 56 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Graph Reasoning Enhanced Language Models for Text-to-SQL;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

2. LLM-PBE: Assessing Data Privacy in Large Language Models;Proceedings of the VLDB Endowment;2024-07

3. MedT5SQL: a transformers-based large language model for text-to-SQL conversion in the healthcare domain;Frontiers in Big Data;2024-06-26

4. Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study;Proceedings of the ACM on Management of Data;2024-05-29

5. Text-to-SQL: A methodical review of challenges and models;Turkish Journal of Electrical Engineering and Computer Sciences;2024-05-20