Enhancing Text to SQL Generation with Dynamic Vector Search
No Thumbnail Available
Date
2024-07
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Indian Statistical Institute, Kolkata
Abstract
Generating accurate SQL from natural language questions (text-to-SQL) is a longstanding
challenge due to the complexities involved in understanding user queries, comprehending
database schemas, and generating SQL statements. Traditional text-to-SQL
systems have utilized human-engineered solutions and deep neural networks. More recently,
pre-trained language models (PLMs) have been employed for text-to-SQL tasks,
showing promising results. However, as modern databases and user queries become increasingly
complex, the limited comprehension capabilities of PLMs can lead to incorrect
SQL generation. This necessitates sophisticated and tailored optimization methods,
which restrict the applicability of PLM-based systems.
In contrast, large language models (LLMs) have demonstrated significant advancements
in natural language understanding as their scale increases. This thesis explores
the integration of LLMs into text-to-SQL systems, highlighting unique opportunities,
challenges, and solutions. We propose a novel approach that leverages examples similar
to user queries, allowing the model to better understand and generate accurate SQL.
This work provides a comprehensive review of LLM-based text-to-SQL systems,
outlining current challenges and the evolutionary process of the field. We introduce
datasets and metrics designed for evaluating text-to-SQL systems. Finally, we discuss
remaining challenges and propose future directions for research in this domain.
Description
Dissertation under the guidance of Jayanta Kumar Mukherjee and Prof. Dipti Prasad Mukherjee
Keywords
pre-trained language models (PLMs), large language models (LLMs), Zero-Shot Experiments
Citation
30p.
