Please use this identifier to cite or link to this item:
http://hdl.handle.net/10263/7530
Title: | Enhancing Text to SQL Generation with Dynamic Vector Search |
Authors: | Mondal, Soumyadweep |
Keywords: | pre-trained language models (PLMs) large language models (LLMs) Zero-Shot Experiments |
Issue Date: | Jul-2024 |
Publisher: | Indian Statistical Institute, Kolkata |
Citation: | 30p. |
Series/Report no.: | Dissertation;;CrS;22-17 |
Abstract: | Generating accurate SQL from natural language questions (text-to-SQL) is a longstanding challenge due to the complexities involved in understanding user queries, comprehending database schemas, and generating SQL statements. Traditional text-to-SQL systems have utilized human-engineered solutions and deep neural networks. More recently, pre-trained language models (PLMs) have been employed for text-to-SQL tasks, showing promising results. However, as modern databases and user queries become increasingly complex, the limited comprehension capabilities of PLMs can lead to incorrect SQL generation. This necessitates sophisticated and tailored optimization methods, which restrict the applicability of PLM-based systems. In contrast, large language models (LLMs) have demonstrated significant advancements in natural language understanding as their scale increases. This thesis explores the integration of LLMs into text-to-SQL systems, highlighting unique opportunities, challenges, and solutions. We propose a novel approach that leverages examples similar to user queries, allowing the model to better understand and generate accurate SQL. This work provides a comprehensive review of LLM-based text-to-SQL systems, outlining current challenges and the evolutionary process of the field. We introduce datasets and metrics designed for evaluating text-to-SQL systems. Finally, we discuss remaining challenges and propose future directions for research in this domain. |
Description: | Dissertation under the guidance of Jayanta Kumar Mukherjee and Prof. Dipti Prasad Mukherjee |
URI: | http://hdl.handle.net/10263/7530 |
Appears in Collections: | Dissertations - M Tech (CRS) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Soumyadweep_CrS2217_2024.pdf | Dissertations - M Tech (CRS) | 601.32 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.