Please use this identifier to cite or link to this item: http://hdl.handle.net/10263/7530
Title: Enhancing Text to SQL Generation with Dynamic Vector Search
Authors: Mondal, Soumyadweep
Keywords: pre-trained language models (PLMs)
large language models (LLMs)
Zero-Shot Experiments
Issue Date: Jul-2024
Publisher: Indian Statistical Institute, Kolkata
Citation: 30p.
Series/Report no.: Dissertation;;CrS;22-17
Abstract: Generating accurate SQL from natural language questions (text-to-SQL) is a longstanding challenge due to the complexities involved in understanding user queries, comprehending database schemas, and generating SQL statements. Traditional text-to-SQL systems have utilized human-engineered solutions and deep neural networks. More recently, pre-trained language models (PLMs) have been employed for text-to-SQL tasks, showing promising results. However, as modern databases and user queries become increasingly complex, the limited comprehension capabilities of PLMs can lead to incorrect SQL generation. This necessitates sophisticated and tailored optimization methods, which restrict the applicability of PLM-based systems. In contrast, large language models (LLMs) have demonstrated significant advancements in natural language understanding as their scale increases. This thesis explores the integration of LLMs into text-to-SQL systems, highlighting unique opportunities, challenges, and solutions. We propose a novel approach that leverages examples similar to user queries, allowing the model to better understand and generate accurate SQL. This work provides a comprehensive review of LLM-based text-to-SQL systems, outlining current challenges and the evolutionary process of the field. We introduce datasets and metrics designed for evaluating text-to-SQL systems. Finally, we discuss remaining challenges and propose future directions for research in this domain.
Description: Dissertation under the guidance of Jayanta Kumar Mukherjee and Prof. Dipti Prasad Mukherjee
URI: http://hdl.handle.net/10263/7530
Appears in Collections:Dissertations - M Tech (CRS)

Files in This Item:
File Description SizeFormat 
Soumyadweep_CrS2217_2024.pdfDissertations - M Tech (CRS)601.32 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.