Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on large language model-based text-to-SQL.

License

NotificationsYou must be signed in to change notification settings

DEEP-PolyU/Awesome-LLM-based-Text2SQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

81 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository provides a comprehensive collection of research papers, benchmarks, and open-source projects onlarge language model-based text-to-SQL (LLM-based Text-to-SQL). It includes all the contents from our survey paper 📖"Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL" and will be continuously updated to incorporate the up-to-date advances and notable contributions from the text-to-SQL community. Stay tuned!!

🤗You are vey welcome to contribute to this repository by launching an issue or a pull request. If you find any missing resources or come across interesting new research works, please don’t hesitate to open an issue or submit a PR!

📫Contact us via emails:zijin[dot]hong[at]connect[dot]polyu[dot]hk

📃 Pleasecite our paper if you find our survey or repository helpful!

🔥 News

  • [2025-10-21] 🎉🎉 Our paper"Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation" has beenaccepted byIEEE International Conference on Data Engineering (ICDE)!
  • [2025-09-21] 🔥🔥 Finished building the benchmarks, datasets, and taxonomy for this repository.
  • [2025-09-14] 🔥🔥 Repository launched based on our survey paper to keep track of recent progress in LLM-based text-to-SQL.
  • [2025-09-02] 🎉🎉 Our paper"Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL" has beenaccepted byIEEE Transactions on Knowledge and Data Engineering (TKDE)!
  • [2025-05-01] 🎉🎉 Our paper"Struture-Guided Large Language Models for Text-to-SQL Generation" has beenaccepted byInternational Conference of Machine Learning (ICML)!

Overview of LLM-based Text-to-SQL Workflow

A user asks a question about football leagues. The LLM takes this question together with the schema of the corresponding database as input and generates an SQL query as output. The generated SQL is then executed on the database, retrieving the result"The 5 leagues with the highest matches", which answers the user's question.

📜 Catalog

Awesome-LLM-based-Text2SQL


📈 Trends

A Sketch of Research Trends in the Field of Text-to-SQL with Representative Works

Before 2023, the focus is on a selection of representative traditional studies. However, from 2023 onward, the emphasis shifts to the rapid advancements driven by LLMs, marking a significant acceleration in the field.

📰 Surveys

  • TKDE Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL [Paper] [Code]
  • CSUR2025 A Survey on Employing Large Language Models for Text-to-SQL Tasks [Paper]
  • TKDE A Survey of Text-to-SQL in the Era of LLMs: Where are We, and Where are We Going? [Paper]
  • TKDE Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey [Paper]
  • arXiv2024 Large Language Model Enhanced Text-to-SQL Generation: A Survey [Paper]
  • VLDBJ2023 A Survey on Deep Learning Approaches for Text-to-SQL [Paper]
  • VLDB2023 Natural Language Interfaces for Databases with Deep Learning [Paper]
  • arXiv2022 A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions [Paper]
  • COLING2022 Recent Advances in Text-to-SQL: A Survey of What We Have and What We Expect [Paper]

🏆 Benchmarks

In the era of LLMs, two benchmarks and their variants/extensions are widely recognized for evaluating text-to-SQL capabilities.We will continually update the top five methods on each benchmark to showcase the latest advances in the text-to-SQL community. These benchmarks, along with other text-to-SQL dataset papers, are listed in thedatasets section below.

BIRD - A Big Bench for Large-Scale Database Grounded Text-to-SQL

Method/ModelDev EX (%)Test EX (%)Paper/CodeDate
arXiv2025 Agentar-Scale-SQL74.9081.67[Paper]2025-09-25
arXiv2025 AskData + GPT-4o76.1480.88[Paper]2025-03-11
Proprietary LongData-SQL74.3277.53[Proprietary]2025-07-14
ICLR2025 CHASE-SQL + Gemini74.9076.02[Paper]2025-04-16
Proprietary JoyDataAgent-SQL74.2575.74[Report] [Code]2025-09-22
Proprietary TCDataAgent-SQL74.1275.74[Report]2025-05-30
Proprietary Contextual-SQL73.5075.63[Report] [Code]2025-02-27

Spider1.0 - Semantic Parsing and Text-to-SQL Challenge

Method/ModelDev EX (%)Test EX (%)Paper/CodeDate
Proprietary MiniSeek-91.2[Report]2023-11-02
VLDB2024 DAIL-SQL + GPT-482.486.6[Paper] [Code]2023-08-20
NeurIPS2025 DIN-SQL + GPT-474.285.3[Paper] [Code]2023-04-21
arXiv2023 C3 + ChatGPT81.882.3[Paper] [Code]2023-06-01
AAAI2025 RESDSQL-3B + NatSQL84.179.9[Paper] [Code]2023-02-27

Spider2.0 - Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Method/ModelSnow ScoreLite ScorePaper/CodeDate
arXiv2025 AgenticData + Qwen3-44.5[Paper]2025-08-07
ICLR2025 ReFoRCE + o337.1137.84[Paper] [Code]2025-05-22
arXiv2024 RSL-SQL + o3-33.09[Paper] [Code]2025-07-10
EMNLP2025 LinkAlign + DeepSeek-R1-33.09[Paper] [Code]2025-04-27
ICLR2025 Spider-Agent + Claude-3.7-Sonnet-28.52[Paper] [Code]2025-03-16

BIRD-CRITIC - Can LLMs Fix User Issues in Real-World Database Applications?

ModelSR (%)Date
ByteBrain-Agent43.332025-06-10
GPT-5-High34.962025-09-04
grok-433.682025-07-18
DeepSeek-R133.512025-04-20
o3-Mini33.332025-04-20

BIRD-INTERACT - Re-imagining Text-to-SQL Evaluation via Lens of Dynamic Interactions

Model/MethodRewardDate
Gemini-2.5-Pro20.922025-08-22
o3-Mini20.272025-08-22
Claude-Sonnet-418.352025-08-22
Qwen-3-Coder-480B17.752025-08-22
DeepSeek-V315.152025-08-22

🗃️ Datasets

We categorize the datasets intoOriginal Datasets andPost-annotated Datasets based on whether they were released with the original dataset (question–SQL pairs) and databases, or were developed by adapting existing datasets and databases with special settings. ThePost-annotated Datasets rely on the databases fromSpider 1.0. For each original dataset, we list its characteristics, number of examples, and number of databases under the dataset title.

Original Datasets

  • arXiv2025BIRD-CRITIC | SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications [Paper] [Code] [Dataset]
    Knowledge-augmented, Long-context; #Example: 600; #DB: 95
  • ICLR2025Spider2.0 | Spider 2.0: Evaluating Language Models on Real-world Enterprise Text-to-SQL Workflows [Paper] [Code] [Dataset]
    Knowledge-augmented, Long-context; #Example: 632; #DB: 213
  • SIGMOD2025BULL | FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis [Paper] [Code] [Dataset]
    Knowledge-augmented, Long-context; #Example: 4,966; #DB: 3
  • NeurIPS2023BIRD | Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs [Paper] [Code] [Dataset]
    Cross-domain, Knowledge-augmented; #Example: 12,751; #DB: 95
  • ACL2021KaggleDBQA | KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers [Paper] [Code] [Dataset]
    Cross-domain; #Example: 272; #DB: 8
  • EMNLP2020DuSQL | DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset [Paper] [Dataset]
    Cross-domain, Cross-lingual; #Example: 23,797; #DB: 200
  • Findings2020SQUALL | On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries [Paper] [Code]
    Cross-domain, Cross-lingual; #Example: 11,468; #DB: 1,679
  • EMNLP2019CoSQL | CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases [Paper] [Code] [Dataset]
    Cross-domain, Context-dependent; #Example: 15,598; #DB: 200
  • EMNLP2018Spider | Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task [Paper] [Code] [Dataset]
    Cross-domain; #Example: 10,181; #DB: 200
  • arXiv2017WikiSQL | Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning [Paper] [Code] [Dataset]
    Cross-domain; #Example: 80,654; #DB: 26,521

Post-annotated Datasets

  • ICLR2023Dr. Spider | Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness [Paper] [Code]
    Robustness; Perturbations in DB, query and SQL
  • ACL2022ADVETA | Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation [Paper] [Code] [Dataset]
    Robustness; Adversarial table perturbation
  • Findings2022Spider-SS&CG | Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment [Paper] [Code] [Dataset]
    Context-dependent; Splitting example into sub-examples
  • EMNLP2021Spider-DK | Exploring Underexplored Limitations of Cross-Domain Text-to-SQL Generalization [Paper] [Code]
    Knowledge-augmented; Adding domain knowledge
  • ACL2021Spider-SYN | Towards Robustness of Text-to-SQL Models against Synonym Substitution [Paper] [Code]
    Knowledge-augmented; Adding domain knowledge
  • Findings2020Spider-Vietnames | A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese [Paper] [Code]
    Cross-lingual; Vietnamese version of Spider
  • NAACL2021Spider-Realistic | Structure-Grounded Pretraining for Text-to-SQL [Paper] [Dataset]
    Robustness; Removing column names in question
  • EMNLP2019CSpider | A Pilot Study for Chinese SQL Semantic Parsing [Paper] [Code]
    Cross-lingual; Chinese version of Spider
  • EMNLP2019SParC | SParC: Cross-Domain Semantic Parsing in Context [Paper] [Code] [Dataset]
    Context-dependent; Annotate conversational contents

🪴 Taxonomy

The implementation of recent LLM-based text-to-SQL methods primarily relies on in-context learning and fine-tuning, enabled by the release of both powerful proprietary and well-architected open-source LLMs.A detailed categorization of text-to-SQL methods can be found inour paper, and subsequent latest research papers will be continually updated and aligned with this taxonomy.

In-context Learning

  • arXiv2025 Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling [Paper] [Code]
  • EMNLP2025 LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL [Paper] [Code]
  • ICLR2025 ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Consensus Enforcement, and Column Exploration [Paper] [Code]
  • arXiv2025 CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning [Paper] [Code]
  • arXiv2025 SAFE-SQL: Self-Augmented In-Context Learning with Fine-grained Example Selection for Text-to-SQL [Paper]
  • COLING2025 Gen-SQL: Efficient Text-to-SQL by Bridging Natural Language Question and Database Schema with Pseudo-Schema [Paper] [Code]
  • COLING2025 In-Context Reinforcement Learning based Retrieval-Augmented Generation for Text-to-SQL [Paper]
  • ICLR2025 Spider 2.0: Evaluating Language Models on Real-world Enterprise Text-to-SQL Workflows [Paper] [Code]
  • arXiv2024 RSL-SQL: Robust Schema Linking in Text-to-SQL Generation [Paper] [Code]
  • ICLR2025 CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL [Paper]
  • arXiv2024 E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL [Paper] [Code]
  • NeurIPS202 The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models [Paper]
  • VLDB2024 The Dawn of Natural Language to SQL: Are We Fully Ready? [Paper]
  • arXiv2024 CHESS: Contextual Harnessing for Efficient SQL Synthesis [Paper] [Code]
  • COLING2025 MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation [Paper]
  • Findings2020 Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation [Paper] [Code]
  • arXiv2024 Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL [Paper] [Code]
  • AAAI2025 MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL [Paper] [Code]
  • NAACL2025 You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL [Paper]
  • ICDE2024 PURPLE: Making a Large Language Model a Better SQL Writer [Paper]
  • arXiv2024 PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency [Paper] [Code]
  • ACL2025³: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks [Paper] [Code]
  • ICDE2024 MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation [Paper] [Code]
  • EMNLP2024 Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments [Paper] [Code]
  • arXiv2024 SQL-CRAFT: Text-to-SQL through Interactive Refinement and Enhanced Reasoning [Paper]
  • ICML2025 Structure-Guided Large Language Models for Text-to-SQL Generation [Paper]
  • Findings2020 Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM [Paper] [Code]
  • Findings2024 Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL [Paper] [Code]
  • Findings2020 Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm [Paper] [Code]
  • COLING2025 MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL [Paper] [Code]
  • Findings2023 ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought [Paper] [Code]
  • Findings2023 Selective Demonstrations for Cross-domain Text-to-SQL [Paper] [Code]
  • VLDB2024 Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation [Paper] [Code]
  • arXiv2023 C3: Zero-shot Text-to-SQL with ChatGPT [Paper] [Code]
  • ICONIP2023 Retrieval-augmented GPT-3.5-based Text-to-SQL Framework with Sample-aware Prompting and Dynamic Revision Chain [Paper]
  • TMLR2024 SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL [Paper]
  • EMNLP2023 Exploring Chain of Thought Style Prompting for Text-to-SQL [Paper]
  • Findings2023 Enhancing Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies [Paper]
  • EMNLP2023 StructGPT: A General Framework for Large Language Model to Reason over Structured Data [Paper] [Code]
  • NeurIPS2023 DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction [Paper] [Code]
  • PRICAI2023 Prompting GPT-3.5 for Text-to-SQL with De-semanticization and Skeleton Retrieval [Paper]
  • ICLR2024 Teaching Large Language Models to Self-Debug [Paper]
  • ICML2023 LEVER: Learning to Verify Language-to-Code Generation with Execution [Paper] [Code]
  • ICML2023 Coder Reviewer Reranking for Code Generation [Paper]
  • EMNLP2022 Natural Language to Code Translation with Execution [Paper] [Code]

Fine-tuning

  • arXiv2025 MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL [Paper] [Code]
  • ACL2025 SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL [Paper] [Code]
  • ICLR2025 ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL [Paper] [Code]
  • arXiv2024 A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL [Paper] [Code]
  • NAACL2025 MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation [Paper] [Code]
  • arXiv2025 Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation [Paper]
  • Findings2024 DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models [Paper] [Code]
  • COLING2025 MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL [Paper] [Code]
  • NeurIPS202 The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models [Paper]
  • arXiv2024 Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL [Paper] [Code]
  • COLM2024 StructLM: Towards Building Generalist Models for Structured Knowledge Grounding [Paper] [Code]
  • SIGMOD2024 CodeS: Towards Building Open-source Language Models for Text-to-SQL [Paper] [Code]
  • ACL2024 Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models [Paper] [Code]
  • VLDB2024 Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation [Paper] [Code]
  • ICML2024 CLLMs: Consistency Large Language Models [Paper] [Code]

📦 Projects

📃 Citation

@article{hong2025next,  title={Next-generation database interfaces: A survey of llm-based text-to-sql},  author={Hong, Zijin and Yuan, Zheng and Zhang, Qinggang and Chen, Hao and Dong, Junnan and Huang, Feiran and Huang, Xiao},  journal={IEEE Transactions on Knowledge and Data Engineering},  year={2025}}

About

[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on large language model-based text-to-SQL.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp