ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.10321
  4. Cited By
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
v1v2v3v4 (latest)

LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?

16 December 2023
Fuheng Zhao
Lawrence Lim
Ishtiyaque Ahmad
D. Agrawal
A. El Abbadi
Amr El Abbadi
ArXiv (abs)PDFHTMLGithub (105★)

Papers citing "LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?"

41 / 41 papers shown
Access Paths for Efficient Ordering with Large Language Models
Access Paths for Efficient Ordering with Large Language Models
Fuheng Zhao
Jiayue Chen
Yiming Pan
Tahseen Rabbani
D. Agrawal
D. Agrawal
A. El Abbadi
Paritosh Aggarwal
Anupam Datta
Dimitris Tsirogiannis
239
1
0
30 Aug 2025
Taming SQL Complexity: LLM-Based Equivalence Evaluation for Text-to-SQL
Taming SQL Complexity: LLM-Based Equivalence Evaluation for Text-to-SQL
Qingyun Zeng
Simin Ma
Arash Niknafs
Ashish Basran
Carol Szabo
191
1
0
11 Jun 2025
QUITE: A Query Rewrite System Beyond Rules with LLM Agents
QUITE: A Query Rewrite System Beyond Rules with LLM Agents
Yuyang Song
Hanxu Yan
Jiale Lao
Yibo Wang
Yufei Li
Yuanchun Zhou
Jianguo Wang
Mingjie Tang
LRM
400
6
0
09 Jun 2025
Text-to-SQL Domain Adaptation via Human-LLM Collaborative Data Annotation
Text-to-SQL Domain Adaptation via Human-LLM Collaborative Data AnnotationInternational Conference on Intelligent User Interfaces (IUI), 2025
Yuan Tian
Daniel Lee
Fei Wu
Tung Mai
Kun Qian
Siddhartha Sahai
Tianyi Zhang
Yunyao Li
SyDa
753
7
0
21 Feb 2025
EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
Anjiang Wei
Jiannan Cao
Ran Li
Zeyang Zhang
Yuhui Zhang
...
Yuan Liu
Thiago S. F. X. Teixeira
Diyi Yang
Ke Wang
Ke Wang
LRM
437
1
0
18 Feb 2025
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELMAILaw
1.3K
405
0
25 Nov 2024
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL
  Benchmark
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark
Heegyu Kim
Taeyang Jeon
Seunghwan Choi
Seungtaek Choi
Hyunsouk Cho
430
7
0
24 Sep 2024
Hybrid Querying Over Relational Databases and Large Language Models
Hybrid Querying Over Relational Databases and Large Language Models
T. Pham
Cody T. Reynolds
A. El Abbadi
301
6
0
01 Aug 2024
Benchmarking Complex Instruction-Following with Multiple Constraints
  Composition
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Bosi Wen
Pei Ke
Xiaohan Zhang
Lindong Wu
Hao Huang
...
Jiaxin Xu
Yiming Liu
Jie Tang
Hongning Wang
Minlie Huang
CoGe
517
121
0
04 Jul 2024
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
  Understanding
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table UnderstandingInternational Conference on Learning Representations (ICLR), 2024
Zilong Wang
Hao Zhang
Chun-Liang Li
Julian Martin Eisenschlos
Vincent Perot
...
Lesly Miculicich
Yasuhisa Fujii
Jingbo Shang
Chen-Yu Lee
Tomas Pfister
ReLMLMTDLRM
302
229
0
09 Jan 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning
  Benchmark for Expert AGI
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGIComputer Vision and Pattern Recognition (CVPR), 2023
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLMELMVLM
950
1,898
0
27 Nov 2023
CodeScope: An Execution-based Multilingual Multitask Multidimensional
  Benchmark for Evaluating LLMs on Code Understanding and Generation
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Weixiang Yan
Haitian Liu
Yunkun Wang
Yunzhe Li
Qian Chen
...
Tingyu Lin
Weishan Zhao
Li Zhu
Hari Sundaram
Shuiguang Deng
ELMLRM
473
57
0
14 Nov 2023
Language Models can be Logical Solvers
Language Models can be Logical Solvers
Jiazhan Feng
Ruochen Xu
Junheng Hao
Hiteshi Sharma
Haoran Pan
Dongyan Zhao
Weizhu Chen
ReLMLRMELM
332
30
0
10 Nov 2023
Rephrase and Respond: Let Large Language Models Ask Better Questions for
  Themselves
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Yihe Deng
Weitong Zhang
Zixiang Chen
Quanquan Gu
LRM
567
146
0
07 Nov 2023
GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks
GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks
Xinlu Zhang
Yujie Lu
Weizhi Wang
An Yan
Jun Yan
Lianke Qin
Heng Wang
Xifeng Yan
William Y. Wang
Linda R. Petzold
LM&MAMLLMELM
274
131
0
02 Nov 2023
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code
  Translation
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weixiang Yan
Yuchen Tian
Yunzhe Li
Qian Chen
Wen Wang
449
95
0
08 Oct 2023
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation
Text-to-SQL Empowered by Large Language Models: A Benchmark EvaluationProceedings of the VLDB Endowment (PVLDB), 2023
Dawei Gao
Haibin Wang
Yaliang Li
Xiuyu Sun
Yichen Qian
Bolin Ding
Jingren Zhou
AI4TS
656
554
0
29 Aug 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
12.1K
16,310
0
18 Jul 2023
C3: Zero-shot Text-to-SQL with ChatGPT
C3: Zero-shot Text-to-SQL with ChatGPT
Xuemei Dong
Chuxu Zhang
Yuhang Ge
Yuren Mao
Yunjun Gao
Lu Chen
Jinshu Lin
Dongfang Lou
424
230
0
14 Jul 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
ToolQA: A Dataset for LLM Question Answering with External ToolsNeural Information Processing Systems (NeurIPS), 2023
Yuchen Zhuang
Yue Yu
Kuan-Chieh Wang
Haotian Sun
Chao Zhang
ELMLLMAG
392
356
0
23 Jun 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
3.4K
7,658
0
09 Jun 2023
Large Language Models are not Fair Evaluators
Large Language Models are not Fair EvaluatorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
806
880
0
29 May 2023
How Language Model Hallucinations Can Snowball
How Language Model Hallucinations Can SnowballInternational Conference on Machine Learning (ICML), 2023
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILMLRM
414
394
0
22 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for
  Large-Scale Database Grounded Text-to-SQLs
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLsNeural Information Processing Systems (NeurIPS), 2023
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
589
819
0
04 May 2023
Can Large Language Models Be an Alternative to Human Evaluations?
Can Large Language Models Be an Alternative to Human Evaluations?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Cheng-Han Chiang
Hung-yi Lee
ALMLM&MA
647
930
0
03 May 2023
From Words to Code: Harnessing Data for Program Synthesis from Natural
  Language
From Words to Code: Harnessing Data for Program Synthesis from Natural Language
Anirudh Khatry
Joyce Cahoon
Jordan Henkel
Shaleen Deep
Venkatesh Emani
...
Vu Le
Mohammad Raza
Sherry Shi
Mukul Singh
A. Tiwari
351
16
0
02 May 2023
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with
  Self-Correction
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-CorrectionNeural Information Processing Systems (NeurIPS), 2023
Mohammadreza Pourreza
Davood Rafiei
ReLMLRM
418
616
0
21 Apr 2023
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability
Aiwei Liu
Xuming Hu
Lijie Wen
Philip S. Yu
LMTDAI4MH
311
193
0
12 Mar 2023
Is ChatGPT better than Human Annotators? Potential and Limitations of
  ChatGPT in Explaining Implicit Hate Speech
Is ChatGPT better than Human Annotators? Potential and Limitations of ChatGPT in Explaining Implicit Hate SpeechThe Web Conference (WWW), 2023
Fan Huang
Haewoon Kwak
Jisun An
AI4MH
314
322
0
11 Feb 2023
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of
  Chain-of-Thought
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtInternational Conference on Learning Representations (ICLR), 2022
Abulhair Saparov
He He
ELMLRMReLM
1.0K
450
0
03 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
3.6K
6,211
0
21 Mar 2022
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELMALM
2.6K
8,889
0
07 Jul 2021
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
KaggleDBQA: Realistic Evaluation of Text-to-SQL ParsersAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Chia-Hsuan Lee
Oleksandr Polozov
Matthew Richardson
LMTDRALM
328
139
0
22 Jun 2021
Semantic Evaluation for Text-to-SQL with Distilled Test Suites
Semantic Evaluation for Text-to-SQL with Distilled Test Suites
Ruiqi Zhong
Tao Yu
Dan Klein
218
171
0
06 Oct 2020
Grounded Adaptation for Zero-shot Executable Semantic Parsing
Grounded Adaptation for Zero-shot Executable Semantic ParsingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Victor Zhong
M. Lewis
Sida I. Wang
Luke Zettlemoyer
361
111
0
16 Sep 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
2.3K
55,939
0
28 May 2020
AmbigQA: Answering Ambiguous Open-domain Questions
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
495
430
0
22 Apr 2020
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL
  Parsers
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL ParsersAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
636
767
0
10 Nov 2019
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain
  Semantic Parsing and Text-to-SQL Task
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL TaskConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Tao Yu
Rui Zhang
Kai-Chou Yang
Michihiro Yasunaga
Dongxu Wang
...
Irene Li
Qingning Yao
Shanelle Roman
Zilin Zhang
Dragomir R. Radev
RALM
896
1,713
0
24 Sep 2018
Seq2SQL: Generating Structured Queries from Natural Language using
  Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
Victor Zhong
Caiming Xiong
R. Socher
RALM
1.2K
1,471
0
31 Aug 2017
Attention Is All You Need
Attention Is All You NeedNeural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
8.3K
171,167
0
12 Jun 2017
1
Page 1 of 1