Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2312.10321
Cited By
v1
v2
v3
v4 (latest)
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
16 December 2023
Fuheng Zhao
Lawrence Lim
Ishtiyaque Ahmad
D. Agrawal
A. El Abbadi
Amr El Abbadi
Re-assign community
ArXiv (abs)
PDF
HTML
Github (105★)
Papers citing
"LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?"
41 / 41 papers shown
Access Paths for Efficient Ordering with Large Language Models
Fuheng Zhao
Jiayue Chen
Yiming Pan
Tahseen Rabbani
D. Agrawal
D. Agrawal
A. El Abbadi
Paritosh Aggarwal
Anupam Datta
Dimitris Tsirogiannis
239
1
0
30 Aug 2025
Taming SQL Complexity: LLM-Based Equivalence Evaluation for Text-to-SQL
Qingyun Zeng
Simin Ma
Arash Niknafs
Ashish Basran
Carol Szabo
191
1
0
11 Jun 2025
QUITE: A Query Rewrite System Beyond Rules with LLM Agents
Yuyang Song
Hanxu Yan
Jiale Lao
Yibo Wang
Yufei Li
Yuanchun Zhou
Jianguo Wang
Mingjie Tang
LRM
400
6
0
09 Jun 2025
Text-to-SQL Domain Adaptation via Human-LLM Collaborative Data Annotation
International Conference on Intelligent User Interfaces (IUI), 2025
Yuan Tian
Daniel Lee
Fei Wu
Tung Mai
Kun Qian
Siddhartha Sahai
Tianyi Zhang
Yunyao Li
SyDa
753
7
0
21 Feb 2025
EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
Anjiang Wei
Jiannan Cao
Ran Li
Zeyang Zhang
Yuhui Zhang
...
Yuan Liu
Thiago S. F. X. Teixeira
Diyi Yang
Ke Wang
Ke Wang
LRM
437
1
0
18 Feb 2025
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELM
AILaw
1.3K
405
0
25 Nov 2024
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark
Heegyu Kim
Taeyang Jeon
Seunghwan Choi
Seungtaek Choi
Hyunsouk Cho
430
7
0
24 Sep 2024
Hybrid Querying Over Relational Databases and Large Language Models
T. Pham
Cody T. Reynolds
A. El Abbadi
301
6
0
01 Aug 2024
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Bosi Wen
Pei Ke
Xiaohan Zhang
Lindong Wu
Hao Huang
...
Jiaxin Xu
Yiming Liu
Jie Tang
Hongning Wang
Minlie Huang
CoGe
517
121
0
04 Jul 2024
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
International Conference on Learning Representations (ICLR), 2024
Zilong Wang
Hao Zhang
Chun-Liang Li
Julian Martin Eisenschlos
Vincent Perot
...
Lesly Miculicich
Yasuhisa Fujii
Jingbo Shang
Chen-Yu Lee
Tomas Pfister
ReLM
LMTD
LRM
302
229
0
09 Jan 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Computer Vision and Pattern Recognition (CVPR), 2023
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLM
ELM
VLM
950
1,898
0
27 Nov 2023
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Weixiang Yan
Haitian Liu
Yunkun Wang
Yunzhe Li
Qian Chen
...
Tingyu Lin
Weishan Zhao
Li Zhu
Hari Sundaram
Shuiguang Deng
ELM
LRM
473
57
0
14 Nov 2023
Language Models can be Logical Solvers
Jiazhan Feng
Ruochen Xu
Junheng Hao
Hiteshi Sharma
Haoran Pan
Dongyan Zhao
Weizhu Chen
ReLM
LRM
ELM
332
30
0
10 Nov 2023
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Yihe Deng
Weitong Zhang
Zixiang Chen
Quanquan Gu
LRM
567
146
0
07 Nov 2023
GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks
Xinlu Zhang
Yujie Lu
Weizhi Wang
An Yan
Jun Yan
Lianke Qin
Heng Wang
Xifeng Yan
William Y. Wang
Linda R. Petzold
LM&MA
MLLM
ELM
274
131
0
02 Nov 2023
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weixiang Yan
Yuchen Tian
Yunzhe Li
Qian Chen
Wen Wang
449
95
0
08 Oct 2023
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation
Proceedings of the VLDB Endowment (PVLDB), 2023
Dawei Gao
Haibin Wang
Yaliang Li
Xiuyu Sun
Yichen Qian
Bolin Ding
Jingren Zhou
AI4TS
656
554
0
29 Aug 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
12.1K
16,310
0
18 Jul 2023
C3: Zero-shot Text-to-SQL with ChatGPT
Xuemei Dong
Chuxu Zhang
Yuhang Ge
Yuren Mao
Yunjun Gao
Lu Chen
Jinshu Lin
Dongfang Lou
424
230
0
14 Jul 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
Neural Information Processing Systems (NeurIPS), 2023
Yuchen Zhuang
Yue Yu
Kuan-Chieh Wang
Haotian Sun
Chao Zhang
ELM
LLMAG
392
356
0
23 Jun 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Neural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
3.4K
7,658
0
09 Jun 2023
Large Language Models are not Fair Evaluators
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
806
880
0
29 May 2023
How Language Model Hallucinations Can Snowball
International Conference on Machine Learning (ICML), 2023
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
414
394
0
22 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Neural Information Processing Systems (NeurIPS), 2023
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
589
819
0
04 May 2023
Can Large Language Models Be an Alternative to Human Evaluations?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Cheng-Han Chiang
Hung-yi Lee
ALM
LM&MA
647
930
0
03 May 2023
From Words to Code: Harnessing Data for Program Synthesis from Natural Language
Anirudh Khatry
Joyce Cahoon
Jordan Henkel
Shaleen Deep
Venkatesh Emani
...
Vu Le
Mohammad Raza
Sherry Shi
Mukul Singh
A. Tiwari
351
16
0
02 May 2023
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction
Neural Information Processing Systems (NeurIPS), 2023
Mohammadreza Pourreza
Davood Rafiei
ReLM
LRM
418
616
0
21 Apr 2023
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability
Aiwei Liu
Xuming Hu
Lijie Wen
Philip S. Yu
LMTD
AI4MH
311
193
0
12 Mar 2023
Is ChatGPT better than Human Annotators? Potential and Limitations of ChatGPT in Explaining Implicit Hate Speech
The Web Conference (WWW), 2023
Fan Huang
Haewoon Kwak
Jisun An
AI4MH
314
322
0
11 Feb 2023
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
International Conference on Learning Representations (ICLR), 2022
Abulhair Saparov
He He
ELM
LRM
ReLM
1.0K
450
0
03 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
International Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
3.6K
6,211
0
21 Mar 2022
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
2.6K
8,889
0
07 Jul 2021
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Chia-Hsuan Lee
Oleksandr Polozov
Matthew Richardson
LMTD
RALM
328
139
0
22 Jun 2021
Semantic Evaluation for Text-to-SQL with Distilled Test Suites
Ruiqi Zhong
Tao Yu
Dan Klein
218
171
0
06 Oct 2020
Grounded Adaptation for Zero-shot Executable Semantic Parsing
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Victor Zhong
M. Lewis
Sida I. Wang
Luke Zettlemoyer
361
111
0
16 Sep 2020
Language Models are Few-Shot Learners
Neural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
2.3K
55,939
0
28 May 2020
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
495
430
0
22 Apr 2020
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
636
767
0
10 Nov 2019
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Tao Yu
Rui Zhang
Kai-Chou Yang
Michihiro Yasunaga
Dongxu Wang
...
Irene Li
Qingning Yao
Shanelle Roman
Zilin Zhang
Dragomir R. Radev
RALM
896
1,713
0
24 Sep 2018
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
Victor Zhong
Caiming Xiong
R. Socher
RALM
1.2K
1,471
0
31 Aug 2017
Attention Is All You Need
Neural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
8.3K
171,167
0
12 Jun 2017
1
Page 1 of 1