Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.12307
Cited By
v1
v2
v3
v4
v5 (latest)
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
18 June 2024
Seungbin Yang
Yujin Baek
Taehee Kim
Jaegul Choo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?"
44 / 44 papers shown
Title
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
Wei He
Yueqing Sun
Hongyan Hao
Xueyuan Hao
Zhikang Xia
...
X. Su
Xiaodong Cai
Xunliang Cai
Yu Yang
Yunke Zhao
114
0
0
30 Sep 2025
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models
S. Jung
Donghun Lee
Shinbok Lee
Gaeun Seo
Daniel Lee
Byeongil Ko
Junrae Cho
Kihyun Kim
EungGyun Kim
M. Shin
267
2
0
02 Apr 2025
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context
Bryan L. M. de Oliveira
Luana G. B. Martins
Bruno Brandão
Luckeciano C. Melo
ELM
794
3
0
17 Feb 2025
Learning Evolving Tools for Large Language Models
International Conference on Learning Representations (ICLR), 2024
Guoxin Chen
Zhong Zhang
Xin Cong
Fangda Guo
Yesai Wu
Yankai Lin
Wenzheng Feng
Yasheng Wang
KELM
471
5
0
09 Oct 2024
Qwen2 Technical Report
An Yang
Baosong Yang
Binyuan Hui
Jian Xu
Bowen Yu
...
Yuqiong Liu
Zeyu Cui
Zhenru Zhang
Zhifang Guo
Zhi-Wei Fan
OSLM
VLM
MU
480
1,604
0
15 Jul 2024
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
Zhiting Fan
Ruizhe Chen
Ruiling Xu
Zuozhu Liu
KELM
222
29
0
14 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELM
LLMAG
104
6
0
02 Jul 2024
Tools Fail: Detecting Silent Errors in Faulty Tools
Jimin Sun
So Yeon Min
Yingshan Chang
Yonatan Bisk
255
14
0
27 Jun 2024
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
Tong Zhang
Peixin Qin
Yang Deng
Chen Huang
Wenqiang Lei
Junhong Liu
Dingnan Jin
Hongru Liang
Tat-Seng Chua
172
22
0
20 May 2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
...
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
481
1,817
0
22 Apr 2024
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Soyeong Jeong
Jinheon Baek
Sukmin Cho
Sung Ju Hwang
Jong C. Park
RALM
274
304
0
21 Mar 2024
What Are Tools Anyway? A Survey from the Language Model Perspective
Zhiruo Wang
Zhoujun Cheng
Hao Zhu
Daniel Fried
Graham Neubig
258
46
0
18 Mar 2024
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhicheng Guo
Sijie Cheng
Hao Wang
Shihao Liang
Yujia Qin
Peng Li
Zhiyuan Liu
Maosong Sun
Yang Liu
ELM
345
62
0
12 Mar 2024
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments
Yu Gu
Yiheng Shu
Hao Yu
Xiao Liu
Yuxiao Dong
Jie Tang
Jayanth Srinivasa
Hugo Latapie
Yu-Chuan Su
KELM
LLMAG
270
45
0
22 Feb 2024
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
Junjie Ye
Sixian Li
Guanyu Li
Jessica Fan
Songyang Gao
Yilong Wu
Tao Gui
Tao Gui
Xuanjing Huang
LLMAG
322
46
0
16 Feb 2024
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
Shijue Huang
Wanjun Zhong
Jianqiao Lu
Qi Zhu
Jiahui Gao
...
Yasheng Wang
Lifeng Shang
Xin Jiang
Ruifeng Xu
Qun Liu
LLMAG
163
63
0
30 Jan 2024
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Junjie Ye
Yilong Wu
Songyang Gao
Jessica Fan
Sixian Li
Guanyu Li
Xiaoran Fan
Tao Gui
Tao Gui
Xuanjing Huang
AAML
183
30
0
16 Jan 2024
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Siyu Yuan
Kaitao Song
Jiangjie Chen
Xu Tan
Yongliang Shen
Ren Kan
Dongsheng Li
Deqing Yang
LLMAG
198
103
0
11 Jan 2024
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LRM
330
2,857
0
10 Oct 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
International Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Haoran Pan
Yujiu Yang
Shiyu Huang
Nan Duan
Weizhu Chen
LRM
AI4CE
LLMAG
302
247
0
29 Sep 2023
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
International Conference on Learning Representations (ICLR), 2023
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
...
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
CLL
ALM
LLMAG
ELM
LM&MA
500
1,038
0
31 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
5.3K
14,855
0
18 Jul 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Neural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
2.2K
6,246
0
09 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
204
247
0
04 Jun 2023
Large Language Models as Tool Makers
International Conference on Learning Representations (ICLR), 2023
Tianle Cai
Xuezhi Wang
Tengyu Ma
Xinyun Chen
Denny Zhou
LLMAG
220
249
0
26 May 2023
MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tatsuro Inaba
Hirokazu Kiyomaru
Fei Cheng
Sadao Kurohashi
KELM
LRM
222
36
0
26 May 2023
On the Tool Manipulation Capability of Open-source Large Language Models
Qiantong Xu
Fenglu Hong
Yangqiu Song
Changran Hu
Zheng Chen
Jian Zhang
LLMAG
196
94
0
25 May 2023
Gorilla: Large Language Model Connected with Massive APIs
Neural Information Processing Systems (NeurIPS), 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELM
CLL
ALM
SyDa
336
817
0
24 May 2023
Selectively Answering Ambiguous Questions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jeremy R. Cole
Michael J.Q. Zhang
D. Gillick
Julian Martin Eisenschlos
Bhuwan Dhingra
Jacob Eisenstein
UQLM
381
46
0
24 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
International Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
337
550
0
19 May 2023
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
Neural Information Processing Systems (NeurIPS), 2023
Shibo Hao
Tianyang Liu
Zhen Wang
Zhiting Hu
RALM
LLMAG
369
230
0
19 May 2023
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Minghao Li
Yingxiu Zhao
Yu Bowen
Feifan Song
Hangyu Li
Haiyang Yu
Zhoujun Li
Fei Huang
Yongbin Li
ELM
RALM
CLL
205
274
0
14 Apr 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
3.4K
20,007
0
15 Mar 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
Neural Information Processing Systems (NeurIPS), 2023
Timo Schick
Jane Dwivedi-Yu
Roberto Dessì
Roberta Raileanu
Maria Lomeli
Luke Zettlemoyer
Nicola Cancedda
Thomas Scialom
SyDa
RALM
373
2,495
0
09 Feb 2023
Large Language Models are Better Reasoners with Self-Verification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Shengping Liu
Bin Sun
Kang Liu
Jun Zhao
ReLM
LRM
360
300
0
19 Dec 2022
ReAct: Synergizing Reasoning and Acting in Language Models
International Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
1.5K
4,819
0
06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
International Conference on Learning Representations (ICLR), 2022
Tushar Khot
H. Trivedi
Matthew Finlayson
Yao Fu
Kyle Richardson
Peter Clark
Ashish Sabharwal
ReLM
LRM
437
572
0
05 Oct 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.1K
13,906
0
28 Jan 2022
SimCSE: Simple Contrastive Learning of Sentence Embeddings
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILaw
SSL
677
3,927
0
18 Apr 2021
Selective Question Answering under Domain Shift
Amita Kamath
Robin Jia
Abigail Z. Jacobs
OOD
206
239
0
16 Jun 2020
Language Models are Few-Shot Learners
Neural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
51,003
0
28 May 2020
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
289
389
0
22 Apr 2020
"None of the Above":Measure Uncertainty in Dialog Response Retrieval
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Yulan Feng
Shikib Mehri
M. Eskénazi
Tiancheng Zhao
152
10
0
04 Apr 2020
Reading Wikipedia to Answer Open-Domain Questions
Danqi Chen
Adam Fisch
Jason Weston
Antoine Bordes
RALM
324
2,128
0
31 Mar 2017
1