ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.12307
  4. Cited By
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
v1v2v3v4v5 (latest)

Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?

18 June 2024
Seungbin Yang
Yujin Baek
Taehee Kim
Jaegul Choo
ArXiv (abs)PDFHTML

Papers citing "Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?"

44 / 44 papers shown
Title
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
Wei He
Yueqing Sun
Hongyan Hao
Xueyuan Hao
Zhikang Xia
...
X. Su
Xiaodong Cai
Xunliang Cai
Yu Yang
Yunke Zhao
114
0
0
30 Sep 2025
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models
S. Jung
Donghun Lee
Shinbok Lee
Gaeun Seo
Daniel Lee
Byeongil Ko
Junrae Cho
Kihyun Kim
EungGyun Kim
M. Shin
267
2
0
02 Apr 2025
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context
Bryan L. M. de Oliveira
Luana G. B. Martins
Bruno Brandão
Luckeciano C. Melo
ELM
794
3
0
17 Feb 2025
Learning Evolving Tools for Large Language Models
Learning Evolving Tools for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Guoxin Chen
Zhong Zhang
Xin Cong
Fangda Guo
Yesai Wu
Yankai Lin
Wenzheng Feng
Yasheng Wang
KELM
471
5
0
09 Oct 2024
Qwen2 Technical Report
Qwen2 Technical Report
An Yang
Baosong Yang
Binyuan Hui
Jian Xu
Bowen Yu
...
Yuqiong Liu
Zeyu Cui
Zhenru Zhang
Zhifang Guo
Zhi-Wei Fan
OSLMVLMMU
480
1,604
0
15 Jul 2024
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
Zhiting Fan
Ruizhe Chen
Ruiling Xu
Zuozhu Liu
KELM
222
29
0
14 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large
  Language Models
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELMLLMAG
104
6
0
02 Jul 2024
Tools Fail: Detecting Silent Errors in Faulty Tools
Tools Fail: Detecting Silent Errors in Faulty Tools
Jimin Sun
So Yeon Min
Yingshan Chang
Yonatan Bisk
255
14
0
27 Jun 2024
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information
  Needs in Large Language Models
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
Tong Zhang
Peixin Qin
Yang Deng
Chen Huang
Wenqiang Lei
Junhong Liu
Dingnan Jin
Hongru Liang
Tat-Seng Chua
172
22
0
20 May 2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
...
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRMALM
481
1,817
0
22 Apr 2024
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language
  Models through Question Complexity
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Soyeong Jeong
Jinheon Baek
Sukmin Cho
Sung Ju Hwang
Jong C. Park
RALM
274
304
0
21 Mar 2024
What Are Tools Anyway? A Survey from the Language Model Perspective
What Are Tools Anyway? A Survey from the Language Model Perspective
Zhiruo Wang
Zhoujun Cheng
Hao Zhu
Daniel Fried
Graham Neubig
258
46
0
18 Mar 2024
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhicheng Guo
Sijie Cheng
Hao Wang
Shihao Liang
Yujia Qin
Peng Li
Zhiyuan Liu
Maosong Sun
Yang Liu
ELM
345
62
0
12 Mar 2024
Middleware for LLMs: Tools Are Instrumental for Language Agents in
  Complex Environments
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments
Yu Gu
Yiheng Shu
Hao Yu
Xiao Liu
Yuxiao Dong
Jie Tang
Jayanth Srinivasa
Hugo Latapie
Yu-Chuan Su
KELMLLMAG
270
45
0
22 Feb 2024
ToolSword: Unveiling Safety Issues of Large Language Models in Tool
  Learning Across Three Stages
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
Junjie Ye
Sixian Li
Guanyu Li
Jessica Fan
Songyang Gao
Yilong Wu
Tao Gui
Tao Gui
Xuanjing Huang
LLMAG
322
46
0
16 Feb 2024
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool
  Utilization in Real-World Complex Scenarios
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
Shijue Huang
Wanjun Zhong
Jianqiao Lu
Qi Zhu
Jiahui Gao
...
Yasheng Wang
Lifeng Shang
Xin Jiang
Ruifeng Xu
Qun Liu
LLMAG
163
63
0
30 Jan 2024
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large
  Language Models in Tool Learning
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Junjie Ye
Yilong Wu
Songyang Gao
Jessica Fan
Sixian Li
Guanyu Li
Xiaoran Fan
Tao Gui
Tao Gui
Xuanjing Huang
AAML
183
30
0
16 Jan 2024
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
EASYTOOL: Enhancing LLM-based Agents with Concise Tool InstructionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Siyu Yuan
Kaitao Song
Jiangjie Chen
Xu Tan
Yongliang Shen
Ren Kan
Dongsheng Li
Deqing Yang
LLMAG
198
103
0
11 Jan 2024
Mistral 7B
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoELRM
330
2,857
0
10 Oct 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingInternational Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Haoran Pan
Yujiu Yang
Shiyu Huang
Nan Duan
Weizhu Chen
LRMAI4CELLMAG
302
247
0
29 Sep 2023
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
  APIs
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIsInternational Conference on Learning Representations (ICLR), 2023
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
...
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
CLLALMLLMAGELMLM&MA
500
1,038
0
31 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
5.3K
14,855
0
18 Jul 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
2.2K
6,246
0
09 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
204
247
0
04 Jun 2023
Large Language Models as Tool Makers
Large Language Models as Tool MakersInternational Conference on Learning Representations (ICLR), 2023
Tianle Cai
Xuezhi Wang
Tengyu Ma
Xinyun Chen
Denny Zhou
LLMAG
220
249
0
26 May 2023
MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of
  Thought Prompting
MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought PromptingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Tatsuro Inaba
Hirokazu Kiyomaru
Fei Cheng
Sadao Kurohashi
KELMLRM
222
36
0
26 May 2023
On the Tool Manipulation Capability of Open-source Large Language Models
On the Tool Manipulation Capability of Open-source Large Language Models
Qiantong Xu
Fenglu Hong
Yangqiu Song
Changran Hu
Zheng Chen
Jian Zhang
LLMAG
196
94
0
25 May 2023
Gorilla: Large Language Model Connected with Massive APIs
Gorilla: Large Language Model Connected with Massive APIsNeural Information Processing Systems (NeurIPS), 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELMCLLALMSyDa
336
817
0
24 May 2023
Selectively Answering Ambiguous Questions
Selectively Answering Ambiguous QuestionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jeremy R. Cole
Michael J.Q. Zhang
D. Gillick
Julian Martin Eisenschlos
Bhuwan Dhingra
Jacob Eisenstein
UQLM
381
46
0
24 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive CritiquingInternational Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELMLRM
337
550
0
19 May 2023
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via
  Tool Embeddings
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool EmbeddingsNeural Information Processing Systems (NeurIPS), 2023
Shibo Hao
Tianyang Liu
Zhen Wang
Zhiting Hu
RALMLLMAG
369
230
0
19 May 2023
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Minghao Li
Yingxiu Zhao
Yu Bowen
Feifan Song
Hangyu Li
Haiyang Yu
Zhoujun Li
Fei Huang
Yongbin Li
ELMRALMCLL
205
274
0
14 Apr 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
3.4K
20,007
0
15 Mar 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
Toolformer: Language Models Can Teach Themselves to Use ToolsNeural Information Processing Systems (NeurIPS), 2023
Timo Schick
Jane Dwivedi-Yu
Roberto Dessì
Roberta Raileanu
Maria Lomeli
Luke Zettlemoyer
Nicola Cancedda
Thomas Scialom
SyDaRALM
373
2,495
0
09 Feb 2023
Large Language Models are Better Reasoners with Self-Verification
Large Language Models are Better Reasoners with Self-VerificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Shengping Liu
Bin Sun
Kang Liu
Jun Zhao
ReLMLRM
360
300
0
19 Dec 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
1.5K
4,819
0
06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
Decomposed Prompting: A Modular Approach for Solving Complex TasksInternational Conference on Learning Representations (ICLR), 2022
Tushar Khot
H. Trivedi
Matthew Finlayson
Yao Fu
Kyle Richardson
Peter Clark
Ashish Sabharwal
ReLMLRM
437
572
0
05 Oct 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
2.1K
13,906
0
28 Jan 2022
SimCSE: Simple Contrastive Learning of Sentence Embeddings
SimCSE: Simple Contrastive Learning of Sentence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILawSSL
677
3,927
0
18 Apr 2021
Selective Question Answering under Domain Shift
Selective Question Answering under Domain Shift
Amita Kamath
Robin Jia
Abigail Z. Jacobs
OOD
206
239
0
16 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
51,003
0
28 May 2020
AmbigQA: Answering Ambiguous Open-domain Questions
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
289
389
0
22 Apr 2020
"None of the Above":Measure Uncertainty in Dialog Response Retrieval
"None of the Above":Measure Uncertainty in Dialog Response RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Yulan Feng
Shikib Mehri
M. Eskénazi
Tiancheng Zhao
152
10
0
04 Apr 2020
Reading Wikipedia to Answer Open-Domain Questions
Reading Wikipedia to Answer Open-Domain Questions
Danqi Chen
Adam Fisch
Jason Weston
Antoine Bordes
RALM
324
2,128
0
31 Mar 2017
1