ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.05155
  4. Cited By
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on
  Open-Source Model

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model

8 October 2023
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
    LRM
ArXivPDFHTML

Papers citing "Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model"

14 / 14 papers shown
Title
OTC: Optimal Tool Calls via Reinforcement Learning
OTC: Optimal Tool Calls via Reinforcement Learning
Hongru Wang
Cheng Qian
Wanjun Zhong
X. Chen
Jiahao Qiu
Shijue Huang
Bowen Jin
Mengdi Wang
Kam-Fai Wong
Heng Ji
OffRL
LRM
31
0
0
21 Apr 2025
ToolRL: Reward is All Tool Learning Needs
ToolRL: Reward is All Tool Learning Needs
Cheng Qian
Emre Can Acikgoz
Qi He
Hongru Wang
X. Chen
Dilek Hakkani-Tür
Gökhan Tür
Heng Ji
OffRL
LRM
25
3
0
16 Apr 2025
Learning to Generate Structured Output with Schema Reinforcement Learning
Learning to Generate Structured Output with Schema Reinforcement Learning
Y. Lu
Haolun Li
Xin Cong
Zhong Zhang
Yesai Wu
Yankai Lin
Zhiyuan Liu
Fangming Liu
Maosong Sun
39
0
0
26 Feb 2025
SMART: Self-Aware Agent for Tool Overuse Mitigation
SMART: Self-Aware Agent for Tool Overuse Mitigation
Cheng Qian
Emre Can Acikgoz
H. Wang
X. Chen
Avirup Sil
Dilek Hakkani-Tür
Gökhan Tür
Heng Ji
LLMAG
KELM
LRM
57
4
0
17 Feb 2025
EscapeBench: Pushing Language Models to Think Outside the Box
EscapeBench: Pushing Language Models to Think Outside the Box
Cheng Qian
Peixuan Han
Qinyu Luo
Bingxiang He
X. Chen
...
Jiarui Yao
Xiaocheng Yang
Denghui Zhang
Yunzhu Li
Heng Ji
LLMAG
LRM
80
2
0
18 Dec 2024
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active
  Assistance
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Y. Lu
Shenzhi Yang
Cheng Qian
Guirong Chen
Qinyu Luo
...
Weiwen Liu
Yasheng Wang
Zhiyuan Liu
Fangming Liu
Maosong Sun
LLMAG
16
3
0
16 Oct 2024
Tool Learning with Large Language Models: A Survey
Tool Learning with Large Language Models: A Survey
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jun Xu
Jirong Wen
LLMAG
31
77
0
28 May 2024
Tell Me More! Towards Implicit User Intention Understanding of Language
  Model Driven Agents
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents
Cheng Qian
Bingxiang He
Zhuang Zhong
Jia Deng
Yujia Qin
...
Zhong Zhang
Jie Zhou
Yankai Lin
Zhiyuan Liu
Maosong Sun
12
27
0
14 Feb 2024
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent
  Self-Evolution
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution
Cheng Qian
Shihao Liang
Yujia Qin
Yining Ye
Xin Cong
Yankai Lin
Yesai Wu
Zhiyuan Liu
Maosong Sun
LLMAG
11
6
0
25 Jan 2024
A Study on Training and Developing Large Language Models for Behavior
  Tree Generation
A Study on Training and Developing Large Language Models for Behavior Tree Generation
Fu Li
Xueying Wang
Bin Li
Yunlong Wu
Yanzhen Wang
Xiaodong Yi
11
4
0
16 Jan 2024
Instruction Tuning with GPT-4
Instruction Tuning with GPT-4
Baolin Peng
Chunyuan Li
Pengcheng He
Michel Galley
Jianfeng Gao
SyDa
ALM
LM&MA
154
576
0
06 Apr 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Extracting Training Data from Large Language Models
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
264
1,798
0
14 Dec 2020
1