Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2505.17612
Cited By
v1
v2 (latest)
Distilling LLM Agent into Small Models with Retrieval and Code Tools
23 May 2025
Minki Kang
Jongwon Jeong
Seanie Lee
Jaewoong Cho
Sung Ju Hwang
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (80 upvotes)
Github (171★)
Papers citing
"Distilling LLM Agent into Small Models with Retrieval and Code Tools"
13 / 63 papers shown
Title
ReAct: Synergizing Reasoning and Acting in Language Models
International Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
1.7K
4,956
0
06 Oct 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Neural Information Processing Systems (NeurIPS), 2022
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAG
LM&Ro
747
736
0
04 Jul 2022
Large Language Models are Zero-Shot Reasoners
Neural Information Processing Systems (NeurIPS), 2022
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
1.3K
5,946
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
International Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
1.9K
5,363
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.2K
14,202
0
28 Jan 2022
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
422
526
0
02 Aug 2021
LoRA: Low-Rank Adaptation of Large Language Models
International Conference on Learning Representations (ICLR), 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
1.5K
15,017
0
17 Jun 2021
Measuring Mathematical Problem Solving With the MATH Dataset
Dan Hendrycks
Collin Burns
Saurav Kadavath
Akul Arora
Steven Basart
Eric Tang
Basel Alomair
Jacob Steinhardt
ReLM
FaML
859
3,811
0
05 Mar 2021
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps
International Conference on Computational Linguistics (COLING), 2020
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
RALM
LRM
394
831
0
02 Nov 2020
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar
Xingdi Yuan
Marc-Alexandre Côté
Yonatan Bisk
Adam Trischler
Matthew J. Hausknecht
LM&Ro
LLMAG
395
618
0
08 Oct 2020
The Curious Case of Neural Text Degeneration
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
432
3,706
0
22 Apr 2019
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
871
3,522
0
25 Sep 2018
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
789
22,224
0
09 Mar 2015
Previous
1
2