Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2504.06821
Cited By
v1
v2 (latest)
Inducing Programmatic Skills for Agentic Tasks
9 April 2025
Zora Z. Wang
Apurva Gandhi
Graham Neubig
Daniel Fried
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (39★)
Papers citing
"Inducing Programmatic Skills for Agentic Tasks"
16 / 16 papers shown
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
Genglin Liu
Shijie Geng
Sha Li
Hejie Cui
Sarah Zhang
Xin Liu
Tianyi Liu
CLL
790
5
0
17 Nov 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
219
1
0
20 Oct 2025
PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
Simon Yu
Gang Li
Weiyan Shi
Peng Qi
LLMAG
228
5
0
17 Oct 2025
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu
Benjamin Van Durme
Jacob Andreas
Harsh Jhamtani
LLMAG
168
3
0
11 Oct 2025
ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Murong Yue
Zhiwei Liu
Liangwei Yang
Jianguo Zhang
Zuxin Liu
...
Ziyu Yao
Silvio Savarese
Caiming Xiong
Shelby Heinecke
Huan Wang
LLMAG
LRM
151
0
0
09 Oct 2025
WALT: Web Agents that Learn Tools
Viraj Prabhu
Yutong Dai
M. Fernández
Jing Gu
Krithika Ramakrishnan
...
Silvio Savarese
Caiming Xiong
Junnan Li
Zeyuan Chen
Ran Xu
LLMAG
CLL
KELM
168
5
0
01 Oct 2025
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Siru Ouyang
Jun Yan
I-Hung Hsu
Yanfei Chen
Ke Jiang
...
Mahsan Rofouei
Hangfei Lin
Jiawei Han
Chen-Yu Lee
Tomas Pfister
LLMAG
CLL
LRM
248
72
0
29 Sep 2025
MAS-Bench: A Unified Benchmark for Shortcut-Augmented Hybrid Mobile GUI Agents
P. Zhao
Guangyi Liu
Yaozhen Liang
Weiqing He
Z. Lu
...
Liang Liu
Yong Liu
Kexin Zhang
Liang Liu
Yong Liu
LLMAG
ELM
151
4
0
08 Sep 2025
A Compute-Matched Re-Evaluation of TroVE on MATH
Tobias Sesterhenn
Ian Berlot-Attwell
Janis Zenkner
Christian Bartelt
267
1
0
16 Jul 2025
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Junhong Shen
Hao Bai
Lunjun Zhang
Yifei Zhou
Amrith Rajagopal Setlur
...
Diego Caples
Nan Jiang
Tong Zhang
Ameet Talwalkar
Aviral Kumar
LLMAG
LRM
365
28
0
09 Jun 2025
Go-Browse: Training Web Agents with Structured Exploration
Apurva Gandhi
Graham Neubig
LLMAG
304
17
0
04 Jun 2025
DeepShop: A Benchmark for Deep Research Shopping Agents
Yougang Lyu
Xiaoyu Zhang
Lingyong Yan
Maarten de Rijke
Zhaochun Ren
Xiuying Chen
484
25
0
03 Jun 2025
RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning
Xiao-Yang Liu
Da Yin
Zirui Wu
Yansong Feng
KELM
LRM
291
1
0
27 May 2025
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Zehan Qi
Xiao-Chang Liu
Iat Long Iong
Hanyu Lai
Xingwu Sun
...
Shuntian Yao
Tianjie Zhang
Wei Xu
J. Tang
Yuxiao Dong
675
152
0
28 Jan 2025
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Frank F. Xu
Yufan Song
Boxuan Li
Yuxuan Tang
Kritanjali Jain
...
Wayne Chi
Lawrence Jang
Yiqing Xie
Shuyan Zhou
Graham Neubig
ELM
931
150
0
18 Dec 2024
The BrowserGym Ecosystem for Web Agent Research
Thibault Le Sellier De Chezelles
Maxime Gasse
Alexandre Lacoste
Alexandre Drouin
Massimo Caccia
...
Siva Reddy
Quentin Cappart
Graham Neubig
Ruslan Salakhutdinov
Nicolas Chapados
LLMAG
2.1K
87
0
06 Dec 2024
1
Page 1 of 1