ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.14878
  4. Cited By
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

22 December 2023
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
Jingxuan Chen
Khyati Khandelwal
James Doran
Xidong Feng
Jiacheng Liu
Zheng Xiong
Yicheng Luo
Jianye Hao
Kun Shao
Haitham Bou-Ammar
Jun Wang
ArXivPDFHTML

Papers citing "Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning"

19 / 19 papers shown
Title
Uncertainty in Action: Confidence Elicitation in Embodied Agents
Tianjiao Yu
Vedant Shah
Muntasir Wahed
Kiet A. Nguyen
Adheesh Sunil Juvekar
Tal August
Ismini Lourentzou
40
0
0
13 Mar 2025
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Lakshmi Nair
Ian Trase
Mark Kim
AIFin
LRM
AI4CE
39
1
0
18 Feb 2025
A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1
A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1
Jun Wang
LRM
KELM
44
1
0
15 Feb 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
H. Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
W. Yu
Dong Yu
LLMAG
54
3
0
03 Jan 2025
DynaSaur: Large Language Agents Beyond Predefined Actions
DynaSaur: Large Language Agents Beyond Predefined Actions
Dang Nguyen
Viet Dac Lai
Seunghyun Yoon
Ryan Rossi
Handong Zhao
...
Nedim Lipka
Yu-Chiang Frank Wang
Trung H. Bui
Franck Dernoncourt
Tianyi Zhou
LM&Ro
ELM
LLMAG
41
6
0
04 Nov 2024
Agentic Information Retrieval
Agentic Information Retrieval
Weinan Zhang
Junwei Liao
Ning Li
Kounianhua Du
Jianghao Lin
AIFin
41
2
0
13 Oct 2024
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
Yuanzhao Zhai
Tingkai Yang
Kele Xu
Feng Dawei
Cheng Yang
Bo Ding
Huaimin Wang
58
9
0
14 Sep 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse
  Environments
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Zhiheng Xi
Yiwen Ding
Wenxiang Chen
Boyang Hong
Honglin Guo
...
Qi Zhang
Xipeng Qiu
Xuanjing Huang
Zuxuan Wu
Yu-Gang Jiang
LLMAG
LM&Ro
38
29
0
06 Jun 2024
Reinforcing Language Agents via Policy Optimization with Action
  Decomposition
Reinforcing Language Agents via Policy Optimization with Action Decomposition
Muning Wen
Ziyu Wan
Weinan Zhang
Jun Wang
Ying Wen
33
7
0
23 May 2024
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
Xuechen Liang
Meiling Tao
Yinghui Xia
Yiting Xie
Jun Wang
JingSong Yang
LLMAG
21
12
0
02 Apr 2024
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented,
  Diversified, and Generalist
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist
Wentao Zhang
Lingxuan Zhao
Haochong Xia
Shuo Sun
Jiaze Sun
...
Yilei Zhao
Xinyu Cai
Longtao Zheng
Xinrun Wang
Bo An
AIFin
31
33
0
28 Feb 2024
DS-Agent: Automated Data Science by Empowering Large Language Models
  with Case-Based Reasoning
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning
Siyuan Guo
Cheng Deng
Ying Wen
Hechang Chen
Yi-Ju Chang
Jun Wang
ELM
LM&Ro
LLMAG
AI4CE
37
27
0
27 Feb 2024
Natural Language Reinforcement Learning
Natural Language Reinforcement Learning
Xidong Feng
Ziyu Wan
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
35
3
0
11 Feb 2024
A call for embodied AI
A call for embodied AI
Giuseppe Paolo
Jonas Gonzalez-Billandon
Balázs Kégl
LM&Ro
19
9
0
06 Feb 2024
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for
  Reasoning Problems
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Kaya Stechly
Matthew Marquez
Subbarao Kambhampati
LRM
155
84
0
19 Oct 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
233
2,470
0
06 Oct 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,217
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
1