ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.04515
  4. Cited By

The Latent Space Hypothesis: Toward Universal Medical Representation Learning

4 June 2025
Salil Patel
ArXiv (abs)PDFHTML

Papers citing "The Latent Space Hypothesis: Toward Universal Medical Representation Learning"

50 / 60 papers shown
Title
Don't Just Follow MLLM Plans: Robust and Efficient Planning for Open-world Agents
Don't Just Follow MLLM Plans: Robust and Efficient Planning for Open-world Agents
Seungjoon Lee
Suhwan Kim
Minhyeon Oh
Youngsik Yoon
Jungseul Ok
LLMAG
40
0
0
30 May 2025
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Dayeon Ki
Dinesh Manocha
Marine Carpuat
Gang Wu
Puneet Mathur
Viswanathan Swaminathan
LLMAGLM&Ro
83
0
0
15 Apr 2025
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
Yujia Qin
Yining Ye
Junjie Fang
Han Wang
Shihao Liang
...
Haifeng Liu
F. Lin
Tao Peng
Xin Liu
Guang Shi
LLMAGLM&Ro
111
69
0
21 Jan 2025
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
Ruosen Li
Teerth Patel
Xinya Du
LLMAGALM
185
102
0
03 Jan 2025
Optimizing Latent Goal by Learning from Trajectory Preference
Optimizing Latent Goal by Learning from Trajectory Preference
Guangyu Zhao
Kewei Lian
Haowei Lin
Haobo Fu
Qiang Fu
Shaofei Cai
Zihao Wang
Yitao Liang
161
4
0
03 Dec 2024
APT: Architectural Planning and Text-to-Blueprint Construction Using
  Large Language Models for Open-World Agents
APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents
Jun Yu Chen
Tao Gao
LM&RoLLMAG
147
3
0
26 Nov 2024
Agent-as-a-Judge: Evaluate Agents with Agents
Agent-as-a-Judge: Evaluate Agents with Agents
Mingchen Zhuge
Changsheng Zhao
Dylan R. Ashley
Wenyi Wang
Dmitrii Khizbullin
...
Raghuraman Krishnamoorthi
Yuandong Tian
Yangyang Shi
Vikas Chandra
Jürgen Schmidhuber
ELM
146
44
0
14 Oct 2024
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
  Long-Horizon Tasks
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
Zaijing Li
Yuquan Xie
Rui Shao
Gongwei Chen
Dongmei Jiang
Liqiang Nie
150
30
0
07 Aug 2024
Visual Grounding for Object-Level Generalization in Reinforcement
  Learning
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang
Zongqing Lu
LM&Ro
81
2
0
04 Aug 2024
Autonomous Improvement of Instruction Following Skills via Foundation
  Models
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
95
14
0
30 Jul 2024
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables
  Open-World Instruction Following Agents
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
Zihao Wang
Shaofei Cai
Zhancun Mu
Haowei Lin
Ceyao Zhang
Xuejie Liu
Qing Li
Hoang Trung-Dung
Xiaojian Ma
Yitao Liang
LM&Ro
99
14
0
27 Jun 2024
OpenVLA: An Open-Source Vision-Language-Action Model
OpenVLA: An Open-Source Vision-Language-Action Model
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
...
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&RoVLM
284
533
0
13 Jun 2024
VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating
  Complex Task Dependencies in Minecraft
VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft
Yubo Dong
Xukun Zhu
Zhengzhe Pan
Linchao Zhu
Yi Yang
86
17
0
09 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
106
33
0
06 Jun 2024
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Soroush Nasiriany
Abhiram Maddukuri
Lance Zhang
Adeet Parikh
Aaron Lo
Abhishek Joshi
Ajay Mandlekar
Yuke Zhu
LM&Ro
144
90
0
04 Jun 2024
Luban: Building Open-Ended Creative Agents via Autonomous Embodied
  Verification
Luban: Building Open-Ended Creative Agents via Autonomous Embodied Verification
Yuxuan Guo
Shaohui Peng
Jiaming Guo
Di Huang
Xishan Zhang
...
Zihao Zhang
Zidong Du
Qi Guo
Xingui Hu
Yunji Chen
69
4
0
24 May 2024
Autonomous Evaluation and Refinement of Digital Agents
Autonomous Evaluation and Refinement of Digital Agents
Jiayi Pan
Yichi Zhang
Nicholas Tomlin
Yifei Zhou
Sergey Levine
Alane Suhr
ELM
147
66
0
09 Apr 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination
  for Simulated-World Control
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&RoAI4CE
113
36
0
18 Mar 2024
Scaling Instructable Agents Across Many Simulated Worlds
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
210
42
0
13 Mar 2024
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in
  Long-Horizon Generation
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Zihao Wang
Hoang Trung-Dung
Haowei Lin
Jiaqi Li
Xiaojian Ma
Yitao Liang
ReLMRALMLRM
154
49
0
08 Mar 2024
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Shaoteng Liu
Haoqi Yuan
Minda Hu
Yanwei Li
Yukang Chen
Shu Liu
Zongqing Lu
Jiaya Jia
LLMAG
111
17
0
29 Feb 2024
Computational Experiments Meet Large Language Model Based Agents: A
  Survey and Perspective
Computational Experiments Meet Large Language Model Based Agents: A Survey and Perspective
Qun Ma
Xiao Xue
Deyu Zhou
Xiangning Yu
Donghua Liu
...
Yifan Shen
Peilin Ji
Juanjuan Li
Gang Wang
Wanpeng Ma
AI4CELM&RoLLMAG
84
9
0
01 Feb 2024
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
  Perception
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
Junyang Wang
Haiyang Xu
Jiabo Ye
Mingshi Yan
Weizhou Shen
Ji Zhang
Fei Huang
Jitao Sang
138
129
0
29 Jan 2024
Holodeck: Language Guided Generation of 3D Embodied AI Environments
Holodeck: Language Guided Generation of 3D Embodied AI Environments
Yue Yang
Fan-Yun Sun
Luca Weihs
Eli VanderBilt
Alvaro Herrasti
...
Lingjie Liu
Chris Callison-Burch
Mark Yatskar
Aniruddha Kembhavi
Christopher Clark
LM&Ro
131
92
0
14 Dec 2023
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active
  Perception
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
Yiran Qin
Enshen Zhou
Qichang Liu
Zhen-fei Yin
Lu Sheng
Ruimao Zhang
Yu Qiao
Jing Shao
LM&Ro
122
50
0
12 Dec 2023
Creative Agents: Empowering Agents with Imagination for Creative Tasks
Creative Agents: Empowering Agents with Imagination for Creative Tasks
Chi Zhang
Penglin Cai
Yuhui Fu
Haoqi Yuan
Zongqing Lu
LM&RoLLMAG
129
24
0
05 Dec 2023
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for
  Training and Benchmarking Agents that Solve Fuzzy Tasks
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
Rohin Shah
75
7
0
05 Dec 2023
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in
  Open Worlds
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
Sipeng Zheng
Jiazheng Liu
Yicheng Feng
Zongqing Lu
89
34
0
20 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
89
49
0
12 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
159
27
0
12 Oct 2023
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Carlos E. Jimenez
John Yang
Alexander Wettig
Shunyu Yao
Kexin Pei
Ofir Press
Karthik Narasimhan
ELM
138
647
0
10 Oct 2023
Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon
  Decision-Making in Embodied AI
Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI
Emily Jin
Jiaheng Hu
Zhuoyi Huang
Ruohan Zhang
Jiajun Wu
Fei-Fei Li
Roberto Martín-Martín
LM&Ro
75
11
0
03 Oct 2023
GenSim: Generating Robotic Simulation Tasks via Large Language Models
GenSim: Generating Robotic Simulation Tasks via Large Language Models
Lirui Wang
Yiyang Ling
Zhecheng Yuan
Mohit Shridhar
Chen Bao
Yuzhe Qin
Bailin Wang
Huazhe Xu
Xiaolong Wang
LM&Ro
115
84
0
02 Oct 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
  Control
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&RoLRM
232
1,293
0
28 Jul 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou
Frank F. Xu
Hao Zhu
Xuhui Zhou
Robert Lo
...
Tianyue Ou
Yonatan Bisk
Daniel Fried
Uri Alon
Graham Neubig
LLMAG
212
496
0
25 Jul 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
551
4,453
0
09 Jun 2023
Benchmarking Foundation Models with Language-Model-as-an-Examiner
Benchmarking Foundation Models with Language-Model-as-an-Examiner
Yushi Bai
Jiahao Ying
Yixin Cao
Xin Lv
Yuze He
...
Yijia Xiao
Haozhe Lyu
Jiayin Zhang
Juanzi Li
Lei Hou
ALMELM
107
149
0
07 Jun 2023
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Shalev Lifshitz
Keiran Paster
Harris Chan
Jimmy Ba
Sheila A. McIlraith
LM&Ro
137
76
0
01 Jun 2023
Ghost in the Minecraft: Generally Capable Agents for Open-World
  Environments via Large Language Models with Text-based Knowledge and Memory
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Xizhou Zhu
Yuntao Chen
Hao Tian
Chenxin Tao
Weijie Su
...
Lewei Lu
Xiaogang Wang
Yu Qiao
Zhaoxiang Zhang
Jifeng Dai
LLMAGLM&Ro
110
240
0
25 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&RoSyDa
171
843
0
25 May 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.6K
14,828
0
15 Mar 2023
Describe, Explain, Plan and Select: Interactive Planning with Large
  Language Models Enables Open-World Multi-Task Agents
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Zihao Wang
Shaofei Cai
Guanzhou Chen
Hoang Trung-Dung
Xiaojian Ma
Yitao Liang
LM&RoLLMAG
125
340
0
03 Feb 2023
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
153
304
0
23 Jun 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&RoLLMAGAI4CE
215
827
0
12 May 2022
Benchmarking the Spectrum of Agent Capabilities
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
105
140
0
14 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
124
190
0
27 Jul 2021
The MineRL BASALT Competition on Learning from Human Feedback
The MineRL BASALT Competition on Learning from Human Feedback
Rohin Shah
Cody Wild
Steven H. Wang
Neel Alex
Brandon Houghton
...
Stephanie Milani
Nicholay Topin
Pieter Abbeel
Stuart J. Russell
Anca Dragan
93
32
0
05 Jul 2021
Open-world Machine Learning: Applications, Challenges, and Opportunities
Open-world Machine Learning: Applications, Challenges, and Opportunities
Jitendra Parmar
S. Chouhan
Vaskar Raychoudhury
S. Rathore
OffRL
106
96
0
27 May 2021
High precision control and deep learning-based corn stand counting
  algorithms for agricultural robot
High precision control and deep learning-based corn stand counting algorithms for agricultural robot
Zhong-Zhong Zhang
E. Kayacan
B. Thompson
Girish Chowdhary
60
59
0
21 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.1K
30,032
0
26 Feb 2021
12
Next