ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.04549
  4. Cited By
Vision Language Models are In-Context Value Learners

Vision Language Models are In-Context Value Learners

International Conference on Learning Representations (ICLR), 2024
7 November 2024
Yecheng Jason Ma
Joey Hejna
Ayzaan Wahid
Chuyuan Fu
Dhruv Shah
Jacky Liang
Zhuo Xu
Sean Kirmani
Peng Xu
Danny Driess
Ted Xiao
Jonathan Tompson
Osbert Bastani
Dinesh Jayaraman
Wenhao Yu
Tingnan Zhang
Dorsa Sadigh
Fei Xia
ArXiv (abs)PDFHTMLGithub

Papers citing "Vision Language Models are In-Context Value Learners"

25 / 25 papers shown
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
Chancharik Mitra
Yusen Luo
Raj Saravanan
Dantong Niu
Anirudh Pai
Jesse Thomason
Trevor Darrell
Abrar Anwar
Deva Ramanan
Roei Herzig
95
2
0
27 Nov 2025
T2T-VICL: Unlocking the Boundaries of Cross-Task Visual In-Context Learning via Implicit Text-Driven VLMs
T2T-VICL: Unlocking the Boundaries of Cross-Task Visual In-Context Learning via Implicit Text-Driven VLMs
Shao-Jun Xia
Huixin Zhang
Zhengzhong Tu
MLLMVLM
476
0
0
20 Nov 2025
$π^{*}_{0.6}$: a VLA That Learns From Experience
π0.6∗π^{*}_{0.6}π0.6∗​: a VLA That Learns From Experience
Physical Intelligence
Ali Amin
Raichelle Aniceto
Ashwin Balakrishna
Kevin Black
...
Blake Williams
Sukwon Yoo
Lili Yu
Ury Zhilinsky
Zhiyuan Zhou
OffRLVLM
1.3K
100
0
18 Nov 2025
Learning Affordances at Inference-Time for Vision-Language-Action Models
Learning Affordances at Inference-Time for Vision-Language-Action Models
Ameesh Shah
William Chen
Adwait Godbole
Federico Mora
Sanjit A. Seshia
Sergey Levine
230
1
0
22 Oct 2025
TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance
TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance
Yuyang Liu
Chuan Wen
Yihang Hu
Dinesh Jayaraman
Yang Gao
197
3
0
30 Sep 2025
SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation
SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation
Qianzhong Chen
Justin Yu
Mac Schwager
Pieter Abbeel
Fred Shentu
Philipp Wu
468
13
0
29 Sep 2025
PhysiAgent: An Embodied Agent Framework in Physical World
PhysiAgent: An Embodied Agent Framework in Physical World
Zhihao Wang
Jianxiong Li
Jinliang Zheng
Wencong Zhang
Dongxiu Liu
Yinan Zheng
Haoyi Niu
Junzhi Yu
Xianyuan Zhan
LM&Ro
262
2
0
29 Sep 2025
VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation
VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation
Huayi Zhou
Kui Jia
LM&Ro
276
0
0
26 Sep 2025
VLA-Reasoner: Empowering Vision-Language-Action Models with Reasoning via Online Monte Carlo Tree Search
VLA-Reasoner: Empowering Vision-Language-Action Models with Reasoning via Online Monte Carlo Tree Search
Wenkai Guo
Guanxing Lu
Haoyuan Deng
Zhenyu Wu
Yansong Tang
Ziwei Wang
LRM
235
5
0
26 Sep 2025
OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation
OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation
Paweł Budzianowski
Emilia Wisnios
Gracjan Góral
Igor Kulakov
Viktor Petrenko
Krzysztof Walas
Krzysztof Walas
245
2
0
22 Sep 2025
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
Shaopeng Zhai
Qi Zhang
Tianyi Zhang
Fuxian Huang
Haoran Zhang
Ming Zhou
Shengzhe Zhang
Litao Liu
Sixu Lin
Jiangmiao Pang
OffRL
293
32
0
19 Sep 2025
Self-Improving Embodied Foundation Models
Self-Improving Embodied Foundation Models
Seyed Kamyar Seyed Ghasemipour
Ayzaan Wahid
Jonathan Tompson
Pannag R Sanketi
Igor Mordatch
LM&RoLRM
195
18
0
18 Sep 2025
Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search
Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search
Cyrus Neary
Omar G. Younis
Artur Kuramshin
Ozgur Aslan
Glen Berseth
190
7
0
17 Aug 2025
RICL: Adding In-Context Adaptability to Pre-Trained Vision-Language-Action Models
RICL: Adding In-Context Adaptability to Pre-Trained Vision-Language-Action Models
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
Insup Lee
LM&RoVLM
176
15
0
04 Aug 2025
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
Philip Schroeder
Ondrej Biza
Thomas Weng
Hongyin Luo
James Glass
LM&RoLRM
215
2
0
03 Aug 2025
Reinforcement Learning for Flow-Matching Policies
Reinforcement Learning for Flow-Matching Policies
Samuel Pfrommer
Yixiao Huang
Somayeh Sojoudi
214
9
0
20 Jul 2025
Scaffolding Dexterous Manipulation with Vision-Language Models
Scaffolding Dexterous Manipulation with Vision-Language Models
Vincent de Bakker
Joey Hejna
Tyler Ga Wei Lum
Onur Celik
Aleksandar Taranovic
Denis Blessing
Gerhard Neumann
Jeannette Bohg
Dorsa Sadigh
287
9
0
24 Jun 2025
VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models
VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models
Christos Ziakas
Alessandra Russo
TTA
374
0
0
11 Jun 2025
Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Wenhao Wang
Jianheng Song
Chiming Liu
Jiayao Ma
Siyuan Feng
...
Modi Shi
Xindong He
Guanghui Ren
Yang Yang
Maoqing Yao
OffRL
345
8
0
24 May 2025
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Jiaming Zhou
Ke Ye
Jiayi Liu
Teli Ma
Zifang Wang
Ronghe Qiu
Kun-Yu Lin
Zhilin Zhao
Junwei Liang
487
21
0
21 May 2025
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
Jiahui Zhang
Yusen Luo
Abrar Anwar
Sumedh Anand Sontakke
Joseph J Lim
Jesse Thomason
Erdem Biyik
Jesse Zhang
OffRLLM&Ro
492
35
0
16 May 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025
Qingwen Bu
Yanting Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
985
207
0
09 May 2025
LuciBot: Automated Robot Policy Learning from Generated Videos
LuciBot: Automated Robot Policy Learning from Generated Videos
Xiaowen Qiu
Yian Wang
Jiting Cai
Zhehuan Chen
Chunru Lin
Tsun-Hsuan Wang
Chuang Gan
LM&RoVGen
432
4
0
12 Mar 2025
TRACE: A Self-Improving Framework for Robot Behavior Forecasting with Vision-Language Models
TRACE: A Self-Improving Framework for Robot Behavior Forecasting with Vision-Language Models
Gokul Puthumanaillam
Paulo Padrao
Jose Fuentes
Pranay Thangeda
William E. Schafer
Jae Hyuk Song
Karan Jagdale
Leonardo Bobadilla
Melkior Ornik
229
5
0
02 Mar 2025
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Alexander Khazatsky
Karl Pertsch
Suraj Nair
Ashwin Balakrishna
Sudeep Dasari
...
Thomas Kollar
Sergey Levine
Chelsea Finn
Sergey Levine
Chelsea Finn
742
647
0
19 Mar 2024
1
Page 1 of 1