Vision Language Models are In-Context Value Learners

International Conference on Learning Representations (ICLR), 2024

7 November 2024

ArXiv (abs)PDF HTML Github

Papers citing "Vision Language Models are In-Context Value Learners"

25 / 25 papers shown

Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations

27 Nov 2025

T2T-VICL: Unlocking the Boundaries of Cross-Task Visual In-Context Learning via Implicit Text-Driven VLMs

476

20 Nov 2025

$$π^{*}_{0.6}$: a VLA That Learns From Experience$

π^{*}_{0.6}

: a VLA That Learns From Experience

Physical Intelligence

...

1.3K

100

18 Nov 2025

Learning Affordances at Inference-Time for Vision-Language-Action Models

230

22 Oct 2025

TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance

197

30 Sep 2025

SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation

468

29 Sep 2025

PhysiAgent: An Embodied Agent Framework in Physical World

262

29 Sep 2025

VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation

Huayi Zhou

Kui Jia

LM&Ro

276

26 Sep 2025

VLA-Reasoner: Empowering Vision-Language-Action Models with Reasoning via Online Monte Carlo Tree Search

235

26 Sep 2025

OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation

245

22 Sep 2025

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

293

19 Sep 2025

Self-Improving Embodied Foundation Models

Seyed Kamyar Seyed Ghasemipour

195

18 Sep 2025

Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search

190

17 Aug 2025

RICL: Adding In-Context Adaptability to Pre-Trained Vision-Language-Action Models

176

04 Aug 2025

ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks

215

03 Aug 2025

Reinforcement Learning for Flow-Matching Policies

Samuel Pfrommer

Yixiao Huang

Somayeh Sojoudi

214

20 Jul 2025

Scaffolding Dexterous Manipulation with Vision-Language Models

287

24 Jun 2025

VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

Christos Ziakas

Alessandra Russo

TTA

374

11 Jun 2025

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

...

345

24 May 2025

Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization

487

21 May 2025

ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations

Jiahui Zhang

Yusen Luo

Abrar Anwar

Sumedh Anand Sontakke

492

16 May 2025

UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025

985

207

09 May 2025

LuciBot: Automated Robot Policy Learning from Generated Videos

432

12 Mar 2025

TRACE: A Self-Improving Framework for Robot Behavior Forecasting with Vision-Language Models

229

02 Mar 2025

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

...

Sergey Levine

Chelsea Finn

742

647

19 Mar 2024