ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.09187
  4. Cited By
Vision-Language Models as a Source of Rewards
v1v2v3 (latest)

Vision-Language Models as a Source of Rewards

14 December 2023
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
Harris Chan
Gheorghe Comanici
Sebastian Flennerhag
Maxime Gazeau
Kristian Holsheimer
Dan Horgan
Michael Laskin
Clare Lyle
Hussain Masoom
Kay McKinney
Volodymyr Mnih
Alexander Neitz
Dmitry Nikulin
Fabio Pardo
Jack Parker-Holder
John Quan
Tim Rocktaschel
Himanshu Sahni
Tom Schaul
Yannick Schroecker
Stephen Spencer
Richie Steigerwald
Luyu Wang
Lei Zhang
    VLMLRM
ArXiv (abs)PDFHTMLHuggingFace (14 upvotes)

Papers citing "Vision-Language Models as a Source of Rewards"

15 / 15 papers shown
Title
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
Abdul Monaf Chowdhury
Akm Moshiur Rahman Mazumder
Rabeya Akter
S. Arib
LM&Ro
88
0
0
27 Sep 2025
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
Philip Schroeder
Ondrej Biza
Thomas Weng
Hongyin Luo
James Glass
LM&RoLRM
99
0
0
03 Aug 2025
ReDit: Reward Dithering for Improved LLM Policy Optimization
ReDit: Reward Dithering for Improved LLM Policy Optimization
Chenxing Wei
Jiarui Yu
Y. He
Hande Dong
Yao Shu
Fei Richard Yu
LRM
176
4
0
23 Jun 2025
GoalLadder: Incremental Goal Discovery with Vision-Language Models
GoalLadder: Incremental Goal Discovery with Vision-Language Models
Alexey Zakharov
Shimon Whiteson
156
1
0
19 Jun 2025
RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills
RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills
Chunru Lin
Haotian Yuan
Yian Wang
Xiaowen Qiu
Tsun-Hsuan Wang
Minghao Guo
Bohan Wang
Yashraj S. Narang
Dieter Fox
Chuang Gan
162
2
0
17 Jun 2025
VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models
VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models
Christos Ziakas
Alessandra Russo
TTA
252
0
0
11 Jun 2025
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving
Dawood Wasif
T. Moore
Chandan K Reddy
Jin-Hee Cho
VLM
171
0
0
01 Jun 2025
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and ActingInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Wei Chen
Jiahao Zhang
Haipeng Zhu
Boyan Xu
Zijian Li
Keli Zhang
Junjian Ye
Ruichu Cai
234
3
0
30 May 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Antonios Tragoudaras
Pavel Kolev
Georg Martius
LM&RoVLM
647
3
0
03 Mar 2025
Offline RLAIF: Piloting VLM Feedback for RL via SFO
Offline RLAIF: Piloting VLM Feedback for RL via SFO
Jacob Beck
OffRL
449
0
0
02 Mar 2025
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman IntelligenceInternational Conference on Machine Learning (ICML), 2024
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
292
53
0
06 Jun 2024
Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Minttu Alakuijala
Reginald McLean
Isaac Woungang
Nariman Farsad
Samuel Kaski
Pekka Marttinen
Kai Yuan
LM&Ro
299
7
0
30 May 2024
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Ziniu Hu
Ahmet Iscen
Aashi Jain
Thomas Kipf
Yisong Yue
David A. Ross
Cordelia Schmid
Alireza Fathi
LLMAG
217
64
0
02 Mar 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model
  Reasoning over Image Sequences
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Xiyao Wang
Yuhang Zhou
Xiaoyu Liu
Hongjin Lu
Yuancheng Xu
...
Taixi Lu
Gedas Bertasius
Mohit Bansal
Huaxiu Yao
Furong Huang
LRMVLM
297
95
0
19 Jan 2024
RePLan: Robotic Replanning with Perception and Language Models
RePLan: Robotic Replanning with Perception and Language Models
Marta Skreta
Zihan Zhou
Jia Lin Yuan
Kourosh Darvish
Alán Aspuru-Guzik
Animesh Garg
LM&RoLRM
279
40
0
08 Jan 2024
1