ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.13817
  4. Cited By
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences

VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences

18 March 2025
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Erfaun Noorani
ArXiv (abs)PDFHTML

Papers citing "VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences"

3 / 3 papers shown
Title
Cross-Modal Instructions for Robot Motion Generation
Cross-Modal Instructions for Robot Motion Generation
William Barron
Xiaoxiang Dong
Matthew Johnson-Roberson
Weiming Zhi
72
0
0
25 Sep 2025
Perception-Aware Policy Optimization for Multimodal Reasoning
Perception-Aware Policy Optimization for Multimodal Reasoning
Zhenhailong Wang
Xuehang Guo
Sofia Stoica
Haiyang Xu
Hongru Wang
...
Xiusi Chen
Yangyi Chen
Ming Yan
Fei Huang
Mengyue Yang
OffRLLRM
320
12
0
08 Jul 2025
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
Peihong Yu
Amisha Bhaskar
Anukriti Singh
Zahiruddin Mahammad
Erfaun Noorani
181
6
0
14 Mar 2025
1