ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.22866
  4. Cited By
Scaling Offline RL via Efficient and Expressive Shortcut Models

Scaling Offline RL via Efficient and Expressive Shortcut Models

28 May 2025
Nicolas Espinosa-Dice
Yiyi Zhang
Yiding Chen
Bradley Guo
Owen Oertell
Gokul Swamy
Kianté Brantley
Wen Sun
    OffRLLRM
ArXiv (abs)PDFHTML

Papers citing "Scaling Offline RL via Efficient and Expressive Shortcut Models"

7 / 7 papers shown
Title
Expressive Value Learning for Scalable Offline Reinforcement Learning
Expressive Value Learning for Scalable Offline Reinforcement Learning
Nicolas Espinosa-Dice
Kianté Brantley
Wen Sun
OffRL
12
0
0
09 Oct 2025
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
Bhavya Agrawalla
Michal Nauman
Khush Agarwal
Aviral Kumar
OffRL
28
0
0
08 Sep 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
602
3,880
0
22 Jan 2025
A General Framework for Inference-time Scaling and Steering of Diffusion Models
A General Framework for Inference-time Scaling and Steering of Diffusion Models
R. Singhal
Zachary Horvitz
Ryan Teehan
Mengye Ren
Zhou Yu
Kathleen McKeown
Rajesh Ranganath
DiffM
295
68
0
12 Jan 2025
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALMLRM
460
470
0
03 Jan 2025
OGBench: Benchmarking Offline Goal-Conditioned RL
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
267
48
0
26 Oct 2024
One Step Diffusion via Shortcut Models
One Step Diffusion via Shortcut Models
Kevin Frans
Danijar Hafner
Sergey Levine
Pieter Abbeel
VLMDiffM
296
99
0
16 Oct 2024
1