Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2508.15044
Cited By

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner

v1v2v3 (latest)

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner

20 August 2025

ArXiv (abs)PDF HTML Github (12★)

Papers citing "Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner"

1 / 1 papers shown

Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning

Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning

OffRL AI4TS LRM

114

1

0

16 Oct 2025

Page 1 of 1