ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.15044
  4. Cited By
Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner
v1v2v3 (latest)

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner

20 August 2025
Bolian Li
Yanran Wu
Xinyu Luo
Ruqi Zhang
ArXiv (abs)PDFHTMLGithub (12★)

Papers citing "Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner"

1 / 1 papers shown
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning
Junlin Wu
Xianrui Zhong
Jiashuo Sun
Bolian Li
Bowen Jin
Jiawei Han
Qingkai Zeng
OffRLAI4TSLRM
114
1
0
16 Oct 2025
1
Page 1 of 1