Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.12895
Cited By
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback
22 January 2025
Yafu Li
Xuyang Hu
Xiaoye Qu
Linjie Li
Yu-Xi Cheng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback"
2 / 2 papers shown
Title
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Xiaoye Qu
Yafu Li
Zhaochen Su
Weigao Sun
Jianhao Yan
...
Chaochao Lu
Yue Zhang
Xian-Sheng Hua
Bowen Zhou
Yu Cheng
ReLM
OffRL
LRM
76
11
0
27 Mar 2025
Self-Supervised Prompt Optimization
Jinyu Xiang
Jiayi Zhang
Zhaoyang Yu
Fengwei Teng
Jinhao Tu
Xinbing Liang
Sirui Hong
Chenglin Wu
Yuyu Luo
OffRL
LRM
41
5
0
07 Feb 2025
1