ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.05501
  4. Cited By
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

5 June 2025
Kaihang Pan
Wendong Bu
Y. Wu
Yang Wu
Kai Shen
Yunfei Li
Hang Zhao
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
ArXiv (abs)PDFHTML

Papers citing "FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL"

6 / 6 papers shown
Reinforcement Learning for Large Model: A Survey
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
317
2
0
24 Dec 2025
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
Dongzhi Jiang
Renrui Zhang
Haodong Li
Zhuofan Zong
Ziyu Guo
...
J. C. Ye
Rongyao Fang
Weijia Li
R. Liu
Hongsheng Li
AI4TSVLMLRM
151
0
0
04 Dec 2025
STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation
STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation
Xiaoxiao Ma
Haibo Qiu
Guohui Zhang
Zhixiong Zeng
Siqi Yang
Lin Ma
Feng Zhao
122
4
0
29 Sep 2025
MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
Yapeng Mi
Hengli Li
Yanpeng Zhao
Chenxi Li
Huimin Wu
Xiaojian Ma
Song-Chun Zhu
Ying Nian Wu
Qing Li
LRMVLM
1.4K
2
0
26 Sep 2025
Group Critical-token Policy Optimization for Autoregressive Image Generation
Group Critical-token Policy Optimization for Autoregressive Image Generation
Guohui Zhang
Hu Yu
Xiaoxiao Ma
Jinghao Zhang
Yaning Pan
Mingde Yao
Jie Xiao
Linjiang Huang
Feng Zhao
153
2
0
26 Sep 2025
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs
Zhaoyu Fan
Kaihang Pan
Mingze Zhou
Bosheng Qin
Juncheng Billy Li
Shengyu Zhang
Wenqiao Zhang
Siliang Tang
Fei Wu
Yueting Zhuang
KELM
155
0
0
06 Sep 2025
1