Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.25162
Cited By
Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models
29 September 2025
Bowei Chen
Sai Bi
Hao Tan
He Zhang
Tianyuan Zhang
Zhengqi Li
Yuanjun Xiong
Jianming Zhang
Kai Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Github (115★)
Papers citing
"Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models"
1 / 1 papers shown
Title
VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator
Hyojun Go
Dominik Narnhofer
Goutam Bhat
Prune Truong
Federico Tombari
Konrad Schindler
VGen
76
0
0
15 Oct 2025
1