Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.23144
Cited By
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms
30 October 2024
Jordan Meyer
Nick Padgett
Cullen Miller
Laura Exline
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms"
4 / 4 papers shown
Title
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
X. Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
57
0
0
05 May 2025
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Yang Shi
Jiaheng Liu
Yushuo Guan
Z. Wu
Y. Zhang
...
Bohan Zeng
W. Zhang
Fuzheng Zhang
Wenjing Yang
Di Zhang
VGen
VLM
65
0
0
14 Apr 2025
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Size Wu
W. Zhang
Lumin Xu
Sheng Jin
Zhonghua Wu
Qingyi Tao
Wentao Liu
Wei Li
Chen Change Loy
VGen
53
2
0
27 Mar 2025
The Human-GenAI Value Loop in Human-Centered Innovation: Beyond the Magical Narrative
Camille Grange
Théophile Demazure
Mickael Ringeval
Simon Bourdeau
Cedric Martineau
22
1
0
04 Jul 2024
1