ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.19872
  4. Cited By
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
v1v2v3 (latest)

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Neural Information Processing Systems (NeurIPS), 2024
30 September 2024
Kaihang Pan
Zhaoyu Fan
Juncheng Li
Qifan Yu
Hao Fei
Siliang Tang
Richang Hong
Hanwang Zhang
Qianru Sun
    KELM
ArXiv (abs)PDFHTMLGithub (6★)

Papers citing "Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration"

8 / 8 papers shown
Title
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning
Yuyao Ge
Shenghua Liu
Yiwei Wang
Shansong Liu
Baolong Bi
Xuanshan Zhou
Jiayu Yao
Jiafeng Guo
Xueqi Cheng
176
2
0
08 Sep 2025
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Wendong Bu
Yang Wu
Qifan Yu
Minghe Gao
Bingchen Miao
...
Mengze Li
Wei Ji
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
ELM
137
1
0
10 Jun 2025
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL
Kaihang Pan
Wendong Bu
Y. Wu
Yang Wu
Kai Shen
Yunfei Li
Hang Zhao
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
194
8
0
05 Jun 2025
ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
Chenxi Wang
Jizhan Fang
Xiang Chen
Bozhong Tian
Ziwen Xu
Zeyang Zhang
Ningyu Zhang
KELM
299
0
0
26 Mar 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAGLRM
316
3
0
24 Mar 2025
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, EditingNeural Information Processing Systems (NeurIPS), 2024
Hao Fei
Shengqiong Wu
Hao Zhang
Tat-Seng Chua
Shuicheng Yan
407
70
0
31 Dec 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any IdeaComputer Vision and Pattern Recognition (CVPR), 2024
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
417
98
0
24 Nov 2024
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Towards Semantic Equivalence of Tokenization in Multimodal LLMInternational Conference on Learning Representations (ICLR), 2024
Shengqiong Wu
Hao Fei
Xiangtai Li
Jiayi Ji
Hanwang Zhang
Tat-Seng Chua
Shuicheng Yan
MLLM
442
55
0
07 Jun 2024
1