ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.00321
  4. Cited By
A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in
  Text-to-Image Encoders through Causal Analysis and Embedding Optimization
v1v2v3v4v5 (latest)

A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization

Neural Information Processing Systems (NeurIPS), 2024
1 October 2024
Chieh-Yun Chen
Chiang Tseng
Li-Wu Tsao
Hong-Han Shuai
ArXiv (abs)PDFHTML

Papers citing "A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization"

12 / 12 papers shown
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling
Bingjie Gao
Qianli Ma
Xiaoxue Wu
Shuai Yang
Guanzhou Lan
...
Qingyang Liu
Yu Qiao
Xinyuan Chen
Y. Wang
Li Niu
VGen
222
1
0
23 Oct 2025
DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
Dongnam Byun
J. Park
Jumgmin Ko
Changin Choi
Wonjong Rhee
DiffM
199
0
0
16 Oct 2025
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Pin-Yen Chiu
I-Sheng Fang
Jun-Cheng Chen
DiffM
153
0
0
23 Sep 2025
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
Chieh-Yun Chen
Min Shi
Gong Zhang
Humphrey Shi
MLLM
420
0
0
28 Jul 2025
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models
L. Chen
Jiner Wang
Zihao Pan
B. Zhu
Xiaofeng Yang
Chi Zhang
DiffM
265
2
0
23 Jul 2025
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
Sanghyun Jo
Wooyeol Lee
Ziseok Lee
Kyungsu Kim
1.1K
0
0
27 May 2025
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation AbilityComputer Vision and Pattern Recognition (CVPR), 2025
Liwen Wang
Senmao Li
Fei Yang
Jianye Wang
Ziheng Zhang
Wenshu Fan
Yijiao Wang
Jian Yang
DiffM
426
2
0
06 May 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Bingjie Gao
Xinyu Gao
Xiaoxue Wu
Yujie Zhou
Yu Qiao
Li Niu
Xinyuan Chen
Yaohui Wang
533
9
0
16 Apr 2025
SPF-Portrait: Towards Pure Text-to-Portrait Customization with Semantic Pollution-Free Fine-Tuning
SPF-Portrait: Towards Pure Text-to-Portrait Customization with Semantic Pollution-Free Fine-Tuning
Xiaole Xian
Zhichao Liao
Qingyu Li
Wenyu Qin
Pengfei Wan
Weicheng Xie
Long Zeng
Linlin Shen
Pingfa Feng
DiffM
569
0
0
01 Apr 2025
Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation
Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation
H. Seo
Junseo Bang
Haechang Lee
Joohoon Lee
Byung Hyun Lee
Se Young Chun
418
0
0
29 Mar 2025
FreeCond: Free Lunch in the Input Conditions of Text-Guided Inpainting
FreeCond: Free Lunch in the Input Conditions of Text-Guided Inpainting
Teng-Fang Hsiao
Bo-Kai Ruan
Sung-Lin Tsai
Yi-Lun Wu
Hong-Han Shuai
DiffM
400
2
0
30 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image
  Synthesis
Token Merging for Training-Free Semantic Binding in Text-to-Image SynthesisNeural Information Processing Systems (NeurIPS), 2024
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Zhiqiang Wang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
415
26
0
11 Nov 2024
1
Page 1 of 1