ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.03498
  4. Cited By
Improved Techniques for Training GANs

Improved Techniques for Training GANs

10 June 2016
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
    GAN
ArXiv (abs)PDFHTML

Papers citing "Improved Techniques for Training GANs"

50 / 4,341 papers shown
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Dongwon Kim
Ju He
Qihang Yu
Chenglin Yang
Xiaohui Shen
Suha Kwak
Liang-Chieh Chen
VLM
433
25
0
13 Jan 2025
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Xiaoying Xing
Avinab Saha
Junfeng He
Susan Hao
Paul Vicol
...
Sahil Singla
Sarah Young
Yinxiao Li
Feng Yang
Deepak Ramachandran
DiffM
306
3
0
11 Jan 2025
MEt3R: Measuring Multi-View Consistency in Generated Images
MEt3R: Measuring Multi-View Consistency in Generated ImagesComputer Vision and Pattern Recognition (CVPR), 2025
Mohammad Asim
Christopher Wewer
Thomas Wimmer
Bernt Schiele
J. E. Lenssen
EGVM3DGSVGen
256
38
0
10 Jan 2025
Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion
Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion
Fan Yang
Jianfeng Zhang
Yichun Shi
Bowen Chen
Chenxu Zhang
Huichao Zhang
Xiaofeng Yang
Xiu Li
Jiashi Feng
Guosheng Lin
403
3
0
10 Jan 2025
CAT: Content-Adaptive Image Tokenization
Junhong Shen
Kushal Tirumala
Michihiro Yasunaga
Ishan Misra
Luke Zettlemoyer
Lili Yu
Chunting Zhou
189
5
0
06 Jan 2025
Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model
Omid Saghatchian
Atiyeh Gh. Moghadam
Ahmad Nickabadi
MoMe
326
6
0
03 Jan 2025
MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware ClassificationAAAI Conference on Artificial Intelligence (AAAI), 2025
Jimin Park
AHyun Ji
Minji Park
Mohammad Saidur Rahman
Se Eun Oh
223
8
0
03 Jan 2025
PQD: Post-training Quantization for Efficient Diffusion Models
Jiaojiao Ye
Zhen Wang
Linnan Jiang
MQ
258
1
0
03 Jan 2025
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image ModelsPatterns (Patterns), 2024
Yuzhu Cai
Sheng Yin
Yuxi Wei
Chenxin Xu
Weibo Mao
Felix Juefei Xu
Siheng Chen
Yanfeng Wang
EGVM
447
4
0
03 Jan 2025
LoVA: Long-form Video-to-Audio Generation
LoVA: Long-form Video-to-Audio GenerationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Xin Cheng
Xihua Wang
Yihan Wu
Yuyue Wang
Ruihua Song
VGenDiffM
258
15
0
31 Dec 2024
Grid Diffusion Models for Text-to-Video Generation
Grid Diffusion Models for Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Taegyeong Lee
Soyeong Kwon
Taehwan Kim
313
19
0
31 Dec 2024
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
329
19
0
31 Dec 2024
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Chia-Yu Hung
Navonil Majumder
Zhifeng Kong
Ambuj Mehrish
Rafael Valle
Bryan Catanzaro
Soujanya Poria
Bryan Catanzaro
Soujanya Poria
375
36
0
30 Dec 2024
D-Judge: How Far Are We? Assessing the Discrepancies Between AI-synthesized and Natural Images through Multimodal Guidance
D-Judge: How Far Are We? Assessing the Discrepancies Between AI-synthesized and Natural Images through Multimodal Guidance
Renyang Liu
Ziyu Lyu
Wei Zhou
See-Kiong Ng
EGVM
451
0
0
23 Dec 2024
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024
Haocheng Huang
Jiaxin Chen
Jinyang Guo
Ruiyi Zhan
Yunhong Wang
DiffMMQ
288
3
0
21 Dec 2024
GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D
  Generators
GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators
Hengjia Li
Yang Liu
Yibo Zhao
Haoran Cheng
Yang Yang
...
Qibo Qiu
Boxi Wu
Tu Zheng
Zheng Yang
Xiaofei He
279
4
0
20 Dec 2024
Next Patch Prediction for Autoregressive Visual Generation
Next Patch Prediction for Autoregressive Visual Generation
Yatian Pang
Peng Jin
Shuo Yang
Bin Lin
Bin Zhu
...
Liuhan Chen
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
629
21
0
19 Dec 2024
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisComputer Vision and Pattern Recognition (CVPR), 2024
Ho Kei Cheng
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Alex Schwing
Yuki Mitsufuji
VGen
537
69
0
19 Dec 2024
E-CAR: Efficient Continuous Autoregressive Image Generation via
  Multistage Modeling
E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Zhihang Yuan
Yuzhang Shang
Hao Zhang
Tongcheng Fang
Rui Xie
Bingxin Xu
Yan Yan
Shengen Yan
Guohao Dai
Yu Wang
DiffM
379
4
0
18 Dec 2024
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
VideoDPO: Omni-Preference Alignment for Video Diffusion GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Runtao Liu
Haoyu Wu
Zheng Ziqiang
Chen Wei
Yingqing He
Renjie Pi
Qifeng Chen
VGen
324
65
0
18 Dec 2024
Is Your World Simulator a Good Story Presenter? A Consecutive
  Events-Based Benchmark for Future Long Video Generation
Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Yiping Wang
Xuehai He
Kuan-Chieh Wang
Luyao Ma
Jianwei Yang
Shuohang Wang
Simon Shaolei Du
Yelong Shen
VGen
331
9
0
17 Dec 2024
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection GuidanceAAAI Conference on Artificial Intelligence (AAAI), 2024
Wenhao Sun
Benlei Cui
Xue-Mei Dong
Jingqun Tang
DiffM
789
32
0
17 Dec 2024
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion
  Models
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024
Weilun Feng
Haotong Qin
Chuanguang Yang
Zhulin An
Libo Huang
Boyu Diao
Fei Wang
Renshuai Tao
Yongjun Xu
Michele Magno
DiffMMQ
277
14
0
16 Dec 2024
Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep
  Neural Networks
Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep Neural Networks
Naoki Sato
Koshiro Izumi
Hideaki Iiduka
ODL
241
2
0
16 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerComputer Vision and Pattern Recognition (CVPR), 2024
Zeyang Zhang
Zihan Wang
Xianrui Li
Xingwu Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
664
32
0
14 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Liang Luo
...
Liang Li
Houcheng Su
Yu Zhong
Wei Liu
Shangsong Liang
OODAI4TSMedIm
300
0
0
13 Dec 2024
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Haonan Qiu
Shiwei Zhang
Yujie Wei
Ruihang Chu
Hangjie Yuan
Xinyu Wang
Yujiao Shi
Ziwei Liu
356
18
0
12 Dec 2024
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for
  Image Reconstruction
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction
Sai Qian Zhang
Ziyun Li
Chuan Guo
Saeed Mahloujifar
Deeksha Dangwal
Edward Suh
B. D. Salvo
Chiao Liu
DiffM
310
2
0
11 Dec 2024
Intelligent Electric Power Steering: Artificial Intelligence Integration
  Enhances Vehicle Safety and Performance
Intelligent Electric Power Steering: Artificial Intelligence Integration Enhances Vehicle Safety and Performance
Vikas Vyas
Sneha Sudhir Shetiya
LLMSV
166
3
0
11 Dec 2024
CAP: Evaluation of Persuasive and Creative Image Generation
CAP: Evaluation of Persuasive and Creative Image Generation
Aysan Aghazadeh
Adriana Kovashka
EGVM
397
3
0
10 Dec 2024
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal
  Latent Alignment
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
Kim Sung-Bin
Arda Senocak
Hyunwoo Ha
Tae-Hyun Oh
DiffM
404
3
0
09 Dec 2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive ConceptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ziwei Huang
Wanggui He
Quanyu Long
Yandi Wang
Haoyuan Li
...
Fangxun Shu
Long Chen
Hao Jiang
Yaoyao Yu
Leilei Gan
EGVM
1.1K
9
0
05 Dec 2024
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion
  Based Generation and Model Pretraining
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model PretrainingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Youssof Nawar
Nouran Soliman
Moustafa Wassel
Mohamed ElHabebe
Noha Adly
Marwan Torki
Ahmed Elmassry
Islam Ahmed
MedIm
289
0
0
04 Dec 2024
BOTracle: A framework for Discriminating Bots and Humans
BOTracle: A framework for Discriminating Bots and Humans
Jan Kadel
August See
Ritwik Sinha
Mathias Fischer
183
1
0
03 Dec 2024
AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation
AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion ExtrapolationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zhihang Lin
Mingbao Lin
Wengyi Zhan
Rongrong Ji
317
2
0
03 Dec 2024
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
Khaled Abud
Sergey Lavrushkin
Alexey Kirillov
D. Vatolin
480
0
0
02 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
RandAR: Decoder-only Autoregressive Visual Generation in Random OrdersComputer Vision and Pattern Recognition (CVPR), 2024
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
392
60
0
02 Dec 2024
BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis
BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis
Seong-Eun Hong
Soobin Lim
Juyeong Hwang
Minwook Chang
Hyeongyeop Kang
612
4
0
28 Nov 2024
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of
  Text-to-Video Generation with LMM
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMMComputer Vision and Pattern Recognition (CVPR), 2024
Jiarui Wang
Huiyu Duan
Guoquan Zheng
Juntong Wang
Xiongkuo Min
EGVM
257
22
0
26 Nov 2024
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient
  Attention and Quantization
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization
Rui Xie
Tianchen Zhao
Zhihang Yuan
Rui Wan
Wenxi Gao
Zhenhua Zhu
Xuefei Ning
Yu Wang
VGenMQ
192
10
0
26 Nov 2024
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis
Xinyu Hou
Zongsheng Yue
Xiaoming Li
Chen Change Loy
VGenDiffM
382
0
0
26 Nov 2024
Factorized Visual Tokenization and Generation
Factorized Visual Tokenization and Generation
Zechen Bai
Jianxiong Gao
Ziteng Gao
Pichao Wang
Zheng Zhang
Tong He
Mike Zheng Shou
274
6
0
25 Nov 2024
Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of
  CycleWGAN, ProGAN, and DCGAN
Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGANBigData Congress [Services Society] (BSS), 2024
Elona Shatri
Kalikidhar Palavala
George Fazekas
281
1
0
25 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guoquan Zheng
EGVM
533
11
0
25 Nov 2024
ExAL: An Exploration Enhanced Adversarial Learning Algorithm
ExAL: An Exploration Enhanced Adversarial Learning Algorithm
A Vinil
Aneesh Sreevallabh Chivukula
Pranav Chintareddy
AAML
132
0
0
24 Nov 2024
Comparative Analysis of Diffusion Generative Models in Computational
  Pathology
Comparative Analysis of Diffusion Generative Models in Computational Pathology
Denisha Thakkar
Vincent Quoc-Huy Trinh
Sonal Varma
Samira Ebrahimi Kahou
Hassan Rivaz
Mahdi S. Hosseini
MedIm
285
1
0
24 Nov 2024
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou
Xiaoyu Zhang
Yongchuan Tang
MLLMDiffM
616
2
0
24 Nov 2024
Hierarchical Cross-Attention Network for Virtual Try-On
Hierarchical Cross-Attention Network for Virtual Try-OnIEEE transactions on multimedia (IEEE TMM), 2024
Hao Tang
Bin Ren
Pingping Wu
Andrii Zadaianchuk
301
1
0
23 Nov 2024
Automatic Evaluation for Text-to-image Generation: Task-decomposed
  Framework, Distilled Training, and Meta-evaluation Benchmark
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Rong-Cheng Tu
Zi-Ao Ma
Tian Lan
Yuehao Zhao
Heyan Huang
Xian-Ling Mao
MLLMVLMEGVM
361
10
0
23 Nov 2024
GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
Junwen He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Chong Li
Hanyuan Chen
Jin-Peng Lan
Bin Luo
Yifeng Geng
323
2
0
18 Nov 2024
Previous
123...91011...858687
Next