ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.03498
  4. Cited By
Improved Techniques for Training GANs

Improved Techniques for Training GANs

10 June 2016
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
    GAN
ArXivPDFHTML

Papers citing "Improved Techniques for Training GANs"

50 / 3,238 papers shown
Title
Improving Text-To-Audio Models with Synthetic Captions
Improving Text-To-Audio Models with Synthetic Captions
Zhifeng Kong
Sang-gil Lee
Deepanway Ghosal
Navonil Majumder
Ambuj Mehrish
Rafael Valle
Soujanya Poria
Bryan Catanzaro
45
11
0
18 Jun 2024
Autoregressive Image Generation without Vector Quantization
Autoregressive Image Generation without Vector Quantization
Tianhong Li
Yonglong Tian
He Li
Mingyang Deng
Kaiming He
DiffM
53
178
0
17 Jun 2024
ChildDiffusion: Unlocking the Potential of Generative AI and
  Controllable Augmentations for Child Facial Data using Stable Diffusion and
  Large Language Models
ChildDiffusion: Unlocking the Potential of Generative AI and Controllable Augmentations for Child Facial Data using Stable Diffusion and Large Language Models
Muhammad Ali Farooq
Wang Yao
Peter Corcoran
36
1
0
17 Jun 2024
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for
  Vision-Language Models
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Xiyang Wu
Tianrui Guan
Dianqi Li
Shuaiyi Huang
Xiaoyu Liu
...
Abhinav Shrivastava
Furong Huang
Jordan L. Boyd-Graber
Dinesh Manocha
Dinesh Manocha
HILM
LRM
VLM
MLLM
30
14
0
16 Jun 2024
IG2: Integrated Gradient on Iterative Gradient Path for Feature
  Attribution
IG2: Integrated Gradient on Iterative Gradient Path for Feature Attribution
Yue Zhuo
Zhiqiang Ge
26
7
0
16 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
42
1
0
15 Jun 2024
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Wei Chen
Lin Li
Yongqi Yang
Bin Wen
Fan Yang
Tingting Gao
Yu Wu
Long Chen
VLM
VGen
47
6
0
15 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe-nan Lin
Rita Singh
Bhiksha Raj
DiffM
43
21
0
14 Jun 2024
Reinforced Decoder: Towards Training Recurrent Neural Networks for Time
  Series Forecasting
Reinforced Decoder: Towards Training Recurrent Neural Networks for Time Series Forecasting
Qi Sima
Xinze Zhang
Yukun Bao
Siyue Yang
Liang Shen
AI4TS
53
1
0
14 Jun 2024
Alleviating Distortion in Image Generation via Multi-Resolution
  Diffusion Models
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Qihao Liu
Zhanpeng Zeng
Ju He
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
53
19
0
13 Jun 2024
Beyond the Frontier: Predicting Unseen Walls from Occupancy Grids by
  Learning from Floor Plans
Beyond the Frontier: Predicting Unseen Walls from Occupancy Grids by Learning from Floor Plans
Ludvig Ericson
Patric Jensfelt
42
7
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing
  Reliability,Reproducibility, and Practicality
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
72
2
0
13 Jun 2024
DiTFastAttn: Attention Compression for Diffusion Transformer Models
DiTFastAttn: Attention Compression for Diffusion Transformer Models
Zhihang Yuan
Pu Lu
Hanling Zhang
Xuefei Ning
Linfeng Zhang
Tianchen Zhao
Shengen Yan
Guohao Dai
Yu Wang
48
20
0
12 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
45
9
0
12 Jun 2024
Image and Video Tokenization with Binary Spherical Quantization
Image and Video Tokenization with Binary Spherical Quantization
Yue Zhao
Yuanjun Xiong
Philipp Krahenbuhl
45
17
0
11 Jun 2024
Commonsense-T2I Challenge: Can Text-to-Image Generation Models
  Understand Commonsense?
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Xingyu Fu
Muyu He
Yujie Lu
William Yang Wang
Dan Roth
EGVM
LRM
31
15
0
11 Jun 2024
SAGIPS: A Scalable Asynchronous Generative Inverse Problem Solver
SAGIPS: A Scalable Asynchronous Generative Inverse Problem Solver
Daniel Lersch
Malachi Schram
Zhenyu Dai
Kishansingh Rajput
Xingfu Wu
Nobuo Sato
J. T. Childers
27
0
0
11 Jun 2024
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for
  Sampling
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling
Denis Blessing
Xiaogang Jia
Johannes Esslinger
Francisco Vargas
Gerhard Neumann
50
16
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
54
2
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image
  Generation
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
66
227
0
10 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
53
10
0
10 Jun 2024
Can Prompt Modifiers Control Bias? A Comparative Analysis of
  Text-to-Image Generative Models
Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models
P. W. Shin
Jihyun Janice Ahn
Wenpeng Yin
Jack Sampson
Vijaykrishnan Narayanan
32
3
0
09 Jun 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Shiji Song
Yuan Yao
Gao Huang
32
14
0
08 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
GenAI Arena: An Open Evaluation Platform for Generative Models
Dongfu Jiang
Max W.F. Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
39
20
0
06 Jun 2024
Multistep Distillation of Diffusion Models via Moment Matching
Multistep Distillation of Diffusion Models via Moment Matching
Tim Salimans
Thomas Mensink
Jonathan Heek
Emiel Hoogeboom
DiffM
38
23
0
06 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
93
1
0
06 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Chenfanfu Jiang
Ningyu Zhang
Kai-Wei Chang
Aditya Grover
EGVM
VGen
40
36
0
05 Jun 2024
When Spiking neural networks meet temporal attention image decoding and
  adaptive spiking neuron
When Spiking neural networks meet temporal attention image decoding and adaptive spiking neuron
Xuerui Qiu
Zheng Luan
Zhaorui Wang
Rui-jie Zhu
41
5
0
05 Jun 2024
Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao
Alexandros Graikos
Jingwei Zhang
Sounak Mondal
Minh Hoai
Dimitris Samaras
38
0
0
04 Jun 2024
Analyzing the Feature Extractor Networks for Face Image Synthesis
Analyzing the Feature Extractor Networks for Face Image Synthesis
Erdi Sarıtaş
H. K. Ekenel
CVBM
EGVM
65
1
0
04 Jun 2024
ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data
  Generation
ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation
Wei Shao
Rongyi Zhu
Cai Yang
Chandra Thapa
Muhammad Ejaz Ahmed
S. Çamtepe
Rui Zhang
DuYong Kim
Hamid Menouar
Flora D. Salim
29
0
0
04 Jun 2024
Rank-based No-reference Quality Assessment for Face Swapping
Rank-based No-reference Quality Assessment for Face Swapping
Xinghui Zhou
Wenbo Zhou
Tianyi Wei
Shen Chen
Taiping Yao
Shouhong Ding
Weiming Zhang
Nenghai Yu
CVBM
37
0
0
04 Jun 2024
L-MAGIC: Language Model Assisted Generation of Images with Coherence
L-MAGIC: Language Model Assisted Generation of Images with Coherence
Zhipeng Cai
Matthias Mueller
R. Birkl
Diana Wofk
Shaoyen Tseng
JunDa Cheng
Gabriela Ben-Melech Stan
Vasudev Lal
Michael Paulitsch
DiffM
MLLM
40
6
0
03 Jun 2024
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Xinyin Ma
Gongfan Fang
Michael Bi Mi
Xinchao Wang
61
30
0
03 Jun 2024
Segmentation-Free Guidance for Text-to-Image Diffusion Models
Segmentation-Free Guidance for Text-to-Image Diffusion Models
K. Azarian
Debasmit Das
Qiqi Hou
Fatih Porikli
VLM
54
0
0
03 Jun 2024
Towards Practical Single-shot Motion Synthesis
Towards Practical Single-shot Motion Synthesis
Konstantinos Roditakis
Spyridon Thermos
N. Zioulis
VGen
43
0
0
03 Jun 2024
$Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion
  Transformers
ΔΔΔ-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen
Mingzhu Shen
Peng Ye
Jianjian Cao
Chongjun Tu
C. Bouganis
Yiren Zhao
Tao Chen
60
27
0
03 Jun 2024
Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
Hunter Nisonoff
Junhao Xiong
Stephan Allenspach
Jennifer Listgarten
60
31
0
03 Jun 2024
SuperGaussian: Repurposing Video Models for 3D Super Resolution
SuperGaussian: Repurposing Video Models for 3D Super Resolution
Yuan Shen
Duygu Ceylan
Paul Guerrero
Zexiang Xu
Niloy J. Mitra
Shenlong Wang
Anna Frühstück
74
4
0
02 Jun 2024
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature
  Inheritance Strategies
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies
Jinchao Zhu
Yuxuan Wang
Siyuan Pan
Pengfei Wan
Di Zhang
Gao Huang
26
0
0
31 May 2024
Unified Directly Denoising for Both Variance Preserving and Variance
  Exploding Diffusion Models
Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models
Jingjing Wang
Dan Zhang
Feng Luo
DiffM
31
0
0
31 May 2024
Learning Gaze-aware Compositional GAN
Learning Gaze-aware Compositional GAN
Nerea Aranjuelo
Siyu Huang
Fundación Vicomtech
Luis Unzueta
Oihana Otaegui
Hanspeter Pfister
Donglai Wei
GAN
CVBM
28
0
0
31 May 2024
Slight Corruption in Pre-training Data Makes Better Diffusion Models
Slight Corruption in Pre-training Data Makes Better Diffusion Models
Hao Chen
Yujin Han
Diganta Misra
Xiang Li
Kai Hu
Difan Zou
Masashi Sugiyama
Jindong Wang
Bhiksha Raj
DiffM
47
5
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
78
6
0
30 May 2024
DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person
  Re-Identification in Real-World
DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World
Wenli Sun
Xinyang Jiang
Dongsheng Li
Cairong Zhao
DiffM
AAML
27
2
0
30 May 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
59
2
0
30 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
70
13
0
28 May 2024
Diffusion Rejection Sampling
Diffusion Rejection Sampling
Byeonghu Na
Yeongmin Kim
Minsang Park
DongHyeok Shin
Wanmo Kang
Il-Chul Moon
46
2
0
28 May 2024
EM Distillation for One-step Diffusion Models
EM Distillation for One-step Diffusion Models
Sirui Xie
Zhisheng Xiao
Diederik P. Kingma
Tingbo Hou
Ying Nian Wu
Kevin Patrick Murphy
Tim Salimans
Ben Poole
Ruiqi Gao
VLM
DiffM
42
24
0
27 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
71
10
0
27 May 2024
Previous
123...8910...636465
Next