ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.09300
  4. Cited By
Autoencoding beyond pixels using a learned similarity metric
v1v2 (latest)

Autoencoding beyond pixels using a learned similarity metric

31 December 2015
Anders Boesen Lindbo Larsen
Søren Kaae Sønderby
Hugo Larochelle
Ole Winther
    GAN
ArXiv (abs)PDFHTML

Papers citing "Autoencoding beyond pixels using a learned similarity metric"

50 / 932 papers shown
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Tianyu Yuan
Yuanbo Yang
Lin Chen
Yao Yao
Zhuzhong Qian
DiffMVGen
257
0
0
04 Dec 2025
MRI Super-Resolution with Deep Learning: A Comprehensive Survey
MRI Super-Resolution with Deep Learning: A Comprehensive Survey
Mohammad Khateri
Serge Vasylechko
Morteza Ghahremani
Liam Timms
Deniz Kocanaogullari
...
Davood Karimi
Alejandra Sierra
Jussi Tohka
Sila Kurugol
O. Afacan
393
1
0
20 Nov 2025
Decoupling Complexity from Scale in Latent Diffusion Model
Tianxiong Zhong
Xingye Tian
X. Wang
Boyuan Jiang
Xin Tao
Pengfei Wan
DiffM
320
1
0
20 Nov 2025
Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications
Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World ApplicationsJournal of Big Data (JBD), 2025
Shamim Yazdani
Akansha Singh
N. Saxena
Sribala Vidyadhari Chinta
Avash Palikhe
Deng Pan
Umapada Pal
Jie Yang
Wenbin Zhang
203
4
0
23 Oct 2025
Quantum Autoencoders for Anomaly Detection in Cybersecurity
Quantum Autoencoders for Anomaly Detection in Cybersecurity
Rohan Senthil
Swee Liang Wong
102
0
0
22 Oct 2025
Lightweight CycleGAN Models for Cross-Modality Image Transformation and Experimental Quality Assessment in Fluorescence Microscopy
Lightweight CycleGAN Models for Cross-Modality Image Transformation and Experimental Quality Assessment in Fluorescence Microscopy
Mohammad Soltaninezhad
Yashar Rouzbahani
Jhonatan Contreras
Rohan Chippalkatti
Daniel Kwaku Abankwa
Christian Eggeling
Thomas Bocklitz
MedIm
95
0
0
17 Oct 2025
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
Jinchuan Tian
Sang-gil Lee
Zhifeng Kong
Sreyan Ghosh
Arushi Goel
...
Shinji Watanabe
Mohammad Shoeybi
Bryan Catanzaro
Rafael Valle
Wei Ping
AuLLMLRM
290
1
0
13 Oct 2025
O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion
O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion
Huu Tuong Tu
Huan Vu
cuong tien nguyen
Dien Hy Ngo
Nguyen Thi Thu Trang
97
0
0
10 Oct 2025
MelTok: 2D Tokenization for Single-Codebook Audio Compression
MelTok: 2D Tokenization for Single-Codebook Audio Compression
Jingyi Li
Zhiyuan Zhao
Yunfei Liu
Lijian Lin
Ye Zhu
Jiahao Wu
Qiuqiang Kong
Yu Li
Y. Li
312
0
0
02 Oct 2025
Cycle Diffusion Model for Counterfactual Image Generation
Cycle Diffusion Model for Counterfactual Image Generation
Fangrui Huang
Alan Wang
Binxu Li
Bailey Trang
Ridvan Yesiloglu
Tianyu Hua
Wei Peng
Ehsan Adeli
DiffMMedIm
213
1
0
29 Sep 2025
From Autoencoders to CycleGAN: Robust Unpaired Face Manipulation via Adversarial Learning
From Autoencoders to CycleGAN: Robust Unpaired Face Manipulation via Adversarial Learning
Collin Guo
Yi Qian
CVBMGAN
289
0
0
15 Sep 2025
Equivariant Flow Matching for Symmetry-Breaking Bifurcation Problems
Equivariant Flow Matching for Symmetry-Breaking Bifurcation Problems
Fleur Hendriks
O. Rokoš
M. Doškář
M. Geers
Vlado Menkovski
168
0
0
03 Sep 2025
Vocoder-Projected Feature Discriminator
Vocoder-Projected Feature Discriminator
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
150
0
0
25 Aug 2025
KB-DMGen: Knowledge-Based Global Guidance and Dynamic Pose Masking for Human Image Generation
KB-DMGen: Knowledge-Based Global Guidance and Dynamic Pose Masking for Human Image Generation
Shibang Liu
Xuemei Xie
G. Shi
DiffM
249
0
0
26 Jul 2025
DOOMGAN:High-Fidelity Dynamic Identity Obfuscation Ocular Generative Morphing
DOOMGAN:High-Fidelity Dynamic Identity Obfuscation Ocular Generative Morphing
Bharath Krishnamurthy
Ajita Rattani
136
1
0
23 Jul 2025
Variational Learning of Disentangled Representations
Variational Learning of Disentangled Representations
Yuli Slavutsky
Ozgur Beker
David Blei
Bianca Dumitrascu
DRLOODCMLCoGe
264
1
0
20 Jun 2025
DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning
DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning
Dongxu Liu
Yuang Peng
Haomiao Tang
Yuwei Chen
Chunrui Han
Zheng Ge
Daxin Jiang
Mingxue Liao
Mingxue Liao
DiffM
293
1
0
11 Jun 2025
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
Lev Novitskiy
Viacheslav Vasilev
Maria Kovaleva
V. Arkhipkin
Denis Dimitrov
VGen
215
1
0
09 Jun 2025
Beyond the Norm: A Survey of Synthetic Data Generation for Rare Events
Beyond the Norm: A Survey of Synthetic Data Generation for Rare Events
Jingyi Gu
Xuan Zhang
Guiling Wang
SyDa
200
6
0
04 Jun 2025
PseudoVC: Improving One-shot Voice Conversion with Pseudo Paired Data
PseudoVC: Improving One-shot Voice Conversion with Pseudo Paired Data
Songjun Cao
Qinghua Wu
Jie Chen
Jin Li
Long Ma
164
0
0
01 Jun 2025
When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds
When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds
Minsu Kang
Seolhee Lee
Choonghyeon Lee
Namhyun Cho
VLM
129
2
0
30 May 2025
SAEs Are Good for Steering -- If You Select the Right Features
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
425
20
0
26 May 2025
Source Separation by Flow Matching
Source Separation by Flow Matching
Robin Scheibler
John R. Hershey
Arnaud Doucet
Henry Li
476
3
0
22 May 2025
NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs
NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs
Zhaoge Bi
Linghan Huang
Haolin Jin
Qingwen Zeng
Huaming Chen
AI4TS
153
0
0
22 May 2025
Towards Generating Realistic Underwater Images
Towards Generating Realistic Underwater Images
Abdul-Kazeem Shamba
GAN
214
0
0
20 May 2025
VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Tianxiong Zhong
Xingye Tian
Boyuan Jiang
Xuebo Wang
Xin Tao
Pengfei Wan
Zhiwei Zhang
308
3
0
17 May 2025
Generative AI for Urban Planning: Synthesizing Satellite Imagery via Diffusion Models
Generative AI for Urban Planning: Synthesizing Satellite Imagery via Diffusion ModelsComputers, Environment and Urban Systems (CEUS), 2025
Qingyi Wang
Yuxuan Liang
Yunhan Zheng
Kaiyuan Xu
Jinhua Zhao
Shenhao Wang
216
5
0
13 May 2025
Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models
Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models
X. Wang
Haoyang Li
Zeyang Zhang
Zeyang Zhang
Wenwu Zhu
LRM
410
5
0
28 Apr 2025
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Sandipan Dhar
N. D. Jana
Swagatam Das
274
4
0
27 Apr 2025
Likelihood-Free Variational Autoencoders
Likelihood-Free Variational Autoencoders
Chen Xu
Qiang Wang
Lijun Sun
DiffMDRL
538
0
0
24 Apr 2025
Hyper-Transforming Latent Diffusion Models
Hyper-Transforming Latent Diffusion Models
I. Peis
Batuhan Koyuncu
Isabel Valera
J. Frellsen
455
1
0
23 Apr 2025
Learning and Generating Diverse Residential Load Patterns Using GAN with Weakly-Supervised Training and Weight Selection
Learning and Generating Diverse Residential Load Patterns Using GAN with Weakly-Supervised Training and Weight SelectionIEEE transactions on consumer electronics (IEEE TCE), 2025
Xinyu Liang
Hao Wang
945
2
0
19 Apr 2025
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment AnimationBritish Machine Vision Conference (BMVC), 2025
R. Vidaurre
Elena Garces
Dan Casas
DiffMAI4CE
283
1
0
24 Mar 2025
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Jiyuan Wang
Chunyu Lin
Cheng Guan
Lang Nie
Jing He
Haodong Li
K. Liao
Yao Zhao
DiffMMDE
466
12
0
20 Mar 2025
QINCODEC: Neural Audio Compression with Implicit Neural Codebooks
QINCODEC: Neural Audio Compression with Implicit Neural Codebooks
Zineb Lahrichi
Gaëtan Hadjeres
Gaël Richard
Geoffroy Peeters
348
2
0
19 Mar 2025
A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation
Forough Fazeliasl
Michael Minyi Zhang
Bei Jiang
Linglong Kong
245
0
0
13 Mar 2025
Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANsInternational Conference on Pattern Recognition (ICPR), 2025
Mahshid shiri
Alessandro Bruno
Daniele Loiacono
MedIm3DV
161
0
0
13 Mar 2025
Steered Generation via Gradient Descent on Sparse Features
Steered Generation via Gradient Descent on Sparse Features
Sumanta Bhattacharyya
Pedram Rooshenas
LLMSV
305
0
0
25 Feb 2025
High-Fidelity Music Vocoder using Neural Audio Codecs
High-Fidelity Music Vocoder using Neural Audio CodecsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Luca A. Lanzendörfer
Florian Grötschla
Michael Ungersböck
Roger Wattenhofer
306
2
0
18 Feb 2025
Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm
Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm
Nanyu Luo
Feng Ji
DRL
484
0
0
15 Feb 2025
FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
FlashSR: One-step Versatile Audio Super-resolution via Diffusion DistillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jaekwon Im
Juhan Nam
DiffM
342
4
0
18 Jan 2025
An Empirical Study of Autoregressive Pre-training from Videos
An Empirical Study of Autoregressive Pre-training from Videos
Jathushan Rajasegaran
Ilija Radosavovic
Rahul Ravishankar
Yossi Gandelsman
Christoph Feichtenhofer
Jitendra Malik
183
16
0
10 Jan 2025
Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised LearningIEEE Wireless Communications and Networking Conference (WCNC), 2025
Zhongwei Wang
Tong Wu
Zhiyong Chen
Liang Qian
Yin Xu
Meixia Tao
FedML
245
0
0
04 Jan 2025
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerComputer Vision and Pattern Recognition (CVPR), 2024
Zeyang Zhang
Zihan Wang
Xianrui Li
Xingwu Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
723
32
0
14 Dec 2024
Hierarchical Conditional Tabular GAN for Multi-Tabular Synthetic Data
  Generation
Hierarchical Conditional Tabular GAN for Multi-Tabular Synthetic Data Generation
Wilhelm Ågren
Victorio Úbeda Sosa
257
2
0
11 Nov 2024
Towards Visual Text Design Transfer Across Languages
Towards Visual Text Design Transfer Across LanguagesNeural Information Processing Systems (NeurIPS), 2024
Yejin Choi
Jiwan Chung
Sumin Shim
Giyeong Oh
Youngjae Yu
VLMDiffM
164
1
0
24 Oct 2024
Longitudinal Causal Image Synthesis
Longitudinal Causal Image Synthesis
Yujia Li
Han Li
ans S. Kevin Zhou
DiffMMedIm
244
0
0
23 Oct 2024
Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax
Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax
I. Butakov
Alexander Sememenko
Alexander Tolmachev
Andrey Gladkov
Marina Munkhoeva
Alexey Frolov
430
2
0
09 Oct 2024
IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRI
IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRIArtificial Intelligence for the Earth Systems (AI4ES), 2024
K. Jeggle
Mikolaj Czerkawski
F. Serva
B. L. Saux
D. Neubauer
Ulrike Lohmann
127
5
0
05 Oct 2024
Khattat: Enhancing Readability and Concept Representation of Semantic
  Typography
Khattat: Enhancing Readability and Concept Representation of Semantic Typography
Ahmed Hussein
Alaa Elsetohy
Sama Hadhoud
Tameem Bakr
Yasser Rohaim
Badr AlKhamissi
VLM
213
1
0
01 Oct 2024
1234...171819
Next
Page 1 of 19
Pageof 19