ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.09300
  4. Cited By
Autoencoding beyond pixels using a learned similarity metric
v1v2 (latest)

Autoencoding beyond pixels using a learned similarity metric

31 December 2015
Anders Boesen Lindbo Larsen
Søren Kaae Sønderby
Hugo Larochelle
Ole Winther
    GAN
ArXiv (abs)PDFHTML

Papers citing "Autoencoding beyond pixels using a learned similarity metric"

50 / 930 papers shown
Title
SVS-GAN: Leveraging GANs for Semantic Video Synthesis
SVS-GAN: Leveraging GANs for Semantic Video Synthesis
Khaled M. Seyam
Julian Wiederer
Markus Braun
Bin Yang
151
0
0
09 Sep 2024
Latent 3D Brain MRI Counterfactual
Latent 3D Brain MRI Counterfactual
Wei Peng
Tian Xia
Fabio De Sousa Ribeiro
Tomas Bosschieter
Ehsan Adeli
Qingyu Zhao
Ben Glocker
K. Pohl
CMLMedIm
242
3
0
09 Sep 2024
An Analysis for Image-to-Image Translation and Style Transfer
An Analysis for Image-to-Image Translation and Style Transfer
Xiaoming Yu
Jie Tian
Zhenhua Hu
VLM
259
0
0
12 Aug 2024
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot
  HOI Detection
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI DetectionACM Multimedia (MM), 2024
Yixin Guo
Yu Liu
Jianghao Li
Weimin Wang
Qi Jia
VLM
183
16
0
12 Aug 2024
Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study
Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study
Zohaib Salahuddin
A. Ibrahim
Sheng Kuang
Y. Widaatalla
R. Miclea
...
Tom Marcelissen
Patricia Zondervan
Auke Jager
Philippe Lambin
Henry C. Woodruff
MedIm
273
1
0
07 Aug 2024
A Non-negative VAE:the Generalized Gamma Belief Network
A Non-negative VAE:the Generalized Gamma Belief Network
Zhibin Duan
Tiansheng Wen
Muyao Wang
Bo Chen
Mingyuan Zhou
BDL
303
2
0
06 Aug 2024
Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving
Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving
Bernard Lange
Masha Itkina
Jiachen Li
Mykel J. Kochenderfer
361
5
0
30 Jul 2024
Can I trust my anomaly detection system? A case study based on
  explainable AI
Can I trust my anomaly detection system? A case study based on explainable AI
Muhammad Rashid
E. Amparore
Enrico Ferrari
Damiano Verda
182
0
0
29 Jul 2024
Diverse Image Harmonization
Diverse Image Harmonization
Xinhao Tao
Tianyuan Qiu
Junyan Cao
Li Niu
208
0
0
22 Jul 2024
LIMT: Language-Informed Multi-Task Visual World Models
LIMT: Language-Informed Multi-Task Visual World Models
Elie Aljalbout
Nikolaos Sotirakis
Patrick van der Smagt
Maximilian Karl
Nutan Chen
346
5
0
18 Jul 2024
All Roads Lead to Rome? Exploring Representational Similarities Between
  Latent Spaces of Generative Image Models
All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models
Charumathi Badrinath
Usha Bhalla
Alexander X. Oesterling
Suraj Srinivas
Himabindu Lakkaraju
DiffM
199
0
0
18 Jul 2024
Mutual Learning for Acoustic Matching and Dereverberation via Visual
  Scene-driven Diffusion
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma
Wenguan Wang
Yi Yang
Feng Zheng
DiffM
229
1
0
15 Jul 2024
Optimal Video Compression using Pixel Shift Tracking
Optimal Video Compression using Pixel Shift Tracking
Hitesh Saai Mananchery Panneerselvam
Smit Anand
138
0
0
28 Jun 2024
Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation
  Using GANs and Integrated Unaligned Clean Data
Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data
Yu-Hua Chen
Woosung Choi
Wei-Hsiang Liao
Marco A. Martínez-Ramírez
K. Cheuk
Yuki Mitsufuji
J. Jang
Yi-Hsuan Yang
160
6
0
22 Jun 2024
Occam's Razor for Self Supervised Learning: What is Sufficient to Learn
  Good Representations?
Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations?
Mark Ibrahim
David Klindt
Randall Balestriero
SSL
270
6
1
15 Jun 2024
DiffPop: Plausibility-Guided Object Placement Diffusion for Image
  Composition
DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
Jiacheng Liu
Hang Zhou
Shida Wei
Rui Ma
253
4
0
12 Jun 2024
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis
Zhijun Liu
Shuai Wang
Sho Inoue
Qibing Bai
Haizhou Li
DiffM
168
31
0
08 Jun 2024
3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving
  Tumor Segmentation Tasks in Data-Scarce Regimes
3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce RegimesIEEE International Symposium on Biomedical Imaging (ISBI), 2024
Aghiles Kebaili
J. Lapuyade-Lahorgue
Pierre Vera
S. Ruan
MedIm
137
6
0
08 Jun 2024
Inference Attacks: A Taxonomy, Survey, and Promising Directions
Inference Attacks: A Taxonomy, Survey, and Promising Directions
Feng Wu
Lei Cui
Shaowen Yao
Shui Yu
352
3
0
04 Jun 2024
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories
Tianlong Wang
Xianfeng Jiao
Yifan He
Zhongzhi Chen
Yinghao Zhu
Xu Chu
Junyi Gao
Yasha Wang
Liantao Ma
LLMSV
407
49
0
26 May 2024
MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection
MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection
Junzhuo Chen
Shitong Kang
296
0
0
10 May 2024
Functional Imaging Constrained Diffusion for Brain PET Synthesis from
  Structural MRI
Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI
Minhui Yu
Mengqi Wu
Ling Yue
Andrea Bozoki
Mingxia Liu
DiffMMedIm
217
2
0
03 May 2024
Tailoring Generative Adversarial Networks for Smooth Airfoil Design
Tailoring Generative Adversarial Networks for Smooth Airfoil Design
Joyjit Chattoraj
Jian Cheng Wong
Zexuan Zhang
Manna Dai
Yingzhi Xia
...
Xinxing Xu
Chin Chun Ooi
Yang Feng
M. Dao
Yong Liu
AI4CEGAN
196
3
0
18 Apr 2024
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like
  Multi-talker Conversations
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker ConversationsNeural Information Processing Systems (NeurIPS), 2024
Leying Zhang
Yao Qian
Long Zhou
Shujie Liu
Dongmei Wang
...
Yanmin Qian
Jinyu Li
Lei He
Sheng Zhao
Michael Zeng
231
15
0
10 Apr 2024
NPB-REC: A Non-parametric Bayesian Deep-learning Approach for
  Undersampled MRI Reconstruction with Uncertainty Estimation
NPB-REC: A Non-parametric Bayesian Deep-learning Approach for Undersampled MRI Reconstruction with Uncertainty Estimation
Samah Khawaled
Moti Freiman
UQCV
166
5
0
06 Apr 2024
Real, fake and synthetic faces -- does the coin have three sides?
Real, fake and synthetic faces -- does the coin have three sides?IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Shahzeb Naeem
Ramzi Al-Sharawi
Muhammad Riyyan Khan
Usman Tariq
Abhinav Dhall
H. Al-Nashash
212
2
0
02 Apr 2024
SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational
  Autoencoder
SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder
Dihan Zheng
Yihang Zou
Xiaowen Zhang
Chenglong Bao
DiffM
261
4
0
26 Mar 2024
Training Generative Adversarial Network-Based Vocoder with Limited Data
  Using Augmentation-Conditional Discriminator
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
146
1
0
25 Mar 2024
A survey of synthetic data augmentation methods in computer vision
A survey of synthetic data augmentation methods in computer visionMachine Intelligence Research (MIR), 2024
A. Mumuni
F. Mumuni
N. K. Gerrar
315
70
0
15 Mar 2024
Motifs, Phrases, and Beyond: The Modelling of Structure in Symbolic
  Music Generation
Motifs, Phrases, and Beyond: The Modelling of Structure in Symbolic Music Generation
Keshav Bhandari
Simon Colton
186
13
0
12 Mar 2024
Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with
  Diffusion Model for Blind Image Super-Resolution
Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-ResolutionEuropean Conference on Computer Vision (ECCV), 2024
Junxiong Lin
Yan Wang
Zeng Tao
Boyang Wang
Qing Zhao
...
Yuxuan Lin
Wei Song
Xuan Tong
Shaoqi Yan
Wenqiang Zhang
202
5
0
09 Mar 2024
Gradient-free neural topology optimization
Gradient-free neural topology optimizationComputational Mechanics (CM), 2024
Gaweł Kuś
Miguel A. Bessa
AI4CE
172
3
0
07 Mar 2024
Unified Generation, Reconstruction, and Representation: Generalized
  Diffusion with Adaptive Latent Encoding-Decoding
Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding
Guangyi Liu
Yu Wang
Zeyu Feng
Qiyu Wu
Liping Tang
...
Shuguang Cui
Julian McAuley
Zichao Yang
Eric P. Xing
Zhiting Hu
DiffM
299
7
0
29 Feb 2024
Generative AI for Secure Physical Layer Communications: A Survey
Generative AI for Secure Physical Layer Communications: A Survey
Changyuan Zhao
Hongyang Du
Dusit Niyato
Jiawen Kang
Zehui Xiong
Dong In Kim
Xuemin
X. Shen
K. B. Letaief
178
48
0
21 Feb 2024
Data-Free Generalized Zero-Shot Learning
Data-Free Generalized Zero-Shot LearningAAAI Conference on Artificial Intelligence (AAAI), 2024
Bowen Tang
Long Yan
Jing Zhang
Qian Yu
Lu Sheng
Dong Xu
VLM
165
15
0
28 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
296
32
0
25 Jan 2024
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive
Adversarial Supervision Makes Layout-to-Image Diffusion Models ThriveInternational Conference on Learning Representations (ICLR), 2024
Yumeng Li
Margret Keuper
Dan Zhang
Anna Khoreva
DiffM
257
18
0
16 Jan 2024
A Physics-informed machine learning model for time-dependent wave runup
  prediction
A Physics-informed machine learning model for time-dependent wave runup predictionOcean Engineering (Ocean Eng.), 2024
Saeed Saviz Naeini
Reda Snaiki
AI4CE
98
13
0
12 Jan 2024
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Brain-Conditional Multimodal Synthesis: A Survey and TaxonomyIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
416
14
0
31 Dec 2023
SEER-ZSL: Semantic Encoder-Enhanced Representations for Generalized Zero-Shot Learning
SEER-ZSL: Semantic Encoder-Enhanced Representations for Generalized Zero-Shot Learning
William Heyden
Habib Ullah
M. Salman Siddiqui
Fadi Al Machot
VLM
230
1
0
20 Dec 2023
Balanced Marginal and Joint Distributional Learning via Mixture
  Cramer-Wold Distance
Balanced Marginal and Joint Distributional Learning via Mixture Cramer-Wold Distance
SeungHwan An
Sungchul Hong
Jong-June Jeon
200
0
0
06 Dec 2023
ELF: Encoding Speaker-Specific Latent Speech Feature for Speech
  Synthesis
ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis
Jungil Kong
Junmo Lee
Jeongmin Kim
Beomjeong Kim
Jihoon Park
Dohee Kong
Changheon Lee
Sangjin Kim
270
3
0
20 Nov 2023
TURBO: The Swiss Knife of Auto-Encoders
TURBO: The Swiss Knife of Auto-EncodersEntropy (Entropy), 2023
Guillaume Quétant
Yury Belousov
Vitaliy Kinakh
Svyatoslav Voloshynovskiy
114
6
0
11 Nov 2023
Proceedings of the 5th International Workshop on Reading Music Systems
Proceedings of the 5th International Workshop on Reading Music Systems
Jorge Calvo-Zaragoza
Alexander Pacha
Elona Shatri
107
0
0
07 Nov 2023
inkn'hue: Enhancing Manga Colorization from Multiple Priors with
  Alignment Multi-Encoder VAE
inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE
Tawin Jiramahapokee
206
2
0
03 Nov 2023
Medical Image Segmentation with Domain Adaptation: A Survey
Medical Image Segmentation with Domain Adaptation: A Survey
Yuemeng Li
Yong-Xian Fan
OOD
290
3
0
03 Nov 2023
Monotone Generative Modeling via a Gromov-Monge Embedding
Monotone Generative Modeling via a Gromov-Monge EmbeddingSIAM Journal on Mathematics of Data Science (SIMODS), 2023
Wonjun Lee
Yifei Yang
Dongmian Zou
Gilad Lerman
GAN
333
2
0
02 Nov 2023
Style Description based Text-to-Speech with Conditional Prosodic Layer
  Normalization based Diffusion GAN
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN
Neeraj Kumar
Ankur Narang
Brejesh Lall
DiffM
146
0
0
27 Oct 2023
Adversarial Anomaly Detection using Gaussian Priors and Nonlinear
  Anomaly Scores
Adversarial Anomaly Detection using Gaussian Priors and Nonlinear Anomaly Scores
Fiete Lüer
Tobias Weber
Maxim Dolgich
Christian Böhm
133
1
0
27 Oct 2023
Vec-Tok Speech: speech vectorization and tokenization for neural speech
  generation
Vec-Tok Speech: speech vectorization and tokenization for neural speech generationIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2023
Xinfa Zhu
Yuanjun Lv
Yinjiao Lei
Tao Li
Wendi He
Hongbin Zhou
Heng Lu
Lei Xie
345
29
0
11 Oct 2023
Previous
12345...171819
Next