ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.09300
  4. Cited By
Autoencoding beyond pixels using a learned similarity metric
v1v2 (latest)

Autoencoding beyond pixels using a learned similarity metric

31 December 2015
Anders Boesen Lindbo Larsen
Søren Kaae Sønderby
Hugo Larochelle
Ole Winther
    GAN
ArXiv (abs)PDFHTML

Papers citing "Autoencoding beyond pixels using a learned similarity metric"

50 / 931 papers shown
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Lip-to-Speech Synthesis for Arbitrary Speakers in the WildACM Multimedia (ACM MM), 2022
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
206
16
0
01 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World ModelsInternational Conference on Learning Representations (ICLR), 2022
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLMOffRL
479
256
0
01 Sep 2022
Music Separation Enhancement with Generative Modeling
Music Separation Enhancement with Generative ModelingInternational Society for Music Information Retrieval Conference (ISMIR), 2022
N. Schaffer
Boaz Cogan
Ethan Manilow
Max Morrison
Prem Seetharaman
Bryan Pardo
209
11
0
26 Aug 2022
Deep Structural Causal Shape Models
Deep Structural Causal Shape Models
Rajat Rasal
Daniel Coelho De Castro
Nick Pawlowski
Ben Glocker
3DVMedIm
240
15
0
23 Aug 2022
Towards Label-efficient Automatic Diagnosis and Analysis: A
  Comprehensive Survey of Advanced Deep Learning-based Weakly-supervised,
  Semi-supervised and Self-supervised Techniques in Histopathological Image
  Analysis
Towards Label-efficient Automatic Diagnosis and Analysis: A Comprehensive Survey of Advanced Deep Learning-based Weakly-supervised, Semi-supervised and Self-supervised Techniques in Histopathological Image AnalysisPhysics in Medicine and Biology (PMB), 2022
Linhao Qu
Siyu Liu
Xiaoyu Liu
Manning Wang
Zhijian Song
206
71
0
18 Aug 2022
Generating Synthetic Clinical Data that Capture Class Imbalanced
  Distributions with Generative Adversarial Networks: Example using
  Antiretroviral Therapy for HIV
Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIVJournal of Biomedical Informatics (JBI), 2022
N. Kuo
Federico Garcia
Anders Sönnerborg
Maurizio Zazzi
Michael Böhm
Rolf Kaiser
Mark Polizzotto
Louisa R Jorm
S. Barbieri
GAN
304
37
0
18 Aug 2022
Generative Design of Physical Objects using Modular Framework
Generative Design of Physical Objects using Modular FrameworkEngineering applications of artificial intelligence (EAAI), 2022
Nikita O. Starodubcev
Nikolay O. Nikitin
Konstantin G. Gavaza
Elizaveta A. Andronova
D. O. Sidorenko
Anna V. Kaluzhnaya
99
7
0
29 Jul 2022
Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance
Animation from Blur: Multi-modal Blur Decomposition with Motion GuidanceEuropean Conference on Computer Vision (ECCV), 2022
Zhihang Zhong
Xiao Sun
Zhirong Wu
Yinqiang Zheng
Stephen Lin
Imari Sato
3DH
164
28
0
20 Jul 2022
Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse
Forget-me-not! Contrastive Critics for Mitigating Posterior CollapseConference on Uncertainty in Artificial Intelligence (UAI), 2022
Sachit Menon
David M. Blei
Carl Vondrick
DRL
297
8
0
19 Jul 2022
Outpainting by Queries
Outpainting by QueriesEuropean Conference on Computer Vision (ECCV), 2022
Kai Yao
Penglei Gao
Xi Yang
Kaizhu Huang
Jie Sun
Rui Zhang
ViT
139
19
0
12 Jul 2022
Bottlenecks CLUB: Unifying Information-Theoretic Trade-offs Among
  Complexity, Leakage, and Utility
Bottlenecks CLUB: Unifying Information-Theoretic Trade-offs Among Complexity, Leakage, and UtilityIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2022
Behrooz Razeghi
Flavio du Pin Calmon
Deniz Gunduz
Svyatoslav Voloshynovskiy
197
19
0
11 Jul 2022
Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory
  Forecasting
Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory ForecastingEuropean Conference on Computer Vision (ECCV), 2022
Dooseop Choi
Kyoung‐Wook Min
196
26
0
11 Jul 2022
Identifying and Mitigating Flaws of Deep Perceptual Similarity Metrics
Identifying and Mitigating Flaws of Deep Perceptual Similarity Metrics
Oskar Sjogren
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
107
4
0
06 Jul 2022
AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE
AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAEAsian Conference on Machine Learning (ACML), 2022
Chang-Tien Lu
Shen Zheng
Zirui Wang
O. Dib
Gaurav Gupta
219
3
0
28 Jun 2022
Auto-Encoding Adversarial Imitation Learning
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
217
1
0
22 Jun 2022
Latent Variable Modelling Using Variational Autoencoders: A survey
Latent Variable Modelling Using Variational Autoencoders: A survey
Vasanth Kalingeri
CMLDRL
160
2
0
20 Jun 2022
Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing
Spatially-Adaptive Multilayer Selection for GAN Inversion and EditingComputer Vision and Pattern Recognition (CVPR), 2022
Gaurav Parmar
Yijun Li
Jingwan Lu
Richard Y. Zhang
Jun-Yan Zhu
Krishna Kumar Singh
DiffM
222
53
0
16 Jun 2022
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use
  Case
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use CaseNeural Information Processing Systems (NeurIPS), 2022
Clément Chadebec
Louis J. Vincent
S. Allassonnière
DRL
183
37
0
16 Jun 2022
A Deep Generative Model of Neonatal Cortical Surface Development
A Deep Generative Model of Neonatal Cortical Surface DevelopmentAnnual Conference on Medical Image Understanding and Analysis (MIUA), 2022
Abdulah Fawaz
Logan Z. J. Williams
A. Edwards
E. C. Robinson
MedIm
166
3
0
15 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingInternational Conference on Learning Representations (ICLR), 2022
Sang-gil Lee
Ming-Yu Liu
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
307
379
0
09 Jun 2022
Multiple Instance Learning for Digital Pathology: A Review on the
  State-of-the-Art, Limitations & Future Potential
Multiple Instance Learning for Digital Pathology: A Review on the State-of-the-Art, Limitations & Future Potential
M. Gadermayr
M. Tschuchnig
253
114
0
09 Jun 2022
Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and
  Rasterization-based GAN
Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GANIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE JSTARS), 2022
Hoàng-Ân Lê
Florent Guiotte
M. Pham
Sébastien Lefèvre
Thomas Corpetti
3DPC
103
10
0
08 Jun 2022
Causality Learning With Wasserstein Generative Adversarial Networks
Causality Learning With Wasserstein Generative Adversarial Networks
H. Petkov
Colin Hanley
Feng Dong
CMLGANOOD
111
0
0
03 Jun 2022
SolarGAN: Synthetic Annual Solar Irradiance Time Series on Urban
  Building Facades via Deep Generative Networks
SolarGAN: Synthetic Annual Solar Irradiance Time Series on Urban Building Facades via Deep Generative NetworksEnergy and AI (EA), 2022
Yufei Zhang
A. Schlüter
C. Waibel
AI4CE
130
35
0
01 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Cascaded Video Generation for Videos In-the-WildInternational Conference on Pattern Recognition (ICPR), 2022
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
190
0
0
01 Jun 2022
Text2Human: Text-Driven Controllable Human Image Generation
Text2Human: Text-Driven Controllable Human Image GenerationACM Transactions on Graphics (TOG), 2022
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
287
64
0
31 May 2022
Variational Transfer Learning using Cross-Domain Latent Modulation
Variational Transfer Learning using Cross-Domain Latent Modulation
Jinyong Hou
Jeremiah D. Deng
Stephen Cranefield
Xuejie Din
DRL
179
1
0
31 May 2022
Exploring the Trade-off between Plausibility, Change Intensity and
  Adversarial Power in Counterfactual Explanations using Multi-objective
  Optimization
Exploring the Trade-off between Plausibility, Change Intensity and Adversarial Power in Counterfactual Explanations using Multi-objective Optimization
Javier Del Ser
Alejandro Barredo Arrieta
Natalia Díaz Rodríguez
Francisco Herrera
Andreas Holzinger
AAML
128
4
0
20 May 2022
Diversity vs. Recognizability: Human-like generalization in one-shot
  generative models
Diversity vs. Recognizability: Human-like generalization in one-shot generative modelsNeural Information Processing Systems (NeurIPS), 2022
Victor Boutin
Lakshya Singhal
Xavier Thomas
Thomas Serre
199
11
0
20 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous
  Driving Tasks
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
114
5
0
18 May 2022
A Unified f-divergence Framework Generalizing VAE and GAN
A Unified f-divergence Framework Generalizing VAE and GAN
Jaime Roquero Gimenez
James Zou
124
2
0
11 May 2022
Policy Gradient Stock GAN for Realistic Discrete Order Data Generation
  in Financial Markets
Policy Gradient Stock GAN for Realistic Discrete Order Data Generation in Financial MarketsIIAI International Conference on Advanced Applied Informatics (ICIAAI), 2022
Masanori Hirano
Hiroki Sakaji
Kiyoshi Izumi
GAN
136
6
0
28 Apr 2022
Novel Applications for VAE-based Anomaly Detection Systems
Novel Applications for VAE-based Anomaly Detection SystemsIEEE International Joint Conference on Neural Network (IJCNN), 2022
Luca Bergamin
Tommaso Carraro
Mirko Polato
F. Aiolli
DRL
97
11
0
26 Apr 2022
Synthesizing Informative Training Samples with GAN
Synthesizing Informative Training Samples with GAN
Bo Zhao
Hakan Bilen
DD
338
92
0
15 Apr 2022
Practical Digital Disguises: Leveraging Face Swaps to Protect Patient
  Privacy
Practical Digital Disguises: Leveraging Face Swaps to Protect Patient Privacy
Ethan Wilson
Frederick Shic
Jenny Skytta
Eakta Jain
PICV
209
8
0
07 Apr 2022
FFC-SE: Fast Fourier Convolution for Speech Enhancement
FFC-SE: Fast Fourier Convolution for Speech EnhancementInterspeech (Interspeech), 2022
Ivan Shchekotov
Pavel Andreev
Oleg Ivanov
Aibek Alanov
Dmitry Vetrov
143
25
0
06 Apr 2022
Adversarial Learning of Intermediate Acoustic Feature for End-to-End
  Lightweight Text-to-Speech
Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-SpeechInterspeech (Interspeech), 2022
Hyungchan Yoon
Seyun Um
Changwhan Kim
Hong-Goo Kang
152
0
0
05 Apr 2022
Quantized GAN for Complex Music Generation from Dance Videos
Quantized GAN for Complex Music Generation from Dance VideosEuropean Conference on Computer Vision (ECCV), 2022
Ye Zhu
Kyle Olszewski
Yuehua Wu
Panos Achlioptas
Menglei Chai
Yan Yan
Sergey Tulyakov
MGen
219
56
0
01 Apr 2022
DAG-WGAN: Causal Structure Learning With Wasserstein Generative
  Adversarial Networks
DAG-WGAN: Causal Structure Learning With Wasserstein Generative Adversarial NetworksEmbedded Systems and Applications (ESA), 2022
H. Petkov
Colin Hanley
Feng Dong
GANOODCML
162
7
0
01 Apr 2022
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise
  Distillation
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise DistillationSpoken Language Technology Workshop (SLT), 2022
Rendi Chevi
Radityo Eko Prasojo
Alham Fikri Aji
Andros Tjandra
S. Sakti
VLM
152
5
0
29 Mar 2022
Semi-Supervised Image-to-Image Translation using Latent Space Mapping
Semi-Supervised Image-to-Image Translation using Latent Space Mapping
Pan Zhang
Jianmin Bao
Ting Zhang
Dong Chen
Fang Wen
148
1
0
29 Mar 2022
Fusing Global and Local Features for Generalized AI-Synthesized Image
  Detection
Fusing Global and Local Features for Generalized AI-Synthesized Image DetectionInternational Conference on Information Photonics (ICIP), 2022
Yan Ju
Shan Jia
Lipeng Ke
Hongfei Xue
Koki Nagano
Siwei Lyu
305
102
0
26 Mar 2022
Efficient-VDVAE: Less is more
Efficient-VDVAE: Less is more
Louay Hazami
Rayhane Mama
Ragavan Thurairatnam
BDL
229
29
0
25 Mar 2022
From MIM-Based GAN to Anomaly Detection:Event Probability Influence on
  Generative Adversarial Networks
From MIM-Based GAN to Anomaly Detection:Event Probability Influence on Generative Adversarial NetworksIEEE Internet of Things Journal (IEEE IoT J.), 2022
R. She
Pingyi Fan
GAN
123
8
0
25 Mar 2022
IA-FaceS: A Bidirectional Method for Semantic Face Editing
IA-FaceS: A Bidirectional Method for Semantic Face EditingNeural Networks (NN), 2022
Wenjing Huang
Shikui Tu
Lei Xu
CVBM
291
19
0
24 Mar 2022
HiFi++: a Unified Framework for Bandwidth Extension and Speech
  Enhancement
HiFi++: a Unified Framework for Bandwidth Extension and Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Pavel Andreev
Aibek Alanov
Oleg Ivanov
Dmitry Vetrov
366
66
0
24 Mar 2022
Pixel VQ-VAEs for Improved Pixel Art Representation
Pixel VQ-VAEs for Improved Pixel Art Representation
Akash Saravanan
Matthew J. Guzdial
153
9
0
23 Mar 2022
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable
  Duration Modeling
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration ModelingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Bac Nguyen
Fabien Cardinaux
Stefan Uhlich
133
4
0
21 Mar 2022
Practical cognitive speech compression
Practical cognitive speech compression
Reza Lotfidereshgi
P. Gournay
181
2
0
08 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating
  Inverse Short-Time Fourier Transform
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier TransformIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
180
87
0
04 Mar 2022
Previous
123456...171819
Next