ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.03481
  4. Cited By
Improving Visual Quality of Image Synthesis by A Token-based Generator
  with Transformers
v1v2 (latest)

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

5 November 2021
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
    ViT
ArXiv (abs)PDFHTML

Papers citing "Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers"

18 / 18 papers shown
TART: Token-based Architecture Transformer for Neural Network Performance Prediction
TART: Token-based Architecture Transformer for Neural Network Performance Prediction
Yannis Y. He
293
0
0
02 Jan 2025
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive ArchitectureInternational Conference on Learning Representations (ICLR), 2024
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
626
6
0
02 Oct 2024
Learning Trimodal Relation for AVQA with Missing Modality
Learning Trimodal Relation for AVQA with Missing Modality
Kyu Ri Park
Hong Joo Lee
Jung Uk Kim
262
5
0
23 Jul 2024
Can SAM Boost Video Super-Resolution?
Can SAM Boost Video Super-Resolution?
Zhihe Lu
Zeyu Xiao
Jiawang Bai
Zhiwei Xiong
Xinchao Wang
396
35
0
11 May 2023
Transformer-based Generative Adversarial Networks in Computer Vision: A
  Comprehensive Survey
Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive SurveyIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
S. Dubey
Satish Kumar Singh
ViT
319
70
0
17 Feb 2023
Learning Spatiotemporal Frequency-Transformer for Low-Quality Video
  Super-Resolution
Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution
Zhongwei Qiu
Huan Yang
Jianlong Fu
Daochang Liu
Chang Xu
Dongmei Fu
AI4TSSupR
186
1
0
27 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and
  Video Generation
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Ludan Ruan
Yi Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffMVGen
519
279
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and SpeedIEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
438
290
0
15 Dec 2022
BiViT: Extremely Compressed Binary Vision Transformer
BiViT: Extremely Compressed Binary Vision TransformerIEEE International Conference on Computer Vision (ICCV), 2022
Yefei He
Zhenyu Lou
Luoming Zhang
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
ViTMQ
339
44
0
14 Nov 2022
Fine-Grained Image Style Transfer with Visual Transformers
Fine-Grained Image Style Transfer with Visual TransformersAsian Conference on Computer Vision (ACCV), 2022
Jianbo Wang
Huan Yang
Jianlong Fu
T. Yamasaki
B. Guo
ViT
266
18
0
11 Oct 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based
  Cross-Modal Generation
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal GenerationACM Multimedia (ACM MM), 2022
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffMMLLM
295
12
0
07 Sep 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face
  Generation
StableFace: Analyzing and Improving Motion Stability for Talking Face GenerationIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
210
20
0
29 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal
  Fashion Design
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion DesignACM Multimedia (ACM MM), 2022
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
227
31
0
11 Aug 2022
Learning Spatiotemporal Frequency-Transformer for Compressed Video
  Super-Resolution
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-ResolutionEuropean Conference on Computer Vision (ECCV), 2022
Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
SupR
219
60
0
05 Aug 2022
EfficientFormer: Vision Transformers at MobileNet Speed
EfficientFormer: Vision Transformers at MobileNet SpeedNeural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
838
571
0
02 Jun 2022
Learning Trajectory-Aware Transformer for Video Super-Resolution
Learning Trajectory-Aware Transformer for Video Super-ResolutionComputer Vision and Pattern Recognition (CVPR), 2022
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
ViT
384
110
0
08 Apr 2022
ITTR: Unpaired Image-to-Image Translation with Transformers
ITTR: Unpaired Image-to-Image Translation with Transformers
Wanfeng Zheng
Qiang Li
Guoxin Zhang
Pengfei Wan
Zhong-ming Wang
ViT
268
26
0
30 Mar 2022
Advancing High-Resolution Video-Language Representation with Large-Scale
  Video Transcriptions
Advancing High-Resolution Video-Language Representation with Large-Scale Video TranscriptionsComputer Vision and Pattern Recognition (CVPR), 2021
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
B. Guo
AI4TSVLM
312
261
0
19 Nov 2021
1
Page 1 of 1