Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.03481
Cited By
v1
v2 (latest)
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
5 November 2021
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers"
18 / 18 papers shown
TART: Token-based Architecture Transformer for Neural Network Performance Prediction
Yannis Y. He
293
0
0
02 Jan 2025
Denoising with a Joint-Embedding Predictive Architecture
International Conference on Learning Representations (ICLR), 2024
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
626
6
0
02 Oct 2024
Learning Trimodal Relation for AVQA with Missing Modality
Kyu Ri Park
Hong Joo Lee
Jung Uk Kim
262
5
0
23 Jul 2024
Can SAM Boost Video Super-Resolution?
Zhihe Lu
Zeyu Xiao
Jiawang Bai
Zhiwei Xiong
Xinchao Wang
396
35
0
11 May 2023
Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
S. Dubey
Satish Kumar Singh
ViT
319
70
0
17 Feb 2023
Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution
Zhongwei Qiu
Huan Yang
Jianlong Fu
Daochang Liu
Chang Xu
Dongmei Fu
AI4TS
SupR
186
1
0
27 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Ludan Ruan
Yi Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffM
VGen
519
279
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
IEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
438
290
0
15 Dec 2022
BiViT: Extremely Compressed Binary Vision Transformer
IEEE International Conference on Computer Vision (ICCV), 2022
Yefei He
Zhenyu Lou
Luoming Zhang
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
ViT
MQ
339
44
0
14 Nov 2022
Fine-Grained Image Style Transfer with Visual Transformers
Asian Conference on Computer Vision (ACCV), 2022
Jianbo Wang
Huan Yang
Jianlong Fu
T. Yamasaki
B. Guo
ViT
266
18
0
11 Oct 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
ACM Multimedia (ACM MM), 2022
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffM
MLLM
295
12
0
07 Sep 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
210
20
0
29 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
ACM Multimedia (ACM MM), 2022
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
227
31
0
11 Aug 2022
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
European Conference on Computer Vision (ECCV), 2022
Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
SupR
219
60
0
05 Aug 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Neural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
838
571
0
02 Jun 2022
Learning Trajectory-Aware Transformer for Video Super-Resolution
Computer Vision and Pattern Recognition (CVPR), 2022
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
ViT
384
110
0
08 Apr 2022
ITTR: Unpaired Image-to-Image Translation with Transformers
Wanfeng Zheng
Qiang Li
Guoxin Zhang
Pengfei Wan
Zhong-ming Wang
ViT
268
26
0
30 Mar 2022
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
Computer Vision and Pattern Recognition (CVPR), 2021
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
B. Guo
AI4TS
VLM
312
261
0
19 Nov 2021
1
Page 1 of 1