Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2103.03206
Cited By
v1
v2 (latest)
Perceiver: General Perception with Iterative Attention
International Conference on Machine Learning (ICML), 2021
4 March 2021
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"Perceiver: General Perception with Iterative Attention"
50 / 783 papers shown
Title
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
International Conference on Learning Representations (ICLR), 2022
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
239
13
0
24 Feb 2022
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models
International Conference on Learning Representations (ICLR), 2022
Spyridon Mouselinos
Henryk Michalewski
Mateusz Malinowski
230
4
0
24 Feb 2022
Learning to Merge Tokens in Vision Transformers
Cédric Renggli
André Susano Pinto
N. Houlsby
Basil Mustafa
J. Puigcerver
C. Riquelme
MoMe
177
78
0
24 Feb 2022
Better Modelling Out-of-Distribution Regression on Distributed Acoustic Sensor Data Using Anchored Hidden State Mixup
IEEE Transactions on Industrial Informatics (IEEE TII), 2022
Hasan Asy’ari Arief
P. J. Thomas
T. Wiktorski
OOD
95
6
0
23 Feb 2022
HiP: Hierarchical Perceiver
João Carreira
Skanda Koppula
Daniel Zoran
Adrià Recasens
Catalin Ionescu
...
M. Botvinick
Oriol Vinyals
Karen Simonyan
Andrew Zisserman
Andrew Jaegle
VLM
307
14
0
22 Feb 2022
Transformer Quality in Linear Time
International Conference on Machine Learning (ICML), 2022
Weizhe Hua
Zihang Dai
Hanxiao Liu
Quoc V. Le
389
289
0
21 Feb 2022
General-purpose, long-context autoregressive modeling with Perceiver AR
International Conference on Machine Learning (ICML), 2022
Curtis Hawthorne
Andrew Jaegle
Cătălina Cangea
Sebastian Borgeaud
C. Nash
...
Hannah R. Sheahan
Neil Zeghidour
Jean-Baptiste Alayrac
João Carreira
Jesse Engel
196
74
0
15 Feb 2022
SpeechPainter: Text-conditioned Speech Inpainting
Interspeech (Interspeech), 2022
Zalan Borsos
Matthew Sharifi
Marco Tagliasacchi
166
34
0
15 Feb 2022
Benchmarking Online Sequence-to-Sequence and Character-based Handwriting Recognition from IMU-Enhanced Pens
International Journal on Document Analysis and Recognition (IJDAR), 2022
Felix Ott
David Rügamer
Lucas Heublein
Tim Hamann
Jens Barth
B. Bischl
Christopher Mutschler
326
20
0
14 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
International Conference on Machine Learning (ICML), 2022
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSL
VLM
ViT
416
1,014
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
International Conference on Machine Learning (ICML), 2022
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
426
992
0
07 Feb 2022
Webly Supervised Concept Expansion for General Purpose Vision Models
European Conference on Computer Vision (ECCV), 2022
Amita Kamath
Christopher Clark
Tanmay Gupta
Eric Kolve
Derek Hoiem
Aniruddha Kembhavi
VLM
248
65
0
04 Feb 2022
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation
Yi-Fan Zhang
Hanlin Zhang
Zachary Chase Lipton
Li Erran Li
Eric P. Xing
OODD
315
33
0
02 Feb 2022
Learning Super-Features for Image Retrieval
International Conference on Learning Representations (ICLR), 2022
Philippe Weinzaepfel
Thomas Lucas
Diane Larlus
Yannis Kalantidis
SupR
VLM
185
55
0
31 Jan 2022
Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive Matrices
ACM Computing Surveys (ACM CSUR), 2022
Mikolaj Malkiñski
Jacek Mańdziuk
406
52
0
28 Jan 2022
From data to functa: Your data point is a function and you can treat it like one
International Conference on Machine Learning (ICML), 2022
Emilien Dupont
Hyunjik Kim
S. M. Ali Eslami
Danilo Jimenez Rezende
Dan Rosenbaum
TDI
3DPC
518
182
0
28 Jan 2022
Density-Aware Hyper-Graph Neural Networks for Graph-based Semi-supervised Node Classification
Jianpeng Liao
Qian Tao
Jun Yan
GNN
132
3
0
27 Jan 2022
Omnivore: A Single Model for Many Visual Modalities
Computer Vision and Pattern Recognition (CVPR), 2022
Rohit Girdhar
Mannat Singh
Nikhil Ravi
Laurens van der Maaten
Armand Joulin
Ishan Misra
485
283
0
20 Jan 2022
Video Transformers: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
374
132
0
16 Jan 2022
Latency Adjustable Transformer Encoder for Language Understanding
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Sajjad Kachuee
M. Sharifkhani
412
1
0
10 Jan 2022
Vision Transformer with Deformable Attention
Computer Vision and Pattern Recognition (CVPR), 2022
Zhuofan Xia
Xuran Pan
Qing Xiao
Li Erran Li
Gao Huang
ViT
370
655
0
03 Jan 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
241
117
0
23 Dec 2021
Learned Queries for Efficient Local Attention
Computer Vision and Pattern Recognition (CVPR), 2021
Moab Arar
Ariel Shamir
Amit H. Bermano
ViT
205
35
0
21 Dec 2021
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
1.3K
20,430
0
20 Dec 2021
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
Ayush Jain
N. Gkanatsios
Ishita Mediratta
Katerina Fragkiadaki
ObjD
389
144
0
16 Dec 2021
Audio-Visual Synchronisation in the wild
Honglie Chen
Weidi Xie
Triantafyllos Afouras
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
179
49
0
08 Dec 2021
Input-level Inductive Biases for 3D Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2021
Yifan Wang
Carl Doersch
Relja Arandjelović
João Carreira
Andrew Zisserman
3DV
324
31
0
06 Dec 2021
Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation
Xiang Li
Jinglu Wang
Xiao Li
Yan Lu
161
19
0
03 Dec 2021
Efficient Self-Ensemble for Semantic Segmentation
British Machine Vision Conference (BMVC), 2021
Walid Bousselham
Guillaume Thibault
Lucas Pagano
Archana Machireddy
Joe W. Gray
Y. Chang
Xubo B. Song
ViT
248
32
0
26 Nov 2021
PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Valerii Likhosherstov
Anurag Arnab
K. Choromanski
Mario Lucic
Yi Tay
Adrian Weller
Mostafa Dehghani
ViT
176
81
0
25 Nov 2021
Conditional Object-Centric Learning from Video
Thomas Kipf
Gamaleldin F. Elsayed
Aravindh Mahendran
Austin Stone
S. Sabour
G. Heigold
Rico Jonschkowski
Alexey Dosovitskiy
Klaus Greff
OCL
303
252
0
24 Nov 2021
Sparse Fusion for Multimodal Transformers
Yi Ding
Alex Rich
Mason Wang
Noah Stier
M. Turk
P. Sen
Tobias Höllerer
ViT
152
9
0
23 Nov 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
320
0
0
22 Nov 2021
Rethinking Query, Key, and Value Embedding in Vision Transformer under Tiny Model Constraints
Jaesin Ahn
Jiuk Hong
Jeongwoo Ju
Heechul Jung
ViT
167
3
0
19 Nov 2021
Edge-Native Intelligence for 6G Communications Driven by Federated Learning: A Survey of Trends and Challenges
IEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021
Mohammad M. Al-Quraan
Lina S. Mohjazi
Lina Bariah
A. Centeno
A. Zoha
Sami Muhaidat
Mérouane Debbah
Muhammad Ali Imran
156
90
0
14 Nov 2021
Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled Attention
British Machine Vision Conference (BMVC), 2021
S. Tan
Runpei Dong
Kaisheng Ma
291
2
0
03 Nov 2021
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
British Machine Vision Conference (BMVC), 2021
Evangelos Kazakos
Jaesung Huh
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
256
52
0
01 Nov 2021
Hyper-Representations: Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction
Konstantin Schurholt
Dimche Kostadinov
Damian Borth
SSL
384
15
0
28 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
Neural Information Processing Systems (NeurIPS), 2021
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
213
187
0
22 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
262
145
0
19 Oct 2021
BERMo: What can BERT learn from ELMo?
Sangamesh Kodge
Kaushik Roy
146
4
0
18 Oct 2021
EncT5: A Framework for Fine-tuning T5 as Non-autoregressive Models
Frederick Liu
T. Huang
Shihang Lyu
Siamak Shakeri
Hongkun Yu
Jing Li
240
10
0
16 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
804
1,421
0
13 Oct 2021
Two-argument activation functions learn soft XOR operations like cortical neurons
Kijung Yoon
Emin Orhan
Juhyeon Kim
Xaq Pitkow
MLT
177
0
0
13 Oct 2021
Dynamic Inference with Neural Interpreters
Nasim Rahaman
Muhammad Waleed Gondal
S. Joshi
Peter V. Gehler
Yoshua Bengio
Francesco Locatello
Bernhard Schölkopf
235
31
0
12 Oct 2021
Efficient Training of Audio Transformers with Patchout
Interspeech (Interspeech), 2021
Khaled Koutini
Jan Schluter
Hamid Eghbalzadeh
Gerhard Widmer
ViT
466
340
0
11 Oct 2021
Recurrent Attention Models with Object-centric Capsule Representation for Multi-object Recognition
Hossein Adeli
Seoyoung Ahn
G. Zelinsky
OCL
171
3
0
11 Oct 2021
Cross-lingual Transfer of Monolingual Models
Evangelia Gogoulou
Ariel Ekgren
T. Isbister
Magnus Sahlgren
213
20
0
15 Sep 2021
Patch-based Medical Image Segmentation using Matrix Product State Tensor Networks
Raghavendra Selvan
Erik Dam
Soren Alexander Flensborg
Jens Petersen
MedIm
235
2
0
15 Sep 2021
The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Yujin Tang
David R Ha
240
82
0
07 Sep 2021
Previous
1
2
3
...
14
15
16
Next