Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1901.10430
Cited By
v1
v2 (latest)
Pay Less Attention with Lightweight and Dynamic Convolutions
International Conference on Learning Representations (ICLR), 2019
29 January 2019
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Pay Less Attention with Lightweight and Dynamic Convolutions"
50 / 337 papers shown
Title
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
186
72
0
26 Mar 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Bang-ju Yang
Fenglin Liu
Yuexian Zou
Xian Wu
Yaowei Wang
David Clifton
174
12
0
11 Mar 2023
One Neuron Saved Is One Neuron Earned: On Parametric Efficiency of Quadratic Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fenglei Fan
Hangcheng Dong
Zhongming Wu
Lecheng Ruan
T. Zeng
Yiming Cui
Jing-Xiao Liao
156
11
0
11 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
303
583
0
09 Mar 2023
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning
Computer Vision and Image Understanding (CVIU), 2023
Jingqiang Chen
259
7
0
04 Feb 2023
Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation
Xiang Lin
Prathyusha Jwalapuram
Shafiq Joty
DiffM
144
0
0
31 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
314
12
0
21 Jan 2023
A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Zinuo Li
Xuhang Chen
Shuqiang Wang
Chi-Man Pun
265
33
0
21 Jan 2023
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chao-Han Huck Yang
Yue Liu
Yu Zhang
Nanxin Chen
Rohit Prabhavalkar
Tara N. Sainath
Trevor Strohman
178
31
0
19 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
International Conference on Database Systems for Advanced Applications (DASFAA), 2023
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
173
7
0
17 Jan 2023
Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions
Ido Ben-Shaul
Tomer Galanti
S. Dekel
216
3
0
11 Jan 2023
DynInt: Dynamic Interaction Modeling for Large-scale Click-Through Rate Prediction
Yachen Yan
Liubo Li
175
0
0
03 Jan 2023
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal
Ronglai Zuo
Brian Mak
SLR
282
27
0
26 Dec 2022
Findings of the WMT 2022 Shared Task on Translation Suggestion
Conference on Machine Translation (WMT), 2022
Zhen Yang
Fandong Meng
Yingxue Zhang
Ernan Li
Jie Zhou
LRM
105
2
0
30 Nov 2022
Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Fenglin Liu
Xian Wu
Chenyu You
Shen Ge
Yuexian Zou
Xu Sun
205
27
0
22 Nov 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Conference on Machine Translation (WMT), 2022
Lekan Raheem
Maab Elrashid
132
1
0
31 Oct 2022
OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for Extreme Multi-label Text Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jie Cao
Yin Zhang
VLM
199
5
0
26 Oct 2022
MetaFormer Baselines for Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
204
254
0
24 Oct 2022
Discriminatory and orthogonal feature learning for noise robust keyword spotting
IEEE Signal Processing Letters (SPL), 2022
Donghyeon Kim
Kyungdeuk Ko
D. Han
Hanseok Ko
121
5
0
20 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Spoken Language Technology Workshop (SLT), 2022
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
202
17
0
18 Oct 2022
LSG Attention: Extrapolation of pretrained Transformers to long sequences
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022
Charles Condevaux
S. Harispe
164
29
0
13 Oct 2022
Improved Data Augmentation for Translation Suggestion
Conference on Machine Translation (WMT), 2022
Hongxiao Zhang
Siyu Lai
Songming Zhang
Hui Huang
Jinan Xu
Jinan Xu
Jian Liu
127
1
0
12 Oct 2022
Mixture of Attention Heads: Selecting Attention Heads Per Token
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Xiaofeng Zhang
Songlin Yang
Zeyu Huang
Jie Zhou
Wenge Rong
Zhang Xiong
MoE
598
65
0
11 Oct 2022
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Spoken Language Technology Workshop (SLT), 2022
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
367
155
0
30 Sep 2022
Transformer-based Models to Deal with Heterogeneous Environments in Human Activity Recognition
Personal and Ubiquitous Computing (PUC), 2022
Sannara Ek
François Portet
P. Lalanda
243
32
0
22 Sep 2022
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
AAAI Conference on Artificial Intelligence (AAAI), 2022
K. Nguyen
Ali Furkan Biten
Andrés Mafla
Lluís Gómez
Dimosthenis Karatzas
161
15
0
21 Sep 2022
Dynamic Graph Message Passing Networks for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Li Zhang
Mohan Chen
Anurag Arnab
Xiangyang Xue
Juil Sock
GNN
143
1
0
20 Sep 2022
Relaxed Attention for Transformer Models
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
155
13
0
20 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
190
1
0
08 Sep 2022
Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
International Conference on Computational Linguistics (COLING), 2022
Seungone Kim
Se June Joo
Hyungjoo Chae
Chaehyeong Kim
Seung-won Hwang
Jinyoung Yeo
135
25
0
02 Sep 2022
Real-time 3D Single Object Tracking with Transformer
IEEE transactions on multimedia (IEEE TMM), 2022
Jiayao Shan
Sifan Zhou
Yubo Cui
Zheng Fang
ViT
189
61
0
02 Sep 2022
Modeling Spatial Trajectories using Coarse-Grained Smartphone Logs
IEEE Transactions on Big Data (TBD), 2022
Vinayak Gupta
Srikanta J. Bedathur
163
4
0
29 Aug 2022
PointConvFormer: Revenge of the Point-based Convolution
Computer Vision and Pattern Recognition (CVPR), 2022
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
320
36
0
04 Aug 2022
Mitigating Biases in Student Performance Prediction via Attention-Based Personalized Federated Learning
International Conference on Information and Knowledge Management (CIKM), 2022
Yun-Wei Chu
Seyyedali Hosseinalipour
Elizabeth Tenorio
Laura Cruz
K. Douglas
Andrew Lan
Christopher G. Brinton
FedML
AI4Ed
185
29
0
02 Aug 2022
giMLPs: Gate with Inhibition Mechanism in MLPs
Cheng Kang
Jindich Prokop
Lei Tong
Huiyu Zhou
Yong Hu
Daneil Novak
133
0
0
01 Aug 2022
Neural Knowledge Bank for Pretrained Transformers
Natural Language Processing and Chinese Computing (NLPCC), 2022
Damai Dai
Wen-Jie Jiang
Qingxiu Dong
Yajuan Lyu
Qiaoqiao She
Zhifang Sui
KELM
242
22
0
31 Jul 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jian Yang
Yuwei Yin
Liqun Yang
Shuming Ma
Haoyang Huang
Dongdong Zhang
Furu Wei
Zhoujun Li
AI4CE
194
22
0
29 Jul 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yi Tay
Mostafa Dehghani
Samira Abnar
Hyung Won Chung
W. Fedus
J. Rao
Sharan Narang
Vinh Q. Tran
Dani Yogatama
Donald Metzler
AI4CE
221
120
0
21 Jul 2022
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer
European Conference on Computer Vision (ECCV), 2022
Hao Zhou
Yun Cao
Wenqing Chu
Junwei Zhu
Tong Lu
Ying Tai
Chengjie Wang
3DPC
ViT
195
167
0
21 Jul 2022
Forming Trees with Treeformers
Recent Advances in Natural Language Processing (RANLP), 2022
Nilay Patel
Jeffrey Flanigan
AI4CE
263
4
0
14 Jul 2022
Attention and Self-Attention in Random Forests
Progress in Artificial Intelligence (PAI), 2022
Lev V. Utkin
A. Konstantinov
164
9
0
09 Jul 2022
Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution
IEEE transactions on multimedia (IEEE TMM), 2022
Wenjie Li
Juncheng Li
Guangwei Gao
Jiantao Zhou
Jian Yang
Guo-Jun Qi
SupR
215
55
0
06 Jul 2022
Wav2Vec-Aug: Improved self-supervised training with limited data
Interspeech (Interspeech), 2022
Anuroop Sriram
Michael Auli
Alexei Baevski
SSL
VLM
165
16
0
27 Jun 2022
Learning Multiscale Transformer Models for Sequence Generation
International Conference on Machine Learning (ICML), 2022
Bei Li
Tong Zheng
Yi Jing
Chengbo Jiao
Tong Xiao
Jingbo Zhu
177
13
0
19 Jun 2022
AGConv: Adaptive Graph Convolution on 3D Point Clouds
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Mingqiang Wei
Zeyong Wei
Hao Zhou
Fei-Jiang Hu
Huajian Si
...
Jingbo Qiu
Xu Yan
Yan Guo
Jun Wang
J. Qin
3DPC
196
65
0
09 Jun 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Neural Information Processing Systems (NeurIPS), 2022
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
791
3,235
0
27 May 2022
A Template-based Method for Constrained Neural Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shuo Wang
Peng Li
Zhixing Tan
Zhaopeng Tu
Maosong Sun
Yang Liu
BDL
103
4
0
23 May 2022
VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling
Andrei Sharf
3DPC
163
1
0
19 May 2022
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
International Conference on Machine Learning (ICML), 2022
Haoran You
Baopu Li
Huihong Shi
Y. Fu
Yingyan Lin
317
18
0
17 May 2022
Efficient dynamic filter for robust and low computational feature extraction
Spoken Language Technology Workshop (SLT), 2022
Donghyeon Kim
Gwantae Kim
Bokyeung Lee
Jeong-gi Kwak
D. Han
Hanseok Ko
147
3
0
03 May 2022
Previous
1
2
3
4
5
6
7
Next