v1v2 (latest)

Pay Less Attention with Lightweight and Dynamic Convolutions

International Conference on Learning Representations (ICLR), 2019

29 January 2019

Angela Fan

Papers citing "Pay Less Attention with Lightweight and Dynamic Convolutions"

50 / 337 papers shown

Title
You Only Segment Once: Towards Real-Time Panoptic SegmentationComputer Vision and Pattern Recognition (CVPR), 2023 Jie Hu Linyan Huang Tianhe Ren Shengchuan Zhang Rongrong Ji Liujuan Cao SSeg 186 72 0 26 Mar 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Bang-ju Yang Fenglin Liu Yuexian Zou Xian Wu Yaowei Wang David Clifton 174 12 0 11 Mar 2023
One Neuron Saved Is One Neuron Earned: On Parametric Efficiency of Quadratic NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Fenglei Fan Hangcheng Dong Zhongming Wu Lecheng Ruan T. Zeng Yiming Cui Jing-Xiao Liao 156 11 0 11 Mar 2023
Scaling up GANs for Text-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023 Minguk Kang Jun-Yan Zhu Richard Y. Zhang Jaesik Park Eli Shechtman Sylvain Paris Taesung Park 303 583 0 09 Mar 2023
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image CaptioningComputer Vision and Image Understanding (CVIU), 2023 Jingqiang Chen 259 7 0 04 Feb 2023
Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation Xiang Lin Prathyusha Jwalapuram Shafiq Joty DiffM 144 0 0 31 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the ReferenceConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023 Vilém Zouhar Shehzaad Dhuliawala Wangchunshu Zhou Nico Daheim Tom Kocmi Yuchen Eleanor Jiang Mrinmaya Sachan 314 12 0 21 Jan 2023
A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film EnhancementInternational Joint Conference on Artificial Intelligence (IJCAI), 2023 Zinuo Li Xuhang Chen Shuqiang Wang Chi-Man Pun 265 33 0 21 Jan 2023
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Chao-Han Huck Yang Yue Liu Yu Zhang Nanxin Chen Rohit Prabhavalkar Tara N. Sainath Trevor Strohman 178 31 0 19 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective ContextInternational Conference on Database Systems for Advanced Applications (DASFAA), 2023 Jian Yang Yuwei Yin Shuming Ma Liqun Yang Hongcheng Guo Haoyang Huang Dongdong Zhang Yutao Zeng Zhoujun Li Furu Wei 173 7 0 17 Jan 2023
Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions Ido Ben-Shaul Tomer Galanti S. Dekel 216 3 0 11 Jan 2023
DynInt: Dynamic Interaction Modeling for Large-scale Click-Through Rate Prediction Yachen Yan Liubo Li 175 0 0 03 Jan 2023
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal Ronglai Zuo Brian Mak SLR 282 27 0 26 Dec 2022
Findings of the WMT 2022 Shared Task on Translation SuggestionConference on Machine Translation (WMT), 2022 Zhen Yang Fandong Meng Yingxue Zhang Ernan Li Jie Zhou LRM 105 2 0 30 Nov 2022
Aligning Source Visual and Target Language Domains for Unpaired Video CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Fenglin Liu Xian Wu Chenyu You Shen Ge Yuexian Zou Xu Sun 205 27 0 22 Nov 2022
Domain Curricula for Code-Switched MT at MixMT 2022Conference on Machine Translation (WMT), 2022 Lekan Raheem Maab Elrashid 132 1 0 31 Oct 2022
OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for Extreme Multi-label Text ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Jie Cao Yin Zhang VLM 199 5 0 26 Oct 2022
MetaFormer Baselines for VisionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Weihao Yu Chenyang Si Pan Zhou Mi Luo Yichen Zhou Jiashi Feng Shuicheng Yan Xinchao Wang MoE 204 254 0 24 Oct 2022
Discriminatory and orthogonal feature learning for noise robust keyword spottingIEEE Signal Processing Letters (SPL), 2022 Donghyeon Kim Kyungdeuk Ko D. Han Hanseok Ko 121 5 0 20 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASRSpoken Language Technology Workshop (SLT), 2022 Zhehuai Chen Ankur Bapna Andrew Rosenberg Yu Zhang Bhuvana Ramabhadran Pedro J. Moreno Nanxin Chen 202 17 0 18 Oct 2022
LSG Attention: Extrapolation of pretrained Transformers to long sequencesPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022 Charles Condevaux S. Harispe 164 29 0 13 Oct 2022
Improved Data Augmentation for Translation SuggestionConference on Machine Translation (WMT), 2022 Hongxiao Zhang Siyu Lai Songming Zhang Hui Huang Jinan Xu Jinan Xu Jian Liu 127 1 0 12 Oct 2022
Mixture of Attention Heads: Selecting Attention Heads Per TokenConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Xiaofeng Zhang Songlin Yang Zeyu Huang Jie Zhou Wenge Rong Zhang Xiong MoE 598 65 0 11 Oct 2022
E-Branchformer: Branchformer with Enhanced merging for speech recognitionSpoken Language Technology Workshop (SLT), 2022 Kwangyoun Kim Felix Wu Yifan Peng Jing Pan Prashant Sridhar Kyu Jeong Han Shinji Watanabe 367 155 0 30 Sep 2022
Transformer-based Models to Deal with Heterogeneous Environments in Human Activity RecognitionPersonal and Ubiquitous Computing (PUC), 2022 Sannara Ek François Portet P. Lalanda 243 32 0 22 Sep 2022
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in WikipediaAAAI Conference on Artificial Intelligence (AAAI), 2022 K. Nguyen Ali Furkan Biten Andrés Mafla Lluís Gómez Dimosthenis Karatzas 161 15 0 21 Sep 2022
Dynamic Graph Message Passing Networks for Visual RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Li Zhang Mohan Chen Anurag Arnab Xiangyang Xue Juil Sock GNN 143 1 0 20 Sep 2022
Relaxed Attention for Transformer ModelsIEEE International Joint Conference on Neural Network (IJCNN), 2022 Timo Lohrenz Björn Möller Zhengyang Li Tim Fingscheidt KELM 155 13 0 20 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation Yile Wang Linyi Yang Zhiyang Teng M. Zhou Yue Zhang GNN 190 1 0 08 Sep 2022
Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue SummarizationInternational Conference on Computational Linguistics (COLING), 2022 Seungone Kim Se June Joo Hyungjoo Chae Chaehyeong Kim Seung-won Hwang Jinyoung Yeo 135 25 0 02 Sep 2022
Real-time 3D Single Object Tracking with TransformerIEEE transactions on multimedia (IEEE TMM), 2022 Jiayao Shan Sifan Zhou Yubo Cui Zheng Fang ViT 189 61 0 02 Sep 2022
Modeling Spatial Trajectories using Coarse-Grained Smartphone LogsIEEE Transactions on Big Data (TBD), 2022 Vinayak Gupta Srikanta J. Bedathur 163 4 0 29 Aug 2022
PointConvFormer: Revenge of the Point-based ConvolutionComputer Vision and Pattern Recognition (CVPR), 2022 Wenxuan Wu Li Fuxin Qi Shan 3DPC 320 36 0 04 Aug 2022
Mitigating Biases in Student Performance Prediction via Attention-Based Personalized Federated LearningInternational Conference on Information and Knowledge Management (CIKM), 2022 Yun-Wei Chu Seyyedali Hosseinalipour Elizabeth Tenorio Laura Cruz K. Douglas Andrew Lan Christopher G. Brinton FedML AI4Ed 185 29 0 02 Aug 2022
giMLPs: Gate with Inhibition Mechanism in MLPs Cheng Kang Jindich Prokop Lei Tong Huiyu Zhou Yong Hu Daneil Novak 133 0 0 01 Aug 2022
Neural Knowledge Bank for Pretrained TransformersNatural Language Processing and Chinese Computing (NLPCC), 2022 Damai Dai Wen-Jie Jiang Qingxiu Dong Yajuan Lyu Qiaoqiao She Zhifang Sui KELM 242 22 0 31 Jul 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 Jian Yang Yuwei Yin Liqun Yang Shuming Ma Haoyang Huang Dongdong Zhang Furu Wei Zhoujun Li AI4CE 194 22 0 29 Jul 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Yi Tay Mostafa Dehghani Samira Abnar Hyung Won Chung W. Fedus J. Rao Sharan Narang Vinh Q. Tran Dani Yogatama Donald Metzler AI4CE 221 120 0 21 Jul 2022
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample TransformerEuropean Conference on Computer Vision (ECCV), 2022 Hao Zhou Yun Cao Wenqing Chu Junwei Zhu Tong Lu Ying Tai Chengjie Wang 3DPC ViT 195 167 0 21 Jul 2022
Forming Trees with TreeformersRecent Advances in Natural Language Processing (RANLP), 2022 Nilay Patel Jeffrey Flanigan AI4CE 263 4 0 14 Jul 2022
Attention and Self-Attention in Random ForestsProgress in Artificial Intelligence (PAI), 2022 Lev V. Utkin A. Konstantinov 164 9 0 09 Jul 2022
Cross-receptive Focused Inference Network for Lightweight Image Super-ResolutionIEEE transactions on multimedia (IEEE TMM), 2022 Wenjie Li Juncheng Li Guangwei Gao Jiantao Zhou Jian Yang Guo-Jun Qi SupR 215 55 0 06 Jul 2022
Wav2Vec-Aug: Improved self-supervised training with limited dataInterspeech (Interspeech), 2022 Anuroop Sriram Michael Auli Alexei Baevski SSL VLM 165 16 0 27 Jun 2022
Learning Multiscale Transformer Models for Sequence GenerationInternational Conference on Machine Learning (ICML), 2022 Bei Li Tong Zheng Yi Jing Chengbo Jiao Tong Xiao Jingbo Zhu 177 13 0 19 Jun 2022
AGConv: Adaptive Graph Convolution on 3D Point CloudsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Mingqiang Wei Zeyong Wei Hao Zhou Fei-Jiang Hu Huajian Si ... Jingbo Qiu Xu Yan Yan Guo Jun Wang J. Qin 3DPC 196 65 0 09 Jun 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessNeural Information Processing Systems (NeurIPS), 2022 Tri Dao Daniel Y. Fu Stefano Ermon Atri Rudra Christopher Ré VLM 791 3,235 0 27 May 2022
A Template-based Method for Constrained Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Shuo Wang Peng Li Zhixing Tan Zhaopeng Tu Maosong Sun Yang Liu BDL 103 4 0 23 May 2022
VNT-Net: Rotational Invariant Vector Neuron Transformers Hedi Zisling Andrei Sharf 3DPC 163 1 0 19 May 2022
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural NetworksInternational Conference on Machine Learning (ICML), 2022 Haoran You Baopu Li Huihong Shi Y. Fu Yingyan Lin 317 18 0 17 May 2022
Efficient dynamic filter for robust and low computational feature extractionSpoken Language Technology Workshop (SLT), 2022 Donghyeon Kim Gwantae Kim Bokyeung Lee Jeong-gi Kwak D. Han Hanseok Ko 147 3 0 03 May 2022