DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

2 May 2020

Qingqing Cao

H. Trivedi

A. Balasubramanian

Niranjan Balasubramanian

ArXiv (abs)PDF HTML Github (120★)

Papers citing "DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering"

27 / 27 papers shown

Enhancing Speech Emotion Recognition with Multi-Task Learning and Dynamic Feature Fusion

25 Aug 2025

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

359

21 May 2024

Vesper: A Compact and Effective Pretrained Model for Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023

345

20 Jul 2023

A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction

Erica Cai

Brendan O'Connor

219

24 May 2023

Investigating the Role of Feed-Forward Networks in Transformers Using Parallel Attention and Feed-Forward Net Design

Shashank Sonkar

Richard G. Baraniuk

229

22 May 2023

AttMEMO : Accelerating Transformers with Memoization on Big Memory Systems

357

23 Jan 2023

Once is Enough: A Light-Weight Cross-Attention for Fast Sentence Pair ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Cuiyun Gao

187

11 Oct 2022

RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event ExtractionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Yuan Liang

Zhuoxuan Jiang

Di Yin

Bo Ren

253

07 Jun 2022

Differentially Private Model CompressionNeural Information Processing Systems (NeurIPS), 2022

Fatemehsadat Mireshghallah

236

03 Jun 2022

Exploring Extreme Parameter Compression for Pre-trained Language ModelsInternational Conference on Learning Representations (ICLR), 2022

Yuxin Ren

Benyou Wang

Lifeng Shang

Xin Jiang

Qun Liu

268

20 May 2022

Transkimmer: Transformer Learns to Layer-wise SkimAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Jingwen Leng

188

15 May 2022

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Zhijian Liu

Song Han

288

138

25 Apr 2022

Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal Grounding

Arjun Reddy Akula

OOD

228

24 Jan 2022

Block-Skim: Efficient Question Answering for Transformer

Jingwen Leng

Yuhao Zhu

271

16 Dec 2021

VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction

Enhong Chen

266

08 Dec 2021

A Survey on Deep Learning Event Extraction: Approaches and Applications

...

Hao Peng

Shu Guo

Lihong Wang

Amin Beheshti

Philip S. Yu

359

05 Jul 2021

TR-BERT: Dynamic Token Reduction for Accelerating BERT InferenceNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Deming Ye

Yankai Lin

Yufei Huang

Maosong Sun

288

25 May 2021

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with AdaptersWorkshop on Document-grounded Dialogue and Conversational Question Answering (DialDoc), 2021

Andrea Madotto

259

13 May 2021

Adapting by Pruning: A Case Study on BERT

Yang Gao

Nicolo Colombo

Wen Wang

177

07 May 2021

Probing Classifiers: Promises, Shortcomings, and AdvancesInternational Conference on Computational Logic (ICCL), 2021

Yonatan Belinkov

928

697

24 Feb 2021

Optimizing Inference Performance of Transformers on CPUs

D. Dice

Alex Kogan

211

12 Feb 2021

Modeling Context in Answer Sentence Selection Systems on a Latency BudgetConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Rujun Han

Luca Soldaini

Alessandro Moschitti

273

28 Jan 2021

Learning Dense Representations of Phrases at ScaleAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

510

128

23 Dec 2020

ReadOnce Transformers: Reusable Representations of Text for TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Shih-Ting Lin

Ashish Sabharwal

Tushar Khot

334

24 Oct 2020

Which *BERT? A Survey Organizing Contextualized EncodersConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Patrick Xia

Shijie Wu

Benjamin Van Durme

396

02 Oct 2020

Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size

256

16 Aug 2020

Compressing Large-Scale Transformer-Based Models: A Case Study on BERTTransactions of the Association for Computational Linguistics (TACL), 2020

625

213

27 Feb 2020