v1v2 (latest)

The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

5 May 2020

Luca Soldaini

Alessandro Moschitti

ArXiv (abs)PDF HTML

Papers citing "The Cascade Transformer: an Application for Efficient Answer Sentence Selection"

23 / 23 papers shown

ORXE: Orchestrating Experts for Dynamically Configurable Efficiency

284

07 May 2025

k

NN Attention Demystified: A Theoretical Exploration for Scalable Transformers

Themistoklis Haris

383

06 Nov 2024

Mobile Edge Intelligence for Large Language Models: A Contemporary Survey

Guanqiao Qu

Qiyuan Chen

Wei Wei

Zheng Lin

Xianhao Chen

Kaibin Huang

634

204

09 Jul 2024

Ranked List Truncation for Large Language Model-based Re-Ranking

Maarten de Rijke

414

28 Apr 2024

On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance

486

25 Mar 2024

Cross-Lingual Knowledge Distillation for Answer Sentence Selection in Low-Resource LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

341

25 May 2023

Path Independent Equilibrium Models Can Better Exploit Test-Time ComputationNeural Information Processing Systems (NeurIPS), 2022

209

18 Nov 2022

Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Neeraj Varshney

Chitta Baral

211

11 Oct 2022

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Luca Di Liello

Siddhant Garg

Luca Soldaini

Alessandro Moschitti

200

20 May 2022

Certified Error Control of Candidate Set Pruning for Two-Stage Relevance RankingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

237

19 May 2022

Paragraph-based Transformer Pre-training for Multi-Sentence InferenceNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Luca Di Liello

Siddhant Garg

Luca Soldaini

Alessandro Moschitti

177

02 May 2022

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking InferenceFindings (Findings), 2022

Zhen Qin

...

Cicero Nogueira dos Santos

Yi Tay

Donald Metzler

225

25 Apr 2022

Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yoshitomo Matsubara

Luca Soldaini

Eric Lind

Alessandro Moschitti

279

15 Jan 2022

Will this Question be Answered? Question Filtering via Answer Model Distillation for Efficient Question Answering

Siddhant Garg

Alessandro Moschitti

221

14 Sep 2021

Adaptive Inference through Early-Exit Networks: Design, Challenges and Directions

318

149

09 Jun 2021

Answer Generation for Retrieval-based Question Answering SystemsFindings (Findings), 2021

Chao-Chun Hsu

Eric Lind

Luca Soldaini

Alessandro Moschitti

177

02 Jun 2021

Efficient pre-training objectives for Transformers

Luca Di Liello

Matteo Gabburo

Alessandro Moschitti

137

20 Apr 2021

Split Computing and Early Exiting for Deep Learning Applications: Survey and Research ChallengesACM Computing Surveys (CSUR), 2021

Yoshitomo Matsubara

Marco Levorato

Francesco Restuccia

511

300

08 Mar 2021

Modeling Context in Answer Sentence Selection Systems on a Latency BudgetConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Rujun Han

Luca Soldaini

Alessandro Moschitti

274

28 Jan 2021

The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models

343

198

14 Jan 2021

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models CascadeConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Lei Li

Yankai Lin

Deli Chen

Shuhuai Ren

Peng Li

Jie Zhou

Xu Sun

287

29 Dec 2020

Pretrained Transformers for Text Ranking: BERT and Beyond

958

729

13 Oct 2020

A Study on Efficiency, Accuracy and Document Structure for Answer Sentence SelectionInternational Conference on Computational Linguistics (COLING), 2020

Daniele Bonadiman

Alessandro Moschitti

RALM

267

04 Mar 2020