Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.02534
Cited By
v1
v2 (latest)
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
5 May 2020
Luca Soldaini
Alessandro Moschitti
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Cascade Transformer: an Application for Efficient Answer Sentence Selection"
23 / 23 papers shown
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
284
0
0
07 May 2025
k
k
k
NN Attention Demystified: A Theoretical Exploration for Scalable Transformers
Themistoklis Haris
383
0
0
06 Nov 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
634
204
0
09 Jul 2024
Ranked List Truncation for Large Language Model-based Re-Ranking
Chuan Meng
Negar Arabzadeh
Arian Askari
Mohammad Aliannejadi
Maarten de Rijke
LRM
414
36
0
28 Apr 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
486
5
0
25 Mar 2024
Cross-Lingual Knowledge Distillation for Answer Sentence Selection in Low-Resource Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shivanshu Gupta
Yoshitomo Matsubara
Ankita N. Chadha
Alessandro Moschitti
341
4
0
25 May 2023
Path Independent Equilibrium Models Can Better Exploit Test-Time Computation
Neural Information Processing Systems (NeurIPS), 2022
Cem Anil
Ashwini Pokle
Kaiqu Liang
Johannes Treutlein
Yuhuai Wu
Shaojie Bai
Zico Kolter
Roger C. Grosse
209
25
0
18 Nov 2022
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Neeraj Varshney
Chitta Baral
211
43
0
11 Oct 2022
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Luca Di Liello
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
200
18
0
20 May 2022
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Minghan Li
Xinyu Crystina Zhang
Ji Xin
Hongyang R. Zhang
Jimmy J. Lin
237
6
0
19 May 2022
Paragraph-based Transformer Pre-training for Multi-Sentence Inference
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Luca Di Liello
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
177
8
0
02 May 2022
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Findings (Findings), 2022
Kai Hui
Honglei Zhuang
Tao Chen
Zhen Qin
Jing Lu
...
Ji Ma
Jai Gupta
Cicero Nogueira dos Santos
Yi Tay
Donald Metzler
225
19
0
25 Apr 2022
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yoshitomo Matsubara
Luca Soldaini
Eric Lind
Alessandro Moschitti
279
7
0
15 Jan 2022
Will this Question be Answered? Question Filtering via Answer Model Distillation for Efficient Question Answering
Siddhant Garg
Alessandro Moschitti
221
29
0
14 Sep 2021
Adaptive Inference through Early-Exit Networks: Design, Challenges and Directions
Stefanos Laskaridis
Alexandros Kouris
Nicholas D. Lane
TPM
318
149
0
09 Jun 2021
Answer Generation for Retrieval-based Question Answering Systems
Findings (Findings), 2021
Chao-Chun Hsu
Eric Lind
Luca Soldaini
Alessandro Moschitti
177
28
0
02 Jun 2021
Efficient pre-training objectives for Transformers
Luca Di Liello
Matteo Gabburo
Alessandro Moschitti
137
16
0
20 Apr 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
ACM Computing Surveys (CSUR), 2021
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
511
300
0
08 Mar 2021
Modeling Context in Answer Sentence Selection Systems on a Latency Budget
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Rujun Han
Luca Soldaini
Alessandro Moschitti
274
14
0
28 Jan 2021
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models
Ronak Pradeep
Rodrigo Nogueira
Jimmy J. Lin
MoE
343
198
0
14 Jan 2021
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Lei Li
Yankai Lin
Deli Chen
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
287
59
0
29 Dec 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
958
729
0
13 Oct 2020
A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection
International Conference on Computational Linguistics (COLING), 2020
Daniele Bonadiman
Alessandro Moschitti
RALM
267
11
0
04 Mar 2020
1
Page 1 of 1