v1v2 (latest)

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

3 May 2023

Papers citing "Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity"

2 / 2 papers shown

Title
MultiMUC: Multilingual Template Filling on MUC-4Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024 William Gantt Shabnam Behzad Hannah YoungEun An Yunmo Chen Aaron Steven White Benjamin Van Durme M. Yarmohammadi 151 5 0 29 Jan 2024
Condensing Multilingual Knowledge with Lightweight Language-Specific ModulesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Haoran Xu Weiting Tan Shuyue Stella Li Yunmo Chen Benjamin Van Durme Philipp Koehn Kenton W. Murray 263 7 0 23 May 2023