v1v2v3v4v5 (latest)

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

17 April 2020

Papers citing "Highway Transformer: Self-Gating Enhanced Self-Attentive Networks"

10 / 10 papers shown

EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs

Zhengge Cai

Haowen Hou

103

20 Sep 2025

IgCraft: A versatile sequence generation framework for antibody discovery and engineering

430

25 Mar 2025

On the Design Space Between Transformers and Recursive Neural Nets

Jishnu Ray Chowdhury

Cornelia Caragea

359

03 Sep 2024

Tokenization Falling Short: The Curse of Tokenization

260

17 Jun 2024

Can Transformers Predict Vibrations?

Fusataka Kuniyoshi

Yoshihide Sawada

177

16 Feb 2024

Investigating Recurrent Transformers with Dynamic Halt

Jishnu Ray Chowdhury

Cornelia Caragea

628

01 Feb 2024

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

311

13 Dec 2022

Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Dian Yu

Heng Ji

324

01 Oct 2022

Leveraging Local Temporal Information for Multimodal Scene Classification

Saurabh Sahu

Palash Goyal

ViT

121

26 Oct 2021

Rewiring the Transformer with Depth-Wise LSTMsInternational Conference on Language Resources and Evaluation (LREC), 2020

338

13 Jul 2020