v1v2 (latest)

A Simple and Effective Positional Encoding for Transformers

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

18 April 2021

Srinadh Bhojanapalli

Papers citing "A Simple and Effective Positional Encoding for Transformers"

32 / 32 papers shown

GeoPE:A Unified Geometric Positional Embedding for Structured Tensors

Yupu Yao

Bowen Yang

MDE

351

04 Dec 2025

What is the Best Sequence Length for BABYLM?

Suchir Salhan

Richard Diehl Martinez

Zébulon Goriely

P. Buttery

148

22 Oct 2025

NDLPNet: A Location-Aware Nighttime Deraining Network and a Real-World Benchmark Dataset

123

17 Sep 2025

FusionMAE: large-scale pretrained model to optimize and simplify diagnostic and control of fusion plasma

...

247

16 Sep 2025

An Empirical Study on Prompt Compression for Large Language Models

305

24 Apr 2025

Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation

Manvi Agarwal

Changhong Wang

Gaël Richard

214

07 Apr 2025

Context-aware Biases for Length Extrapolation

Ali Veisi

Hamidreza Amirzadeh

Amir Mansourian

644

11 Mar 2025

LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

695

10 Feb 2025

Learning the RoPEs: Better 2D and 3D Position Encodings with STRING

...

Krzysztof Choromanski

381

04 Feb 2025

Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering

251

22 Nov 2024

DipMe: Haptic Recognition of Granular Media for Tangible Interactive Applications

136

13 Nov 2024

Enhancing High-order Interaction Awareness in LLM-based Recommender ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

323

30 Sep 2024

TeXBLEU: Automatic Metric for Evaluate LaTeX FormatIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

398

10 Sep 2024

A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships

Gracile Astlin Pereira

Muhammad Hussain

ViT

320

27 Aug 2024

Graph Transformers: A Survey

Karin Verspoor

451

13 Jul 2024

Positional encoding is not the same as context: A study on positional encoding for sequential recommendation

443

16 May 2024

PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval

325

16 May 2024

PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models

Arpit Aggarwal

144

29 Apr 2024

EulerFormer: Sequential User Behavior Modeling with Complex Vector Attention

352

26 Mar 2024

MEP: Multiple Kernel Learning Enhancing Relative Positional Encoding Length Extrapolation

Weiguo Gao

248

26 Mar 2024

PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

257

26 Mar 2024

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

381

03 Feb 2024

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

...

509

114

21 Nov 2023

Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis

382

21 Nov 2023

Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions

276

08 Oct 2023

Generalized Power Attacks against Crypto Hardware using Long-Range Deep Learning

241

12 Jun 2023

On the Design Fundamentals of Diffusion Models: A SurveyPattern Recognition (Pattern Recogn.), 2023

Ziyi Chang

George Alex Koulieris

Hyung Jin Chang

Hubert P. H. Shum

DiffM

805

07 Jun 2023

Knowledge Distillation in Vision Transformers: A Critical Review

Gousia Habib

Tausifa Jan Saleem

Brejesh Lall

396

04 Feb 2023

P-Transformer: Towards Better Document-to-Document Neural Machine TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

202

12 Dec 2022

Word Order Matters when you Increase MaskingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Karim Lasri

Alessandro Lenci

Thierry Poibeau

311

08 Nov 2022

KERPLE: Kernelized Relative Positional Embedding for Length ExtrapolationNeural Information Processing Systems (NeurIPS), 2022

Ta-Chung Chi

Ting-Han Fan

Peter J. Ramadge

Alexander I. Rudnicky

433

20 May 2022

Decoupled Side Information Fusion for Sequential RecommendationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022

Yueqi Xie

Peilin Zhou

Sunghun Kim

438

155

23 Apr 2022