ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.10318
  4. Cited By
Interpreting and Exploiting Functional Specialization in Multi-Head
  Attention under Multi-task Learning

Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning

16 October 2023
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
ArXivPDFHTML

Papers citing "Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning"

5 / 5 papers shown
Title
ResiDual Transformer Alignment with Spectral Decomposition
ResiDual Transformer Alignment with Spectral Decomposition
Lorenzo Basile
Valentino Maiorca
Luca Bortolussi
Emanuele Rodolà
Francesco Locatello
45
1
0
31 Oct 2024
Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors
Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors
Weixuan Wang
J. Yang
Wei Peng
LLMSV
16
2
0
16 Oct 2024
Rethinking Attention-Model Explainability through Faithfulness Violation
  Test
Rethinking Attention-Model Explainability through Faithfulness Violation Test
Y. Liu
Haoliang Li
Yangyang Guo
Chen Kong
Jing Li
Shiqi Wang
FAtt
116
42
0
28 Jan 2022
Importance-based Neuron Allocation for Multilingual Neural Machine
  Translation
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie
Yang Feng
Shuhao Gu
Dong Yu
31
32
0
14 Jul 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1