ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11631
  4. Cited By
Towards Opening the Black Box of Neural Machine Translation: Source and
  Target Interpretations of the Transformer

Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

23 May 2022
Javier Ferrando
Gerard I. Gállego
Belen Alastruey
Carlos Escolano
Marta R. Costa-jussá
ArXivPDFHTML

Papers citing "Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer"

33 / 33 papers shown
Title
Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement
Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement
Yichen Dong
Xinglin Lyu
Junhui Li
Daimeng Wei
Min Zhang
Shimin Tao
Hao Yang
19
0
0
08 Apr 2025
Hallucination Detection using Multi-View Attention Features
Hallucination Detection using Multi-View Attention Features
Yuya Ogasa
Yuki Arase
21
0
0
06 Apr 2025
Analyzing the Attention Heads for Pronoun Disambiguation in
  Context-aware Machine Translation Models
Analyzing the Attention Heads for Pronoun Disambiguation in Context-aware Machine Translation Models
Paweł Mąka
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
69
0
0
15 Dec 2024
Unveiling the Role of Pretraining in Direct Speech Translation
Unveiling the Role of Pretraining in Direct Speech Translation
Belen Alastruey
Gerard I. Gállego
Marta R. Costa-jussá
26
0
0
26 Sep 2024
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models
Sepehr Kamahi
Yadollah Yaghoobzadeh
30
0
0
21 Aug 2024
What Have We Achieved on Non-autoregressive Translation?
What Have We Achieved on Non-autoregressive Translation?
Yafu Li
Huajian Zhang
Jianhao Yan
Yongjing Yin
Yue Zhang
21
0
0
21 May 2024
Detecting Hallucination and Coverage Errors in Retrieval Augmented
  Generation for Controversial Topics
Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Tyler A. Chang
Katrin Tomanek
Jessica Hoffmann
Nithum Thain
Erin van Liemt
Kathleen Meier-Hellstern
Lucas Dixon
26
7
0
13 Mar 2024
Enhanced Hallucination Detection in Neural Machine Translation through
  Simple Detector Aggregation
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
Anas Himmi
Guillaume Staerman
Marine Picot
Pierre Colombo
Nuno M. Guerreiro
16
4
0
20 Feb 2024
Isotropy, Clusters, and Classifiers
Isotropy, Clusters, and Classifiers
Timothee Mickus
Stig-Arne Gronroos
Joseph Attieh
8
0
0
05 Feb 2024
On Measuring Context Utilization in Document-Level MT Systems
On Measuring Context Utilization in Document-Level MT Systems
Wafaa Mohammed
Vlad Niculae
6
2
0
02 Feb 2024
On Early Detection of Hallucinations in Factual Question Answering
On Early Detection of Hallucinations in Factual Question Answering
Ben Snyder
Marius Moisescu
Muhammad Bilal Zafar
HILM
39
24
0
19 Dec 2023
Added Toxicity Mitigation at Inference Time for Multimodal and Massively
  Multilingual Translation
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation
Marta R. Costa-jussá
David Dale
Maha Elbayad
Bokai Yu
19
1
0
11 Nov 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for
  Fairer Instruction-Tuned Machine Translation
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
Giuseppe Attanasio
Flor Miriam Plaza del Arco
Debora Nozza
Anne Lauscher
16
18
0
18 Oct 2023
Why bother with geometry? On the relevance of linear decompositions of
  Transformer embeddings
Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings
Timothee Mickus
Raúl Vázquez
13
2
0
10 Oct 2023
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Anna Langedijk
Hosein Mohebbi
Gabriele Sarti
Willem H. Zuidema
Jaap Jumelet
4
10
0
05 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine
  Translation
Quantifying the Plausibility of Context Reliance in Neural Machine Translation
Gabriele Sarti
Grzegorz Chrupala
Malvina Nissim
Arianna Bisazza
22
5
0
02 Oct 2023
SpeechAlign: a Framework for Speech Translation Alignment Evaluation
SpeechAlign: a Framework for Speech Translation Alignment Evaluation
Belen Alastruey
Aleix Sant
Gerard I. Gállego
David Dale
Marta R. Costa-jussá
AuLLM
14
3
0
20 Sep 2023
Let the Models Respond: Interpreting Language Model Detoxification
  Through the Lens of Prompt Dependence
Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
Daniel Scalena
Gabriele Sarti
Malvina Nissim
Elisabetta Fersini
9
0
0
01 Sep 2023
Improving Translation Faithfulness of Large Language Models via
  Augmenting Instructions
Improving Translation Faithfulness of Large Language Models via Augmenting Instructions
Yijie Chen
Yanjun Liu
Fandong Meng
Yufeng Chen
Jinan Xu
Jie Zhou
17
19
0
24 Aug 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil
  Demographic Biases in Languages at Scale
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
Marta R. Costa-jussá
Pierre Yves Andrews
Eric Michael Smith
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Daniel Licht
Carleigh Wood
27
10
0
22 May 2023
Explaining How Transformers Use Context to Build Predictions
Explaining How Transformers Use Context to Build Predictions
Javier Ferrando
Gerard I. Gállego
Ioannis Tsiamas
Marta R. Costa-jussá
10
31
0
21 May 2023
ReSeTOX: Re-learning attention weights for toxicity mitigation in
  machine translation
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation
Javier García Gilabert
Carlos Escolano
Marta R. Costa-jussá
CLL
MU
6
2
0
19 May 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination
  and Omission Detection in Machine Translation
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
David Dale
Elena Voita
Janice Lam
Prangthip Hansanti
C. Ropers
Elahe Kalbassi
Cynthia Gao
Loïc Barrault
Marta R. Costa-jussá
HILM
16
27
0
19 May 2023
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality
  Estimation Method for Blackbox Machine Translation
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation
Tu Anh Dinh
J. Niehues
10
5
0
12 May 2023
Computational modeling of semantic change
Computational modeling of semantic change
Nina Tahmasebi
Haim Dubossarsky
26
6
0
13 Apr 2023
Hallucinations in Large Multilingual Translation Models
Hallucinations in Large Multilingual Translation Models
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLM
HILM
LRM
13
139
0
28 Mar 2023
Inseq: An Interpretability Toolkit for Sequence Generation Models
Inseq: An Interpretability Toolkit for Sequence Generation Models
Gabriele Sarti
Nils Feldhus
Ludwig Sickert
Oskar van der Wal
Malvina Nissim
Arianna Bisazza
17
64
0
27 Feb 2023
Optimal Transport for Unsupervised Hallucination Detection in Neural
  Machine Translation
Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation
Nuno M. Guerreiro
Pierre Colombo
Pablo Piantanida
André F.T. Martins
17
10
0
19 Dec 2022
Detecting and Mitigating Hallucinations in Machine Translation: Model
  Internal Workings Alone Do Well, Sentence Similarity Even Better
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
David Dale
Elena Voita
Loïc Barrault
Marta R. Costa-jussá
HILM
14
55
0
16 Dec 2022
Attention as a Guide for Simultaneous Speech Translation
Attention as a Guide for Simultaneous Speech Translation
Sara Papi
Matteo Negri
Marco Turchi
6
30
0
15 Dec 2022
Toxicity in Multilingual Machine Translation at Scale
Toxicity in Multilingual Machine Translation at Scale
Marta R. Costa-jussá
Eric Michael Smith
C. Ropers
Daniel Licht
Jean Maillard
Javier Ferrando
Carlos Escolano
14
24
0
06 Oct 2022
Incorporating Residual and Normalization Layers into Analysis of Masked
  Language Models
Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
Goro Kobayashi
Tatsuki Kuribayashi
Sho Yokoi
Kentaro Inui
153
45
0
15 Sep 2021
The Bottom-up Evolution of Representations in the Transformer: A Study
  with Machine Translation and Language Modeling Objectives
The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Elena Voita
Rico Sennrich
Ivan Titov
179
181
0
03 Sep 2019
1