Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

23 May 2022

Carlos Escolano

Papers citing "Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer"

33 / 33 papers shown

Title
Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement Yichen Dong Xinglin Lyu Junhui Li Daimeng Wei Min Zhang Shimin Tao Hao Yang 19 0 0 08 Apr 2025
Hallucination Detection using Multi-View Attention Features Yuya Ogasa Yuki Arase 21 0 0 06 Apr 2025
Analyzing the Attention Heads for Pronoun Disambiguation in Context-aware Machine Translation Models Paweł Mąka Yusuf Can Semerci Jan Scholtes Gerasimos Spanakis 69 0 0 15 Dec 2024
Unveiling the Role of Pretraining in Direct Speech Translation Belen Alastruey Gerard I. Gállego Marta R. Costa-jussá 26 0 0 26 Sep 2024
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models Sepehr Kamahi Yadollah Yaghoobzadeh 30 0 0 21 Aug 2024
What Have We Achieved on Non-autoregressive Translation? Yafu Li Huajian Zhang Jianhao Yan Yongjing Yin Yue Zhang 21 0 0 21 May 2024
Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics Tyler A. Chang Katrin Tomanek Jessica Hoffmann Nithum Thain Erin van Liemt Kathleen Meier-Hellstern Lucas Dixon 26 7 0 13 Mar 2024
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation Anas Himmi Guillaume Staerman Marine Picot Pierre Colombo Nuno M. Guerreiro 16 4 0 20 Feb 2024
Isotropy, Clusters, and Classifiers Timothee Mickus Stig-Arne Gronroos Joseph Attieh 8 0 0 05 Feb 2024
On Measuring Context Utilization in Document-Level MT Systems Wafaa Mohammed Vlad Niculae 6 2 0 02 Feb 2024
On Early Detection of Hallucinations in Factual Question Answering Ben Snyder Marius Moisescu Muhammad Bilal Zafar HILM 39 24 0 19 Dec 2023
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Marta R. Costa-jussá David Dale Maha Elbayad Bokai Yu 19 1 0 11 Nov 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation Giuseppe Attanasio Flor Miriam Plaza del Arco Debora Nozza Anne Lauscher 16 18 0 18 Oct 2023
Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings Timothee Mickus Raúl Vázquez 13 2 0 10 Oct 2023
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Anna Langedijk Hosein Mohebbi Gabriele Sarti Willem H. Zuidema Jaap Jumelet 4 10 0 05 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine Translation Gabriele Sarti Grzegorz Chrupala Malvina Nissim Arianna Bisazza 22 5 0 02 Oct 2023
SpeechAlign: a Framework for Speech Translation Alignment Evaluation Belen Alastruey Aleix Sant Gerard I. Gállego David Dale Marta R. Costa-jussá AuLLM 14 3 0 20 Sep 2023
Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence Daniel Scalena Gabriele Sarti Malvina Nissim Elisabetta Fersini 9 0 0 01 Sep 2023
Improving Translation Faithfulness of Large Language Models via Augmenting Instructions Yijie Chen Yanjun Liu Fandong Meng Yufeng Chen Jinan Xu Jie Zhou 17 19 0 24 Aug 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale Marta R. Costa-jussá Pierre Yves Andrews Eric Michael Smith Prangthip Hansanti C. Ropers Elahe Kalbassi Cynthia Gao Daniel Licht Carleigh Wood 27 10 0 22 May 2023
Explaining How Transformers Use Context to Build Predictions Javier Ferrando Gerard I. Gállego Ioannis Tsiamas Marta R. Costa-jussá 10 31 0 21 May 2023
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation Javier García Gilabert Carlos Escolano Marta R. Costa-jussá CLL MU 6 2 0 19 May 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation David Dale Elena Voita Janice Lam Prangthip Hansanti C. Ropers Elahe Kalbassi Cynthia Gao Loïc Barrault Marta R. Costa-jussá HILM 16 27 0 19 May 2023
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation Tu Anh Dinh J. Niehues 10 5 0 12 May 2023
Computational modeling of semantic change Nina Tahmasebi Haim Dubossarsky 26 6 0 13 Apr 2023
Hallucinations in Large Multilingual Translation Models Nuno M. Guerreiro Duarte M. Alves Jonas Waldendorf Barry Haddow Alexandra Birch Pierre Colombo André F.T. Martins VLM HILM LRM 13 139 0 28 Mar 2023
Inseq: An Interpretability Toolkit for Sequence Generation Models Gabriele Sarti Nils Feldhus Ludwig Sickert Oskar van der Wal Malvina Nissim Arianna Bisazza 17 64 0 27 Feb 2023
Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation Nuno M. Guerreiro Pierre Colombo Pablo Piantanida André F.T. Martins 17 10 0 19 Dec 2022
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better David Dale Elena Voita Loïc Barrault Marta R. Costa-jussá HILM 14 55 0 16 Dec 2022
Attention as a Guide for Simultaneous Speech Translation Sara Papi Matteo Negri Marco Turchi 6 30 0 15 Dec 2022
Toxicity in Multilingual Machine Translation at Scale Marta R. Costa-jussá Eric Michael Smith C. Ropers Daniel Licht Jean Maillard Javier Ferrando Carlos Escolano 14 24 0 06 Oct 2022
Incorporating Residual and Normalization Layers into Analysis of Masked Language Models Goro Kobayashi Tatsuki Kuribayashi Sho Yokoi Kentaro Inui 153 45 0 15 Sep 2021
The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives Elena Voita Rico Sennrich Ivan Titov 179 181 0 03 Sep 2019