ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.08771
  4. Cited By
A Call for Clarity in Reporting BLEU Scores
v1v2 (latest)

A Call for Clarity in Reporting BLEU Scores

Conference on Machine Translation (WMT), 2018
23 April 2018
Matt Post
ArXiv (abs)PDFHTML

Papers citing "A Call for Clarity in Reporting BLEU Scores"

50 / 1,823 papers shown
Title
VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models
VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models
Hanling Zhang
Yayu Zhou
Tongcheng Fang
Zhihang Yuan
Guohao Dai
Yu Wang
Yu Wang
56
0
0
07 Jan 2026
Structured Document Translation via Format Reinforcement Learning
Structured Document Translation via Format Reinforcement Learning
Haiyue Song
Johannes Eschbach-Dymanus
Hour Kaing
Sumire Honda
Hideki Tanaka
Bianka Buschbeck
Masao Utiyama
36
0
0
04 Dec 2025
MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages
MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages
Yexing Du
Kaiyuan Liu
Youcheng Pan
B. Yang
Keqi Deng
Xie Chen
Yang Xiang
Ming Liu
Bin Qin
Y. Wang
LRM
48
0
0
01 Dec 2025
Agreement-Constrained Probabilistic Minimum Bayes Risk Decoding
Koki Natsumi
Hiroyuki Deguchi
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
40
0
0
01 Dec 2025
Asm2SrcEval: Evaluating Large Language Models for Assembly-to-Source Code Translation
Asm2SrcEval: Evaluating Large Language Models for Assembly-to-Source Code Translation
Parisa Hamedi
Hamed Jelodar
Samita Bai
Mohammad Meymani
Roozbeh Razavi-Far
Ali Ghorbani
ELM
88
0
0
28 Nov 2025
Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach
Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach
Haruki Sakajo
Hiroshi Takato
Hiroshi Tsutsui
Komei Soda
Hidetaka Kamigaito
Taro Watanabe
MLLM
132
0
0
28 Nov 2025
RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data
RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data
Zhisheng Zheng
Xiaohang Sun
Tuan Dinh
Abhishek Yanamandra
Abhinav Jain
...
Sunil Hadap
Vimal Bhat
Manoj Aggarwal
Gérard Medioni
David Harwath
96
0
0
26 Nov 2025
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Daniil Ignatev
Ayman Santeer
Albert Gatt
Denis Paperno
140
0
0
21 Nov 2025
Evaluating Multimodal Large Language Models on Vertically Written Japanese Text
Evaluating Multimodal Large Language Models on Vertically Written Japanese Text
Keito Sasagawa
Shuhei Kurita
Daisuke Kawahara
68
0
0
19 Nov 2025
Fast Neural Tangent Kernel Alignment, Norm and Effective Rank via Trace Estimation
Fast Neural Tangent Kernel Alignment, Norm and Effective Rank via Trace Estimation
James Hazelden
80
0
0
13 Nov 2025
Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Kunal Kingkar Das
Manoj Balaji Jagadeeshan
Nallani Chakravartula Sahith
Jivnesh Sandhan
Pawan Goyal
56
0
0
11 Nov 2025
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Yingfeng Luo
Ziqiang Xu
Yuxuan Ouyang
Murun Yang
Dingyang Lin
...
Bei Li
Peinan Feng
Quan Du
Tong Xiao
Jingbo Zhu
LRM
227
0
0
10 Nov 2025
Mind the Gap... or Not? How Translation Errors and Evaluation Details Skew Multilingual Results
Mind the Gap... or Not? How Translation Errors and Evaluation Details Skew Multilingual Results
Jan-Thorsten Peter
David Vilar
Tobias Domhan
Dan Malkin
Markus Freitag
64
0
0
07 Nov 2025
It Takes Two: A Dual Stage Approach for Terminology-Aware Translation
It Takes Two: A Dual Stage Approach for Terminology-Aware Translation
Akshat Singh Jaswal
136
2
0
07 Nov 2025
How to Evaluate Speech Translation with Source-Aware Neural MT Metrics
How to Evaluate Speech Translation with Source-Aware Neural MT Metrics
Mauro Cettolo
Marco Gaido
Matteo Negri
Sara Papi
L. Bentivogli
144
0
0
05 Nov 2025
"Don't Teach Minerva": Guiding LLMs Through Complex Syntax for Faithful Latin Translation with RAG
"Don't Teach Minerva": Guiding LLMs Through Complex Syntax for Faithful Latin Translation with RAG
Sergio Torres Aguilar
61
0
0
03 Nov 2025
Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel Corpus
Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel CorpusConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Pooja Singh
Shashwat Bhardwaj
V. Sharma
Sandeep Kumar
96
0
0
01 Nov 2025
A Survey on Unlearning in Large Language Models
A Survey on Unlearning in Large Language Models
Ruichen Qiu
Jiajun Tan
Jiayue Pu
Honglin Wang
Xiao-Shan Gao
Fei Sun
MUAILawPILM
630
0
0
29 Oct 2025
A Critical Study of Automatic Evaluation in Sign Language Translation
A Critical Study of Automatic Evaluation in Sign Language Translation
Shakib Yazdani
Yasser Hamidullah
C. España-Bonet
Eleftherios Avramidis
Josef van Genabith
SLR
309
0
0
29 Oct 2025
Seeing, Signing, and Saying: A Vision-Language Model-Assisted Pipeline for Sign Language Data Acquisition and Curation from Social Media
Seeing, Signing, and Saying: A Vision-Language Model-Assisted Pipeline for Sign Language Data Acquisition and Curation from Social Media
Shakib Yazdani
Yasser Hamidullah
C. España-Bonet
Josef van Genabith
SLR
226
1
0
29 Oct 2025
IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering
IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering
Jieyong Kim
Maryam Amirizaniani
Soojin Yoon
Dongha Lee
116
0
0
27 Oct 2025
Iterative Layer Pruning for Efficient Translation Inference
Iterative Layer Pruning for Efficient Translation Inference
Yasmin Moslem
Muhammad Hazim Al Farouq
John D. Kelleher
101
1
0
26 Oct 2025
SteerX: Disentangled Steering for LLM Personalization
SteerX: Disentangled Steering for LLM Personalization
Xiaoyan Zhao
Ming Yan
Yilun Qiu
Haoting Ni
Y. Zhang
Fuli Feng
Hong Cheng
Tat-Seng Chua
LLMSV
208
1
0
25 Oct 2025
Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset
Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset
Paul Lerner
François Yvon
116
0
0
23 Oct 2025
Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition
Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition
Yuu Jinnai
104
0
0
22 Oct 2025
SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision
SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision
Yasser Hamidullah
Shakib Yazdani
Cennet Oguz
Josef van Genabith
C. España-Bonet
SLRVLM
349
1
0
22 Oct 2025
Sign Language Translation with Sentence Embedding Supervision
Sign Language Translation with Sentence Embedding SupervisionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yasser Hamidullah
Josef van Genabith
C. España-Bonet
SLR
268
11
0
22 Oct 2025
Conditions for Catastrophic Forgetting in Multilingual Translation
Conditions for Catastrophic Forgetting in Multilingual Translation
Danni Liu
Jan Niehues
CLL
156
0
0
22 Oct 2025
Spatio-temporal Sign Language Representation and Translation
Spatio-temporal Sign Language Representation and TranslationConference on Machine Translation (WMT), 2025
Yasser Hamidullah
Josef van Genabith
C. España-Bonet
SLR
264
7
0
22 Oct 2025
Lingua Custodi's participation at the WMT 2025 Terminology shared task
Lingua Custodi's participation at the WMT 2025 Terminology shared task
Jingshu Liu
Raheel Qader
Gaëtan Caillaut
Mariam Nakhlé
137
0
0
20 Oct 2025
Beyond Function-Level Search: Repository-Aware Dual-Encoder Code Retrieval with Adversarial Verification
Beyond Function-Level Search: Repository-Aware Dual-Encoder Code Retrieval with Adversarial Verification
Aofan Liu
Shiyuan Song
Haoxuan Li
Cehao Yang
Yiyan Qi
92
1
0
16 Oct 2025
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models
Daniil Gurgurov
Josef van Genabith
Simon Ostermann
MoE
194
0
0
15 Oct 2025
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
A. Zebaze
Rachel Bawden
Benoît Sagot
LRM
124
1
0
13 Oct 2025
End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
Nam Luu
Ondřej Bojar
AuLLM
186
0
0
11 Oct 2025
Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages
Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages
Amir Hossein Yari
Kalmit Kulkarni
Ahmad Raza Khan
Fajri Koto
105
0
0
08 Oct 2025
LASER: An LLM-based ASR Scoring and Evaluation Rubric
LASER: An LLM-based ASR Scoring and Evaluation Rubric
Amruta Parulekar
Preethi Jyothi
100
1
0
08 Oct 2025
TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation
TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation
Adam Filipek
76
0
0
07 Oct 2025
Reward Models are Metrics in a Trench Coat
Reward Models are Metrics in a Trench Coat
Sebastian Gehrmann
136
0
0
03 Oct 2025
Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?
Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?
Oriol Pareras
Gerard I. Gállego
Federico Costa
Cristina España-Bonet
Javier Hernando
LRM
100
0
0
03 Oct 2025
A-VERT: Agnostic Verification with Embedding Ranking Targets
A-VERT: Agnostic Verification with Embedding Ranking Targets
Nicolás Aguirre
Ramiro Caso
Ramiro Rodríguez Colmeiro
Mauro Santelli
Joaquín Toranzo Calderón
112
0
0
01 Oct 2025
Self-Speculative Biased Decoding for Faster Re-Translation
Self-Speculative Biased Decoding for Faster Re-Translation
Linxiao Zeng
Haoyun Deng
Kangyuan Shu
Shizhen Wang
96
0
0
26 Sep 2025
Beyond statistical significance: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation
Beyond statistical significance: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation
Jonne Sälevä
Duygu Ataman
Constantine Lignos
128
0
0
26 Sep 2025
Semantic Agreement Enables Efficient Open-Ended LLM Cascades
Semantic Agreement Enables Efficient Open-Ended LLM Cascades
Duncan Soiffer
Steven Kolawole
Virginia Smith
226
0
0
26 Sep 2025
JGU Mainz's Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA
JGU Mainz's Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA
Hossain Shaikh Saadi
Minh Duc Bui
Mario Sanz-Guerrero
Katharina von der Wense
76
1
0
26 Sep 2025
UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice
UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice
Sitong Cheng
Weizhen Bian
Xinsheng Wang
Ruibin Yuan
Jianyi Chen
Shunshun Yin
Wenhan Luo
Wei Xue
128
0
0
25 Sep 2025
The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages
The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages
Pranjal A. Chitale
Varun Gumma
Sanchit Ahuja
Prashant Kodali
Manan Uppadhyay
Deepthi Sudharsan
Sunayana Sitaram
SyDa
128
0
0
25 Sep 2025
SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages
SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages
Hannah Liu
Junghyun Min
Ethan Yue Heng Cheung
Shou-Yi Hung
Syed Mekael Wasti
...
Elsie Chan
Ka Ieng Charlotte Lo
Wing Yu Yip
Richard Tzong-Han Tsai
En-Shiun Annie Lee
91
1
0
24 Sep 2025
DTW-Align: Bridging the Modality Gap in End-to-End Speech Translation with Dynamic Time Warping Alignment
DTW-Align: Bridging the Modality Gap in End-to-End Speech Translation with Dynamic Time Warping Alignment
Abderrahmane Issam
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
84
0
0
23 Sep 2025
Speech Vecalign: an Embedding-based Method for Aligning Parallel Speech Documents
Speech Vecalign: an Embedding-based Method for Aligning Parallel Speech Documents
Chutong Meng
Philipp Koehn
89
0
0
22 Sep 2025
Angular Dispersion Accelerates $k$-Nearest Neighbors Machine Translation
Angular Dispersion Accelerates kkk-Nearest Neighbors Machine Translation
Evgeniia Tokarchuk
S. Troshin
Vlad Niculae
104
0
0
20 Sep 2025
1234...353637
Next