ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.07446
  4. Cited By
Quality Does Matter: A Detailed Look at the Quality and Utility of
  Web-Mined Parallel Corpora
v1v2v3 (latest)

Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora

Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
12 February 2024
Surangika Ranathunga
Nisansa de Silva
Menan Velayuthan
Aloka Fernando
Charitha Rathnayake
ArXiv (abs)PDFHTMLGithub (3662★)

Papers citing "Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora"

7 / 7 papers shown
How Do LLMs Persuade? Linear Probes Can Uncover Persuasion Dynamics in Multi-Turn Conversations
How Do LLMs Persuade? Linear Probes Can Uncover Persuasion Dynamics in Multi-Turn Conversations
Brandon Jaipersaud
David M. Krueger
Ekdeep Singh Lubana
163
4
0
07 Aug 2025
Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics
Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics
Aloka Fernando
Nisansa de Silva
Menan Velyuthan
Charitha Rathnayake
Surangika Ranathunga
418
1
0
26 Feb 2025
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
Yomal De Mel
Kasun Wickramasinghe
Nisansa de Silva
Surangika Ranathunga
451
6
0
03 Jan 2025
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise in Machine Translation
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise in Machine Translation
Yan Meng
Di Wu
Christof Monz
477
4
0
02 Jul 2024
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models
Peiqin Lin
Marcely Zanon Boito
Hinrich Schütze
618
3
0
29 Jun 2024
Machine Translation Models are Zero-Shot Detectors of Translation Direction
Machine Translation Models are Zero-Shot Detectors of Translation DirectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Michelle Wastl
Jannis Vamvas
Rico Sennrich
VLM
671
0
0
12 Jan 2024
MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority
  Languages in China
MC2^22: Towards Transparent and Culturally-Aware NLP for Minority Languages in ChinaAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Chen Zhang
Mingxu Tao
Quzhe Huang
Jiuheng Lin
Zhibin Chen
Yansong Feng
331
9
0
14 Nov 2023
1
Page 1 of 1