ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.06166
  4. Cited By
Parallel Corpus Filtering via Pre-trained Language Models

Parallel Corpus Filtering via Pre-trained Language Models

13 May 2020
Boliang Zhang
Ajay Nagesh
Kevin Knight
ArXiv (abs)PDFHTML

Papers citing "Parallel Corpus Filtering via Pre-trained Language Models"

17 / 17 papers shown
A kinetic-based regularization method for data science applications
A kinetic-based regularization method for data science applications
Abhisek Ganguly
Alessandro Gabbana
Vybhav Rao
Sauro Succi
Santosh Ansumali
464
7
0
06 Mar 2025
Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics
Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics
Aloka Fernando
Nisansa de Silva
Menan Velyuthan
Charitha Rathnayake
Surangika Ranathunga
416
1
0
26 Feb 2025
Positive Text Reframing under Multi-strategy Optimization
Positive Text Reframing under Multi-strategy Optimization
Shutong Jia
Biwei Cao
Qingqing Gao
Jiuxin Cao
Bo Liu
336
1
0
25 Jul 2024
Critical Learning Periods: Leveraging Early Training Dynamics for
  Efficient Data Pruning
Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
E. Chimoto
Jay Gala
Orevaoghene Ahia
Julia Kreutzer
Bruce A. Bassett
Sara Hooker
VLM
447
6
0
29 May 2024
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation
  Approach for the Generation and Detection of Problematic Content
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
244
10
0
26 Aug 2023
Discovering Language Model Behaviors with Model-Written Evaluations
Discovering Language Model Behaviors with Model-Written EvaluationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ethan Perez
Sam Ringer
Kamilė Lukošiūtė
Karina Nguyen
Edwin Chen
...
Danny Hernandez
Deep Ganguli
Evan Hubinger
Nicholas Schiefer
Jared Kaplan
ALM
440
692
0
19 Dec 2022
High-Resource Methodological Bias in Low-Resource Investigations
High-Resource Methodological Bias in Low-Resource Investigations
Maartje ter Hoeve
David Grangier
Natalie Schluter
296
3
0
14 Nov 2022
Faithfulness in Natural Language Generation: A Systematic Survey of
  Analysis, Evaluation and Optimization Methods
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods
Wei Li
Wenhao Wu
Moye Chen
Jiachen Liu
Xinyan Xiao
Hua Wu
HILM
398
38
0
10 Mar 2022
Empirical Analysis of Korean Public AI Hub Parallel Corpora and in-depth
  Analysis using LIWC
Empirical Analysis of Korean Public AI Hub Parallel Corpora and in-depth Analysis using LIWC
Chanjun Park
Midan Shim
Sugyeong Eo
Seolhwa Lee
Jaehyung Seo
Hyeonseok Moon
Heuiseok Lim
148
8
0
28 Oct 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A SurveyACM Computing Surveys (CSUR), 2021
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
383
346
0
29 Jun 2021
Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine
  Translation Data
Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Rajat Bhatnagar
Ananya Ganesh
Katharina Kann
187
3
0
12 Jun 2021
Prevent the Language Model from being Overconfident in Neural Machine
  Translation
Prevent the Language Model from being Overconfident in Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Mengqi Miao
Fandong Meng
Yijin Liu
Xiao-Hua Zhou
Jie Zhou
406
46
0
24 May 2021
The Curious Case of Hallucinations in Neural Machine Translation
The Curious Case of Hallucinations in Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Vikas Raunak
Arul Menezes
Marcin Junczys-Dowmunt
640
232
0
14 Apr 2021
Assessing Reference-Free Peer Evaluation for Machine Translation
Assessing Reference-Free Peer Evaluation for Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Sweta Agrawal
George F. Foster
Markus Freitag
Colin Cherry
LRM
211
11
0
12 Apr 2021
Score Combination for Improved Parallel Corpus Filtering for Low
  Resource Conditions
Score Combination for Improved Parallel Corpus Filtering for Low Resource ConditionsConference on Machine Translation (WMT), 2020
Muhammad N. ElNokrashy
Amr Hendy
M. Abdelghaffar
Mohamed Afify
Ahmed Tawfik
Hany Awadalla
177
3
0
16 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Chunting Zhou
Graham Neubig
Jiatao Gu
Mona T. Diab
P. Guzmán
Luke Zettlemoyer
Marjan Ghazvininejad
HILM
518
200
0
05 Nov 2020
DiDi's Machine Translation System for WMT2020
DiDi's Machine Translation System for WMT2020Conference on Machine Translation (WMT), 2020
Tianrun Chen
Weiwei Wang
Wenyang Wei
Xing Shi
Xiangang Li
Jieping Ye
Kevin Knight
178
2
0
16 Oct 2020
1
Page 1 of 1