ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.02855
  4. Cited By
The IIT Bombay English-Hindi Parallel Corpus

The IIT Bombay English-Hindi Parallel Corpus

8 October 2017
Anoop Kunchukuttan
Pratik Mehta
P. Bhattacharyya
    AIMat
ArXivPDFHTML

Papers citing "The IIT Bombay English-Hindi Parallel Corpus"

46 / 46 papers shown
Title
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Umer Butt
Stalin Veranasi
Günter Neumann
60
0
0
27 Mar 2025
A kinetic-based regularization method for data science applications
Abhisek Ganguly
Alessandro Gabbana
Vybhav Rao
Sauro Succi
Santosh Ansumali
57
0
0
06 Mar 2025
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
43
3
0
26 Sep 2024
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for
  Sexual Education in Rural India
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India
Salam Michael Singh
Shubhmoy Kumar Garg
Amitesh Misra
Aaditeshwar Seth
Tanmoy Chakraborty
31
0
0
03 May 2024
Hindi to English: Transformer-Based Neural Machine Translation
Hindi to English: Transformer-Based Neural Machine Translation
Kavit Gangar
Hardik Ruparel
Shreyas Lele
25
5
0
23 Sep 2023
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for
  English to Indian Languages
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
33
2
0
30 Aug 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500
  Languages
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALM
LRM
49
96
0
20 May 2023
Accelerating Transformer Inference for Translation via Parallel Decoding
Accelerating Transformer Inference for Translation via Parallel Decoding
Andrea Santilli
Silvio Severino
Emilian Postolache
Valentino Maiorca
Michele Mancusi
R. Marin
Emanuele Rodolà
44
80
0
17 May 2023
Taxi1500: A Multilingual Dataset for Text Classification in 1500
  Languages
Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages
Chunlan Ma
Ayyoob Imani
Haotian Ye
Renhao Pei
Ehsaneddin Asgari
Hinrich Schütze
40
23
0
15 May 2023
Investigating Lexical Sharing in Multilingual Machine Translation for
  Indian Languages
Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages
Sonal Sannigrahi
Rachel Bawden
37
0
0
04 May 2023
MUTANT: A Multi-sentential Code-mixed Hinglish Dataset
MUTANT: A Multi-sentential Code-mixed Hinglish Dataset
Rahul Gupta
Vivek Srivastava
M. Singh
34
1
0
23 Feb 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
  Programming Languages
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua Wu
38
36
0
13 Dec 2022
ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for
  Understanding and Generation
ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Bin Shan
Yaqian Han
Weichong Yin
Shuohuan Wang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
MLLM
VLM
24
7
0
09 Nov 2022
Multi-level Distillation of Semantic Knowledge for Pre-training
  Multilingual Language Model
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
Mingqi Li
Fei Ding
Dan Zhang
Long Cheng
Hongxin Hu
Feng Luo
43
6
0
02 Nov 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Lekan Raheem
Maab Elrashid
34
1
0
31 Oct 2022
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of
  code mixed data
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data
Akshat Gahoi
Jayant Duneja
Anshul Padhi
Shivam Mangale
Saransh Rajput
Tanvi Kamble
D. Sharma
Vasudeva Varma
35
3
0
21 Oct 2022
Mismatching-Aware Unsupervised Translation Quality Estimation For
  Low-Resource Languages
Mismatching-Aware Unsupervised Translation Quality Estimation For Low-Resource Languages
Fatemeh Azadi
Heshaam Faili
M. Dousti
24
4
0
31 Jul 2022
When does Parameter-Efficient Transfer Learning Work for Machine
  Translation?
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Ahmet Üstün
Asa Cooper Stickland
42
7
0
23 May 2022
Nebula-I: A General Framework for Collaboratively Training Deep Learning
  Models on Low-Bandwidth Cloud Clusters
Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Yang Xiang
Zhihua Wu
Weibao Gong
Siyu Ding
Xianjie Mo
...
Yue Yu
Ge Li
Yu Sun
Yanjun Ma
Dianhai Yu
24
5
0
19 May 2022
Accurate Online Posterior Alignments for Principled
  Lexically-Constrained Decoding
Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding
Soumya Chatterjee
Sunita Sarawagi
Preethi Jyothi
36
1
0
02 Apr 2022
Ceasing hate withMoH: Hate Speech Detection in Hindi-English
  Code-Switched Language
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language
Arushi Sharma
Anubha Kabra
Minni Jain
29
52
0
18 Oct 2021
HintedBT: Augmenting Back-Translation with Quality and Transliteration
  Hints
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints
Sahana Ramnath
Melvin Johnson
Abhirut Gupta
A. Raghuveer
48
8
0
09 Sep 2021
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence
  Pretraining
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining
Machel Reid
Mikel Artetxe
VLM
50
26
0
04 Aug 2021
Quality Evaluation of the Low-Resource Synthetically Generated
  Code-Mixed Hinglish Text
Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
26
12
0
04 Aug 2021
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG
  Evaluation
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation
Ayush Garg
S. S. Kagi
Vivek Srivastava
M. Singh
29
9
0
24 Jul 2021
From Machine Translation to Code-Switching: Generating High-Quality
  Code-Switched Text
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text
Ishan Tarunesh
Syamantak Kumar
Preethi Jyothi
46
45
0
14 Jul 2021
HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish
  Text
HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
32
45
0
08 Jul 2021
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word
  Alignment
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
Zewen Chi
Li Dong
Bo Zheng
Shaohan Huang
Xian-Ling Mao
Heyan Huang
Furu Wei
45
67
0
11 Jun 2021
Exploring Text-to-Text Transformers for English to Hinglish Machine
  Translation with Synthetic Code-Mixing
Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Ganesh Jawahar
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
L. Lakshmanan
34
29
0
18 May 2021
Samanantar: The Largest Publicly Available Parallel Corpora Collection
  for 11 Indic Languages
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Gowtham Ramesh
Sumanth Doddapaneni
Aravinth Bheemaraj
Mayank Jobanputra
AK Raghavan
...
K. Deepak
Vivek Raghavan
Anoop Kunchukuttan
Pratyush Kumar
Mitesh Khapra
LRM
37
231
0
12 Apr 2021
Robust Experimentation in the Continuous Time Bandit Problem
Robust Experimentation in the Continuous Time Bandit Problem
Pasquale Antonante
27
0
0
31 Mar 2021
Harnessing Multilinguality in Unsupervised Machine Translation for Rare
  Languages
Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages
Xavier Garcia
Aditya Siddhant
Orhan Firat
Ankur P. Parikh
30
31
0
23 Sep 2020
A Multilingual Parallel Corpora Collection Effort for Indian Languages
A Multilingual Parallel Corpora Collection Effort for Indian Languages
Shashank Siripragrada
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
VLM
32
47
0
15 Jul 2020
Cross-lingual Retrieval for Iterative Self-Supervised Training
Cross-lingual Retrieval for Iterative Self-Supervised Training
C. Tran
Y. Tang
Xian Li
Jiatao Gu
RALM
28
73
0
16 Jun 2020
An Augmented Translation Technique for low Resource language pair:
  Sanskrit to Hindi translation
An Augmented Translation Technique for low Resource language pair: Sanskrit to Hindi translation
Rashi Kumar
Piyush Jha
V. Sahula
19
14
0
09 Jun 2020
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
Simran Khanuja
Sandipan Dandapat
A. Srinivasan
Sunayana Sitaram
Monojit Choudhury
ELM
29
142
0
26 Apr 2020
Towards Automatic Face-to-Face Translation
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
42
172
0
01 Mar 2020
From English To Foreign Languages: Transferring Pre-trained Language
  Models
From English To Foreign Languages: Transferring Pre-trained Language Models
Ke M. Tran
30
49
0
18 Feb 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
55
1,777
0
22 Jan 2020
Self-attention based end-to-end Hindi-English Neural Machine Translation
Self-attention based end-to-end Hindi-English Neural Machine Translation
Siddhant Srivastava
Ritu Tiwari
9
2
0
21 Sep 2019
A Universal Parent Model for Low-Resource Neural Machine Translation
  Transfer
A Universal Parent Model for Low-Resource Neural Machine Translation Transfer
Mozhdeh Gheini
Jonathan May
19
21
0
14 Sep 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple
  Cross-lingual Tasks
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
Haoyang Huang
Yaobo Liang
Nan Duan
Ming Gong
Linjun Shou
Daxin Jiang
M. Zhou
44
230
0
03 Sep 2019
Cross-lingual Language Model Pretraining
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,712
0
22 Jan 2019
Addressing word-order Divergence in Multilingual Neural Machine
  Translation for extremely Low Resource Languages
Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages
V. Rudra Murthy
Anoop Kunchukuttan
P. Bhattacharyya
27
42
0
01 Nov 2018
XNLI: Evaluating Cross-lingual Sentence Representations
XNLI: Evaluating Cross-lingual Sentence Representations
Alexis Conneau
Guillaume Lample
Ruty Rinott
Adina Williams
Samuel R. Bowman
Holger Schwenk
Veselin Stoyanov
ELM
23
1,349
0
13 Sep 2018
Neural Machine Translation for Low Resource Languages using Bilingual
  Lexicon Induced from Comparable Corpora
Neural Machine Translation for Low Resource Languages using Bilingual Lexicon Induced from Comparable Corpora
Sree Harsha Ramesh
Krishnamurthy Sankaranarayanan
22
36
0
25 Jun 2018
1