ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.09447
  4. Cited By
PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine
  Translation

PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation

20 April 2020
Vivek Srivastava
M. Singh
ArXiv (abs)PDFHTML

Papers citing "PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation"

34 / 34 papers shown
Title
CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages
CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages
Yilun Yang
Yekun Chai
83
2
0
24 Jul 2025
Creating and Evaluating Code-Mixed Nepali-English and Telugu-English Datasets for Abusive Language Detection Using Traditional and Deep Learning Models
Creating and Evaluating Code-Mixed Nepali-English and Telugu-English Datasets for Abusive Language Detection Using Traditional and Deep Learning Models
Manish Pandey
Nageshwar Prasad Yadav
Mokshada Adduru
Sawan Rai
103
0
0
23 Apr 2025
COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing
COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing
Rajvee Sheth
Himanshu Beniwal
Mayank Singh
144
2
0
27 Mar 2025
A kinetic-based regularization method for data science applications
A kinetic-based regularization method for data science applications
Abhisek Ganguly
Alessandro Gabbana
Vybhav Rao
Sauro Succi
Santosh Ansumali
230
1
0
06 Mar 2025
LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content
LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media ContentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Mohamed Bayan Kmainasi
Ali Ezzat Shahroor
Maram Hasanain
Sahinur Rahman Laskar
Naeemul Hassan
Firoj Alam
193
4
0
20 Oct 2024
Code-Mixer Ya Nahi: Novel Approaches to Measuring Multilingual LLMs'
  Code-Mixing Capabilities
Code-Mixer Ya Nahi: Novel Approaches to Measuring Multilingual LLMs' Code-Mixing Capabilities
Ayushman Gupta
Akhil Bhogal
Kripabandhu Ghosh
105
3
0
14 Oct 2024
MINERS: Multilingual Language Models as Semantic Retrievers
MINERS: Multilingual Language Models as Semantic Retrievers
Genta Indra Winata
Ruochen Zhang
David Ifeoluwa Adelani
RALM
264
11
0
11 Jun 2024
Code-mixed Sentiment and Hate-speech Prediction
Code-mixed Sentiment and Hate-speech Prediction
Anjali Yadav
Tanya Garg
Matej Klemen
Matej Ulčar
Basant Agarwal
Marko Robnik-Šikonja
131
7
0
21 May 2024
Synthetic Data Generation and Joint Learning for Robust Code-Mixed
  Translation
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
Kamal Kumar
Yinhan Liu
Parth Patwa
Tanmoy
Mihir Adam Roberts
177
4
0
25 Mar 2024
Representativeness as a Forgotten Lesson for Multilingual and
  Code-switched Data Collection and Preparation
Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and PreparationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
A. Seza Doğruöz
Sunayana Sitaram
Zheng-Xin Yong
146
17
0
31 Oct 2023
CONFLATOR: Incorporating Switching Point based Rotatory Positional
  Encodings for Code-Mixed Language Modeling
CONFLATOR: Incorporating Switching Point based Rotatory Positional Encodings for Code-Mixed Language Modeling
Mohsin Ali
K. S. Teja
Neeharika Gupta
Parth Patwa
Anubhab Chatterjee
Vinija Jain
Vasu Sharma
Amitava Das
137
1
0
11 Sep 2023
MUTANT: A Multi-sentential Code-mixed Hinglish Dataset
MUTANT: A Multi-sentential Code-mixed Hinglish DatasetFindings (Findings), 2023
Rahul Gupta
Vivek Srivastava
M. Singh
102
1
0
23 Feb 2023
Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A
  Case Study in Taiwanese Hokkien
Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese HokkienConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sin-En Lu
Bo-Han Lu
Chaohong Lu
Richard Tzong-Han Tsai
102
9
0
21 Jan 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic
  Survey on Trends and Challenges
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and ChallengesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Genta Indra Winata
Alham Fikri Aji
Zheng-Xin Yong
Thamar Solorio
167
45
0
19 Dec 2022
ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched
  Egyptian Arabic - English
ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - EnglishWorkshop on Arabic Natural Language Processing (WANLP), 2022
Injy Hamed
Farah E. Shamout
Slim Abdennadher
Ngoc Thang Vu
98
17
0
22 Nov 2022
Language Agnostic Code-Mixing Data Augmentation by Predicting Linguistic
  Patterns
Language Agnostic Code-Mixing Data Augmentation by Predicting Linguistic Patterns
Shuyue Stella Li
Kenton W. Murray
106
4
0
14 Nov 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Domain Curricula for Code-Switched MT at MixMT 2022Conference on Machine Translation (WMT), 2022
Lekan Raheem
Maab Elrashid
92
1
0
31 Oct 2022
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric
  Code Switching
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code SwitchingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Chenxi Whitehouse
Fenia Christopoulou
Ignacio Iacobacci
179
12
0
22 Oct 2022
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of
  code mixed data
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed dataConference on Machine Translation (WMT), 2022
Akshat Gahoi
Jayant Duneja
Anshul Padhi
Shivam Mangale
Saransh Rajput
Tanvi Kamble
D. Sharma
Vasudeva Varma
155
4
0
21 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained ModelsConference on Machine Translation (WMT), 2022
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
211
3
0
21 Oct 2022
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared
  Task (MixMT)
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT)Conference on Machine Translation (WMT), 2022
Faheem Kirefu
Vivek Iyer
Pinzhen Chen
Laurie Burchell
MoE
112
1
0
20 Oct 2022
Exploring Segmentation Approaches for Neural Machine Translation of
  Code-Switched Egyptian Arabic-English Text
Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English TextConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Marwa Gaser
Manuel Mager
Injy Hamed
Farah E. Shamout
Slim Abdennadher
Ngoc Thang Vu
147
7
0
11 Oct 2022
Study of Encoder-Decoder Architectures for Code-Mix Search Query
  Translation
Study of Encoder-Decoder Architectures for Code-Mix Search Query Translation
Mandar M. Kulkarni
Soumya Chennabasavaraj
Nikesh Garera
101
3
0
07 Aug 2022
JU_NLP at HinglishEval: Quality Evaluation of the Low-Resource
  Code-Mixed Hinglish Text
JU_NLP at HinglishEval: Quality Evaluation of the Low-Resource Code-Mixed Hinglish Text
Prantik Guha
Rudra Dhar
Dipankar Das
91
2
0
16 Jun 2022
Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted
  Belongingness from Suicide Notes
Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted Belongingness from Suicide NotesInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Soumitra Ghosh
Asif Ekbal
P. Bhattacharyya
110
8
0
20 May 2022
A Comprehensive Understanding of Code-mixed Language Semantics using
  Hierarchical Transformer
A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical TransformerIEEE Transactions on Computational Social Systems (IEEE TCSS), 2022
Ayan Sengupta
Tharun Suresh
Md. Shad Akhtar
Tanmoy Chakraborty
124
11
0
27 Apr 2022
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data
Shuguang Chen
Gustavo Aguilar
A. Srinivasan
Mona T. Diab
Thamar Solorio
121
17
0
19 Feb 2022
Quality Evaluation of the Low-Resource Synthetically Generated
  Code-Mixed Hinglish Text
Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
120
13
0
04 Aug 2021
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG
  Evaluation
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation
Ayush Garg
S. S. Kagi
Vivek Srivastava
M. Singh
108
9
0
24 Jul 2021
HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish
  Text
HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
127
51
0
08 Jul 2021
Challenges and Limitations with the Metrics Measuring the Complexity of
  Code-Mixed Text
Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text
Vivek Srivastava
M. Singh
118
21
0
18 Jun 2021
Challenges and Considerations with Code-Mixed NLP for Multilingual
  Societies
Challenges and Considerations with Code-Mixed NLP for Multilingual Societies
Vivek Srivastava
M. Singh
147
5
0
15 Jun 2021
Exploring Text-to-Text Transformers for English to Hinglish Machine
  Translation with Synthetic Code-Mixing
Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Ganesh Jawahar
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
L. Lakshmanan
153
31
0
18 May 2021
IIT Gandhinagar at SemEval-2020 Task 9: Code-Mixed Sentiment
  Classification Using Candidate Sentence Generation and Selection
IIT Gandhinagar at SemEval-2020 Task 9: Code-Mixed Sentiment Classification Using Candidate Sentence Generation and Selection
Vivek Srivastava
M. Singh
165
13
0
25 Jun 2020
1