ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.03075
  4. Cited By
A Survey of Data Augmentation Approaches for NLP

A Survey of Data Augmentation Approaches for NLP

7 May 2021
Steven Y. Feng
Varun Gangal
Jason W. Wei
Sarath Chandar
Soroush Vosoughi
Teruko Mitamura
Eduard H. Hovy
    AIMat
ArXivPDFHTML

Papers citing "A Survey of Data Augmentation Approaches for NLP"

50 / 67 papers shown
Title
LLM-based Semantic Augmentation for Harmful Content Detection
LLM-based Semantic Augmentation for Harmful Content Detection
Elyas Meguellati
Assaad Zeghina
S. Sadiq
Gianluca Demartini
27
0
0
22 Apr 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
65
2
0
07 Mar 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
73
11
0
31 Dec 2024
Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Mengze Hong
Yuanfeng Song
Di Jiang
Lu Wang
Zichang Guo
Yuanqin He
Zhiyang Su
Qing Li
35
1
0
16 Oct 2024
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation
Heejoon Koo
33
0
0
28 Jul 2024
Exploration of Masked and Causal Language Modelling for Text Generation
Exploration of Masked and Causal Language Modelling for Text Generation
Nicolo Micheletti
Samuel Belkadi
Lifeng Han
Goran Nenadic
22
6
0
21 May 2024
A Comprehensive Survey on Data Augmentation
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
85
18
0
15 May 2024
Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based
  Method for Evaluating Chess Strategies from Textbooks
Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks
Haifa Alrdahi
R. Batista-Navarro
30
0
0
10 May 2024
A Framework for Real-time Safeguarding the Text Generation of Large
  Language Model
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Ximing Dong
Dayi Lin
Shaowei Wang
Ahmed E. Hassan
20
1
0
29 Apr 2024
Probing the Robustness of Time-series Forecasting Models with
  CounterfacTS
Probing the Robustness of Time-series Forecasting Models with CounterfacTS
Haakon Hanisch Kjaernli
Lluis Mas-Ribas
Aida Ashrafi
Gleb Sizov
Helge Langseth
Odd Erik Gundersen
AI4TS
13
0
0
06 Mar 2024
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods
  in Low-resource Regimes
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
11
1
0
08 Feb 2024
Can LLMs Augment Low-Resource Reading Comprehension Datasets?
  Opportunities and Challenges
Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges
Vinay Samuel
Houda Aynaou
Arijit Ghosh Chowdhury
Karthik Venkat Ramanan
Aman Chadha
SyDa
6
7
0
21 Sep 2023
Are Large Language Models Really Robust to Word-Level Perturbations?
Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang
Guozheng Ma
Cong Yu
Ning Gui
Linrui Zhang
...
Sen Zhang
Li Shen
Xueqian Wang
Peilin Zhao
Dacheng Tao
KELM
13
22
0
20 Sep 2023
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential
  Behavior Comprehension in Recommendation
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation
Jianghao Lin
Rongjie Shan
Chenxu Zhu
Kounianhua Du
Bo Chen
Shigang Quan
Ruiming Tang
Yong Yu
Weinan Zhang
LRM
21
79
0
22 Aug 2023
Unlocking Hardware Security Assurance: The Potential of LLMs
Unlocking Hardware Security Assurance: The Potential of LLMs
Xingyu Meng
Amisha Srivastava
Ayush Arunachalam
Avik Ray
Pedro Henrique Silva
Rafail Psiakis
Yiorgos Makris
K. Basu
8
29
0
21 Aug 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and
  Negative Prompting for Coherent and Diverse Synthetic Data Generation
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
29
1
0
15 Aug 2023
Towards Generalising Neural Topical Representations
Towards Generalising Neural Topical Representations
Xiaohao Yang
He Zhao
Dinh Q. Phung
Lan Du
BDL
OOD
MedIm
6
1
0
24 Jul 2023
Data Augmentation for Machine Translation via Dependency Subtree
  Swapping
Data Augmentation for Machine Translation via Dependency Subtree Swapping
Attila Nagy
Dorina Lakatos
Botond Barta
Patrick Nanys
Judit Ács
18
1
0
13 Jul 2023
Semi-supervised Relation Extraction via Data Augmentation and
  Consistency-training
Semi-supervised Relation Extraction via Data Augmentation and Consistency-training
Komal K. Teru
30
5
0
16 Jun 2023
A Scalable and Adaptive System to Infer the Industry Sectors of
  Companies: Prompt + Model Tuning of Generative Language Models
A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models
Le-le Cao
Vilhelm von Ehrenheim
Astrid Berghult
Cecilia Henje
Richard Anselmo Stahl
Joar Wandborg
S. Stan
Armin Catovic
Erik Ferm
Hannes Ingelhag
6
4
0
05 Jun 2023
Symmetric Replay Training: Enhancing Sample Efficiency in Deep
  Reinforcement Learning for Combinatorial Optimization
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization
Hyeon-Seob Kim
Minsu Kim
Sungsoo Ahn
Jinkyoo Park
OffRL
21
7
0
02 Jun 2023
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
Aoran Xiao
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DPC
27
18
0
31 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient
  Visual Reinforcement Learning
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
28
9
0
25 May 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Y. Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
18
2
0
23 May 2023
Rethinking Data Augmentation for Tabular Data in Deep Learning
Rethinking Data Augmentation for Tabular Data in Deep Learning
Soma Onishi
Shoya Meguro
LMTD
10
14
0
17 May 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
11
39
0
07 Apr 2023
REFINER: Reasoning Feedback on Intermediate Representations
REFINER: Reasoning Feedback on Intermediate Representations
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
12
168
0
04 Apr 2023
Exploring Data Augmentation Methods on Social Media Corpora
Exploring Data Augmentation Methods on Social Media Corpora
Isabel Garcia Pietri
Kineret Stanley
15
0
0
03 Mar 2023
STA: Self-controlled Text Augmentation for Improving Text
  Classifications
STA: Self-controlled Text Augmentation for Improving Text Classifications
Congcong Wang
Gonzalo Fiz Pontiveros
Steven Derby
Tri Kurniawan Wijaya
23
3
0
24 Feb 2023
Data Augmentation for Modeling Human Personality: The Dexter Machine
Data Augmentation for Modeling Human Personality: The Dexter Machine
Yair Neuman
Vladyslav Kozhukhov
Dan Vilenchik
SyDa
13
4
0
20 Jan 2023
Data-centric AI: Perspectives and Challenges
Data-centric AI: Perspectives and Challenges
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Xia Hu
8
66
0
12 Jan 2023
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality
  Classification of Socio-Political Event Data
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data
A. Nik
Ge Zhang
Xingran Chen
Mingyu Li
Jie Fu
14
4
0
04 Nov 2022
Counterfactual Data Augmentation via Perspective Transition for
  Open-Domain Dialogues
Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues
Jiao Ou
Jinchao Zhang
Yang Feng
Jie Zhou
16
13
0
30 Oct 2022
EUREKA: EUphemism Recognition Enhanced through Knn-based methods and
  Augmentation
EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation
Sedrick Scott Keh
Rohit K Bharadwaj
Emmy Liu
Simone Tedeschi
Varun Gangal
Roberto Navigli
6
7
0
23 Oct 2022
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text
  Generation Models
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models
Steven Y. Feng
Vivek Khetan
Bogdan Sacaleanu
A. Gershman
Eduard H. Hovy
LRM
17
10
0
09 Oct 2022
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
Zeming Dong
Qiang Hu
Yuejun Guo
Maxime Cordy
Mike Papadakis
Zhenya Zhang
Yves Le Traon
Jianjun Zhao
10
8
0
06 Oct 2022
An Empirical Study on Cross-X Transfer for Legal Judgment Prediction
An Empirical Study on Cross-X Transfer for Legal Judgment Prediction
Joel Niklaus
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
24
18
0
25 Sep 2022
PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel
  Personification data for Learning Enhanced generation
PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation
Sedrick Scott Keh
Kevin Lu
Varun Gangal
Steven Y. Feng
Harsh Jhamtani
Malihe Alikhani
Eduard H. Hovy
21
2
0
16 Sep 2022
PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters
  Automatically
PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
Sedrick Scott Keh
Steven Y. Feng
Varun Gangal
Malihe Alikhani
Eduard H. Hovy
8
4
0
13 Sep 2022
Augraphy: A Data Augmentation Library for Document Images
Augraphy: A Data Augmentation Library for Document Images
Alexander Groleau
Kok Wei Chee
Stefan Larson
Samay Maini
Jonathan Boarman
8
10
0
30 Aug 2022
Formal Algorithms for Transformers
Formal Algorithms for Transformers
Mary Phuong
Marcus Hutter
6
68
0
19 Jul 2022
FairDistillation: Mitigating Stereotyping in Language Models
FairDistillation: Mitigating Stereotyping in Language Models
Pieter Delobelle
Bettina Berendt
17
8
0
10 Jul 2022
Robustness Analysis of Video-Language Models Against Visual and Language
  Perturbations
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations
Madeline Chantry Schiappa
Shruti Vyas
Hamid Palangi
Y. S. Rawat
Vibhav Vineet
VLM
101
17
0
05 Jul 2022
Data Augmentation for Dementia Detection in Spoken Language
Data Augmentation for Dementia Detection in Spoken Language
Anna Hlédiková
Dominika Woszczyk
Alican Acman
Soteris Demetriou
Björn Schuller
9
12
0
26 Jun 2022
TreeMix: Compositional Constituency-based Data Augmentation for Natural
  Language Understanding
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang
Zichao Yang
Diyi Yang
18
24
0
12 May 2022
Detecting the Role of an Entity in Harmful Memes: Techniques and Their
  Limitations
Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations
R. N. Nandi
Firoj Alam
Preslav Nakov
12
6
0
09 May 2022
bitsa_nlp@LT-EDI-ACL2022: Leveraging Pretrained Language Models for
  Detecting Homophobia and Transphobia in Social Media Comments
bitsa_nlp@LT-EDI-ACL2022: Leveraging Pretrained Language Models for Detecting Homophobia and Transphobia in Social Media Comments
Vitthal Bhandari
Poonam Goyal
15
16
0
27 Mar 2022
Contrastive-mixup learning for improved speaker verification
Contrastive-mixup learning for improved speaker verification
Xin Zhang
Minho Jin
R. Cheng
Ruirui Li
Eunjung Han
A. Stolcke
AAML
SSL
20
10
0
22 Feb 2022
Model-Agnostic Augmentation for Accurate Graph Classification
Model-Agnostic Augmentation for Accurate Graph Classification
Jaemin Yoo
Sooyeon Shim
U. Kang
GNN
8
29
0
21 Feb 2022
AugLy: Data Augmentations for Robustness
AugLy: Data Augmentations for Robustness
Zoe Papakipos
Joanna Bitton
AAML
9
52
0
17 Jan 2022
12
Next