ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.03752
  4. Cited By
Improving short text classification through global augmentation methods

Improving short text classification through global augmentation methods

7 July 2019
Vukosi Marivate
T. Sefara
    VLM
ArXivPDFHTML

Papers citing "Improving short text classification through global augmentation methods"

32 / 32 papers shown
Title
From N-grams to Pre-trained Multilingual Models For Language
  Identification
From N-grams to Pre-trained Multilingual Models For Language Identification
Thapelo Sindane
Vukosi Marivate
24
1
0
11 Oct 2024
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula
Sara Tonelli
26
0
0
10 Oct 2024
Parallel Corpus Augmentation using Masked Language Models
Parallel Corpus Augmentation using Masked Language Models
Vibhuti Kumari
Narayana Murthy Kavi
19
0
0
04 Oct 2024
LeCov: Multi-level Testing Criteria for Large Language Models
LeCov: Multi-level Testing Criteria for Large Language Models
Xuan Xie
Jiayang Song
Yuheng Huang
Da Song
Fuyuan Zhang
Felix Juefei-Xu
Lei Ma
ELM
29
0
0
20 Aug 2024
A Comprehensive Survey on Data Augmentation
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
97
21
0
15 May 2024
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive
  Study of Augmentation Methods and Curriculum Strategies
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies
Himmet Toprak Kesgin
M. Amasyalı
27
6
0
14 Feb 2024
Evaluation Metrics for Text Data Augmentation in NLP
Evaluation Metrics for Text Data Augmentation in NLP
Marcellus Amadeus
William Alberto Cruz Castañeda
30
1
0
09 Feb 2024
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods
  in Low-resource Regimes
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
22
1
0
08 Feb 2024
Benchmarking Large Multimodal Models against Common Corruptions
Benchmarking Large Multimodal Models against Common Corruptions
Jiawei Zhang
Tianyu Pang
Chao Du
Yi Ren
Bo-wen Li
Min-Bin Lin
MLLM
22
14
0
22 Jan 2024
Iterative Mask Filling: An Effective Text Augmentation Method Using
  Masked Language Modeling
Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling
Himmet Toprak Kesgin
M. Amasyalı
11
7
0
03 Jan 2024
Augmenty: A Python Library for Structured Text Augmentation
Augmenty: A Python Library for Structured Text Augmentation
K. Enevoldsen
18
0
0
09 Dec 2023
From Big to Small Without Losing It All: Text Augmentation with ChatGPT
  for Efficient Sentiment Analysis
From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis
Stanislaw Wo'zniak
Jan Kocoñ
38
9
0
07 Dec 2023
Noisy Self-Training with Data Augmentations for Offensive and Hate
  Speech Detection Tasks
Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks
João A. Leite
Carolina Scarton
D. F. Silva
35
1
0
31 Jul 2023
Textual Augmentation Techniques Applied to Low Resource Machine
  Translation: Case of Swahili
Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili
Catherine Gitau
VUkosi Marivate
19
3
0
12 Jun 2023
Data Augmentation for Conflict and Duplicate Detection in Software
  Engineering Sentence Pairs
Data Augmentation for Conflict and Duplicate Detection in Software Engineering Sentence Pairs
G. Malik
Mucahit Cevik
Ayse Basar
14
2
0
16 May 2023
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Ana Kotarcic
Dominik Hangartner
Fabrizio Gilardi
Selina Kurer
K. Donnay
24
2
0
05 Dec 2022
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
Zilu Tang
Muhammed Yusuf Kocyigit
Derry Wijaya
35
8
0
20 Oct 2022
Rethinking Textual Adversarial Defense for Pre-trained Language Models
Rethinking Textual Adversarial Defense for Pre-trained Language Models
Jiayi Wang
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
AAML
SILM
13
11
0
21 Jul 2022
NL-Augmenter: A Framework for Task-Sensitive Natural Language
  Augmentation
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
169
86
0
06 Dec 2021
Training Cross-Lingual embeddings for Setswana and Sepedi
Training Cross-Lingual embeddings for Setswana and Sepedi
Mack Makgatho
Vukosi Marivate
T. Sefara
Valencia Wagner
AI4TS
8
2
0
11 Nov 2021
Semi-Supervised Adversarial Discriminative Domain Adaptation
Semi-Supervised Adversarial Discriminative Domain Adaptation
Thai Nguyen
Anh Nguyen
Nghia Le
H. Le
GAN
16
10
0
27 Sep 2021
A Diversity-Enhanced and Constraints-Relaxed Augmentation for
  Low-Resource Classification
A Diversity-Enhanced and Constraints-Relaxed Augmentation for Low-Resource Classification
Guang Liu
Hailong Huang
Yuzhao Mao
Weiguo Gao
Xuan Li
Jianping Shen
33
1
0
24 Sep 2021
Learning from Multiple Noisy Augmented Data Sets for Better
  Cross-Lingual Spoken Language Understanding
Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding
Yingmei Guo
Linjun Shou
J. Pei
Ming Gong
Mingxing Xu
Zhiyong Wu
Daxin Jiang
24
5
0
03 Sep 2021
Is it Fake? News Disinformation Detection on South African News Websites
Is it Fake? News Disinformation Detection on South African News Websites
Harm de Wet
Vukosi Marivate
4
3
0
06 Aug 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
14
4
0
28 Jul 2021
A Survey on Data Augmentation for Text Classification
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
28
334
0
07 Jul 2021
Can vectors read minds better than experts? Comparing data augmentation
  strategies for the automated scoring of children's mindreading ability
Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability
Venelin Kovatchev
Phillip Smith
Mark G. Lee
R. Devine
16
6
0
03 Jun 2021
Tell Me How to Ask Again: Question Data Augmentation with Controllable
  Rewriting in Continuous Space
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng Liu
Yeyun Gong
Jie Fu
Yu Yan
Jiusheng Chen
Jiancheng Lv
Nan Duan
M. Zhou
10
37
0
04 Oct 2020
Low resource language dataset creation, curation and classification:
  Setswana and Sepedi -- Extended Abstract
Low resource language dataset creation, curation and classification: Setswana and Sepedi -- Extended Abstract
Vukosi Marivate
T. Sefara
Vongani Chabalala
Keamogetswe Makhaya
T. Mokgonyane
Rethabile Mokoena
Abiodun Modupe
6
4
0
30 Mar 2020
Investigating an approach for low resource language dataset creation,
  curation and classification: Setswana and Sepedi
Investigating an approach for low resource language dataset creation, curation and classification: Setswana and Sepedi
Vukosi Marivate
T. Sefara
Vongani Chabalala
Keamogetswe Makhaya
T. Mokgonyane
Rethabile Mokoena
Abiodun Modupe
9
28
0
18 Feb 2020
Short Text Language Identification for Under Resourced Languages
Short Text Language Identification for Under Resourced Languages
B. Duvenhage
17
13
0
18 Nov 2019
Not Enough Data? Deep Learning to the Rescue!
Not Enough Data? Deep Learning to the Rescue!
Ateret Anaby-Tavor
Boaz Carmeli
Esther Goldbraich
Amir Kantor
George Kour
Segev Shlomov
N. Tepper
Naama Zwerdling
14
364
0
08 Nov 2019
1