Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.12239
Cited By
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
25 April 2020
Jiaao Chen
Zichao Yang
Diyi Yang
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (357★)
Papers citing
"MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification"
50 / 189 papers shown
CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design
Zijun Gao
Mutian He
Shijia Sun
Hanqun Cao
J. Zhang
...
Xiaorui Wang
Xiaojun Yao
Chang-Yu Hsieh
Chunbin Gu
Pheng-Ann Heng
75
0
0
20 Nov 2025
LM-mixup: Text Data Augmentation via Language Model based Mixup
Zhijie Deng
Zhouan Shen
Ling Li
Yao Zhou
Zhaowei Zhu
Yanji He
Wei Wang
Jiaheng Wei
145
0
0
23 Oct 2025
Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classification
International Conference on Conceptual Structures (ICCS), 2025
Łukasz Radliński
Mateusz Guściora
Jan Kocoñ
176
2
0
19 Jul 2025
MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification
Iustin Sîrbu
Robert-Adrian Popovici
Cornelia Caragea
Stefan Trausan-Matu
Traian Rebedea
353
2
0
09 Jun 2025
SMOTExT: SMOTE meets Large Language Models
Mateusz Bystroński
Mikołaj Hołysz
Grzegorz Piotrowski
Nitesh Chawla
Tomasz Kajdanowicz
256
1
0
19 May 2025
The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification
Arezoo Hatefi
Xuan-Son Vu
Monowar Bhuyan
Frank Drewes
VLM
305
0
0
10 May 2025
AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks
Ilyas Oulkadda
Julien Perez
ALM
243
0
0
05 May 2025
CGMatch: A Different Perspective of Semi-supervised Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Bo Cheng
Jueqing Lu
Yuan Tian
Haifeng Zhao
Yi-Ju Chang
Lan Du
390
7
0
04 Mar 2025
MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification
Varun Vashisht
Siyang Song
Mihir Konduskar
Jaskaran Singh Walia
Vukosi Marivate
286
1
0
25 Feb 2025
TCProF: Time-Complexity Prediction SSL Framework
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Joonghyuk Hahn
Hyeseon Ahn
Jungin Kim
Soohan Lim
Yo-Sub Han
361
1
0
10 Feb 2025
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ran Xu
Hejie Cui
Yue Yu
Xuan Kan
Wenqi Shi
Yuchen Zhuang
Wei Jin
Joyce C. Ho
Carl Yang
465
35
0
28 Jan 2025
LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning
Knowledge Discovery and Data Mining (KDD), 2024
Fanshuang Kong
Richong Zhang
Ziqiao Wang
464
1
0
22 Dec 2024
Does VLM Classification Benefit from LLM Description Semantics?
AAAI Conference on Artificial Intelligence (AAAI), 2024
Pingchuan Ma
Lennart Rietdorf
Dmytro Kotovenko
Vincent Tao Hu
Bjorn Ommer
VLM
466
5
0
16 Dec 2024
Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Qianren Mao
Weifeng Jiang
Qingbin Liu
Chenghua Lin
Qian Li
Xianqing Wen
Jianxin Li
Jinhu Lu
370
1
0
01 Dec 2024
Soft-TransFormers for Continual Learning
Haeyong Kang
Chang D. Yoo
CLL
484
0
0
25 Nov 2024
Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDa
ALM
625
6
0
14 Nov 2024
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
International Conference on Learning Representations (ICLR), 2024
Yiming Wang
Pei Zhang
Baosong Yang
Yang Li
Rui Wang
LRM
449
48
0
17 Oct 2024
ALVIN: Active Learning Via INterpolation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Michalis Korakakis
Andreas Vlachos
Adrian Weller
357
0
0
11 Oct 2024
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction
Steven Rogulsky
Nicholas Popovic
Michael Färber
HILM
344
9
0
10 Oct 2024
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Yuxin Xiao
Shujian Zhang
Wenxuan Zhou
Marzyeh Ghassemi
Sanqiang Zhao
1.2K
2
0
07 Oct 2024
Exploring Empty Spaces: Human-in-the-Loop Data Augmentation
International Conference on Human Factors in Computing Systems (CHI), 2024
Catherine Yeh
Donghao Ren
Yannick Assogba
Dominik Moritz
Fred Hohman
393
8
0
01 Oct 2024
Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
Guanyi Mou
Yichuan Li
Kyumin Lee
348
3
0
26 Sep 2024
FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection
International Conference on Pattern Recognition (ICPR), 2024
Xinying Lu
Jianli Xiao
135
0
0
12 Sep 2024
Investigating the Impact of Semi-Supervised Methods with Data Augmentation on Offensive Language Detection in Romanian Language
International Conference on Knowledge-Based Intelligent Information & Engineering Systems (KES), 2024
Elena Beatrice Nicola
Dumitru-Clementin Cercel
Florin-Catalin Pop
334
1
0
29 Jul 2024
Scalable Language Model with Generalized Continual Learning
Bohao Peng
Zhuotao Tian
Shu Liu
Mingchang Yang
Jiaya Jia
ALM
CLL
KELM
276
35
0
11 Apr 2024
Heterogeneous Contrastive Learning for Foundation Models and Beyond
Lecheng Zheng
Baoyu Jing
Zihao Li
Hanghang Tong
Jingrui He
VLM
315
42
0
30 Mar 2024
Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization
Zihan Wang
Jiayu Xiao
Mengxian Li
Zhongjiang He
Yongxiang Li
Chao Wang
Shuangyong Song
219
4
0
16 Mar 2024
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks
Minju Seo
Jinheon Baek
James Thorne
Sung Ju Hwang
RALM
314
21
0
21 Feb 2024
Evaluation Metrics for Text Data Augmentation in NLP
Marcellus Amadeus
William Alberto Cruz Castañeda
200
2
0
09 Feb 2024
Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better
Shengchao Liu
Xiaoming Liu
Yichen Wang
Zehua Cheng
Chengzhengxu Li
Zhaohan Zhang
Y. Lan
Chao Shen
DeLMO
284
16
0
01 Feb 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
538
53
0
27 Jan 2024
IndiText Boost: Text Augmentation for Low Resource India Languages
Onkar Litake
Niraj Yagnik
S. Labhsetwar
VLM
197
6
0
23 Jan 2024
Neural Networks Against (and For) Self-Training: Classification with Small Labeled and Large Unlabeled Sets
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Payam Karisani
284
2
0
31 Dec 2023
A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jingyi Zhou
Jie Zhou
Jiabao Zhao
Siyin Wang
Haijun Shan
Gui Tao
Tao Gui
Xuanjing Huang
LLMAG
VLM
245
7
0
16 Dec 2023
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yoo-Seok Kho
Jaehee Kim
Pilsung Kang
VLM
284
1
0
08 Dec 2023
Summarization-based Data Augmentation for Document Classification
Yueguan Wang
Naoki Yoshinaga
VLM
RALM
183
1
0
01 Dec 2023
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiahuan Yan
Haojun Gao
Zhang Kai
Weize Liu
Benlin Liu
Jian Wu
Jintai Chen
235
7
0
28 Nov 2023
SCStory: Self-supervised and Continual Online Story Discovery
The Web Conference (WWW), 2023
Susik Yoon
Yu Meng
Dongha Lee
Jiawei Han
CLL
248
14
0
27 Nov 2023
SegMix: A Simple Structure-Aware Data Augmentation Method
Yuxin Pei
Pushkar Bhuse
Zhengzhong Liu
Eric P. Xing
267
1
0
16 Nov 2023
Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation
Jiaqi Wu
Junbiao Pang
Qingming Huang
169
0
0
03 Nov 2023
CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster Tweet Classification
Henry Peng Zou
Yue Zhou
Cornelia Caragea
Doina Caragea
294
3
0
23 Oct 2023
JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Henry Peng Zou
Cornelia Caragea
316
30
0
23 Oct 2023
DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet Classification via Memory Bank
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Henry Peng Zou
Yue Zhou
Weizhi Zhang
Cornelia Caragea
180
14
0
23 Oct 2023
Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jianing Wang
Qiushi Sun
Nuo Chen
Chengyu Wang
Jun Huang
Ming Gao
Xiang Li
UQLM
395
5
0
19 Oct 2023
TK-KNN: A Balanced Distance-Based Pseudo Labeling Approach for Semi-Supervised Intent Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nicholas Botzer
David Vasquez
Tim Weninger
I. Laradji
260
1
0
17 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
309
8
0
11 Oct 2023
AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
PeerJ Computer Science (PeerJ Comput. Sci.), 2023
Leixin Yang
Yu Xiang
481
3
0
22 Sep 2023
AttentionMix: Data augmentation method that relies on BERT attention mechanism
Dominik Lewy
Jacek Mańdziuk
312
4
0
20 Sep 2023
Dual-Decoder Consistency via Pseudo-Labels Guided Data Augmentation for Semi-Supervised Medical Image Segmentation
Yuanbin Chen
Tao Wang
Hui Tang
Yuanbin Chen
Ruige Zong
Shun Chen
Longxuan Zhao
Xinlin Zhang
Tong Tong
364
4
0
31 Aug 2023
Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Zhengxiang Wang
221
1
0
29 Jun 2023
1
2
3
4
Next
Page 1 of 4