MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

25 April 2020

Jiaao Chen

Zichao Yang

Diyi Yang

VLM

ArXiv (abs)PDF HTML Github (357★)

Papers citing "MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification"

50 / 189 papers shown

CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design

...

20 Nov 2025

LM-mixup: Text Data Augmentation via Language Model based Mixup

145

23 Oct 2025

Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classificationInternational Conference on Conceptual Structures (ICCS), 2025

Łukasz Radliński

Mateusz Guściora

Jan Kocoñ

176

19 Jul 2025

MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification

Iustin Sîrbu

Robert-Adrian Popovici

Cornelia Caragea

Stefan Trausan-Matu

Traian Rebedea

353

09 Jun 2025

SMOTExT: SMOTE meets Large Language Models

256

19 May 2025

The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification

305

10 May 2025

AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks

Ilyas Oulkadda

Julien Perez

ALM

243

05 May 2025

CGMatch: A Different Perspective of Semi-supervised LearningComputer Vision and Pattern Recognition (CVPR), 2025

390

04 Mar 2025

MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification

286

25 Feb 2025

TCProF: Time-Complexity Prediction SSL FrameworkNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

361

10 Feb 2025

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

465

28 Jan 2025

LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt TuningKnowledge Discovery and Data Mining (KDD), 2024

Fanshuang Kong

Richong Zhang

Ziqiao Wang

464

22 Dec 2024

Does VLM Classification Benefit from LLM Description Semantics?AAAI Conference on Artificial Intelligence (AAAI), 2024

466

16 Dec 2024

Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self TranscendenceNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

370

01 Dec 2024

Soft-TransFormers for Continual Learning

Haeyong Kang

Chang D. Yoo

CLL

484

25 Nov 2024

Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide

Márton Szép

Daniel Rueckert

Rüdiger von Eisenhart-Rothe

Florian Hinterwimmer

SyDa ALM

625

14 Nov 2024

Latent Space Chain-of-Embedding Enables Output-free LLM Self-EvaluationInternational Conference on Learning Representations (ICLR), 2024

Yiming Wang

449

17 Oct 2024

ALVIN: Active Learning Via INterpolationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Michalis Korakakis

Andreas Vlachos

Adrian Weller

357

11 Oct 2024

The Effects of Hallucinations in Synthetic Training Data for Relation Extraction

344

10 Oct 2024

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

1.2K

07 Oct 2024

Exploring Empty Spaces: Human-in-the-Loop Data AugmentationInternational Conference on Human Factors in Computing Systems (CHI), 2024

Dominik Moritz

393

01 Oct 2024

Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification

Guanyi Mou

Yichuan Li

Kyumin Lee

348

26 Sep 2024

FPMT: Enhanced Semi-Supervised Model for Traffic Incident DetectionInternational Conference on Pattern Recognition (ICPR), 2024

Xinying Lu

Jianli Xiao

135

12 Sep 2024

Investigating the Impact of Semi-Supervised Methods with Data Augmentation on Offensive Language Detection in Romanian LanguageInternational Conference on Knowledge-Based Intelligent Information & Engineering Systems (KES), 2024

Elena Beatrice Nicola

Dumitru-Clementin Cercel

Florin-Catalin Pop

334

29 Jul 2024

Scalable Language Model with Generalized Continual Learning

276

11 Apr 2024

Heterogeneous Contrastive Learning for Foundation Models and Beyond

315

30 Mar 2024

Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization

Zhongjiang He

219

16 Mar 2024

Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks

314

21 Feb 2024

Evaluation Metrics for Text Data Augmentation in NLP

Marcellus Amadeus

William Alberto Cruz Castañeda

200

09 Feb 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better

Xiaoming Liu

284

01 Feb 2024

A Survey on Data Augmentation in Large Model Era

538

27 Jan 2024

IndiText Boost: Text Augmentation for Low Resource India Languages

197

23 Jan 2024

Neural Networks Against (and For) Self-Training: Classification with Small Labeled and Large Unlabeled SetsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Payam Karisani

284

31 Dec 2023

A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment AnalysisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jie Zhou

Xuanjing Huang

245

16 Dec 2023

Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

284

08 Dec 2023

Summarization-based Data Augmentation for Document Classification

Yueguan Wang

Naoki Yoshinaga

VLM RALM

183

01 Dec 2023

Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jintai Chen

235

28 Nov 2023

SCStory: Self-supervised and Continual Online Story DiscoveryThe Web Conference (WWW), 2023

248

27 Nov 2023

SegMix: A Simple Structure-Aware Data Augmentation Method

267

16 Nov 2023

Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation

Jiaqi Wu

Junbiao Pang

Qingming Huang

169

03 Nov 2023

CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster Tweet Classification

Henry Peng Zou

Yue Zhou

Cornelia Caragea

Doina Caragea

294

23 Oct 2023

JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Henry Peng Zou

Cornelia Caragea

316

23 Oct 2023

DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet Classification via Memory BankConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Henry Peng Zou

Yue Zhou

Weizhi Zhang

Cornelia Caragea

180

23 Oct 2023

Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Chengyu Wang

Xiang Li

395

19 Oct 2023

TK-KNN: A Balanced Distance-Based Pseudo Labeling Approach for Semi-Supervised Intent ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

260

17 Oct 2023

RobustGEC: Robust Grammatical Error Correction Against Subtle Context PerturbationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yue Zhang

Leyang Cui

Enbo Zhao

Wei Bi

Shuming Shi

309

11 Oct 2023

AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in TransformerPeerJ Computer Science (PeerJ Comput. Sci.), 2023

Leixin Yang

Yu Xiang

481

22 Sep 2023

AttentionMix: Data augmentation method that relies on BERT attention mechanism

Dominik Lewy

Jacek Mańdziuk

312

20 Sep 2023

Dual-Decoder Consistency via Pseudo-Labels Guided Data Augmentation for Semi-Supervised Medical Image Segmentation

364

31 Aug 2023

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Zhengxiang Wang

221

29 Jun 2023