v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown

SoK: Are Watermarks in LLMs Ready for Deployment?

178

24 Dec 2025

PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch

437

24 Dec 2025

MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation

130

02 Dec 2025

Label Forensics: Interpreting Hard Labels in Black-Box Text Classifier

111

01 Dec 2025

Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Muhammad Muneeb

David B. Ascher

Ahsan Baidar Bakht

104

29 Nov 2025

Standard Occupation Classifier -- A Natural Language Processing Approach

Sidharth Rony

Jack Patman

130

28 Nov 2025

SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features

Mohammad Zare

26 Nov 2025

Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning

277

26 Nov 2025

Efficient Covariance Estimation for Sparsified Functional Data

Sijie Zheng

Fandong Meng

Jie Zhou

23 Nov 2025

A multi-view contrastive learning framework for spatial embeddings in risk modelling

Freek Holvoet

Christopher Blier-Wong

Katrien Antonio

22 Nov 2025

Spanning Tree Autoregressive Visual Generation

205

21 Nov 2025

Analysis of heart failure patient trajectories using sequence modeling

Christina E. Lundberg

290

20 Nov 2025

Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels

Sourya Dipta Das

Shubham Kumar

Kuldeep Yadav

118

17 Nov 2025

MURPHY: Multi-Turn GRPO for Self Correcting Code Generation

C. Ekbote

Vijay Lingam

Behrooz Omidvar-Tehrani

162

11 Nov 2025

Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data

Mihael Arcan

David-Paul Niland

AI4MH

593

10 Nov 2025

Comparing Reconstruction Attacks on Pretrained Versus Full Fine-tuned Large Language Model Embeddings on Homo Sapiens Splice Sites Genomic Data

09 Nov 2025

Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding

Qian Ma

Ruoxiang Xu

Yongqiang Cai

09 Nov 2025

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization

365

06 Nov 2025

Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction

194

01 Nov 2025

Reversal Invariance in Autoregressive Language Models

Mihir Sahasrabudhe

01 Nov 2025

Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion

100

30 Oct 2025

MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference

Mădălina Zgreabăn

Tejaswini Deoskar

Lasha Abzianidze

123

28 Oct 2025

SALSA: Single-pass Autoregressive LLM Structured Classification

Ruslan Berdichevsky

Shai Nahum-Gefen

Elad Ben Zaken

147

26 Oct 2025

Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges

...

119

22 Oct 2025

IMB: An Italian Medical Benchmark for Question Answering

239

21 Oct 2025

Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs

Yehor Tereshchenko

Mika Hämäläinen

150

20 Oct 2025

DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning

247

20 Oct 2025

RL makes MLLMs see better than SFT

196

18 Oct 2025

TRI-DEP: A Trimodal Comparative Study for Depression Detection Using Speech, Text, and EEG

Annisaa Fitri Nurfidausi

Eleonora Mancini

Paolo Torroni

16 Oct 2025

Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference

336

15 Oct 2025

ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

165

14 Oct 2025

Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs

213

10 Oct 2025

SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets

103

09 Oct 2025

Language models for longitudinal analysis of abusive content in Billboard Music Charts

06 Oct 2025

Self-Speculative Masked Diffusions

164

04 Oct 2025

Allocation of Parameters in Transformers

161

04 Oct 2025

Towards Sampling Data Structures for Tensor Products in Turnstile Streams

Zhao Song

Shenghao Xie

Samson Zhou

147

04 Oct 2025

Multimodal Foundation Models for Early Disease Detection

Md Talha Mohsin

Ismail Abdulrashid

147

02 Oct 2025

PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning

Raahul Krishna Durairaju

K. Saruladha

182

02 Oct 2025

GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling

Jose I. Mestre

Alberto Fernández-Hernández

Cristian Pérez-Corral

Manuel F. Dolz

Jose Duato

Enrique S. Quintana-Ortí

181

01 Oct 2025

Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions

106

29 Sep 2025

Text Adversarial Attacks with Dynamic Outputs

108

26 Sep 2025

Understanding and Enhancing Mask-Based Pretraining towards Universal Representations

143

25 Sep 2025

Performance Consistency of Learning Methods for Information Retrieval Tasks

Meng Yuan

Justin Zobel

25 Sep 2025

Confidence Calibration in Large Language Model-Based Entity Matching

Iris Kamsteeg

Juan Cardenas-Cartagena

Floris van Beers

Gineke ten Holt

Tsegaye Misikir Tashu

Matias Valdenegro-Toro

117

23 Sep 2025

Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations

Lekkala Sai Teja

Annepaka Yadagiri

Sangam Sai Anish

Siva Gopala Krishna Nuthakki

Partha Pakray

AAML DeLMO

230

22 Sep 2025

DRES: Fake news detection by dynamic representation and ensemble selection

Faramarz Farhangian

Leandro A. Ensina

George D. C. Cavalcanti

Rafael M. O. Cruz

168

21 Sep 2025

A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse

148

20 Sep 2025

Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification

Tian Lan

Yiming Zheng

Jianxin Yin

156

19 Sep 2025

Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers

206

19 Sep 2025