Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,732 papers shown
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
176
1
0
24 Dec 2025
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
Abhishek Ghosh
Ajay Nayak
Ashish Panwar
Arkaprava Basu
GNN
436
2
0
24 Dec 2025
MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
Youxin Pang
Jiajun Liu
L. Tan
Yong Zhang
Feng Gao
Xiang Deng
Zhuoliang Kang
Xiaoming Wei
Y. Liu
VGen
130
0
0
02 Dec 2025
Label Forensics: Interpreting Hard Labels in Black-Box Text Classifier
Mengyao Du
Gang Yang
Han Fang
Quanjun Yin
Ee-Chien Chang
111
0
0
01 Dec 2025
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb
David B. Ascher
Ahsan Baidar Bakht
104
0
0
29 Nov 2025
Standard Occupation Classifier -- A Natural Language Processing Approach
Sidharth Rony
Jack Patman
130
0
0
28 Nov 2025
SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
Mohammad Zare
54
0
0
26 Nov 2025
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong
Yinglong Zhang
Xiaoying Hong
Xuewen Xia
Xing Xu
277
0
0
26 Nov 2025
Efficient Covariance Estimation for Sparsified Functional Data
Sijie Zheng
Fandong Meng
Jie Zhou
97
1
0
23 Nov 2025
A multi-view contrastive learning framework for spatial embeddings in risk modelling
Freek Holvoet
Christopher Blier-Wong
Katrien Antonio
49
0
0
22 Nov 2025
Spanning Tree Autoregressive Visual Generation
Sangkyu Lee
Changho Lee
Janghoon Han
Hosung Song
Tackgeun You
Hwasup Lim
Stanley Jungkyu Choi
Honglak Lee
Youngjae Yu
205
0
0
21 Nov 2025
Analysis of heart failure patient trajectories using sequence modeling
Falk Dippela
Yinan Yu
Annika Rosengren
Martin Lindgren
Christina E. Lundberg
Erik Aerts
Martin Adiels
Helen Sjöland
Mamba
289
0
0
20 Nov 2025
Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels
Sourya Dipta Das
Shubham Kumar
Kuldeep Yadav
118
0
0
17 Nov 2025
MURPHY: Multi-Turn GRPO for Self Correcting Code Generation
C. Ekbote
Vijay Lingam
Behrooz Omidvar-Tehrani
Jun Huan
Sujay Sanghavi
Anoop Deoras
Stefano Soatto
LRM
158
0
0
11 Nov 2025
Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data
Mihael Arcan
David-Paul Niland
AI4MH
592
0
0
10 Nov 2025
Comparing Reconstruction Attacks on Pretrained Versus Full Fine-tuned Large Language Model Embeddings on Homo Sapiens Splice Sites Genomic Data
Reem Al-Saidi
Erman Ayday
Ziad Kobti
AAML
96
0
0
09 Nov 2025
Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding
Qian Ma
Ruoxiang Xu
Yongqiang Cai
93
0
0
09 Nov 2025
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
Yuantian Shao
Yuanteng Chen
Peisong Wang
Jianlin Yu
Jing Lin
Yiwu Yao
Zhihui Wei
Jian Cheng
MQ
365
1
0
06 Nov 2025
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
Peter Atandoh
Jie Zou
Weikang Guo
Jiwei Wei
Zheng Wang
194
0
0
01 Nov 2025
Reversal Invariance in Autoregressive Language Models
Mihir Sahasrabudhe
62
0
0
01 Nov 2025
Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion
Sean Patten
Pin-Yu Chen
Christina Schweikert
D. Frank Hsu
100
0
0
30 Oct 2025
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
Mădălina Zgreabăn
Tejaswini Deoskar
Lasha Abzianidze
120
0
0
28 Oct 2025
SALSA: Single-pass Autoregressive LLM Structured Classification
Ruslan Berdichevsky
Shai Nahum-Gefen
Elad Ben Zaken
147
0
0
26 Oct 2025
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
Cheng Huang
Nyima Tashi
Fan Gao
Yutong Liu
J. Li
...
Guojie Tang
Xiangxiang Wang
Jia Zhang
Tsengdar J. Lee
Yongbin Yu
119
0
0
22 Oct 2025
IMB: An Italian Medical Benchmark for Question Answering
Antonio Romano
Giuseppe Riccio
Mariano Barone
Marco Postiglione
V. Moscato
AI4MH
239
0
0
21 Oct 2025
Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
Yehor Tereshchenko
Mika Hämäläinen
150
1
0
20 Oct 2025
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
Yongxin He
Shan Zhang
Yixuan Cao
Lei Ma
Ping Luo
DeLMO
247
1
0
20 Oct 2025
RL makes MLLMs see better than SFT
Junha Song
Sangdoo Yun
Dongyoon Han
Jaegul Choo
Byeongho Heo
OffRL
196
0
0
18 Oct 2025
TRI-DEP: A Trimodal Comparative Study for Depression Detection Using Speech, Text, and EEG
Annisaa Fitri Nurfidausi
Eleonora Mancini
Paolo Torroni
61
0
0
16 Oct 2025
Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference
Nikhil Bhendawade
K. Nishu
Arnav Kundu
Chris Bartels
Minsik Cho
Irina Belousova
LRM
335
0
0
15 Oct 2025
ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification
Utsav Nareti
Suraj Kumar
Soumya Pandey
S. Chattopadhyay
Chandranath Adak
VLM
165
0
0
14 Oct 2025
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Xu Pan
Ely Hahami
Jingxuan Fan
Ziqian Xie
H. Sompolinsky
213
1
0
10 Oct 2025
SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets
Qiang Yang
Xiuying Chen
Changsheng Ma
Rui Yin
Xin Gao
Xiangliang Zhang
103
0
0
09 Oct 2025
Language models for longitudinal analysis of abusive content in Billboard Music Charts
Rohitash Chandra
Yathin Suresh
Divyansh Raj Sinha
Sanchit Jindal
68
0
0
06 Oct 2025
Self-Speculative Masked Diffusions
Andrew Campbell
Valentin De Bortoli
Jiaxin Shi
Arnaud Doucet
DiffM
164
4
0
04 Oct 2025
Allocation of Parameters in Transformers
Ruoxi Yu
Haotian Jiang
Jingpu Cheng
Penghao Yu
Qianxiao Li
Zhong Li
MoE
161
0
0
04 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
147
0
0
04 Oct 2025
Multimodal Foundation Models for Early Disease Detection
Md Talha Mohsin
Ismail Abdulrashid
147
1
0
02 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
182
0
0
02 Oct 2025
GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling
Jose I. Mestre
Alberto Fernández-Hernández
Cristian Pérez-Corral
Manuel F. Dolz
Jose Duato
Enrique S. Quintana-Ortí
181
0
0
01 Oct 2025
Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions
Luisa Geiger
Mareike Hartmann
Michael Sullivan
Alexander Koller
106
0
0
29 Sep 2025
Text Adversarial Attacks with Dynamic Outputs
Wenqiang Wang
Siyuan Liang
Xiao Yan
Xiaochun Cao
AAML
108
0
0
26 Sep 2025
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Mingze Dong
Leda Wang
Yuval Kluger
SSL
143
1
0
25 Sep 2025
Performance Consistency of Learning Methods for Information Retrieval Tasks
Meng Yuan
Justin Zobel
97
0
0
25 Sep 2025
Confidence Calibration in Large Language Model-Based Entity Matching
Iris Kamsteeg
Juan Cardenas-Cartagena
Floris van Beers
Gineke ten Holt
Tsegaye Misikir Tashu
Matias Valdenegro-Toro
117
0
0
23 Sep 2025
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
Lekkala Sai Teja
Annepaka Yadagiri
Sangam Sai Anish
Siva Gopala Krishna Nuthakki
Partha Pakray
AAML
DeLMO
230
1
0
22 Sep 2025
DRES: Fake news detection by dynamic representation and ensemble selection
Faramarz Farhangian
Leandro A. Ensina
George D. C. Cavalcanti
Rafael M. O. Cruz
167
3
0
21 Sep 2025
A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse
Xiaohan Ding
Kaike Ping
Buse Çarık
Eugenia H Rho
148
2
0
20 Sep 2025
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
Tian Lan
Yiming Zheng
Jianxin Yin
156
0
0
19 Sep 2025
Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers
Krati Saxena
Federico Jurado Ruiz
Guido Manzi
Dianbo Liu
Alex Lamb
206
0
0
19 Sep 2025
1
2
3
4
...
73
74
75
Next
Page 1 of 75
Page
of 75
Go