ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown
SoK: Are Watermarks in LLMs Ready for Deployment?
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
176
1
0
24 Dec 2025
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
Abhishek Ghosh
Ajay Nayak
Ashish Panwar
Arkaprava Basu
GNN
436
2
0
24 Dec 2025
MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
Youxin Pang
Jiajun Liu
L. Tan
Yong Zhang
Feng Gao
Xiang Deng
Zhuoliang Kang
Xiaoming Wei
Y. Liu
VGen
130
0
0
02 Dec 2025
Label Forensics: Interpreting Hard Labels in Black-Box Text Classifier
Label Forensics: Interpreting Hard Labels in Black-Box Text Classifier
Mengyao Du
Gang Yang
Han Fang
Quanjun Yin
Ee-Chien Chang
111
0
0
01 Dec 2025
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb
David B. Ascher
Ahsan Baidar Bakht
104
0
0
29 Nov 2025
Standard Occupation Classifier -- A Natural Language Processing Approach
Standard Occupation Classifier -- A Natural Language Processing Approach
Sidharth Rony
Jack Patman
130
0
0
28 Nov 2025
SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
Mohammad Zare
54
0
0
26 Nov 2025
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong
Yinglong Zhang
Xiaoying Hong
Xuewen Xia
Xing Xu
277
0
0
26 Nov 2025
Efficient Covariance Estimation for Sparsified Functional Data
Efficient Covariance Estimation for Sparsified Functional Data
Sijie Zheng
Fandong Meng
Jie Zhou
97
1
0
23 Nov 2025
A multi-view contrastive learning framework for spatial embeddings in risk modelling
A multi-view contrastive learning framework for spatial embeddings in risk modelling
Freek Holvoet
Christopher Blier-Wong
Katrien Antonio
49
0
0
22 Nov 2025
Spanning Tree Autoregressive Visual Generation
Spanning Tree Autoregressive Visual Generation
Sangkyu Lee
Changho Lee
Janghoon Han
Hosung Song
Tackgeun You
Hwasup Lim
Stanley Jungkyu Choi
Honglak Lee
Youngjae Yu
205
0
0
21 Nov 2025
Analysis of heart failure patient trajectories using sequence modeling
Analysis of heart failure patient trajectories using sequence modeling
Falk Dippela
Yinan Yu
Annika Rosengren
Martin Lindgren
Christina E. Lundberg
Erik Aerts
Martin Adiels
Helen Sjöland
Mamba
289
0
0
20 Nov 2025
Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels
Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels
Sourya Dipta Das
Shubham Kumar
Kuldeep Yadav
118
0
0
17 Nov 2025
MURPHY: Multi-Turn GRPO for Self Correcting Code Generation
MURPHY: Multi-Turn GRPO for Self Correcting Code Generation
C. Ekbote
Vijay Lingam
Behrooz Omidvar-Tehrani
Jun Huan
Sujay Sanghavi
Anoop Deoras
Stefano Soatto
LRM
158
0
0
11 Nov 2025
Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data
Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data
Mihael Arcan
David-Paul Niland
AI4MH
592
0
0
10 Nov 2025
Comparing Reconstruction Attacks on Pretrained Versus Full Fine-tuned Large Language Model Embeddings on Homo Sapiens Splice Sites Genomic Data
Comparing Reconstruction Attacks on Pretrained Versus Full Fine-tuned Large Language Model Embeddings on Homo Sapiens Splice Sites Genomic Data
Reem Al-Saidi
Erman Ayday
Ziad Kobti
AAML
96
0
0
09 Nov 2025
Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding
Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding
Qian Ma
Ruoxiang Xu
Yongqiang Cai
93
0
0
09 Nov 2025
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
Yuantian Shao
Yuanteng Chen
Peisong Wang
Jianlin Yu
Jing Lin
Yiwu Yao
Zhihui Wei
Jian Cheng
MQ
365
1
0
06 Nov 2025
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
Peter Atandoh
Jie Zou
Weikang Guo
Jiwei Wei
Zheng Wang
194
0
0
01 Nov 2025
Reversal Invariance in Autoregressive Language Models
Reversal Invariance in Autoregressive Language Models
Mihir Sahasrabudhe
62
0
0
01 Nov 2025
Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion
Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion
Sean Patten
Pin-Yu Chen
Christina Schweikert
D. Frank Hsu
100
0
0
30 Oct 2025
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
Mădălina Zgreabăn
Tejaswini Deoskar
Lasha Abzianidze
120
0
0
28 Oct 2025
SALSA: Single-pass Autoregressive LLM Structured Classification
SALSA: Single-pass Autoregressive LLM Structured Classification
Ruslan Berdichevsky
Shai Nahum-Gefen
Elad Ben Zaken
147
0
0
26 Oct 2025
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
Cheng Huang
Nyima Tashi
Fan Gao
Yutong Liu
J. Li
...
Guojie Tang
Xiangxiang Wang
Jia Zhang
Tsengdar J. Lee
Yongbin Yu
119
0
0
22 Oct 2025
IMB: An Italian Medical Benchmark for Question Answering
IMB: An Italian Medical Benchmark for Question Answering
Antonio Romano
Giuseppe Riccio
Mariano Barone
Marco Postiglione
V. Moscato
AI4MH
239
0
0
21 Oct 2025
Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
Yehor Tereshchenko
Mika Hämäläinen
150
1
0
20 Oct 2025
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
Yongxin He
Shan Zhang
Yixuan Cao
Lei Ma
Ping Luo
DeLMO
247
1
0
20 Oct 2025
RL makes MLLMs see better than SFT
RL makes MLLMs see better than SFT
Junha Song
Sangdoo Yun
Dongyoon Han
Jaegul Choo
Byeongho Heo
OffRL
196
0
0
18 Oct 2025
TRI-DEP: A Trimodal Comparative Study for Depression Detection Using Speech, Text, and EEG
TRI-DEP: A Trimodal Comparative Study for Depression Detection Using Speech, Text, and EEG
Annisaa Fitri Nurfidausi
Eleonora Mancini
Paolo Torroni
61
0
0
16 Oct 2025
Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference
Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference
Nikhil Bhendawade
K. Nishu
Arnav Kundu
Chris Bartels
Minsik Cho
Irina Belousova
LRM
335
0
0
15 Oct 2025
ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification
ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification
Utsav Nareti
Suraj Kumar
Soumya Pandey
S. Chattopadhyay
Chandranath Adak
VLM
165
0
0
14 Oct 2025
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Xu Pan
Ely Hahami
Jingxuan Fan
Ziqian Xie
H. Sompolinsky
213
1
0
10 Oct 2025
SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets
SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets
Qiang Yang
Xiuying Chen
Changsheng Ma
Rui Yin
Xin Gao
Xiangliang Zhang
103
0
0
09 Oct 2025
Language models for longitudinal analysis of abusive content in Billboard Music Charts
Language models for longitudinal analysis of abusive content in Billboard Music Charts
Rohitash Chandra
Yathin Suresh
Divyansh Raj Sinha
Sanchit Jindal
68
0
0
06 Oct 2025
Self-Speculative Masked Diffusions
Self-Speculative Masked Diffusions
Andrew Campbell
Valentin De Bortoli
Jiaxin Shi
Arnaud Doucet
DiffM
164
4
0
04 Oct 2025
Allocation of Parameters in Transformers
Allocation of Parameters in Transformers
Ruoxi Yu
Haotian Jiang
Jingpu Cheng
Penghao Yu
Qianxiao Li
Zhong Li
MoE
161
0
0
04 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
147
0
0
04 Oct 2025
Multimodal Foundation Models for Early Disease Detection
Multimodal Foundation Models for Early Disease Detection
Md Talha Mohsin
Ismail Abdulrashid
147
1
0
02 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
182
0
0
02 Oct 2025
GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling
GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling
Jose I. Mestre
Alberto Fernández-Hernández
Cristian Pérez-Corral
Manuel F. Dolz
Jose Duato
Enrique S. Quintana-Ortí
181
0
0
01 Oct 2025
Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions
Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions
Luisa Geiger
Mareike Hartmann
Michael Sullivan
Alexander Koller
106
0
0
29 Sep 2025
Text Adversarial Attacks with Dynamic Outputs
Text Adversarial Attacks with Dynamic Outputs
Wenqiang Wang
Siyuan Liang
Xiao Yan
Xiaochun Cao
AAML
108
0
0
26 Sep 2025
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Mingze Dong
Leda Wang
Yuval Kluger
SSL
143
1
0
25 Sep 2025
Performance Consistency of Learning Methods for Information Retrieval Tasks
Performance Consistency of Learning Methods for Information Retrieval Tasks
Meng Yuan
Justin Zobel
97
0
0
25 Sep 2025
Confidence Calibration in Large Language Model-Based Entity Matching
Confidence Calibration in Large Language Model-Based Entity Matching
Iris Kamsteeg
Juan Cardenas-Cartagena
Floris van Beers
Gineke ten Holt
Tsegaye Misikir Tashu
Matias Valdenegro-Toro
117
0
0
23 Sep 2025
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
Lekkala Sai Teja
Annepaka Yadagiri
Sangam Sai Anish
Siva Gopala Krishna Nuthakki
Partha Pakray
AAMLDeLMO
230
1
0
22 Sep 2025
DRES: Fake news detection by dynamic representation and ensemble selection
DRES: Fake news detection by dynamic representation and ensemble selection
Faramarz Farhangian
Leandro A. Ensina
George D. C. Cavalcanti
Rafael M. O. Cruz
167
3
0
21 Sep 2025
A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse
A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse
Xiaohan Ding
Kaike Ping
Buse Çarık
Eugenia H Rho
148
2
0
20 Sep 2025
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
Tian Lan
Yiming Zheng
Jianxin Yin
156
0
0
19 Sep 2025
Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers
Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers
Krati Saxena
Federico Jurado Ruiz
Guido Manzi
Dianbo Liu
Alex Lamb
206
0
0
19 Sep 2025
1234...737475
Next
Page 1 of 75
Pageof 75