ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Christopher Ormerod
248
0
0
28 May 2025
MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
Yinuo Xue
Eric Spero
Yun Sing Koh
Giovanni Russello
AAML
272
11
0
26 May 2025
Multi-Party Conversational Agents: A Survey
Multi-Party Conversational Agents: A Survey
Sagar Sapkota
M. Hasan
Mubarak Shah
Santu Karmaker
LLMAG
307
0
0
24 May 2025
A Position Paper on the Automatic Generation of Machine Learning Leaderboards
A Position Paper on the Automatic Generation of Machine Learning Leaderboards
Roelien C Timmer
Yufang Hou
Stephen Wan
459
1
0
23 May 2025
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Muhammad Monjurul Karim
Yan Shi
Shucheng Zhang
Bingzhang Wang
Mehrdad Nasri
Yinhai Wang
198
12
0
19 May 2025
Spatial-LLaVA: Enhancing Large Language Models with Spatial Referring Expressions for Visual Understanding
Spatial-LLaVA: Enhancing Large Language Models with Spatial Referring Expressions for Visual Understanding
Xuefei Sun
Doncey Albin
Cecilia Mauceri
Dusty Woods
Christoffer Heckman
LRM
227
1
0
18 May 2025
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chenlu Wang
Weimin Lyu
Ritwik Banerjee
219
0
0
17 May 2025
Hierarchical Bracketing Encodings for Dependency Parsing as Tagging
Hierarchical Bracketing Encodings for Dependency Parsing as TaggingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Ana Ezquerro
David Vilares
Anssi Yli-Jyrä
Carlos Gómez-Rodríguez
350
1
0
16 May 2025
An empirical study of task and feature correlations in the reuse of pre-trained models
An empirical study of task and feature correlations in the reuse of pre-trained models
Jama Hussein Mohamud
Willie Brink
172
0
0
15 May 2025
Multi-Token Prediction Needs Registers
Multi-Token Prediction Needs Registers
Anastasios Gerontopoulos
Spyros Gidaris
N. Komodakis
390
3
0
15 May 2025
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Xiaoliang Luo
Xinyi Xu
Michael Ramscar
Bradley C. Love
391
0
0
13 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
334
1
0
13 May 2025
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
Zibo Gao
Junjie Hu
Feng Guo
Yixin Zhang
Yinglong Han
Siyuan Liu
Haiyang Li
Zhiqiang Lv
410
2
0
10 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLMLRM
197
0
0
10 May 2025
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Dhruvesh Patel
Aishwarya Sahoo
Avinash Amballa
Tahira Naseem
Tim G. J. Rudner
Andrew McCallum
KELM
520
2
0
09 May 2025
Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
338
1
0
05 May 2025
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
Yingquan Chen
Qianmu Li
Xiaocong Wu
Huifeng Li
Qing Chang
DiffM
346
1
0
02 May 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language ModelsThe VLDB journal (VLDB J.), 2025
Junxuan Zhang
Jiadong Wang
Haoyang Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
192
1
0
24 Apr 2025
Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection
Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection
Zihan Wang
Lu Yuan
Zhengxuan Zhang
Qing Zhao
166
1
0
24 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
287
1
0
24 Apr 2025
RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore
RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore
Zhenkai Qin
Guifang Yang
Dongze Wu
MoE
258
0
0
24 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
293
0
0
23 Apr 2025
Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers
Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers
KM Khalid Saifullah
Faiaz Azmain
Habiba Hye
111
0
0
22 Apr 2025
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
Xingyu Lu
Tianke Zhang
Chang Meng
Xinyu Wang
Jinpeng Wang
...
Hai-Tao Zheng
Fan Yang
Yan Li
Di Zhang
Kun Gai
OffRL
254
7
0
21 Apr 2025
Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation
Q-FAKER: Query-free Hard Black-box Attack via Controlled GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
CheolWon Na
YunSeok Choi
Jee-Hyong Lee
AAML
189
0
0
18 Apr 2025
Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective
Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective
Yuling Jiao
Yanming Lai
Yang Wang
Bokai Yan
249
1
0
18 Apr 2025
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
Shiwei Ding
Lan Zhang
Zhenlin Wang
Giuseppe Ateniese
Xiaoyong Yuan
236
0
0
16 Apr 2025
Looking beyond the next token
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
380
4
0
15 Apr 2025
C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset
C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection DatasetThe Web Conference (WWW), 2025
Fuqiang Niu
Yue Yang
Xianghua Fu
Genan Dai
Bowen Zhang
350
2
0
14 Apr 2025
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Shuai Zhao
Linchao Zhu
Yi Yang
Yi Yang
459
4
0
14 Apr 2025
Confidence Regularized Masked Language Modeling using Text Length
Confidence Regularized Masked Language Modeling using Text Length
Seunghyun Ji
Soowon Lee
382
0
0
08 Apr 2025
SapiensID: Foundation for Human Recognition
SapiensID: Foundation for Human RecognitionComputer Vision and Pattern Recognition (CVPR), 2025
Minchul Kim
Dingqiang Ye
Yiyang Su
Feng Liu
Xiaoming Liu
CVBMVLM
291
8
0
07 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
Jiafeng Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
1.2K
2
0
07 Apr 2025
TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context
TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context
S. Nigam
Balaramamahanthi Deepak Patnaik
Shivam Mishra
Noel Shallum
Kripabandhu Ghosh
Arnab Bhattacharya
AILawELM
713
1
0
07 Apr 2025
Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection
Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection
Nasar Iqbal
Niki Martinel
Mamba
232
1
0
04 Apr 2025
Is Less Really More? Fake News Detection with Limited Information
Is Less Really More? Fake News Detection with Limited InformationSIGKDD Explorations (SIGKDD Explor.), 2025
Zhaoyang Cao
John Nguyen
Reza Zafarani
258
0
0
02 Apr 2025
A thorough benchmark of automatic text classification: From traditional approaches to large language models
A thorough benchmark of automatic text classification: From traditional approaches to large language models
Washington Cunha
Leonardo Rocha
M. A. Gonçalves
VLM
186
6
0
02 Apr 2025
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
COST: Contrastive One-Stage Transformer for Vision-Language Small Object TrackingInformation Fusion (Inf. Fusion), 2025
Chunhui Zhang
Li Liu
Jialin Gao
Xin Sun
Hao Wen
Xi Zhou
Shiming Ge
Yucheng Wang
299
4
0
02 Apr 2025
Semantic Adapter for Universal Text Embeddings: Diagnosing and Mitigating Negation Blindness to Enhance Universality
Semantic Adapter for Universal Text Embeddings: Diagnosing and Mitigating Negation Blindness to Enhance Universality
Hongliu Cao
420
1
0
01 Apr 2025
A Retrieval-Based Approach to Medical Procedure Matching in Romanian
A Retrieval-Based Approach to Medical Procedure Matching in Romanian
Andrei Niculae
Adrian Cosma
Emilian Radoi
339
2
0
26 Mar 2025
AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction
AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction
Sadaf Khademi
Mehran Shabanpour
Reza Taleei
A. Oikonomou
Arash Mohammadi
MedIm
231
0
0
26 Mar 2025
Improving User Behavior Prediction: Leveraging Annotator Metadata in Supervised Machine Learning Models
Improving User Behavior Prediction: Leveraging Annotator Metadata in Supervised Machine Learning ModelsProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2025
Lynnette Ng
Kokil Jaidka
Kaiyuan Tay
Hansin Ahuja
Niyati Chhaya
308
2
0
26 Mar 2025
Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content
Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content
Sai Kartheek Reddy Kasu
Shankar Biradar
Sunil Saumya
329
1
0
20 Mar 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2025
Andrea Maracani
Savas Ozkan
Sijun Cho
Hyowon Kim
Eunchung Noh
Jeongwon Min
Cho Jung Min
Dookun Park
Mete Ozay
411
1
0
20 Mar 2025
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Yizhou Sun
Juan Yin
Juan Zhao
Fan Zhang
Yongheng Liu
Hongji Chen
249
0
0
19 Mar 2025
Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies
Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies
Elyas Meguellati
Stefano Civelli
Pietro Bernardelle
S. Sadiq
Gianluca Demartini
Gianluca Demartini
236
0
0
18 Mar 2025
Can Large Vision Language Models Read Maps Like a Human?
Can Large Vision Language Models Read Maps Like a Human?
Shuo Xing
Zezhou Sun
Shuangyu Xie
Kaiyuan Chen
Yanjia Huang
Yuping Wang
Jiachen Li
Dezhen Song
Zhengzhong Tu
391
20
0
18 Mar 2025
A Survey on Federated Fine-tuning of Large Language Models
A Survey on Federated Fine-tuning of Large Language Models
Yebo Wu
Chunlin Tian
Jingguang Li
He Sun
Kahou Tam
Zhanting Zhou
Haicheng Liao
Zhijiang Guo
Li Li
Chengzhong Xu
FedML
523
5
0
15 Mar 2025
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More MoreAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Arvid Frydenlund
LRM
560
2
0
13 Mar 2025
How Well Does Your Tabular Generator Learn the Structure of Tabular Data?
Xiangjian Jiang
Nikola Simidjievski
M. Jamnik
LMTD
274
2
0
13 Mar 2025
Previous
123456...737475
Next
Page 3 of 75
Pageof 75