ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXivPDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 1,069 papers shown
Title
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
43
0
0
10 May 2025
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Dhruvesh Patel
Aishwarya Sahoo
Avinash Amballa
Tahira Naseem
Tim G. J. Rudner
Andrew McCallum
KELM
47
0
0
09 May 2025
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
20
0
0
05 May 2025
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
Yingquan Chen
Qianmu Li
Xiaocong Wu
Huifeng Li
Qing Chang
DiffM
24
0
0
02 May 2025
Looking beyond the next token
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
57
1
0
15 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
J. Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
123
0
0
07 Apr 2025
Is Less Really More? Fake News Detection with Limited Information
Is Less Really More? Fake News Detection with Limited Information
Zhaoyang Cao
John Nguyen
Reza Zafarani
54
0
0
02 Apr 2025
How Well Does Your Tabular Generator Learn the Structure of Tabular Data?
Xiangjian Jiang
Nikola Simidjievski
M. Jamnik
LMTD
80
0
0
13 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
44
0
0
13 Mar 2025
eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Suraiya Tairin
Shohaib Mahmud
Haiying Shen
Anand Iyer
MoE
132
0
0
10 Mar 2025
Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs
Gonzalo Mancera
Daniel DeAlcala
Julian Fierrez
Ruben Tolosana
Aythami Morales
46
1
0
10 Mar 2025
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Yifei Duan
Raphael Shang
Deng Liang
Yongqiang Cai
80
0
0
28 Feb 2025
Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation
Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation
Y. Wang
Xinnan Dai
Wenqi Fan
Yao Ma
74
1
0
26 Feb 2025
CAMEx: Curvature-aware Merging of Experts
CAMEx: Curvature-aware Merging of Experts
Dung V. Nguyen
Minh H. Nguyen
Luc Q. Nguyen
R. Teo
T. Nguyen
Linh Duy Tran
MoMe
81
2
0
26 Feb 2025
Detecting Code Vulnerabilities with Heterogeneous GNN Training
Detecting Code Vulnerabilities with Heterogeneous GNN Training
Yu Luo
Weifeng Xu
Dianxiang Xu
41
0
0
24 Feb 2025
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification
Yubo Wang
Haoyang Li
Fei Teng
Lei Chen
91
1
0
17 Feb 2025
Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word Substitutes
Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word Substitutes
Juraj Vladika
Stephen Meisenbacher
Florian Matthes
130
0
0
06 Feb 2025
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
A. K. Kadhim
Lei Jiao
R. Shafik
Ole-Christoffer Granmo
DeLMO
72
0
0
31 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Peiling Yi
A. Zubiaga
Yunfei Long
85
0
0
28 Jan 2025
Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Ziwei Liu
Qi Zhang
Lifu Gao
34
0
0
28 Jan 2025
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
P. Tiwald
Ivona Krchova
Andrey Sidorenko
Mariana Vargas-Vieyra
Mario Scriminaci
Michael Platzer
47
1
0
21 Jan 2025
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Ziyue Luo
Jia-Wei Liu
Myungjin Lee
Ness B. Shroff
39
0
0
09 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
25
0
0
03 Jan 2025
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu
Yixin Ji
Z. Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
92
0
0
02 Dec 2024
Hysteresis Activation Function for Efficient Inference
Hysteresis Activation Function for Efficient Inference
Moshe Kimhi
Idan Kashani
A. Mendelson
Chaim Baskin
LLMSV
33
0
0
15 Nov 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
41
2
0
28 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye
Jiahui Gao
Shansan Gong
Lin Zheng
Xin Jiang
Z. Li
Lingpeng Kong
DiffM
LRM
46
15
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
19
1
0
18 Oct 2024
CMAL: A Novel Cross-Modal Associative Learning Framework for
  Vision-Language Pre-Training
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
54
9
0
16 Oct 2024
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
Yijiong Yu
Ma Xiufa
Fang Jianwei
Zhi-liang Xu
Su Guangyao
...
Zhixiao Qi
Wei Wang
W. Liu
Ran Chen
Ji Pei
LRM
RALM
27
0
0
06 Oct 2024
Variational Language Concepts for Interpreting Foundation Language
  Models
Variational Language Concepts for Interpreting Foundation Language Models
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
32
3
0
04 Oct 2024
The Roles of Generative Artificial Intelligence in Internet of Electric
  Vehicles
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
Hanwen Zhang
Dusit Niyato
Wei Zhang
Changyuan Zhao
Hongyang Du
Abbas Jamalipour
Sumei Sun
Yiyang Pei
AI4CE
42
2
0
24 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
21
1
0
18 Sep 2024
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion
  Generation
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation
Seyed Rohollah Hosseyni
Ali Ahmad Rahmani
S. J. Seyedmohammadi
Sanaz Seyedin
Arash Mohammadi
DiffM
38
5
0
17 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using
  LLMs
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
29
7
0
15 Sep 2024
Layerwise Change of Knowledge in Neural Networks
Layerwise Change of Knowledge in Neural Networks
Xu Cheng
Lei Cheng
Zhaoran Peng
Yang Xu
Tian Han
Quanshi Zhang
KELM
FAtt
33
6
0
13 Sep 2024
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Elisabeth Fischer
Albin Zehe
Andreas Hotho
Daniel Schlor
HAI
26
0
0
28 Aug 2024
End-to-end Semantic-centric Video-based Multimodal Affective Computing
End-to-end Semantic-centric Video-based Multimodal Affective Computing
Ronghao Lin
Ying Zeng
Sijie Mai
Haifeng Hu
VGen
40
0
0
14 Aug 2024
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Yonghui Wang
Shaokai Liu
Li Li
Wengang Zhou
Houqiang Li
ViT
40
1
0
07 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
30
11
0
01 Aug 2024
Informed Correctors for Discrete Diffusion Models
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
Lester W. Mackey
Scott W. Linderman
Lester Mackey
Scott Linderman
42
9
0
30 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction
  with Phonetic Analysis
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
34
3
0
20 Jul 2024
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek
  Language based on Textually Represented Environments
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments
D. Papadopoulos
Katerina Metropoulou
N. Matsatsinis
N. Papadakis
LRM
25
3
0
13 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
40
2
0
12 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
42
43
0
09 Jul 2024
MST5 -- Multilingual Question Answering over Knowledge Graphs
MST5 -- Multilingual Question Answering over Knowledge Graphs
Nikit Srivastava
Mengshi Ma
Daniel Vollmers
Hamada M. Zahera
Diego Moussallem
A. N. Ngomo
29
0
0
08 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between
  Autoregressive and Masked Pretraining
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
34
3
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
60
3
0
01 Jul 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Mengnan Du
Shuaiqiang Wang
Dawei Yin
Sumi Helal
53
28
0
28 Jun 2024
Deepfake tweets automatic detection
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
19
0
0
24 Jun 2024
1234...202122
Next