ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXivPDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 1,094 papers shown
Title
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable
  Reward Function
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function
A. B. Siddique
M. H. Maqbool
Kshitija Taywade
H. Foroosh
24
12
0
24 Mar 2023
Human Behavior in the Time of COVID-19: Learning from Big Data
Human Behavior in the Time of COVID-19: Learning from Big Data
Hanjia Lyu
Arsal Imtiaz
Yufei Zhao
Jiebo Luo
20
6
0
23 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Learning for Amalgamation: A Multi-Source Transfer Learning Framework
  For Sentiment Classification
Learning for Amalgamation: A Multi-Source Transfer Learning Framework For Sentiment Classification
Cuong V. Nguyen
Khiem H. Le
Anh Tran
Quang-Cuong Pham
Binh T. Nguyen
15
14
0
16 Mar 2023
Transformer-based approaches to Sentiment Detection
O. E. Ojo
Hoang Thang Ta
Alexander Gelbukh
Hiram Calvo
O. O. Adebanji
Grigori Sidorov
6
7
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A. B. Siddique
23
4
0
12 Mar 2023
Generating Query Focused Summaries without Fine-tuning the
  Transformer-based Pre-trained Models
Generating Query Focused Summaries without Fine-tuning the Transformer-based Pre-trained Models
D. Abdullah
Shamanth Nayak
Gandharv Suri
Yllias Chali
22
2
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
20
41
0
10 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
34
7
0
09 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
29
504
0
07 Mar 2023
WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data
  augmentation in tExt Regression Tasks
WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Manan Suri
Aaryak Garg
Divya Chaudhary
I. Gorton
B. Kumar
18
1
0
05 Mar 2023
HULAT at SemEval-2023 Task 10: Data augmentation for pre-trained transformers applied to the detection of sexism in social media
Isabel Segura-Bedmar
ViT
20
1
0
24 Feb 2023
Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Muthuraman Chidambaram
Chenwei Wu
Yu Cheng
Rong Ge
18
0
0
24 Feb 2023
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Jaesung Huh
A. Brown
Jee-weon Jung
Joon Son Chung
Arsha Nagrani
D. Garcia-Romero
Andrew Zisserman
18
26
0
20 Feb 2023
Cluster-based Deep Ensemble Learning for Emotion Classification in
  Internet Memes
Cluster-based Deep Ensemble Learning for Emotion Classification in Internet Memes
Xiaoyu Guo
Jing Ma
A. Zubiaga
27
0
0
16 Feb 2023
Platform-Independent and Curriculum-Oriented Intelligent Assistant for
  Higher Education
Platform-Independent and Curriculum-Oriented Intelligent Assistant for Higher Education
Ramteja Sajja
Y. Sermet
David M. Cwiertny
Ibrahim Demir
16
62
0
15 Feb 2023
An Extended Sequence Tagging Vocabulary for Grammatical Error Correction
An Extended Sequence Tagging Vocabulary for Grammatical Error Correction
Stuart Mesham
Christopher Bryant
Marek Rei
Zheng Yuan
22
7
0
12 Feb 2023
TextDefense: Adversarial Text Detection based on Word Importance Entropy
TextDefense: Adversarial Text Detection based on Word Importance Entropy
Lujia Shen
Xuhong Zhang
S. Ji
Yuwen Pu
Chunpeng Ge
Xing Yang
Yanghe Feng
AAML
13
8
0
12 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
36
57
0
11 Feb 2023
Revisiting Offline Compression: Going Beyond Factorization-based Methods
  for Transformer Language Models
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models
Mohammadreza Banaei
Klaudia Bałazy
Artur Kasymov
R. Lebret
Jacek Tabor
Karl Aberer
OffRL
19
0
0
08 Feb 2023
Findings of the TSAR-2022 Shared Task on Multilingual Lexical
  Simplification
Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification
Horacio Saggion
S. vStajner
Daniel Ferrés
Kim Cheng Sheang
Matthew Shardlow
Kai North
Marcos Zampieri
25
48
0
06 Feb 2023
Bioformer: an efficient transformer language model for biomedical text
  mining
Bioformer: an efficient transformer language model for biomedical text mining
Li Fang
Qingyu Chen
Chih-Hsuan Wei
Zhiyong Lu
Kai Wang
MedIm
AI4CE
24
18
0
03 Feb 2023
Detecting Reddit Users with Depression Using a Hybrid Neural Network
  SBERT-CNN
Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN
Ziyi Chen
Ren Yang
S. Fu
Nansu Zong
Hongfang Liu
Ming Huang
AI4MH
18
14
0
03 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
46
9
0
01 Feb 2023
Towards Personalized Review Summarization by Modeling Historical Reviews
  from Customer and Product Separately
Towards Personalized Review Summarization by Modeling Historical Reviews from Customer and Product Separately
Xin Cheng
Shen Gao
Yuchi Zhang
Yongliang Wang
Xiuying Chen
Mingzhe Li
Dongyan Zhao
Rui Yan
18
10
0
27 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
34
2
0
26 Jan 2023
Characterizing the Entities in Harmful Memes: Who is the Hero, the
  Villain, the Victim?
Characterizing the Entities in Harmful Memes: Who is the Hero, the Villain, the Victim?
Shivam Sharma
Atharva Kulkarni
Tharun Suresh
Himanshi Mathur
Preslav Nakov
Md. Shad Akhtar
Tanmoy Chakraborty
30
15
0
26 Jan 2023
A benchmark for toxic comment classification on Civil Comments dataset
A benchmark for toxic comment classification on Civil Comments dataset
Corentin Duchene
Henri Jamet
Pierre Guillaume
Reda Dehak
18
8
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
32
2
0
25 Jan 2023
BDMMT: Backdoor Sample Detection for Language Models through Model
  Mutation Testing
BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Jiali Wei
Ming Fan
Wenjing Jiao
Wuxia Jin
Ting Liu
AAML
29
10
0
25 Jan 2023
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Imtiaz Karim
Kazi Samin Mubasshir
Mirza Masfiqur Rahman
Elisa Bertino
17
22
0
22 Jan 2023
REDAffectiveLM: Leveraging Affect Enriched Embedding and
  Transformer-based Neural Language Model for Readers' Emotion Detection
REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection
Anoop Kadan
Deepak P
Manjary P.Gangan
Savitha Sam Abraham
L. LajishV.
13
1
0
21 Jan 2023
Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling
Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling
Ahmed Elnaggar
Hazem Essam
Wafaa Salah-Eldin
Walid Moustafa
Mohamed Elkerdawy
Charlotte Rochereau
B. Rost
153
86
0
16 Jan 2023
CHRONOS: Time-Aware Zero-Shot Identification of Libraries from
  Vulnerability Reports
CHRONOS: Time-Aware Zero-Shot Identification of Libraries from Vulnerability Reports
Yu-zeng Lyu
Thanh Le-Cong
Hong Jin Kang
Ratnadira Widyasari
Zhipeng Zhao
X. Le
Ming Li
David Lo
13
16
0
10 Jan 2023
Understanding the Complexity and Its Impact on Testing in ML-Enabled
  Systems
Understanding the Complexity and Its Impact on Testing in ML-Enabled Systems
Junming Cao
Bihuan Chen
Longjie Hu
Jie Ying Gao
Kaifeng Huang
Xin Peng
13
3
0
10 Jan 2023
Does compressing activations help model parallel training?
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
19
4
0
06 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
31
59
0
04 Jan 2023
PIE-QG: Paraphrased Information Extraction for Unsupervised Question
  Generation from Small Corpora
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora
D. Nagumothu
B. Ofoghi
G. Huang
Peter W. Eklund
RALM
16
5
0
03 Jan 2023
Semi-Structured Object Sequence Encoders
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
59
0
0
03 Jan 2023
Relevance Classification of Flood-related Twitter Posts via Multiple
  Transformers
Relevance Classification of Flood-related Twitter Posts via Multiple Transformers
Wisal Mukhtiar
Waliiya Rizwan
A. Habib
Y. Afridi
Laiq Hasan
Kashif Ahmad
9
3
0
01 Jan 2023
Text classification in shipping industry using unsupervised models and
  Transformer based supervised models
Text classification in shipping industry using unsupervised models and Transformer based supervised models
Yingyi Xie
Dongping Song
29
1
0
21 Dec 2022
A Length-Extrapolatable Transformer
A Length-Extrapolatable Transformer
Yutao Sun
Li Dong
Barun Patra
Shuming Ma
Shaohan Huang
Alon Benhaim
Vishrav Chaudhary
Xia Song
Furu Wei
30
115
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq R. Joty
Boyang Albert Li
Lidong Bing
24
231
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
32
25
0
20 Dec 2022
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
Yu Li
Baolin Peng
Pengcheng He
Michel Galley
Zhou Yu
Jianfeng Gao
21
7
0
20 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
  Massively Multilingual Machine Translation Model
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
30
29
0
19 Dec 2022
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue
  Systems
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems
Denis Emelin
Daniele Bonadiman
Sawsan Alqahtani
Yi Zhang
Saab Mansour
13
17
0
15 Dec 2022
The Effects of In-domain Corpus Size on pre-training BERT
The Effects of In-domain Corpus Size on pre-training BERT
Chris Sanchez
Zheyu Zhang
AI4CE
6
4
0
15 Dec 2022
Towards mapping the contemporary art world with ArtLM: an art-specific
  NLP model
Towards mapping the contemporary art world with ArtLM: an art-specific NLP model
Qinkai Chen
Mohamed El-Mennaoui
Antoine Fosset
Amine Rebei
Haoyang Cao
Philine Bouscasse
Christy Eóin O'Beirne
Sasha Shevchenko
Mathieu Rosenbaum
KELM
11
1
0
14 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and
  Methods
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
29
25
0
13 Dec 2022
Previous
123456...202122
Next