Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,094 papers shown
Title
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function
A. B. Siddique
M. H. Maqbool
Kshitija Taywade
H. Foroosh
24
12
0
24 Mar 2023
Human Behavior in the Time of COVID-19: Learning from Big Data
Hanjia Lyu
Arsal Imtiaz
Yufei Zhao
Jiebo Luo
20
6
0
23 Mar 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Learning for Amalgamation: A Multi-Source Transfer Learning Framework For Sentiment Classification
Cuong V. Nguyen
Khiem H. Le
Anh Tran
Quang-Cuong Pham
Binh T. Nguyen
15
14
0
16 Mar 2023
Transformer-based approaches to Sentiment Detection
O. E. Ojo
Hoang Thang Ta
Alexander Gelbukh
Hiram Calvo
O. O. Adebanji
Grigori Sidorov
6
7
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A. B. Siddique
23
4
0
12 Mar 2023
Generating Query Focused Summaries without Fine-tuning the Transformer-based Pre-trained Models
D. Abdullah
Shamanth Nayak
Gandharv Suri
Yllias Chali
22
2
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
20
41
0
10 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
34
7
0
09 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
29
504
0
07 Mar 2023
WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Manan Suri
Aaryak Garg
Divya Chaudhary
I. Gorton
B. Kumar
18
1
0
05 Mar 2023
HULAT at SemEval-2023 Task 10: Data augmentation for pre-trained transformers applied to the detection of sexism in social media
Isabel Segura-Bedmar
ViT
20
1
0
24 Feb 2023
Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Muthuraman Chidambaram
Chenwei Wu
Yu Cheng
Rong Ge
18
0
0
24 Feb 2023
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Jaesung Huh
A. Brown
Jee-weon Jung
Joon Son Chung
Arsha Nagrani
D. Garcia-Romero
Andrew Zisserman
18
26
0
20 Feb 2023
Cluster-based Deep Ensemble Learning for Emotion Classification in Internet Memes
Xiaoyu Guo
Jing Ma
A. Zubiaga
27
0
0
16 Feb 2023
Platform-Independent and Curriculum-Oriented Intelligent Assistant for Higher Education
Ramteja Sajja
Y. Sermet
David M. Cwiertny
Ibrahim Demir
16
62
0
15 Feb 2023
An Extended Sequence Tagging Vocabulary for Grammatical Error Correction
Stuart Mesham
Christopher Bryant
Marek Rei
Zheng Yuan
22
7
0
12 Feb 2023
TextDefense: Adversarial Text Detection based on Word Importance Entropy
Lujia Shen
Xuhong Zhang
S. Ji
Yuwen Pu
Chunpeng Ge
Xing Yang
Yanghe Feng
AAML
13
8
0
12 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
36
57
0
11 Feb 2023
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models
Mohammadreza Banaei
Klaudia Bałazy
Artur Kasymov
R. Lebret
Jacek Tabor
Karl Aberer
OffRL
19
0
0
08 Feb 2023
Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification
Horacio Saggion
S. vStajner
Daniel Ferrés
Kim Cheng Sheang
Matthew Shardlow
Kai North
Marcos Zampieri
25
48
0
06 Feb 2023
Bioformer: an efficient transformer language model for biomedical text mining
Li Fang
Qingyu Chen
Chih-Hsuan Wei
Zhiyong Lu
Kai Wang
MedIm
AI4CE
24
18
0
03 Feb 2023
Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN
Ziyi Chen
Ren Yang
S. Fu
Nansu Zong
Hongfang Liu
Ming Huang
AI4MH
18
14
0
03 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
46
9
0
01 Feb 2023
Towards Personalized Review Summarization by Modeling Historical Reviews from Customer and Product Separately
Xin Cheng
Shen Gao
Yuchi Zhang
Yongliang Wang
Xiuying Chen
Mingzhe Li
Dongyan Zhao
Rui Yan
18
10
0
27 Jan 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
34
2
0
26 Jan 2023
Characterizing the Entities in Harmful Memes: Who is the Hero, the Villain, the Victim?
Shivam Sharma
Atharva Kulkarni
Tharun Suresh
Himanshi Mathur
Preslav Nakov
Md. Shad Akhtar
Tanmoy Chakraborty
30
15
0
26 Jan 2023
A benchmark for toxic comment classification on Civil Comments dataset
Corentin Duchene
Henri Jamet
Pierre Guillaume
Reda Dehak
18
8
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
32
2
0
25 Jan 2023
BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Jiali Wei
Ming Fan
Wenjing Jiao
Wuxia Jin
Ting Liu
AAML
29
10
0
25 Jan 2023
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Imtiaz Karim
Kazi Samin Mubasshir
Mirza Masfiqur Rahman
Elisa Bertino
17
22
0
22 Jan 2023
REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection
Anoop Kadan
Deepak P
Manjary P.Gangan
Savitha Sam Abraham
L. LajishV.
13
1
0
21 Jan 2023
Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling
Ahmed Elnaggar
Hazem Essam
Wafaa Salah-Eldin
Walid Moustafa
Mohamed Elkerdawy
Charlotte Rochereau
B. Rost
153
86
0
16 Jan 2023
CHRONOS: Time-Aware Zero-Shot Identification of Libraries from Vulnerability Reports
Yu-zeng Lyu
Thanh Le-Cong
Hong Jin Kang
Ratnadira Widyasari
Zhipeng Zhao
X. Le
Ming Li
David Lo
13
16
0
10 Jan 2023
Understanding the Complexity and Its Impact on Testing in ML-Enabled Systems
Junming Cao
Bihuan Chen
Longjie Hu
Jie Ying Gao
Kaifeng Huang
Xin Peng
13
3
0
10 Jan 2023
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
19
4
0
06 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
31
59
0
04 Jan 2023
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora
D. Nagumothu
B. Ofoghi
G. Huang
Peter W. Eklund
RALM
16
5
0
03 Jan 2023
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
59
0
0
03 Jan 2023
Relevance Classification of Flood-related Twitter Posts via Multiple Transformers
Wisal Mukhtiar
Waliiya Rizwan
A. Habib
Y. Afridi
Laiq Hasan
Kashif Ahmad
9
3
0
01 Jan 2023
Text classification in shipping industry using unsupervised models and Transformer based supervised models
Yingyi Xie
Dongping Song
29
1
0
21 Dec 2022
A Length-Extrapolatable Transformer
Yutao Sun
Li Dong
Barun Patra
Shuming Ma
Shaohan Huang
Alon Benhaim
Vishrav Chaudhary
Xia Song
Furu Wei
30
115
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq R. Joty
Boyang Albert Li
Lidong Bing
24
231
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
32
25
0
20 Dec 2022
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
Yu Li
Baolin Peng
Pengcheng He
Michel Galley
Zhou Yu
Jianfeng Gao
21
7
0
20 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
30
29
0
19 Dec 2022
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems
Denis Emelin
Daniele Bonadiman
Sawsan Alqahtani
Yi Zhang
Saab Mansour
13
17
0
15 Dec 2022
The Effects of In-domain Corpus Size on pre-training BERT
Chris Sanchez
Zheyu Zhang
AI4CE
6
4
0
15 Dec 2022
Towards mapping the contemporary art world with ArtLM: an art-specific NLP model
Qinkai Chen
Mohamed El-Mennaoui
Antoine Fosset
Amine Rebei
Haoyang Cao
Philine Bouscasse
Christy Eóin O'Beirne
Sasha Shevchenko
Mathieu Rosenbaum
KELM
11
1
0
14 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
29
25
0
13 Dec 2022
Previous
1
2
3
4
5
6
...
20
21
22
Next