ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXivPDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 1,069 papers shown
Title
Evaluating ChatGPT as a Question Answering System: A Comprehensive
  Analysis and Comparison with Existing Models
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELM
AI4MH
34
17
0
11 Dec 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for
  Histopathology Whole Slide Image Analysis
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
27
4
0
21 Nov 2023
Argumentation Element Annotation Modeling using XLNet
Argumentation Element Annotation Modeling using XLNet
Christopher M. Ormerod
Amy Burkhardt
Mackenzie Young
Susan Lottridge
28
2
0
10 Nov 2023
Making LLMs Worth Every Penny: Resource-Limited Text Classification in
  Banking
Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking
Lefteris Loukas
Ilias Stogiannidis
Odysseas Diamantopoulos
Prodromos Malakasiotis
Stavros Vassos
10
43
0
10 Nov 2023
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training
  Regime and Better Alignment to Human Preferences
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences
Yuanhe Tian
Ruyi Gan
Yan Song
Jiaxing Zhang
Yongdong Zhang
AI4MH
AI4CE
LM&MA
27
30
0
10 Nov 2023
Explained anomaly detection in text reviews: Can subjective scenarios be
  correctly evaluated?
Explained anomaly detection in text reviews: Can subjective scenarios be correctly evaluated?
David Novoa-Paradela
O. Fontenla-Romero
B. Guijarro-Berdiñas
20
0
0
08 Nov 2023
Evaluating multiple large language models in pediatric ophthalmology
Evaluating multiple large language models in pediatric ophthalmology
J. Holmes
Rui Peng
Yiwei Li
Jinyu Hu
Zheng Liu
...
Wei Liu
Hong Wei
Jie Zou
Tianming Liu
Yi Shao
AI4Ed
ELM
LM&MA
21
0
0
07 Nov 2023
OmniVec: Learning robust representations with cross modal sharing
OmniVec: Learning robust representations with cross modal sharing
Siddharth Srivastava
Gaurav Sharma
SSL
21
64
0
07 Nov 2023
Towards Concept-Aware Large Language Models
Towards Concept-Aware Large Language Models
Chen Shani
Jilles Vreeken
Dafna Shahaf
LRM
19
6
0
03 Nov 2023
Discourse Relations Classification and Cross-Framework Discourse
  Relation Classification Through the Lens of Cognitive Dimensions: An
  Empirical Investigation
Discourse Relations Classification and Cross-Framework Discourse Relation Classification Through the Lens of Cognitive Dimensions: An Empirical Investigation
Yingxue Fu
16
0
0
01 Nov 2023
XAI-CLASS: Explanation-Enhanced Text Classification with Extremely Weak
  Supervision
XAI-CLASS: Explanation-Enhanced Text Classification with Extremely Weak Supervision
Daniel Hajialigol
Hanwen Liu
Xuan Wang
VLM
21
5
0
31 Oct 2023
Unveiling Black-boxes: Explainable Deep Learning Models for Patent
  Classification
Unveiling Black-boxes: Explainable Deep Learning Models for Patent Classification
Md. Shajalal
Sebastian Denef
Md. Rezaul Karim
Alexander Boden
Gunnar Stevens
XAI
11
5
0
31 Oct 2023
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Jiaao Chen
Diyi Yang
MU
22
135
0
31 Oct 2023
An Ensemble Method Based on the Combination of Transformers with
  Convolutional Neural Networks to Detect Artificially Generated Text
An Ensemble Method Based on the Combination of Transformers with Convolutional Neural Networks to Detect Artificially Generated Text
Vijini Liyanage
Davide Buscaldi
DeLMO
21
1
0
26 Oct 2023
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
S. Nigam
Aniket Deroy
Noel Shallum
Ayush Kumar Mishra
Anup Roy
Shubham Kumar Mishra
Arnab Bhattacharya
Saptarshi Ghosh
Kripabandhu Ghosh
AILaw
ELM
15
10
0
17 Oct 2023
DropMix: Better Graph Contrastive Learning with Harder Negative Samples
DropMix: Better Graph Contrastive Learning with Harder Negative Samples
Yueqi Ma
Minjie Chen
Xiang Li
SSL
15
1
0
15 Oct 2023
The Temporal Structure of Language Processing in the Human Brain
  Corresponds to The Layered Hierarchy of Deep Language Models
The Temporal Structure of Language Processing in the Human Brain Corresponds to The Layered Hierarchy of Deep Language Models
Ariel Goldstein
Eric Ham
Mariano Schain
Samuel A. Nastase
Zaid Zada
...
Avinatan Hassidim
O. Devinsky
A. Flinker
Omer Levy
Uri Hasson
AI4CE
15
10
0
11 Oct 2023
Argumentative Stance Prediction: An Exploratory Study on Multimodality
  and Few-Shot Learning
Argumentative Stance Prediction: An Exploratory Study on Multimodality and Few-Shot Learning
Arushi Sharma
Abhibha Gupta
Maneesh Bilalpur
14
4
0
11 Oct 2023
GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using
  Large Language Models
GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using Large Language Models
B. Silva
Leonardo Nunes
Roberto Estevão
Vijay Aski
Ranveer Chandra
ELM
LM&MA
27
12
0
10 Oct 2023
LLMLingua: Compressing Prompts for Accelerated Inference of Large
  Language Models
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang
Qianhui Wu
Chin-Yew Lin
Yuqing Yang
Lili Qiu
24
100
0
09 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale
  Pre-Trained Models
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
30
2
0
08 Oct 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual
  Knowledge for In-Context Learning
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning
J. Wang
Chengyu Wang
Chuanqi Tan
Jun Huang
Ming Gao
KELM
26
4
0
26 Sep 2023
Word Embedding with Neural Probabilistic Prior
Word Embedding with Neural Probabilistic Prior
Shaogang Ren
Dingcheng Li
P. Li
BDL
17
0
0
21 Sep 2023
UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
Yan Jiang
Ruihong Qiu
Yi Zhang
Zi Huang
LM&MA
22
2
0
08 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
V. W. Anelli
F. Narducci
T. D. Noia
E. Sciascio
ALM
42
27
0
07 Sep 2023
A Multi-Task Semantic Decomposition Framework with Task-specific
  Pre-training for Few-Shot NER
A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER
Guanting Dong
Zechen Wang
Jinxu Zhao
Gang Zhao
Daichi Guo
...
Keqing He
Xuefeng Li
Liwen Wang
Xinyue Cui
Weiran Xu
32
19
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
73
31
0
27 Aug 2023
Learning Representations on Logs for AIOps
Learning Representations on Logs for AIOps
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
30
11
0
18 Aug 2023
Lip Reading for Low-resource Languages by Learning and Combining General
  Speech Knowledge and Language-specific Knowledge
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge
Minsu Kim
Jeong Hun Yeo
J. Choi
Y. Ro
34
16
0
18 Aug 2023
SPM: Structured Pretraining and Matching Architectures for Relevance
  Modeling in Meituan Search
SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search
Wen-xin Zan
Yaopeng Han
Xiaotian Jiang
Yao Xiao
Yang Yang
Dayao Chen
Sheng Chen
24
3
0
15 Aug 2023
ERNetCL: A novel emotion recognition network in textual conversation
  based on curriculum learning strategy
ERNetCL: A novel emotion recognition network in textual conversation based on curriculum learning strategy
Jiang Li
Xiaoping Wang
Yingjian Liu
Zhigang Zeng
33
6
0
12 Aug 2023
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
J. Puentes
Angela Castillo
Wilmar Osejo
Yuly Calderón
Viviana Quintero
L. Saldarriaga
D. Agudelo
Pablo Arbelaez
13
2
0
07 Aug 2023
Detecting Spells in Fantasy Literature with a Transformer Based
  Artificial Intelligence
Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence
Marcel Moravek
Alexander Zender
Andreas Müller
10
0
0
07 Aug 2023
Text Analysis Using Deep Neural Networks in Digital Humanities and
  Information Science
Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science
Omri Suissa
Avshalom Elmalech
M. Zhitomirsky-Geffet
AI4CE
9
45
0
30 Jul 2023
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context
  Information for Expressive Speech Synthesis
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
H. Meng
22
7
0
29 Jul 2023
A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer
  using LSTM, BiLSTM, CNN, GRU, and GloVe
A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe
Sanad Aburass
O. Dorgham
Jamil Al Shaqsi
17
21
0
24 Jul 2023
Pseudo Outlier Exposure for Out-of-Distribution Detection using
  Pretrained Transformers
Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers
Jaeyoung Kim
Kyuheon Jung
Dongbin Na
Sion Jang
Eunbin Park
Sungchul Choi
OODD
22
6
0
18 Jul 2023
Attention over pre-trained Sentence Embeddings for Long Document
  Classification
Attention over pre-trained Sentence Embeddings for Long Document Classification
Amine Abdaoui
Sourav Dutta
6
1
0
18 Jul 2023
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning?
  Insights from Cross-Lingual Language Understanding
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding
Bolei Ma
Ercong Nie
Helmut Schmid
Hinrich Schütze
AAML
VLM
LRM
29
8
0
15 Jul 2023
Unsupervised Calibration through Prior Adaptation for Text
  Classification using Large Language Models
Unsupervised Calibration through Prior Adaptation for Text Classification using Large Language Models
Lautaro Estienne
Luciana Ferrer
Matías Vera
Pablo Piantanida
VLM
23
1
0
13 Jul 2023
A Side-by-side Comparison of Transformers for English Implicit Discourse
  Relation Classification
A Side-by-side Comparison of Transformers for English Implicit Discourse Relation Classification
Bruce W. Lee
Bongseok Yang
J. Lee
16
0
0
07 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
27
5
0
04 Jul 2023
A Dual-Stream Recurrence-Attention Network With Global-Local Awareness
  for Emotion Recognition in Textual Dialog
A Dual-Stream Recurrence-Attention Network With Global-Local Awareness for Emotion Recognition in Textual Dialog
Jiang Li
Xiaoping Wang
Zhigang Zeng
13
4
0
02 Jul 2023
SpATr: MoCap 3D Human Action Recognition based on Spiral Auto-encoder
  and Transformer Network
SpATr: MoCap 3D Human Action Recognition based on Spiral Auto-encoder and Transformer Network
Hamza Bouzid
Lahoucine Ballihi
ViT
3DH
22
2
0
30 Jun 2023
Quantizable Transformers: Removing Outliers by Helping Attention Heads
  Do Nothing
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
13
88
0
22 Jun 2023
Lexical Speaker Error Correction: Leveraging Language Models for Speaker
  Diarization Error Correction
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Rohit Paturi
S. Srinivasan
Xiang Li
16
13
0
15 Jun 2023
Relational Temporal Graph Reasoning for Dual-task Dialogue Language
  Understanding
Relational Temporal Graph Reasoning for Dual-task Dialogue Language Understanding
Bowen Xing
Ivor W. Tsang
37
13
0
15 Jun 2023
Research on Named Entity Recognition in Improved transformer with R-Drop
  structure
Research on Named Entity Recognition in Improved transformer with R-Drop structure
Weidong Ji
Yousheng Zhang
Guohui Zhou
Xu Wang
26
0
0
14 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
38
2
0
12 Jun 2023
QUERT: Continual Pre-training of Language Model for Query Understanding
  in Travel Domain Search
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search
Jian Xie
Yidan Liang
Jingping Liu
Yanghua Xiao
Baohua Wu
Shenghua Ni
VLM
LRM
30
8
0
11 Jun 2023
Previous
123456...202122
Next