ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown
DEHYDRATOR: Enhancing Provenance Graph Storage via Hierarchical Encoding and Sequence GenerationIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
J. Ying
Tiantian Zhu
Mingqi Lv
Tieming Chen
158
0
0
03 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
79
0
0
03 Jan 2025
TED: Turn Emphasis with Dialogue Feature Attention for Emotion Recognition in Conversation
Junya Ono
Hiromi Wakaki
290
0
0
03 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in MedicineInformation Fusion (Inf. Fusion), 2024
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILawLM&MALRM
458
82
0
31 Dec 2024
Context-Aware Deep Learning for Multi Modal Depression Detection
Context-Aware Deep Learning for Multi Modal Depression DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Genevieve Lam
Huang Dongyan
Weisi Lin
302
4
0
26 Dec 2024
Invisible Textual Backdoor Attacks based on Dual-Trigger
Invisible Textual Backdoor Attacks based on Dual-Trigger
Yang Hou
Qiuling Yue
Lujia Chai
Guozhao Liao
Wenbao Han
Wei Ou
351
0
0
23 Dec 2024
ImagePiece: Content-aware Re-tokenization for Efficient Image
  Recognition
ImagePiece: Content-aware Re-tokenization for Efficient Image RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2024
Seungdong Yoa
Seungjun Lee
Hyeseung Cho
Bumsoo Kim
Woohyung Lim
ViT
219
1
0
21 Dec 2024
Automated CVE Analysis: Harnessing Machine Learning In Designing
  Question-Answering Models For Cybersecurity Information Extraction
Automated CVE Analysis: Harnessing Machine Learning In Designing Question-Answering Models For Cybersecurity Information Extraction
Tanjim Bin Faruk
133
1
0
21 Dec 2024
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental
  Health
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar
Eirini Ntoutsi
Pushpraj Singh Rajawat
Giacomo Medda
Diego Reforgiato Recupero
AI4MH
270
6
0
17 Dec 2024
Multi-Head Encoding for Extreme Label Classification
Multi-Head Encoding for Extreme Label ClassificationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Minggao Zhang
273
0
0
13 Dec 2024
TECO: Improving Multimodal Intent Recognition with Text Enhancement
  through Commonsense Knowledge Extraction
TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge ExtractionPacific Asia Conference on Language, Information and Computation (PACLIC), 2024
Quynh-Mai Thi Nguyen
Lan-Nhi Thi Nguyen
Cam-Van Thi Nguyen
231
1
0
11 Dec 2024
Comateformer: Combined Attention Transformer for Semantic Sentence
  Matching
Comateformer: Combined Attention Transformer for Semantic Sentence MatchingEuropean Conference on Artificial Intelligence (ECAI), 2024
Bo Li
Di Liang
Zixin Zhang
265
9
0
10 Dec 2024
A Review of Human Emotion Synthesis Based on Generative Technology
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
297
8
0
10 Dec 2024
Investigating Acoustic-Textual Emotional Inconsistency Information for
  Automatic Depression Detection
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
Rongfeng Su
Changqing Xu
Xinyi Wu
Feng Xu
Xie Chen
Lan Wangt
Nan Yan
204
1
0
09 Dec 2024
Impromptu Cybercrime Euphemism Detection
Impromptu Cybercrime Euphemism DetectionInternational Conference on Computational Linguistics (COLING), 2024
Xiang Li
Yimiao Zhou
Laiping Zhao
Jing Li
Fengyuan Liu
350
2
0
02 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial SearchNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hai Ye
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELMCLL
638
1
0
02 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
RandAR: Decoder-only Autoregressive Visual Generation in Random OrdersComputer Vision and Pattern Recognition (CVPR), 2024
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
397
61
0
02 Dec 2024
Generative Language Models Potential for Requirement Engineering
  Applications: Insights into Current Strengths and Limitations
Generative Language Models Potential for Requirement Engineering Applications: Insights into Current Strengths and Limitations
Summra Saleem
Muhammad Nabeel Asim
L. V. Elst
Andreas Dengel
285
1
0
01 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream
  applications of foundation models?
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
300
1
0
27 Nov 2024
What Differentiates Educational Literature? A Multimodal Fusion Approach
  of Transformers and Computational Linguistics
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics
Jordan J. Bird
412
0
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language
  Model
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yujiao Shi
Min Li
372
3
0
23 Nov 2024
Forecasting Future International Events: A Reliable Dataset for
  Text-Based Event Modeling
Forecasting Future International Events: A Reliable Dataset for Text-Based Event ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daehoon Gwak
Junwoo Park
Minho Park
C. Park
Hyunchan Lee
E. Choi
Jaegul Choo
260
1
0
21 Nov 2024
LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender System
LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender SystemProceedings of the VLDB Endowment (PVLDB), 2024
Fengxin Li
Yi Li
Yue Liu
Chao Zhou
Yuan Wang
...
Haijie Gu
Jie Jiang
Hongyan Liu
Biao Qin
Jun He
267
2
0
21 Nov 2024
Hysteresis Activation Function for Efficient Inference
Hysteresis Activation Function for Efficient Inference
Moshe Kimhi
Idan Kashani
A. Mendelson
Chaim Baskin
LLMSV
471
2
0
15 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic
  Survey
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
354
19
0
14 Nov 2024
Multi-head Span-based Detector for AI-generated Fragments in Scientific
  Papers
Multi-head Span-based Detector for AI-generated Fragments in Scientific Papers
German Gritsai
Ildar Khabutdinov
Andrey Grabovoy
DeLMO
269
4
0
11 Nov 2024
A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust,
  Reliable, and Safe Learning Techniques for Real-world Robotics
A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world RoboticsNeural Information Processing Systems (NeurIPS), 2024
Puze Liu
Jonas Günster
Niklas Funk
Simon Gröger
Dong Chen
...
Thomas Bonenfant
Marcello Restelli
Davide Tateo
Z. Liu
Jan Peters
193
1
0
08 Nov 2024
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask
  Transformer-Based Spatiotemporal Model
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model
Shang-Ling Hsu
Emmanuel Tung
John Krumm
Cyrus Shahabi
Khurram Hassan-Shafique
197
24
0
07 Nov 2024
Performance-Guided LLM Knowledge Distillation for Efficient Text
  Classification at Scale
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at ScaleConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Flavio Di Palo
Prateek Singhi
Bilal Fadlallah
135
17
0
07 Nov 2024
Pseudo-labeling with Keyword Refining for Few-Supervised Video
  Captioning
Pseudo-labeling with Keyword Refining for Few-Supervised Video CaptioningPattern Recognition (Pattern Recogn.), 2024
Ping Li
Tao Wang
Xinkui Zhao
Xianghua Xu
Mingli Song
215
9
0
06 Nov 2024
A Library Perspective on Supervised Text Processing in Digital
  Libraries: An Investigation in the Biomedical Domain
A Library Perspective on Supervised Text Processing in Digital Libraries: An Investigation in the Biomedical DomainACM/IEEE Joint Conference on Digital Libraries (JCDL), 2024
H. Kroll
Pascal Sackhoff
Bill Matthias Thang
Maha Ksouri
Wolf-Tilo Balke
221
0
0
06 Nov 2024
Trustworthy Federated Learning: Privacy, Security, and Beyond
Trustworthy Federated Learning: Privacy, Security, and BeyondKnowledge and Information Systems (KAIS), 2024
Chunlu Chen
Ji Liu
Haowen Tan
Xingjian Li
Kevin I-Kai Wang
Peng Li
Kouichi Sakurai
Dejing Dou
FedML
294
48
0
03 Nov 2024
Randomized Autoregressive Visual Generation
Randomized Autoregressive Visual Generation
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VGenDiffM
329
87
1
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
312
4
0
31 Oct 2024
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based
  Encoder For Legal Violation Detection and Resolution
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
Shikha Bordia
AILaw
168
0
0
30 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive
  Learning
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningNeural Information Processing Systems (NeurIPS), 2024
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
317
51
0
28 Oct 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Beyond Autoregression: Fast LLMs via Self-Distillation Through TimeInternational Conference on Learning Representations (ICLR), 2024
Justin Deschenaux
Çağlar Gülçehre
497
25
0
28 Oct 2024
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
Uncovering Capabilities of Model Pruning in Graph Contrastive LearningACM Multimedia (MM), 2024
Wu Junran
Chen Xueyuan
Li Shangzhe
260
2
0
27 Oct 2024
Building Dialogue Understanding Models for Low-resource Language
  Indonesian from Scratch
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
Donglin Di
Weinan Zhang
Yue Zhang
Fanglin Wang
303
2
0
24 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning TechniquesApplied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
242
3
0
24 Oct 2024
Dependency Graph Parsing as Sequence Labeling
Dependency Graph Parsing as Sequence LabelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ana Ezquerro
David Vilares
Carlos Gómez-Rodríguez
170
2
0
23 Oct 2024
Future Token Prediction -- Causal Language Modelling with Per-Token
  Semantic State Vector for Multi-Token Prediction
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
Nicholas Walker
161
0
0
23 Oct 2024
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
BadFair: Backdoored Fairness Attacks with Group-conditioned TriggersConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiaqi Xue
Qian Lou
Mengxin Zheng
217
3
0
23 Oct 2024
Multi-head Sequence Tagging Model for Grammatical Error Correction
Multi-head Sequence Tagging Model for Grammatical Error CorrectionEngineering applications of artificial intelligence (EAAI), 2024
Kamal Al-Sabahi
Kang Yang
Wangwang Liu
Guanyu Jiang
Xian Li
Ming Yang
181
3
0
21 Oct 2024
Evaluation Of P300 Speller Performance Using Large Language Models Along
  With Cross-Subject Training
Evaluation Of P300 Speller Performance Using Large Language Models Along With Cross-Subject Training
Nithin Parthasarathy
J. Soetedjo
S. Panchavati
Nitya Parthasarathy
C. Arnold
N. Pouratian
W. Speier
68
0
0
19 Oct 2024
Controllable Discovery of Intents: Incremental Deep Clustering Using
  Semi-Supervised Contrastive Learning
Controllable Discovery of Intents: Incremental Deep Clustering Using Semi-Supervised Contrastive LearningInternational Joint Conference on Natural Language Processing (IJCNLP), 2024
Mrinal Rawat
Hithesh Sankararaman
Victor Barrès
303
0
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
471
13
0
18 Oct 2024
On the Regularization of Learnable Embeddings for Time Series Forecasting
On the Regularization of Learnable Embeddings for Time Series Forecasting
L. Butera
G. Felice
Andrea Cini
Cesare Alippi
AI4TS
368
0
0
18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and PlanningInternational Conference on Learning Representations (ICLR), 2024
Jiacheng Ye
Lei Li
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffMLRM
644
74
0
18 Oct 2024
Fine-Tuning Language Models on Multiple Datasets for Citation Intention
  Classification
Fine-Tuning Language Models on Multiple Datasets for Citation Intention ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Amit Gupta
Petros Karypis
Daniel S. Karls
Mingjian Wen
Saurav Manchanda
E. Tadmor
George Karypis
128
4
0
17 Oct 2024
Previous
123456...737475
Next
Page 5 of 75
Pageof 75