ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,731 papers shown
CMAL: A Novel Cross-Modal Associative Learning Framework for
  Vision-Language Pre-Training
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-TrainingACM Multimedia (ACM MM), 2022
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
377
9
0
16 Oct 2024
NSmark: Null Space Based Black-box Watermarking Defense Framework for Language Models
NSmark: Null Space Based Black-box Watermarking Defense Framework for Language Models
Haodong Zhao
Jinming Hu
Peixuan Li
Fangqi Li
Jinrui Sha
Peixuan Chen
Zhuosheng Zhang
Gongshen Liu
Gongshen Liu
AAML
184
0
0
16 Oct 2024
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era
  of Large Language Models
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models
S. Nigam
Aniket Deroy
Subhankar Maity
Arnab Bhattacharya
ELMAILaw
181
12
0
14 Oct 2024
Customize Your Visual Autoregressive Recipe with Set Autoregressive
  Modeling
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Wenze Liu
Le Zhuo
Yi Xin
Sheng Xia
Peng Gao
Xiangyu Yue
230
17
0
14 Oct 2024
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping
Taozhe Li
Wei Sun
302
1
0
14 Oct 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative
  Refinement
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Yuxi Xie
Anirudh Goyal
Xiaobao Wu
Xunjian Yin
Xiao Xu
Min-Yen Kan
Liangming Pan
William Yang Wang
LRM
895
1
0
12 Oct 2024
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal
  Multi-scale Context Modeling
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling
Rui Liu
Zhenqi Jia
Jie Yang
Yifan Hu
Hong Li
318
5
0
12 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive
  Survey
Text Classification using Graph Convolutional Networks: A Comprehensive SurveyACM Computing Surveys (ACM CSUR), 2024
Syed Mustafa Haider Rizvi
Ramsha Imran
Arif Mahmood
GNNOODFaML
207
10
0
12 Oct 2024
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific
  Citation Prediction
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation PredictionNeural Information Processing Systems (NeurIPS), 2024
Qianyue Hao
Jingyang Fan
Fengli Xu
Jian Yuan
Yong Li
201
16
0
10 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking
Chain and Causal Attention for Efficient Entity TrackingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Erwan Fagnou
Paul Caillon
Blaise Delattre
Alexandre Allauzen
245
7
0
07 Oct 2024
Investigating large language models for their competence in extracting
  grammatically sound sentences from transcribed noisy utterances
Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterancesConference on Computational Natural Language Learning (CoNLL), 2024
Alina Wróblewska
169
0
0
07 Oct 2024
Computational design of target-specific linear peptide binders with
  TransformerBeta
Computational design of target-specific linear peptide binders with TransformerBeta
Haowen Zhao
Francesco A. Aprile
Barbara Bravi
262
0
0
07 Oct 2024
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
Yijiong Yu
Ma Xiufa
Fang Jianwei
Zhi-liang Xu
Su Guangyao
...
Zhixiao Qi
Wei Wang
Wen Liu
Ran Chen
Ji Pei
LRMRALM
347
6
0
06 Oct 2024
Fundamental Limitations on Subquadratic Alternatives to Transformers
Fundamental Limitations on Subquadratic Alternatives to TransformersInternational Conference on Learning Representations (ICLR), 2024
Josh Alman
Hantao Yu
436
6
0
05 Oct 2024
Variational Language Concepts for Interpreting Foundation Language
  Models
Variational Language Concepts for Interpreting Foundation Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
397
4
0
04 Oct 2024
Linear Transformer Topological Masking with Graph Random Features
Linear Transformer Topological Masking with Graph Random FeaturesInternational Conference on Learning Representations (ICLR), 2024
Isaac Reid
Kumar Avinava Dubey
Deepali Jain
Will Whitney
Amr Ahmed
...
Connor Schenck
Richard E. Turner
René Wagner
Adrian Weller
Krzysztof Choromanski
293
4
0
04 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Wei Wu
Chao Wang
L. Chen
Mingze Yin
Yiheng Zhu
Kun Fu
Jieping Ye
Hui Xiong
Zheng Wang
392
3
0
04 Oct 2024
Graph-tree Fusion Model with Bidirectional Information Propagation for
  Long Document Classification
Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sudipta Singha Roy
Xindi Wang
Robert E. Mercer
Frank Rudzicz
174
0
0
03 Oct 2024
On The Adaptation of Unlimiformer for Decoder-Only Transformers
On The Adaptation of Unlimiformer for Decoder-Only TransformersInternational Conference on Language Resources and Evaluation (LREC), 2024
Kian Ahrabian
Alon Benhaim
Barun Patra
Jay Pujara
Saksham Singhal
Xia Song
211
0
0
02 Oct 2024
Preserving Generalization of Language models in Few-shot Continual
  Relation Extraction
Preserving Generalization of Language models in Few-shot Continual Relation ExtractionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Quyen Tran
Nguyen Xuan Thanh
Nguyen Hoang Anh
Nam Le Hai
Trung Le
Linh Van Ngo
Thien Huu Nguyen
CLLKELM
287
9
0
01 Oct 2024
Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios
Perception Compressor: A Training-Free Prompt Compression Framework in Long Context ScenariosNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jiwei Tang
Jin Xu
Tingwei Lu
Hai Lin
Yiming Zhao
Lin Hai
Hai-Tao Zheng
VLM
335
0
0
28 Sep 2024
Leveraging Long-Context Large Language Models for Multi-Document
  Understanding and Summarization in Enterprise Applications
Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications
Aditi Godbole
Jabin Geevarghese George
Smita Shandilya
258
11
0
27 Sep 2024
Trustworthy AI: Securing Sensitive Data in Large Language Models
Trustworthy AI: Securing Sensitive Data in Large Language ModelsApplied Informatics (AI), 2024
G. Feretzakis
V. Verykios
230
37
0
26 Sep 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical
  Impacts, Constraints, and Emerging Questions
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
129
0
0
25 Sep 2024
The Roles of Generative Artificial Intelligence in Internet of Electric
  Vehicles
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
Hanwen Zhang
Dusit Niyato
Wei Zhang
Changyuan Zhao
Hongyang Du
Abbas Jamalipour
Sumei Sun
Yiyang Pei
AI4CE
186
3
0
24 Sep 2024
Improving Academic Skills Assessment with NLP and Ensemble Learning
Improving Academic Skills Assessment with NLP and Ensemble LearningInternational Conference on Information Systems and Computer Aided Education (ICISCAE), 2024
Xinyi Huang
Yingyi Wu
Danyang Zhang
Jiacheng Hu
Yujian Long
223
11
0
23 Sep 2024
"I Never Said That": A dataset, taxonomy and baselines on response
  clarity classification
"I Never Said That": A dataset, taxonomy and baselines on response clarity classificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Konstantinos Thomas
Giorgos Filandrianos
Maria Lymperaiou
Chrysoula Zerva
Giorgos Stamou
178
0
0
20 Sep 2024
GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for
  Interpretable Text Classification
GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for Interpretable Text ClassificationInternational Conference on Computational Linguistics (COLING), 2024
Ximing Wen
Wenjuan Tan
Rosina O. Weber
220
5
0
20 Sep 2024
Incremental and Data-Efficient Concept Formation to Support Masked Word
  Prediction
Incremental and Data-Efficient Concept Formation to Support Masked Word Prediction
Xin Lian
Nishant Baglodi
Christopher J. MacLellan
152
1
0
19 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text RecognizerACM Multimedia (MM), 2024
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
256
5
0
18 Sep 2024
Evaluation of pretrained language models on music understanding
Evaluation of pretrained language models on music understanding
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
261
4
0
17 Sep 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of
  Modalities
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities
Hanane Azzag
Hanane Azzag
M. Lebbah
ObjD
350
2
0
17 Sep 2024
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion
  Generation
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation
Seyed Rohollah Hosseyni
Ali Ahmad Rahmani
S. J. Seyedmohammadi
Sanaz Seyedin
Arash Mohammadi
DiffM
206
10
0
17 Sep 2024
Language Models Learn Metadata: Political Stance Detection Case Study
Language Models Learn Metadata: Political Stance Detection Case Study
Stanley Cao
Felix Drinkall
170
0
0
15 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using
  LLMs
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
250
15
0
15 Sep 2024
Synthetic4Health: Generating Annotated Synthetic Clinical Letters
Synthetic4Health: Generating Annotated Synthetic Clinical LettersFrontiers in Digital Health (Front. Digit. Health), 2024
Libo Ren
Samuel Belkadi
Lifeng Han
Warren Del-Pinto
Goran Nenadic
SyDa
170
5
0
14 Sep 2024
Layerwise Change of Knowledge in Neural Networks
Layerwise Change of Knowledge in Neural NetworksInternational Conference on Machine Learning (ICML), 2024
Xu Cheng
Lei Cheng
Zhaoran Peng
Yang Xu
Tian Han
Quanshi Zhang
KELMFAtt
223
7
0
13 Sep 2024
TheraGen: Therapy for Every Generation
TheraGen: Therapy for Every Generation
Kartikey Doshi
Jimit Shah
Narendra Shekokar
AI4MH
175
0
0
12 Sep 2024
Multimodal Emotion Recognition with Vision-language Prompting and
  Modality Dropout
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
Anbin QI
Zhongliang Liu
Xinyong Zhou
Jinba Xiao
Fengrun Zhang
Qi Gan
Ming Tao
Gaozheng Zhang
Lu Zhang
VLM
152
10
0
11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
225
6
0
10 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
227
1
0
08 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
An overview of domain-specific foundation model: key technologies, applications and challengesScience China Information Sciences (Sci. China Inf. Sci.), 2024
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALMVLM
492
19
0
06 Sep 2024
Revolutionizing Database Q&A with Large Language Models: Comprehensive
  Benchmark and Evaluation
Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Yihang Zheng
Yue Liu
Zhenghao Lin
Yi Luo
Xuanhe Zhou
Chen Lin
Jinsong Su
Guoliang Li
Shifu Li
ELM
279
4
0
05 Sep 2024
Dreaming is All You Need
Dreaming is All You Need
Mingze Ni
Wei Liu
131
0
0
03 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review
Pre-Trained Language Models for Keyphrase Prediction: A ReviewICT express (IE), 2024
Muhammad Umair
Tangina Sultana
Young-Koo Lee
316
8
0
02 Sep 2024
Hound: Hunting Supervision Signals for Few and Zero Shot Node
  Classification on Text-attributed Graph
Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph
Yuxiang Wang
Xiao Yan
Shiyu Jin
Quanqing Xu
Chuanhui Yang
Yuanyuan Zhu
Chuang Hu
Bo Du
Jiawei Jiang
VLM
157
2
0
01 Sep 2024
How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models
How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jiyue Jiang
Pengan Chen
Xiaoou Liu
Sheng Wang
Qinghang Bao
Lingpeng Kong
Yu Li
Chuan Wu
ELMALM
148
1
0
29 Aug 2024
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item PredictionACM Transactions on Recommender Systems (TRS), 2024
Elisabeth Fischer
Albin Zehe
Andreas Hotho
Daniel Schlor
HAI
460
0
0
28 Aug 2024
EMP: Enhance Memory in Data Pruning
EMP: Enhance Memory in Data Pruning
Jinying Xiao
Ping Li
Jie Nie
Zhe Tang
Shasha Li
Xiaodong Liu
Jun Ma
Qingbo Wu
Jie Yu
VLM
370
0
0
28 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
385
6
0
27 Aug 2024
Previous
123...567...737475
Next
Page 6 of 75
Pageof 75