Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,732 papers shown
DEHYDRATOR: Enhancing Provenance Graph Storage via Hierarchical Encoding and Sequence Generation
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
J. Ying
Tiantian Zhu
Mingqi Lv
Tieming Chen
158
0
0
03 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
79
0
0
03 Jan 2025
TED: Turn Emphasis with Dialogue Feature Attention for Emotion Recognition in Conversation
Junya Ono
Hiromi Wakaki
290
0
0
03 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Information Fusion (Inf. Fusion), 2024
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
458
82
0
31 Dec 2024
Context-Aware Deep Learning for Multi Modal Depression Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Genevieve Lam
Huang Dongyan
Weisi Lin
302
4
0
26 Dec 2024
Invisible Textual Backdoor Attacks based on Dual-Trigger
Yang Hou
Qiuling Yue
Lujia Chai
Guozhao Liao
Wenbao Han
Wei Ou
351
0
0
23 Dec 2024
ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2024
Seungdong Yoa
Seungjun Lee
Hyeseung Cho
Bumsoo Kim
Woohyung Lim
ViT
219
1
0
21 Dec 2024
Automated CVE Analysis: Harnessing Machine Learning In Designing Question-Answering Models For Cybersecurity Information Extraction
Tanjim Bin Faruk
133
1
0
21 Dec 2024
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar
Eirini Ntoutsi
Pushpraj Singh Rajawat
Giacomo Medda
Diego Reforgiato Recupero
AI4MH
270
6
0
17 Dec 2024
Multi-Head Encoding for Extreme Label Classification
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Minggao Zhang
273
0
0
13 Dec 2024
TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge Extraction
Pacific Asia Conference on Language, Information and Computation (PACLIC), 2024
Quynh-Mai Thi Nguyen
Lan-Nhi Thi Nguyen
Cam-Van Thi Nguyen
231
1
0
11 Dec 2024
Comateformer: Combined Attention Transformer for Semantic Sentence Matching
European Conference on Artificial Intelligence (ECAI), 2024
Bo Li
Di Liang
Zixin Zhang
265
9
0
10 Dec 2024
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
297
8
0
10 Dec 2024
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
Rongfeng Su
Changqing Xu
Xinyi Wu
Feng Xu
Xie Chen
Lan Wangt
Nan Yan
204
1
0
09 Dec 2024
Impromptu Cybercrime Euphemism Detection
International Conference on Computational Linguistics (COLING), 2024
Xiang Li
Yimiao Zhou
Laiping Zhao
Jing Li
Fengyuan Liu
350
2
0
02 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hai Ye
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
638
1
0
02 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Computer Vision and Pattern Recognition (CVPR), 2024
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
397
61
0
02 Dec 2024
Generative Language Models Potential for Requirement Engineering Applications: Insights into Current Strengths and Limitations
Summra Saleem
Muhammad Nabeel Asim
L. V. Elst
Andreas Dengel
285
1
0
01 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
300
1
0
27 Nov 2024
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics
Jordan J. Bird
412
0
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yujiao Shi
Min Li
372
3
0
23 Nov 2024
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daehoon Gwak
Junwoo Park
Minho Park
C. Park
Hyunchan Lee
E. Choi
Jaegul Choo
260
1
0
21 Nov 2024
LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender System
Proceedings of the VLDB Endowment (PVLDB), 2024
Fengxin Li
Yi Li
Yue Liu
Chao Zhou
Yuan Wang
...
Haijie Gu
Jie Jiang
Hongyan Liu
Biao Qin
Jun He
267
2
0
21 Nov 2024
Hysteresis Activation Function for Efficient Inference
Moshe Kimhi
Idan Kashani
A. Mendelson
Chaim Baskin
LLMSV
471
2
0
15 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
354
19
0
14 Nov 2024
Multi-head Span-based Detector for AI-generated Fragments in Scientific Papers
German Gritsai
Ildar Khabutdinov
Andrey Grabovoy
DeLMO
269
4
0
11 Nov 2024
A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics
Neural Information Processing Systems (NeurIPS), 2024
Puze Liu
Jonas Günster
Niklas Funk
Simon Gröger
Dong Chen
...
Thomas Bonenfant
Marcello Restelli
Davide Tateo
Z. Liu
Jan Peters
193
1
0
08 Nov 2024
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model
Shang-Ling Hsu
Emmanuel Tung
John Krumm
Cyrus Shahabi
Khurram Hassan-Shafique
197
24
0
07 Nov 2024
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Flavio Di Palo
Prateek Singhi
Bilal Fadlallah
135
17
0
07 Nov 2024
Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning
Pattern Recognition (Pattern Recogn.), 2024
Ping Li
Tao Wang
Xinkui Zhao
Xianghua Xu
Mingli Song
215
9
0
06 Nov 2024
A Library Perspective on Supervised Text Processing in Digital Libraries: An Investigation in the Biomedical Domain
ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2024
H. Kroll
Pascal Sackhoff
Bill Matthias Thang
Maha Ksouri
Wolf-Tilo Balke
221
0
0
06 Nov 2024
Trustworthy Federated Learning: Privacy, Security, and Beyond
Knowledge and Information Systems (KAIS), 2024
Chunlu Chen
Ji Liu
Haowen Tan
Xingjian Li
Kevin I-Kai Wang
Peng Li
Kouichi Sakurai
Dejing Dou
FedML
294
48
0
03 Nov 2024
Randomized Autoregressive Visual Generation
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VGen
DiffM
329
87
1
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
312
4
0
31 Oct 2024
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
Shikha Bordia
AILaw
168
0
0
30 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Neural Information Processing Systems (NeurIPS), 2024
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
317
51
0
28 Oct 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
International Conference on Learning Representations (ICLR), 2024
Justin Deschenaux
Çağlar Gülçehre
497
25
0
28 Oct 2024
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
ACM Multimedia (MM), 2024
Wu Junran
Chen Xueyuan
Li Shangzhe
260
2
0
27 Oct 2024
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
Donglin Di
Weinan Zhang
Yue Zhang
Fanglin Wang
303
2
0
24 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Applied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
242
3
0
24 Oct 2024
Dependency Graph Parsing as Sequence Labeling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ana Ezquerro
David Vilares
Carlos Gómez-Rodríguez
170
2
0
23 Oct 2024
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
Nicholas Walker
161
0
0
23 Oct 2024
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiaqi Xue
Qian Lou
Mengxin Zheng
217
3
0
23 Oct 2024
Multi-head Sequence Tagging Model for Grammatical Error Correction
Engineering applications of artificial intelligence (EAAI), 2024
Kamal Al-Sabahi
Kang Yang
Wangwang Liu
Guanyu Jiang
Xian Li
Ming Yang
181
3
0
21 Oct 2024
Evaluation Of P300 Speller Performance Using Large Language Models Along With Cross-Subject Training
Nithin Parthasarathy
J. Soetedjo
S. Panchavati
Nitya Parthasarathy
C. Arnold
N. Pouratian
W. Speier
68
0
0
19 Oct 2024
Controllable Discovery of Intents: Incremental Deep Clustering Using Semi-Supervised Contrastive Learning
International Joint Conference on Natural Language Processing (IJCNLP), 2024
Mrinal Rawat
Hithesh Sankararaman
Victor Barrès
303
0
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
471
13
0
18 Oct 2024
On the Regularization of Learnable Embeddings for Time Series Forecasting
L. Butera
G. Felice
Andrea Cini
Cesare Alippi
AI4TS
368
0
0
18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
International Conference on Learning Representations (ICLR), 2024
Jiacheng Ye
Lei Li
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
LRM
644
74
0
18 Oct 2024
Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Amit Gupta
Petros Karypis
Daniel S. Karls
Mingjian Wen
Saurav Manchanda
E. Tadmor
George Karypis
128
4
0
17 Oct 2024
Previous
1
2
3
4
5
6
...
73
74
75
Next
Page 5 of 75
Page
of 75
Go