v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown

Autoregressive Image Generation with Randomized Parallel Decoding

277

13 Mar 2025

KV-Distill: Nearly Lossless Learnable Context Compression for LLMs

271

13 Mar 2025

Sentiment Analysis in SemEval: A Review of Sentiment Identification ApproachesInternational Journal of Electrical and Computer Engineering (IJECE) (IJECE), 2023

Bousselham EL HADDAOUI

R. Chiheb

R. Faizi

A. E. Afia

264

13 Mar 2025

ARLED: Leveraging LED-based ARMAN Model for Abstractive Summarization of Persian Long Documents

Samira Zangooei

Amirhossein Darmani

Hossein Farahmand Nezhad

Laya Mahmoudi

232

13 Mar 2025

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

339

13 Mar 2025

Towards Graph Foundation Models: A Transferability Perspective

260

13 Mar 2025

CALLM: Understanding Cancer Survivors' Emotions and Intervention Opportunities via Mobile Diaries and Context-Aware Language Models

186

12 Mar 2025

LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence RerankingJournal of Artificial Intelligence Research (JAIR), 2025

Yan Yan

Junyuan Liu

Bo Zhang

181

11 Mar 2025

LiSu: A Dataset and Method for LiDAR Surface Normal EstimationComputer Vision and Pattern Recognition (CVPR), 2025

Dušan Malić

Christian Fruhwirth-Reisinger

Samuel Schulter

Horst Possegger

3DV

292

11 Mar 2025

eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference

864

10 Mar 2025

Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

348

10 Mar 2025

Learning-Order Autoregressive Models with Application to Molecular Graph Generation

361

07 Mar 2025

UniNet: A Unified Multi-granular Traffic Modeling Framework for Network SecurityIEEE Transactions on Cognitive Communications and Networking (TCCN), 2025

Binghui Wu

D. Divakaran

M. Gurusamy

342

06 Mar 2025

An Optimization Algorithm for Multimodal Data Alignment

149

05 Mar 2025

Zero-Shot Complex Question-Answering on Long Scientific Documents

Wanting Wang

RALM

134

04 Mar 2025

Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual RepresentationAAAI Conference on Artificial Intelligence (AAAI), 2025

250

02 Mar 2025

Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition

347

28 Feb 2025

Revisiting Kernel Attention with Correlated Gaussian Process RepresentationConference on Uncertainty in Artificial Intelligence (UAI), 2025

366

27 Feb 2025

CAMEx: Curvature-aware Merging of ExpertsInternational Conference on Learning Representations (ICLR), 2025

356

26 Feb 2025

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

478

26 Feb 2025

Exploring Graph Learning Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation

326

26 Feb 2025

LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation

368

25 Feb 2025

How Vital is the Jurisprudential Relevance: Law Article Intervened Legal Case Retrieval and Matching

293

25 Feb 2025

Predicting Through Generation: Why Generation Is Better for PredictionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Md. Kowsher

Nusrat Jahan Prottasha

553

25 Feb 2025

Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT

308

25 Feb 2025

Detecting Code Vulnerabilities with Heterogeneous GNN Training

Yu Luo

Weifeng Xu

Dianxiang Xu

324

24 Feb 2025

Streaming Looking Ahead with Token-level Self-reward

Han Zhang

Ruixin Hong

Dong Yu

230

24 Feb 2025

Predictive Modeling: BIM Command Recommendation Based on Large-scale Usage LogsAdvanced Engineering Informatics (AEI), 2025

221

23 Feb 2025

IPAD: Inverse Prompt for AI Detection - A Robust and Interpretable LLM-Generated Text Detector

268

21 Feb 2025

Model Privacy: A Unified Framework to Understand Model Stealing Attacks and Defenses

G. Wang

Yuhong Yang

Jie Ding

177

21 Feb 2025

CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Mst. Fahmida Sultana Naznin

Adnan Ibney Faruq

Mostafa Rifat Tazwar

Md Jobayer

Md. Mehedi Hasan Shawon

Md Rakibul Hasan

MedIm

259

21 Feb 2025

Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models

Ranjan Sapkota

Shaina Raza

Manoj Karkee

277

21 Feb 2025

What Are They Filtering Out? An Experimental Benchmark of Filtering Strategies for Harm Reduction in Pretraining Datasets

Marco Antonio Stranisci

Christian Hardmeier

411

17 Feb 2025

The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training

Matteo Saponati

Pascal Sager

Pau Vilimelis Aceituno

Thilo Stadelmann

Benjamin Grewe

211

15 Feb 2025

Handwritten Text Recognition: A Survey

Carlos Garrido-Munoz

Antonio Ríos-Vila

Jorge Calvo-Zaragoza

318

12 Feb 2025

Context information can be more important than reasoning for time series forecasting with a large language modelInternational Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2025

Janghoon Yang

AI4TS LRM

202

08 Feb 2025

Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word SubstitutesInternational Conference on Agents and Artificial Intelligence (ICAART), 2025

Juraj Vladika

Stephen Meisenbacher

Florian Matthes

555

06 Feb 2025

Lowering the Barrier of Machine Learning: Achieving Zero Manual Labeling in Review Classification Using LLMsInternational Conference on Computing and Artificial Intelligence (ICCAI), 2025

Yejian Zhang

Shingo Takada

211

05 Feb 2025

A Framework for Double-Blind Federated Adaptation of Foundation Models

Nurbek Tastan

Karthik Nandakumar

FedML

322

03 Feb 2025

DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active LearningTransactions of the Association for Computational Linguistics (TACL), 2024

421

01 Feb 2025

Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings

Ahmed K. Kadhim

Lei Jiao

Rishad Shafik

Ole-Christoffer Granmo

DeLMO

418

31 Jan 2025

Detecting harassment and defamation in cyberbullying with emotion-adaptive trainingInternational Conference on Web and Social Media (ICWSM), 2025

Peiling Yi

A. Zubiaga

Yunfei Long

395

28 Jan 2025

Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks

Ziwei Liu

Qi Zhang

Lifu Gao

165

28 Jan 2025

TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data

P. Tiwald

Ivona Krchova

Andrey Sidorenko

Mariana Vargas-Vieyra

Mario Scriminaci

Michael Platzer

461

21 Jan 2025

Generative AI in Cybersecurity: A Comprehensive Review of LLM Applications and Vulnerabilities

131

20 Jan 2025

Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU ClustersIEEE Conference on Computer Communications (IEEE INFOCOM), 2025

183

09 Jan 2025

AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Ten Modalities via Language as a Reference FrameworkIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023

...

487

08 Jan 2025

Trust Modeling in Counseling Conversations: A Benchmark Study

187

06 Jan 2025

GORAG: Graph-based Online Retrieval Augmented Generation for Dynamic Few-shot Social Media Text Classification

497

06 Jan 2025

Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language UnderstandingInternational Conference on Computational Linguistics (COLING), 2025

Binh-Nguyen Nguyen

Yang He

260

05 Jan 2025