v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,732 papers shown

Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems

Christopher Ormerod

248

28 May 2025

MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection

276

26 May 2025

Multi-Party Conversational Agents: A Survey

307

24 May 2025

A Position Paper on the Automatic Generation of Machine Learning Leaderboards

Roelien C Timmer

Yufang Hou

Stephen Wan

459

23 May 2025

Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review

Muhammad Monjurul Karim

204

19 May 2025

Spatial-LLaVA: Enhancing Large Language Models with Spatial Referring Expressions for Visual Understanding

228

18 May 2025

Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Chenlu Wang

Weimin Lyu

Ritwik Banerjee

219

17 May 2025

Hierarchical Bracketing Encodings for Dependency Parsing as TaggingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Ana Ezquerro

David Vilares

Anssi Yli-Jyrä

Carlos Gómez-Rodríguez

353

16 May 2025

An empirical study of task and feature correlations in the reuse of pre-trained models

Jama Hussein Mohamud

Willie Brink

172

15 May 2025

Multi-Token Prediction Needs Registers

Anastasios Gerontopoulos

Spyros Gidaris

N. Komodakis

390

15 May 2025

Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies

392

13 May 2025

Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer

334

13 May 2025

I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference

410

10 May 2025

Boosting Neural Language Inference via Cascaded Interactive Reasoning

Min Li

Chun Yuan

ReLM LRM

197

10 May 2025

Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions

520

09 May 2025

Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification

338

05 May 2025

A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts

346

02 May 2025

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language ModelsThe VLDB journal (VLDB J.), 2025

192

24 Apr 2025

Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection

166

24 Apr 2025

The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes

Wencong You

Daniel Lowd

287

24 Apr 2025

RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore

258

24 Apr 2025

Distilling semantically aware orders for autoregressive image generation

293

23 Apr 2025

Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers

KM Khalid Saifullah

Faiaz Azmain

Habiba Hye

111

22 Apr 2025

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

...

254

21 Apr 2025

Q-FAKER: Query-free Hard Black-box Attack via Controlled GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

189

18 Apr 2025

Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective

249

18 Apr 2025

You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models

236

16 Apr 2025

Looking beyond the next token

380

15 Apr 2025

C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection DatasetThe Web Conference (WWW), 2025

350

14 Apr 2025

Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Shuai Zhao

Linchao Zhu

Yi Yang

459

14 Apr 2025

Confidence Regularized Masked Language Modeling using Text Length

Seunghyun Ji

Soowon Lee

382

08 Apr 2025

SapiensID: Foundation for Human RecognitionComputer Vision and Pattern Recognition (CVPR), 2025

293

07 Apr 2025

Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling

1.2K

07 Apr 2025

TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context

S. Nigam

Balaramamahanthi Deepak Patnaik

714

07 Apr 2025

Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection

Nasar Iqbal

Niki Martinel

Mamba

232

04 Apr 2025

Is Less Really More? Fake News Detection with Limited InformationSIGKDD Explorations (SIGKDD Explor.), 2025

Zhaoyang Cao

John Nguyen

Reza Zafarani

258

02 Apr 2025

A thorough benchmark of automatic text classification: From traditional approaches to large language models

186

02 Apr 2025

COST: Contrastive One-Stage Transformer for Vision-Language Small Object TrackingInformation Fusion (Inf. Fusion), 2025

299

02 Apr 2025

Semantic Adapter for Universal Text Embeddings: Diagnosing and Mitigating Negation Blindness to Enhance Universality

Hongliu Cao

420

01 Apr 2025

A Retrieval-Based Approach to Medical Procedure Matching in Romanian

Andrei Niculae

Adrian Cosma

Emilian Radoi

339

26 Mar 2025

AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction

231

26 Mar 2025

Improving User Behavior Prediction: Leveraging Annotator Metadata in Supervised Machine Learning ModelsProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2025

308

26 Mar 2025

Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content

Sai Kartheek Reddy Kasu

Shankar Biradar

Sunil Saumya

329

20 Mar 2025

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2025

412

20 Mar 2025

Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization

249

19 Mar 2025

Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies

236

18 Mar 2025

Can Large Vision Language Models Read Maps Like a Human?

391

18 Mar 2025

A Survey on Federated Fine-tuning of Large Language Models

525

15 Mar 2025

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More MoreAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Arvid Frydenlund

LRM

560

13 Mar 2025

How Well Does Your Tabular Generator Learn the Structure of Tabular Data?

275

13 Mar 2025