v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,731 papers shown

CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-TrainingACM Multimedia (ACM MM), 2022

377

16 Oct 2024

NSmark: Null Space Based Black-box Watermarking Defense Framework for Language Models

184

16 Oct 2024

Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models

181

14 Oct 2024

Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling

Xiangyu Yue

230

14 Oct 2024

MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping

Taozhe Li

Wei Sun

302

14 Oct 2024

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

895

12 Oct 2024

Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling

318

12 Oct 2024

Text Classification using Graph Convolutional Networks: A Comprehensive SurveyACM Computing Surveys (ACM CSUR), 2024

Syed Mustafa Haider Rizvi

Ramsha Imran

Arif Mahmood

GNN OOD FaML

207

12 Oct 2024

HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation PredictionNeural Information Processing Systems (NeurIPS), 2024

Yong Li

201

10 Oct 2024

Chain and Causal Attention for Efficient Entity TrackingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Erwan Fagnou

Paul Caillon

Blaise Delattre

Alexandre Allauzen

245

07 Oct 2024

Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterancesConference on Computational Natural Language Learning (CoNLL), 2024

Alina Wróblewska

169

07 Oct 2024

Computational design of target-specific linear peptide binders with TransformerBeta

Haowen Zhao

Francesco A. Aprile

Barbara Bravi

262

07 Oct 2024

Hyper-multi-step: The Truth Behind Difficult Long-context Tasks

Yijiong Yu

...

347

06 Oct 2024

Fundamental Limitations on Subquadratic Alternatives to TransformersInternational Conference on Learning Representations (ICLR), 2024

Josh Alman

Hantao Yu

436

05 Oct 2024

Variational Language Concepts for Interpreting Foundation Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Hao Wang

397

04 Oct 2024

Linear Transformer Topological Masking with Graph Random FeaturesInternational Conference on Learning Representations (ICLR), 2024

...

Richard E. Turner

Adrian Weller

Krzysztof Choromanski

293

04 Oct 2024

Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs

392

04 Oct 2024

Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

174

03 Oct 2024

On The Adaptation of Unlimiformer for Decoder-Only TransformersInternational Conference on Language Resources and Evaluation (LREC), 2024

Xia Song

211

02 Oct 2024

Preserving Generalization of Language models in Few-shot Continual Relation ExtractionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Quyen Tran

Nguyen Xuan Thanh

Nguyen Hoang Anh

Nam Le Hai

Trung Le

Linh Van Ngo

Thien Huu Nguyen

CLL KELM

287

01 Oct 2024

Perception Compressor: A Training-Free Prompt Compression Framework in Long Context ScenariosNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Lin Hai

Hai-Tao Zheng

VLM

335

28 Sep 2024

Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications

Aditi Godbole

Jabin Geevarghese George

Smita Shandilya

258

27 Sep 2024

Trustworthy AI: Securing Sensitive Data in Large Language ModelsApplied Informatics (AI), 2024

G. Feretzakis

V. Verykios

230

26 Sep 2024

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

Zeyneb N. Kaya

Souvick Ghosh

129

25 Sep 2024

The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles

Dusit Niyato

Hongyang Du

186

24 Sep 2024

Improving Academic Skills Assessment with NLP and Ensemble LearningInternational Conference on Information Systems and Computer Aided Education (ICISCAE), 2024

Xinyi Huang

223

23 Sep 2024

"I Never Said That": A dataset, taxonomy and baselines on response clarity classificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Konstantinos Thomas

Giorgos Filandrianos

Maria Lymperaiou

Chrysoula Zerva

Giorgos Stamou

178

20 Sep 2024

GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for Interpretable Text ClassificationInternational Conference on Computational Linguistics (COLING), 2024

Ximing Wen

Wenjuan Tan

Rosina O. Weber

220

20 Sep 2024

Incremental and Data-Efficient Concept Formation to Support Masked Word Prediction

Xin Lian

Nishant Baglodi

Christopher J. MacLellan

152

19 Sep 2024

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text RecognizerACM Multimedia (MM), 2024

Humen Zhong

Zhibo Yang

Zhaohai Li

Peng Wang

Jun Tang

Wenqing Cheng

Cong Yao

256

18 Sep 2024

Evaluation of pretrained language models on music understanding

Yannis Vasilakis

Rachel M. Bittner

Johan Pauwels

261

17 Sep 2024

OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities

Hanane Azzag

M. Lebbah

ObjD

350

17 Sep 2024

BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Seyed Rohollah Hosseyni

Ali Ahmad Rahmani

S. J. Seyedmohammadi

Sanaz Seyedin

Arash Mohammadi

DiffM

206

17 Sep 2024

Language Models Learn Metadata: Political Stance Detection Case Study

Stanley Cao

Felix Drinkall

170

15 Sep 2024

AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs

250

15 Sep 2024

Synthetic4Health: Generating Annotated Synthetic Clinical LettersFrontiers in Digital Health (Front. Digit. Health), 2024

Libo Ren

170

14 Sep 2024

Layerwise Change of Knowledge in Neural NetworksInternational Conference on Machine Learning (ICML), 2024

Xu Cheng

Tian Han

223

13 Sep 2024

TheraGen: Therapy for Every Generation

175

12 Sep 2024

Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout

Zhongliang Liu

152

11 Sep 2024

DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models

Maryam Akhavan Aghdam

Hongpeng Jin

Yanzhao Wu

MoE

225

10 Sep 2024

Expanding Expressivity in Transformer Models with MöbiusAttention

Anna-Maria Halacheva

M. Nayyeri

Steffen Staab

227

08 Sep 2024

An overview of domain-specific foundation model: key technologies, applications and challengesScience China Information Sciences (Sci. China Inf. Sci.), 2024

492

06 Sep 2024

Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation

Guoliang Li

279

05 Sep 2024

Dreaming is All You Need

Mingze Ni

Wei Liu

131

03 Sep 2024

Pre-Trained Language Models for Keyphrase Prediction: A ReviewICT express (IE), 2024

Muhammad Umair

Tangina Sultana

Young-Koo Lee

316

02 Sep 2024

Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph

Xiao Yan

Chuang Hu

Jiawei Jiang

157

01 Sep 2024

How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

148

29 Aug 2024

Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item PredictionACM Transactions on Recommender Systems (TRS), 2024

460

28 Aug 2024

EMP: Enhance Memory in Data Pruning

Shasha Li

Xiaodong Liu

Jun Ma

Qingbo Wu

Jie Yu

VLM

370

28 Aug 2024

A Survey of Large Language Models for European Languages

Wazir Ali

S. Pyysalo

385

27 Aug 2024