v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown

Milestones in Bengali Sentiment Analysis leveraging Transformer-models: Fundamentals, Challenges and Future Directions

Saptarshi Sengupta

Shreya Ghosh

Prasenjit Mitra

Tarikul Islam Tamiti

360

15 Jan 2024

Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question AnsweringBiophysics Reports (BR), 2024

450

15 Jan 2024

Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive StudyNatural Language Processing Journal (JNLP), 2024

Ahmadul Karim Chowdhury

Saidur Rahman Sujon

Md. Shirajus Salekin Shafi

203

14 Jan 2024

Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection

Muhammad Tayyab Zamir

107

12 Jan 2024

Reliability Analysis of Psychological Concept Extraction and Classification in User-penned TextInternational Conference on Web and Social Media (ICWSM), 2024

164

12 Jan 2024

LLMRS: Unlocking Potentials of LLM-Based Recommender Systems for Software Purchase

301

12 Jan 2024

Multi-Task Learning for Front-End Text Processing in TTSIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

178

12 Jan 2024

Autocompletion of Chief Complaints in the Electronic Health Records using Large Language ModelsBigData Congress [Services Society] (BSS), 2023

173

11 Jan 2024

Phishing Website Detection through Multi-Model Analysis of HTML Content

202

09 Jan 2024

Setting the Record Straight on Transformer Oversmoothing

G. Dovonon

M. Bronstein

Matt J. Kusner

406

09 Jan 2024

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

338

08 Jan 2024

TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment

239

08 Jan 2024

An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning FrameworksScience of Computer Programming (SCP), 2024

Chen Yang

Peng Liang

Zinan Ma

229

08 Jan 2024

Building Efficient and Effective OpenQA Systems for Low-Resource Languages

269

07 Jan 2024

MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition

Zheng Lian

Guoying Zhao

Yong Ren

Hao Gu

429

07 Jan 2024

Enhancing Context Through Contrast

232

06 Jan 2024

SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations

168

06 Jan 2024

Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning

329

05 Jan 2024

Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

273

05 Jan 2024

Understanding LLMs: A Comprehensive Overview from Training to Inference

...

Tuo Zhang

Tianming Liu

464

124

04 Jan 2024

Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction SequencesInternational Conference on AI in Finance (ICAF), 2023

223

03 Jan 2024

Evaluating Fairness in Self-supervised and Supervised Models for Sequential Data

Sofia Yfantidou

Dimitris Spathis

Marios Constantinides

Athena Vakali

Daniele Quercia

F. Kawsar

325

03 Jan 2024

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

S.M. Towhidul Islam Tonmoy

Vinija Jain

461

356

02 Jan 2024

Unifying Structured Data as Graph for Data-to-Text Pre-TrainingTransactions of the Association for Computational Linguistics (TACL), 2024

Min Yang

...

Fei Huang

293

02 Jan 2024

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

Siyuan Li

Luyuan Zhang

Zedong Wang

Di Wu

Lirong Wu

...

Jun Xia

Cheng Tan

Yang Liu

Baigui Sun

Stan Z. Li

SSL

301

31 Dec 2023

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingComputer Vision and Pattern Recognition (CVPR), 2023

891

31 Dec 2023

Research on the Laws of Multimodal Perception and Cognition from a Cross-cultural Perspective -- Taking Overseas Chinese Gardens as an Example

110

29 Dec 2023

Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination

207

25 Dec 2023

Multi-level biomedical NER through multi-granularity embeddings and enhanced labeling

211

24 Dec 2023

Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference

Yixiao Du

235

23 Dec 2023

Characterizing and Classifying Developer Forum Posts with their Intentions

Xingfang Wu

Eric Thibodeau-Laufer

140

21 Dec 2023

DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization

Rahul Chand

Yashoteja Prabhu

Pratyush Kumar

190

20 Dec 2023

Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment

Haoran Xie

300

272

19 Dec 2023

Assessing Logical Reasoning Capabilities of Encoder-Only Transformer Models

Paulo Pirozelli

M. M. José

Paulo de Tarso P. Filho

A. Brandão

Fabio Gagliardi Cozman

LRM ELM

330

18 Dec 2023

A mathematical perspective on TransformersBulletin of the American Mathematical Society (BAMS), 2023

671

104

17 Dec 2023

RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding

243

15 Dec 2023

BinGo: Identifying Security Patches in Binary Code with Graph Representation LearningACM Asia Conference on Computer and Communications Security (AsiaCCS), 2023

Qi Li

Kun Sun

147

13 Dec 2023

One-Step Diffusion Distillation via Deep Equilibrium ModelsNeural Information Processing Systems (NeurIPS), 2023

Zhengyang Geng

Ashwini Pokle

Trevor Killeen

294

12 Dec 2023

Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models

196

11 Dec 2023

Why "classic" Transformers are shallow and how to make them go deep

Yueyao Yu

Yin Zhang

ViT

277

11 Dec 2023

Transformer as Linear Expansion of LearngeneAAAI Conference on Artificial Intelligence (AAAI), 2023

198

09 Dec 2023

Sim-GPT: Text Similarity via GPT Annotated Data

Jiwei Li

218

09 Dec 2023

Enhanced E-Commerce Attribute Extraction: Innovating with Decorative Relation Correction and LLAMA 2.0-Based Annotation

104

09 Dec 2023

Graph Convolutions Enrich the Self-Attention in Transformers!

Jeongwhan Choi

411

07 Dec 2023

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training

Madian Khabsa

241

07 Dec 2023

Series2Vec: Similarity-based Self-supervised Representation Learning for
Time Series Classification

Navid Mohammadi Foumani

297

07 Dec 2023

Detecting Rumor Veracity with Only Textual Information by Double-Channel StructureInternational Workshop on Natural Language Processing for Social Media (SocialNLP), 2023

Alex G. Kim

Sangwon Yoon

173

06 Dec 2023

Large Language Models on Graphs: A Comprehensive SurveyIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

Heng Ji

342

248

05 Dec 2023

Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial AlignmentACM Multimedia (ACM MM), 2023

338

04 Dec 2023

Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?

171

03 Dec 2023