Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,048 papers shown
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Applied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
242
3
0
24 Oct 2024
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
International Conference on Computer Aided Design (ICCAD), 2024
Zebin Yang
Renze Chen
Taiqiang Wu
Ngai Wong
Yun Liang
Runsheng Wang
R. Huang
Meng Li
MQ
258
2
0
23 Oct 2024
Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
Mengying Wang
Andreas Spitz
113
0
0
23 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Chen Zhang
Conghui Tan
Rongzhong Lian
MoMe
297
1
0
21 Oct 2024
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
Yangqiu Song
Guangyi Chen
Leilei Gan
Kun Zhang
LRM
325
19
0
20 Oct 2024
Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Zia-ur-Rehman
Arif Mahmood
Wenxiong Kang
239
3
0
18 Oct 2024
Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
Philip Torr
Ricardo M. A. Silva
386
0
0
18 Oct 2024
From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition
Qiyuan Yang
Pengda Wang
Luke D. Plonsky
Frederick L. Oswald
Hanjie Chen
ELM
233
2
0
17 Oct 2024
Unitary Multi-Margin BERT for Robust Natural Language Processing
Hao-Yuan Chang
Kang L. Wang
AAML
173
0
0
16 Oct 2024
FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression
Zhenheng Tang
Xueze Kang
Yiming Yin
Xinglin Pan
Yuxin Wang
...
Shaohuai Shi
Amelie Chi Zhou
Bo Li
Bingsheng He
Xiaowen Chu
AI4CE
231
10
0
16 Oct 2024
Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kai Yao
P. Gao
Lichun Li
Yuan Zhao
Xiaofeng Wang
Wei Wang
Jianke Zhu
152
7
0
15 Oct 2024
TSDS: Data Selection for Task-Specific Model Finetuning
Neural Information Processing Systems (NeurIPS), 2024
Zifan Liu
Amin Karbasi
Theodoros Rekatsinas
309
15
0
15 Oct 2024
Arrhythmia Classification Using Graph Neural Networks Based on Correlation Matrix
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Seungwoo Han
376
12
0
14 Oct 2024
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
International Conference on Learning Representations (ICLR), 2024
Federico Arangath Joseph
Jerome Sieber
Melanie Zeilinger
Carmen Amo Alonso
465
2
0
14 Oct 2024
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
International Conference on Learning Representations (ICLR), 2024
Tong Wu
Shujian Zhang
Kaiqiang Song
Silei Xu
Sanqiang Zhao
Ravi Agrawal
Sathish Indurthi
Chong Xiang
Prateek Mittal
Wenxuan Zhou
406
32
0
09 Oct 2024
Exploring Large Language Models for Detecting Mental Disorders
Gleb Kuzmin
Petr Strepetov
Maksim Stankevich
Natalia Chudova
Artem Shelmanov
Ivan Smirnov
230
2
0
09 Oct 2024
Towards the generation of hierarchical attack models from cybersecurity vulnerabilities using language models
Applied Soft Computing (Appl. Soft Comput.), 2024
Kacper Sowka
Vasile Palade
Xiaorui Jiang
Hesam Jadidbonab
212
2
0
07 Oct 2024
Computational design of target-specific linear peptide binders with TransformerBeta
Haowen Zhao
Francesco A. Aprile
Barbara Bravi
262
0
0
07 Oct 2024
Regularized Neural Ensemblers
Sebastian Pineda Arango
Maciej Janowski
Lennart Purucker
Arber Zela
Frank Hutter
Josif Grabocka
UQCV
301
0
0
06 Oct 2024
Variational Language Concepts for Interpreting Foundation Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
397
4
0
04 Oct 2024
Demystifying the Token Dynamics of Deep Selective State Space Models
International Conference on Learning Representations (ICLR), 2024
Thieu N. Vo
Tung D. Pham
Xin T. Tong
Tan Minh Nguyen
Mamba
318
1
0
04 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Wei Wu
Chao Wang
L. Chen
Mingze Yin
Yiheng Zhu
Kun Fu
Jieping Ye
Hui Xiong
Zheng Wang
389
3
0
04 Oct 2024
Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models
Mingxue Xu
Sadia Sharmin
Danilo Mandic
262
3
0
03 Oct 2024
Morphological evaluation of subwords vocabulary used by BETO language model
Óscar García-Sierra
Ana Fernández-Pampillón Cesteros
Miguel Ortega-Martín
216
0
0
03 Oct 2024
DeIDClinic: A Multi-Layered Framework for De-identification of Clinical Free-text Data
Angel Paul
Dhivin Shaji
Lifeng Han
Warren Del-Pinto
Goran Nenadic
OOD
225
3
0
02 Oct 2024
DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models
Yuxuan Zhang
Ruizhe Li
MoMe
487
2
0
02 Oct 2024
On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding
Kevin Xu
Issei Sato
761
7
0
02 Oct 2024
Depression detection in social media posts using transformer-based models and auxiliary features
Social Network Analysis and Mining (SNAM), 2024
Marios Kerasiotis
Loukas Ilias
D. Askounis
184
20
0
30 Sep 2024
FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
Yucheng Xie
Fu Feng
Ruixiao Shi
Jing Wang
Xin Geng
AI4CE
200
5
0
28 Sep 2024
On the Inductive Bias of Stacking Towards Improving Reasoning
Neural Information Processing Systems (NeurIPS), 2024
Nikunj Saunshi
Stefani Karp
Shankar Krishnan
Sobhan Miryoosefi
Sashank J. Reddi
Sanjiv Kumar
LRM
AI4CE
280
13
0
27 Sep 2024
Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
Yu Fu
Jie He
Yifan Yang
Qun Liu
Deyi Xiong
OffRL
LRM
418
0
0
27 Sep 2024
DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Devrim Cavusoglu
Secil Sen
Ulas Sert
153
7
0
26 Sep 2024
Integrating Hierarchical Semantic into Iterative Generation Model for Entailment Tree Explanation
Qin Wang
Jianzhou Feng
Yiming Xu
192
0
0
26 Sep 2024
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
Neural Information Processing Systems (NeurIPS), 2024
Ming Dai
Lingfeng Yang
Yihao Xu
Zhenhua Feng
Wankou Yang
ObjD
452
39
0
26 Sep 2024
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Taehun Cha
Donghun Lee
HILM
219
1
0
25 Sep 2024
dnaGrinder: a lightweight and high-capacity genomic foundation model
Qihang Zhao
Chi Zhang
Weixiong Zhang
183
3
0
24 Sep 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
383
17
0
23 Sep 2024
Data-centric NLP Backdoor Defense from the Lens of Memorization
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhenting Wang
Zhizhi Wang
Haoyang Ling
Mengnan Du
Juan Zhai
Shiqing Ma
266
5
0
21 Sep 2024
Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Amrit Diggavi Seshadri
160
1
0
21 Sep 2024
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs
International Conference on Field-Programmable Technology (ICFPT), 2024
Ehsan Kabir
Md. Arafat Kabir
Austin R. J. Downey
Jason D. Bakos
David Andrews
Miaoqing Huang
GNN
276
2
0
21 Sep 2024
Profiling Patient Transcript Using Large Language Model Reasoning Augmentation for Alzheimer's Disease Detection
Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024
Chin-Po Chen
Jeng-Lin Li
LM&MA
90
2
0
19 Sep 2024
Evaluation of pretrained language models on music understanding
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
261
4
0
17 Sep 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities
Hanane Azzag
Hanane Azzag
M. Lebbah
ObjD
350
2
0
17 Sep 2024
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison
Judy Hanwen Shen
Archit Sharma
Jun Qin
182
13
0
15 Sep 2024
Deep Fast Machine Learning Utils: A Python Library for Streamlined Machine Learning Prototyping
Fabi Prezja
AI4CE
132
0
0
14 Sep 2024
Multi-intent Aware Contrastive Learning for Sequential Recommendation
International Conference on Artificial Neural Networks (ICANN), 2024
Junshu Huang
Zi Long
Xianghua Fu
Yin Chen
HAI
147
2
0
13 Sep 2024
A BERT-Based Summarization approach for depression detection
Hossein Salahshoor Gavalan
Mohmmad Naim Rastgoo
Bahareh Nakisa
136
6
0
13 Sep 2024
TheraGen: Therapy for Every Generation
Kartikey Doshi
Jimit Shah
Narendra Shekokar
AI4MH
175
0
0
12 Sep 2024
Enhancing adversarial robustness in Natural Language Inference using explanations
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Alexandros Koulakos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
SILM
AAML
393
3
0
11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
225
6
0
10 Sep 2024
Previous
1
2
3
...
5
6
7
...
59
60
61
Next
Page 6 of 61
Page
of 61
Go