Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,050 papers shown
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer
Boan Liu
Liang Ding
Li Shen
Keqin Peng
Yu Cao
Dazhao Cheng
Dacheng Tao
MoE
212
19
0
15 Oct 2023
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
Md. Rony
Christian Suess
Sinchana Ramakanth Bhat
Viju Sudhi
Julia Schneider
Maximilian Vogel
Roman Teucher
Ken E. Friedl
S. Sahoo
230
15
0
14 Oct 2023
Low-Resource Clickbait Spoiling for Indonesian via Question Answering
Ni Putu Intan Maharani
Ayu Purwarianti
Alham Fikri Aji
167
5
0
12 Oct 2023
To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer
Md. Mushfiqur Rahman
Fardin Ahsan Sakib
Fahim Faisal
Antonios Anastasopoulos
219
4
0
12 Oct 2023
Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Huiyin Xue
Nikolaos Aletras
330
1
0
11 Oct 2023
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
Findings (Findings), 2023
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Aishwarya N. Reganti
Vinija Jain
Vasu Sharma
Amit P. Sheth
Amitava Das
287
2
0
11 Oct 2023
The Temporal Structure of Language Processing in the Human Brain Corresponds to The Layered Hierarchy of Deep Language Models
bioRxiv (bioRxiv), 2023
Ariel Goldstein
Eric Ham
Mariano Schain
Samuel A. Nastase
Zaid Zada
...
Avinatan Hassidim
O. Devinsky
A. Flinker
Omer Levy
Uri Hasson
AI4CE
263
16
0
11 Oct 2023
Sparse Universal Transformer
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Shawn Tan
Songlin Yang
Zhenfang Chen
Aaron Courville
Chuang Gan
MoE
266
25
0
11 Oct 2023
A Comparative Study of Transformer-based Neural Text Representation Techniques on Bug Triaging
International Conference on Automated Software Engineering (ASE), 2023
Atish Kumar Dipongkor
Kevin Moran
60
12
0
10 Oct 2023
P5: Plug-and-Play Persona Prompting for Personalized Response Selection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Joosung Lee
Min Sik Oh
Donghun Lee
205
6
0
10 Oct 2023
Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction
Journal of Biomedical Informatics (JBI), 2023
C.A.I. Peng
Xi Yang
Kaleb E. Smith
Zehao Yu
Aokun Chen
Jiang Bian
Yonghui Wu
VLM
LRM
206
64
0
10 Oct 2023
Evolution of Natural Language Processing Technology: Not Just Language Processing Towards General Purpose AI
Masahiro Yamamoto
187
1
0
10 Oct 2023
LLM for SoC Security: A Paradigm Shift
IEEE Access (IEEE Access), 2023
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
360
85
0
09 Oct 2023
IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
V. Saxena
Benjamin Bashpole
Gijs Van Dijck
Gerasimos Spanakis
287
5
0
09 Oct 2023
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongqiu Wu
Linfeng Liu
Haizhen Zhao
Min Zhang
LRM
AI4CE
NAI
ELM
236
8
0
09 Oct 2023
On the Zero-Shot Generalization of Machine-Generated Text Detectors
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiao Pu
Jingyu Zhang
Xiaochuang Han
Yulia Tsvetkov
Tianxing He
DeLMO
169
23
0
08 Oct 2023
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning
Wang Lu
Hao Yu
Yongfeng Zhang
Damien Teney
Haohan Wang
Yiqiang Chen
Qiang Yang
Xing Xie
Xiangyang Ji
238
10
0
08 Oct 2023
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Megha Chakraborty
S.M. Towhidul Islam Tonmoy
S. M. Mehedi
Krish Sharma
Niyar R. Barman
...
Tanay Kumar
Vinija Jain
Vasu Sharma
Amit P. Sheth
Amitava Das
DeLMO
203
27
0
08 Oct 2023
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models
Song Guo
Jiahang Xu
Li Zhang
Mao Yang
272
18
0
08 Oct 2023
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Xiusi Chen
Jyun-Yu Jiang
Wei-Cheng Chang
Cho-Jui Hsieh
Hsiang-Fu Yu
Wei Wang
321
18
0
08 Oct 2023
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vipula Rawte
Swagata Chakraborty
Agnibh Pathak
Anubhav Sarkar
S.M. Towhidul Islam Tonmoy
Vasu Sharma
Mikel Artetxe
Punit Daniel Simig
HILM
325
186
0
08 Oct 2023
A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset
Andreas Voskou
Konstantinos P. Panousis
Harris Partaourides
Kyriakos Tolias
S. Chatzis
SLR
254
7
0
07 Oct 2023
A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks
Fangshuo Liao
Md Tahmid Rahman Laskar
Cruz Barnum
Jimmy Xiangji Huang
AI4MH
LM&MA
367
119
0
06 Oct 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
International Conference on Learning Representations (ICLR), 2023
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
311
28
0
04 Oct 2023
ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer Layers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yiming Wang
Jinyu Li
224
11
0
03 Oct 2023
ScaleNet: An Unsupervised Representation Learning Method for Limited Information
German Conference on Pattern Recognition (GCPR), 2023
Huili Huang
M. M. Roozbahani
SSL
328
786
0
03 Oct 2023
Selective Feature Adapter for Dense Vision Transformers
XueQing Deng
Qi Fan
Xiaojie Jin
Linjie Yang
Peng Wang
226
1
0
03 Oct 2023
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models
International Conference on Learning Representations (ICLR), 2023
Zijun Wu
Yongkang Wu
Lili Mou
VLM
216
9
0
02 Oct 2023
From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication
International Conference on Learning Representations (ICLR), 2023
Irene Cannistraci
Luca Moschella
Marco Fumero
Valentino Maiorca
Emanuele Rodolà
253
19
0
02 Oct 2023
Improving Length-Generalization in Transformers via Task Hinting
Pranjal Awasthi
Anupam Gupta
192
14
0
01 Oct 2023
RelBERT: Embedding Relations with Language Models
Artificial Intelligence (AIJ), 2023
Asahi Ushio
Jose Camacho-Collados
Steven Schockaert
KELM
324
3
0
30 Sep 2023
KLoB: a Benchmark for Assessing Knowledge Locating Methods in Language Models
Yiming Ju
Zheng Zhang
KELM
168
9
0
28 Sep 2023
ELIP: Efficient Discriminative Language-Image Pre-training with Fewer Vision Tokens
Yangyang Guo
Haoyu Zhang
Yongkang Wong
Liqiang Nie
Mohan Kankanhalli
VLM
276
5
0
28 Sep 2023
Question answering using deep learning in low resource Indian language Marathi
Dhiraj Amin
S. Govilkar
Sagar Kulkarni
105
5
0
27 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey
Victoria Smith
Ali Shahin Shamsabadi
Carolyn Ashurst
Adrian Weller
PILM
503
41
0
27 Sep 2023
Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Automatic Speech Recognition & Understanding (ASRU), 2023
Chao-Han Huck Yang
Yile Gu
Yi-Chieh Liu
Shalini Ghosh
I. Bulyko
A. Stolcke
KELM
LRM
432
80
0
27 Sep 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning
Jiadong Wang
Chengyu Wang
Chuanqi Tan
Jun Huang
Ming Gao
KELM
317
8
0
26 Sep 2023
LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression
Ayush Kaushal
Tejas Vaidhya
Irina Rish
363
27
0
25 Sep 2023
Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical Abstracts
IEEE International Conference on Healthcare Informatics (ICHI), 2023
Z. Li
Samuel Belkadi
Nicolo Micheletti
Lifeng Han
Matthew Shardlow
Goran Nenadic
254
12
0
22 Sep 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
IEEE International Conference on Computer Vision (ICCV), 2023
Kan Wu
Houwen Peng
Zhenghong Zhou
Bin Xiao
Xiyang Dai
...
Xi
Xi Chen
Xinggang Wang
Hongyang Chao
Han Hu
VLM
OODD
260
97
0
21 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
International Conference on Language Resources and Evaluation (LREC), 2023
Deepak Gupta
Kush Attal
Dina Demner-Fushman
LM&MA
161
4
0
21 Sep 2023
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision
Jinzhao Zhou
Yiqun Duan
Yu-Cheng Chang
Yu-Kai Wang
Chin-Teng Lin
222
6
0
21 Sep 2023
DimCL: Dimensional Contrastive Learning For Improving Self-Supervised Learning
IEEE Access (IEEE Access), 2023
Thanh Nguyen
T. Pham
Chaoning Zhang
Tung M. Luu
Thang Vu
Chang D. Yoo
315
10
0
21 Sep 2023
Long-tail Augmented Graph Contrastive Learning for Recommendation
Qian Zhao
Zhengwei Wu
Qing Cui
Jun Zhou
136
11
0
20 Sep 2023
Heterogeneous Entity Matching with Complex Attribute Associations using BERT and Neural Networks
Shitao Wang
Jiamin Lu
129
1
0
20 Sep 2023
A Family of Pretrained Transformer Language Models for Russian
International Conference on Language Resources and Evaluation (LREC), 2023
Dmitry Zmitrovich
Alexander Abramov
Andrey Kalmykov
Maria Tikhonova
Ekaterina Taktasheva
...
Vitalii Kadulin
Sergey Markov
Tatiana Shavrina
Vladislav Mikhailov
Alena Fenogenova
320
51
0
19 Sep 2023
Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education
Ramteja Sajja
Y. Sermet
Muhammed Cikmaz
David M. Cwiertny
Ibrahim Demir
266
328
0
19 Sep 2023
A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Danushka Bollegala
Shuichi Otake
T. Machide
Ken-ichi Kawarabayashi
364
5
0
19 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
200
25
0
19 Sep 2023
Generative modeling, design and analysis of spider silk protein sequences for enhanced mechanical properties
Advanced Functional Materials (Adv. Funct. Mater.), 2023
Wei Lu
David L. Kaplan
Markus J. Buehler
173
39
0
18 Sep 2023
Previous
1
2
3
...
15
16
17
...
59
60
61
Next
Page 16 of 61
Page
of 61
Go