Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,048 papers shown
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
Joymallya Chakraborty
Wei Xia
Anirban Majumder
Dan Ma
Walid Chaabene
Naveed Janvekar
144
8
0
09 Sep 2024
Application Specific Compression of Deep Learning Models
Rohit Raj Rai
Angana Borah
Amit Awekar
182
0
0
09 Sep 2024
PriorDrive: Enhancing Online HD Mapping with Unified Vector Priors
Shuang Zeng
Xinyuan Chang
Xinran Liu
Yujian Yuan
Shiyi Liang
Zheng Pan
Mu Xu
Xing Wei
390
17
0
09 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
227
1
0
08 Sep 2024
Achieving Peak Performance for Large Language Models: A Systematic Review
IEEE Access (IEEE Access), 2024
Z. R. K. Rostam
Sándor Szénási
Gábor Kertész
321
18
0
07 Sep 2024
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhuowei Chen
Lianxi Wang
Yuben Wu
Xinfeng Liao
Yujia Tian
Junyang Zhong
DiffM
359
7
0
05 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review
ICT express (IE), 2024
Muhammad Umair
Tangina Sultana
Young-Koo Lee
313
8
0
02 Sep 2024
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education
Unggi Lee
Jiyeong Bae
Yeonji Jung
Minji Kang
Gyuri Byun
...
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Hyeoncheol Kim
AI4Ed
KELM
252
2
0
31 Aug 2024
Speaker Tagging Correction With Non-Autoregressive Language Models
Grigor Kirakosyan
Davit Karamyan
3DV
239
1
0
30 Aug 2024
Is Personality Prediction Possible Based on Reddit Comments?
Robert Deimann
Till Preidt
Shaptarshi Roy
Jan Stanicki
148
1
0
28 Aug 2024
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
385
6
0
27 Aug 2024
Shifted Window Fourier Transform And Retention For Image Captioning
International Conference on Neural Information Processing (ICONIP), 2024
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
VLM
312
2
0
25 Aug 2024
Genetic Approach to Mitigate Hallucination in Generative IR
Hrishikesh Kulkarni
Nazli Goharian
O. Frieder
Sean MacAvaney
HILM
150
4
0
25 Aug 2024
Domain-specific long text classification from sparse relevant information
European Conference on Artificial Intelligence (ECAI), 2024
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
202
2
0
23 Aug 2024
Instruct-DeBERTa: A Hybrid Approach for Aspect-based Sentiment Analysis on Textual Reviews
Dineth Jayakody
A. V. A. Malkith
Koshila Isuranda
Vishal Thenuwara
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
175
8
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
278
3
0
23 Aug 2024
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Mohamed Elgaar
Jiali Cheng
Nidhi Vakil
Hadi Amiri
Leo Anthony Celi
218
2
0
23 Aug 2024
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering
Haowei Du
Dongyan Zhao
KELM
182
0
0
23 Aug 2024
Large Language Models are Good Attackers: Efficient and Stealthy Textual Backdoor Attacks
Wandi Qiao
Yueqi Zeng
Pengfei Xia
Lei Liu
Zhangjie Fu
Bin Li
SILM
AAML
279
4
0
21 Aug 2024
BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Yuxuan Chen
Haoyan Yang
Hengkai Pan
Fardeen Siddiqui
Antonio Verdone
Qingyang Zhang
S. Chopra
Chen Zhao
Yiqiu Shen
113
4
0
21 Aug 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
European Conference on Artificial Intelligence (ECAI), 2024
Yuan Xin
Hui Yuan
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILM
MIACV
343
2
0
20 Aug 2024
Uniting contrastive and generative learning for event sequences models
International Joint Conference on the Analysis of Images, Social Networks and Texts (AISNT), 2024
Aleksandr Yugay
Alexey Zaytsev
AI4TS
213
2
0
19 Aug 2024
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
Lionel Z. Wang
Yiming Ma
Renfei Gao
Beichen Guo
Han Zhu
Wenqi Fan
Zexin Lu
Ka Chung Ng
SyDa
245
10
0
19 Aug 2024
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
194
0
0
09 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Computational Linguistics (CL), 2024
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
232
1
0
09 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
225
6
0
08 Aug 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
ACM Multimedia (MM), 2024
Xiaofeng Mao
Zhengkai Jiang
Qilin Wang
Chencan Fu
Jiangning Zhang
Jiafu Wu
Yabiao Wang
Chengjie Wang
Wei Li
Mingmin Chi
336
10
0
06 Aug 2024
Dopamin: Transformer-based Comment Classifiers through Domain Post-Training and Multi-level Layer Aggregation
Nam Le Hai
Nghi D. Q. Bui
249
6
0
06 Aug 2024
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
Sajal Aggarwal
Ananya Pandey
Dinesh Kumar Vishwakarma
193
4
0
05 Aug 2024
Large Language Model Aided QoS Prediction for Service Recommendation
Huiying Liu
Zekun Zhang
Honghao Li
Qilin Wu
Yiwen Zhang
215
4
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
300
0
0
04 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong Jiang
254
7
0
04 Aug 2024
Cross-layer Attention Sharing for Pre-trained Large Language Models
Yongyu Mu
Yuzhang Wu
Yuchun Fan
Chenglong Wang
Hengyu Li
...
Murun Yang
Fandong Meng
Jie Zhou
Tong Xiao
Jingbo Zhu
285
6
0
04 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
464
16
0
02 Aug 2024
Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities
Yangzhen Wu
P. Khuwaja
Kapal Dev
H. A. Hamadi
Yiming Yang
324
0
0
01 Aug 2024
Big Cooperative Learning
Yulai Cong
AI4CE
197
0
0
31 Jul 2024
A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy
Meng Jiang
Qing Zhao
Jianqiang Li
Fan Wang
Tianyu He
Xinyan Cheng
Bing Xiang Yang
Grace W.K. Ho
Guanghui Fu
221
19
0
28 Jul 2024
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification
Vivi Nastase
Paola Merlo
193
6
0
25 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
213
12
0
25 Jul 2024
Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning
Hongwei Jin
George Papadimitriou
Krishnan Raghavan
Pawel Zuk
Dali Wang
Cong Wang
A. Mandal
Ewa Deelman
172
8
0
24 Jul 2024
Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs
Huan-jing Zhao
Beining Yang
Yukuo Cen
Junyu Ren
Chenhui Zhang
Yuxiao Dong
Evgeny Kharlamov
Shu Zhao
Jie Tang
VLM
221
14
0
22 Jul 2024
Token-Picker: Accelerating Attention in Text Generation with Minimized Memory Transfer via Probability Estimation
Junyoung Park
Myeonggu Kang
Yunki Han
Yang-Gon Kim
Jaekang Shin
Lee-Sup Kim
139
7
0
21 Jul 2024
Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Haiquan Lu
Xiaotian Liu
Yefan Zhou
Qunli Li
Kurt Keutzer
Michael W. Mahoney
Yujun Yan
Huanrui Yang
Yaoqing Yang
201
2
0
17 Jul 2024
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks
Salma Afifi
Ishan G. Thakkar
S. Pasricha
GNN
190
2
0
17 Jul 2024
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning
Mustafa Dogan
.Ilker Kesen
Iacer Calixto
Aykut Erdem
Erkut Erdem
LRM
254
2
0
17 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
185
4
0
17 Jul 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
Yujia Hu
Zhiqiang Hu
C. Seah
Roy Ka-wei Lee
174
2
0
16 Jul 2024
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction
Xingzhi Zhou
Xin Dong
Chunhao Li
Yuning Bai
Yulong Xu
...
Simon See
Xinpeng Song
Runshun Zhang
Xuezhong Zhou
Nevin L. Zhang
LM&MA
184
11
0
15 Jul 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Zhuocheng Gong
Ang Lv
Jian Guan
Junxi Yan
Wei Wu
Huishuai Zhang
Minlie Huang
Dongyan Zhao
Rui Yan
MoE
174
8
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
544
157
0
09 Jul 2024
Previous
1
2
3
...
6
7
8
...
59
60
61
Next
Page 7 of 61
Page
of 61
Go