Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,911 papers shown
Title
One-Step Diffusion Distillation via Deep Equilibrium Models
Zhengyang Geng
Ashwini Pokle
Trevor Killeen
28
28
0
12 Dec 2023
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELM
AI4MH
34
17
0
11 Dec 2023
Why "classic" Transformers are shallow and how to make them go deep
Yueyao Yu
Yin Zhang
ViT
16
0
0
11 Dec 2023
Transformer as Linear Expansion of Learngene
Shiyu Xia
Miaosen Zhang
Xu Yang
Ruiming Chen
Haokun Chen
Xin Geng
38
6
0
09 Dec 2023
Sim-GPT: Text Similarity via GPT Annotated Data
Shuhe Wang
Beiming Cao
Shengyu Zhang
Xiaoya Li
Jiwei Li
Fei Wu
Guoyin Wang
Eduard Hovy
43
2
0
09 Dec 2023
Enhanced E-Commerce Attribute Extraction: Innovating with Decorative Relation Correction and LLAMA 2.0-Based Annotation
Jianghong Zhou
Weizhi Du
Md Omar Faruk Rokon
Zhaodong Wang
Jiaxuan Xu
Isha Shah
Kuang-chih Lee
Musen Wen
14
1
0
09 Dec 2023
Graph Convolutions Enrich the Self-Attention in Transformers!
Jeongwhan Choi
Hyowon Wi
Jayoung Kim
Yehjin Shin
Kookjin Lee
Nathaniel Trask
Noseong Park
25
4
0
07 Dec 2023
RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Jaehyung Kim
Yuning Mao
Rui Hou
Hanchao Yu
Davis Liang
Pascale Fung
Qifan Wang
Fuli Feng
Lifu Huang
Madian Khabsa
AAML
23
2
0
07 Dec 2023
Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification
Navid Mohammadi Foumani
Chang Wei Tan
Geoffrey I. Webb
Hamid Rezatofighi
Mahsa Salehi
SSL
AI4TS
31
5
0
07 Dec 2023
Detecting Rumor Veracity with Only Textual Information by Double-Channel Structure
Alex G. Kim
Sangwon Yoon
12
4
0
06 Dec 2023
Large Language Models on Graphs: A Comprehensive Survey
Bowen Jin
Gang Liu
Chi Han
Meng-Long Jiang
Heng Ji
Jiawei Han
AI4CE
28
137
0
05 Dec 2023
Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Cong-Duy Nguyen
The-Anh Vu-Le
Thong Nguyen
Tho Quan
A. Luu
23
5
0
04 Dec 2023
Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?
Gopichand Kanumolu
Lokesh Madasu
Pavan Baswani
Ananya Mukherjee
Manish Shrivastava
14
1
0
03 Dec 2023
Learning to Compose SuperWeights for Neural Parameter Allocation Search
Piotr Teterwak
Soren Nelson
Nikoli Dryden
D. Bashkirova
Kate Saenko
Bryan A. Plummer
25
1
0
03 Dec 2023
Adaptive Resource Allocation for Semantic Communication Networks
Lingyi Wang
Wei Wu
Fuhui Zhou
Zhaohui Yang
Zhijing Qin
19
17
0
02 Dec 2023
The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models
Srinath Namburi
Makesh Narsimhan Sreedhar
Srinath Srinivasan
Frederic Sala
MQ
26
8
0
01 Dec 2023
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
Tianyu Ding
Tianyi Chen
Haidong Zhu
Jiachen Jiang
Yiqi Zhong
Jinxin Zhou
Guangzhi Wang
Zhihui Zhu
Ilya Zharkov
Luming Liang
27
22
0
01 Dec 2023
Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting
Haotian Gao
Renhe Jiang
Zheng Dong
Jinliang Deng
Yuxin Ma
Xuan Song
AI4TS
46
15
0
01 Dec 2023
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
Anku Rani
Dwip Dalal
Shreya Gautam
Pankaj Gupta
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
17
0
0
01 Dec 2023
Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection
Saurabh Page
Sudeep Mangalvedhekar
Kshitij Deshpande
Tanmay Chavan
S. Sonawane
18
1
0
30 Nov 2023
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
Sabit Hassan
Malihe Alikhani
38
13
0
29 Nov 2023
RACE-IT: A Reconfigurable Analog CAM-Crossbar Engine for In-Memory Transformer Acceleration
Lei Zhao
Luca Buonanno
Ron M. Roth
Sergey Serebryakov
Archit Gajjar
John Moon
Jim Ignowski
Giacomo Pedretti
28
3
0
29 Nov 2023
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4
Zihao Tan
Qingliang Chen
Yongjian Huang
Chen Liang
SILM
AAML
34
3
0
29 Nov 2023
LayerCollapse: Adaptive compression of neural networks
Soheil Zibakhsh Shabgahi
Mohammad Soheil Shariff
F. Koushanfar
AI4CE
18
1
0
29 Nov 2023
A Survey on Prompting Techniques in LLMs
Prabin Bhandari
24
7
0
28 Nov 2023
Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis
Dan Ma
Jun Xu
Zongyu Wang
Xuezhi Cao
Yunsen Xian
11
0
0
28 Nov 2023
Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions
Xinhong Chen
Zongxi Li
Yaowei Wang
Haoran Xie
Jianping Wang
Qing Li
16
0
0
28 Nov 2023
Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes
Tuan-Dung Le
Zhuqi Miao
Samuel Alvarado
Brittany Smith
William Paiva
Thanh Thieu
18
1
0
27 Nov 2023
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing
Avigyan Bhattacharya
Mainak Singha
Ankit Jha
Biplab Banerjee
SSL
VLM
19
6
0
27 Nov 2023
A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling
Shashidhar Reddy Javaji
Haoran Hu
Sai Sameer Vennam
Vijaya Gajanan Buddhavarapu
11
0
0
27 Nov 2023
Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation
Haoyi Wu
Kewei Tu
132
3
0
26 Nov 2023
General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level
Bingkang Shi
Xiaodan Zhang
Dehan Kong
Yulei Wu
Zongzhen Liu
Honglei Lyu
Longtao Huang
AI4CE
25
2
0
23 Nov 2023
A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAs
Muhammad Ilyas Azeem
Sallam Abualhaija
40
5
0
23 Nov 2023
Transformer-based Named Entity Recognition in Construction Supply Chain Risk Management in Australia
Milad Baghalzadeh Shishehgarkhaneh
R. Moehler
Yihai Fang
Amer A. Hijazi
Hamed Aboutorab
28
6
0
23 Nov 2023
Efficient Transformer Knowledge Distillation: A Performance Review
Nathan Brown
Ashton Williamson
Tahj Anderson
Logan Lawrence
VLM
19
5
0
22 Nov 2023
Looped Transformers are Better at Learning Learning Algorithms
Liu Yang
Kangwook Lee
Robert D. Nowak
Dimitris Papailiopoulos
24
24
0
21 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
31
54
0
21 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
30
4
0
21 Nov 2023
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
16
3
0
19 Nov 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
35
36
0
17 Nov 2023
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
25
11
0
16 Nov 2023
Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach
Pritom Saha Akash
Kashob Kumar Roy
Lucian Popa
Kevin Chen-Chuan Chang
16
3
0
15 Nov 2023
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Ziyang Chen
Dongfang Li
Xiang Zhao
Baotian Hu
Min Zhang
LRM
22
15
0
15 Nov 2023
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu
Peiqin Lin
Mingyang Wang
Hinrich Schütze
24
21
0
15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games
Kokil Jaidka
Hansin Ahuja
Lynnette Ng
43
7
0
15 Nov 2023
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
29
31
0
14 Nov 2023
AI-generated text boundary detection with RoFT
Laida Kushnareva
T. Gaintseva
German Magai
S. Barannikov
Dmitry Abulkhanov
Kristian Kuznetsov
Eduard Tulchinskii
Irina Piontkovskaya
Sergey I. Nikolenko
DeLMO
21
4
0
14 Nov 2023
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models
.Ilker Kesen
Andrea Pedrotti
Mustafa Dogan
Michele Cafagna
Emre Can Acikgoz
...
Iacer Calixto
Anette Frank
Albert Gatt
Aykut Erdem
Erkut Erdem
33
15
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
11
0
0
12 Nov 2023
Tunable Soft Prompts are Messengers in Federated Learning
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
38
7
0
12 Nov 2023
Previous
1
2
3
...
10
11
12
...
57
58
59
Next