Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,050 papers shown
Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?
Gopichand Kanumolu
Lokesh Madasu
Pavan Baswani
Ananya Mukherjee
Manish Shrivastava
172
2
0
03 Dec 2023
Learning to Compose SuperWeights for Neural Parameter Allocation Search
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Piotr Teterwak
Soren Nelson
Nikoli Dryden
D. Bashkirova
Kate Saenko
Bryan A. Plummer
287
3
0
03 Dec 2023
Adaptive Resource Allocation for Semantic Communication Networks
IEEE Transactions on Communications (IEEE Trans. Commun.), 2023
Lingyi Wang
Wei Wu
Fuhui Zhou
Zhaohui Yang
Zhijing Qin
353
64
0
02 Dec 2023
The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Srinath Namburi
Makesh Narsimhan Sreedhar
Srinath Srinivasan
Frederic Sala
MQ
224
12
0
01 Dec 2023
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
Tianyu Ding
Tianyi Chen
Haidong Zhu
Jiachen Jiang
Yiqi Zhong
Jinxin Zhou
Guangzhi Wang
Zhihui Zhu
Ilya Zharkov
Luming Liang
410
33
0
01 Dec 2023
Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Haotian Gao
Renhe Jiang
Zheng Dong
Jinliang Deng
Yuxin Ma
Xuan Song
AI4TS
429
55
0
01 Dec 2023
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
Anku Rani
Dwip Dalal
Shreya Gautam
Pankaj Gupta
Vinija Jain
Vasu Sharma
Amit P. Sheth
Amitava Das
224
3
0
01 Dec 2023
Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection
Saurabh Page
Sudeep Mangalvedhekar
Kshitij Deshpande
Tanmay Chavan
S. Sonawane
123
2
0
30 Nov 2023
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Sabit Hassan
Malihe Alikhani
251
18
0
29 Nov 2023
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4
Natural Language Processing and Chinese Computing (NLPCC), 2023
Zihao Tan
Qingliang Chen
Yongjian Huang
Chen Liang
SILM
AAML
253
5
0
29 Nov 2023
LayerCollapse: Adaptive compression of neural networks
Soheil Zibakhsh Shabgahi
Mohammad Soheil Shariff
F. Koushanfar
AI4CE
225
1
0
29 Nov 2023
RACE-IT: A Reconfigurable Analog Computing Engine for In-Memory Transformer Acceleration
Lei Zhao
Aishwarya Natarajan
Luca Buonanno
Archit Gajjar
Ron M. Roth
Sergey Serebryakov
John Moon
Jim Ignowski
Giacomo Pedretti
312
5
0
29 Nov 2023
A Survey on Prompting Techniques in LLMs
Prabin Bhandari
192
13
0
28 Nov 2023
Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis
Dan Ma
Jun Xu
Zongyu Wang
Xuezhi Cao
Yunsen Xian
162
0
0
28 Nov 2023
Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions
Xinhong Chen
Zongxi Li
Yaowei Wang
Haoran Xie
Jianping Wang
Qing Li
121
0
0
28 Nov 2023
Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes
Tuan-Dung Le
Zhuqi Miao
Samuel Alvarado
Brittany Smith
William Paiva
Thanh Thieu
159
1
0
27 Nov 2023
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing
Indian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023
Avigyan Bhattacharya
Mainak Singha
Ankit Jha
Biplab Banerjee
SSL
VLM
196
10
0
27 Nov 2023
A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling
Shashidhar Reddy Javaji
Haoran Hu
Sai Sameer Vennam
Vijaya Gajanan Buddhavarapu
110
0
0
27 Nov 2023
Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Haoyi Wu
Kewei Tu
839
4
0
26 Nov 2023
General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Bingkang Shi
Xiaodan Zhang
Dehan Kong
Yulei Wu
Zongzhen Liu
Honglei Lyu
Longtao Huang
AI4CE
317
4
0
23 Nov 2023
A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAs
Empirical Software Engineering (EMSE), 2023
Muhammad Ilyas Azeem
Sallam Abualhaija
192
16
0
23 Nov 2023
Transformer-based Named Entity Recognition in Construction Supply Chain Risk Management in Australia
IEEE Access (IEEE Access), 2023
Milad Baghalzadeh Shishehgarkhaneh
R. Moehler
Yihai Fang
Amer A. Hijazi
Hamed Aboutorab
267
31
0
23 Nov 2023
Efficient Transformer Knowledge Distillation: A Performance Review
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nathan Brown
Ashton Williamson
Tahj Anderson
Logan Lawrence
VLM
146
9
0
22 Nov 2023
Looped Transformers are Better at Learning Learning Algorithms
International Conference on Learning Representations (ICLR), 2023
Liu Yang
Kangwook Lee
Robert D. Nowak
Dimitris Papailiopoulos
460
55
0
21 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
383
102
0
21 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
286
6
0
21 Nov 2023
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
154
5
0
19 Nov 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
337
65
0
17 Nov 2023
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
173
15
0
16 Nov 2023
Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach
Cheng Wang
Kashob Kumar Roy
Yatin Nandwani
Kevin Chen-Chuan Chang
207
3
0
15 Nov 2023
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ziyang Chen
Dongfang Li
Xiang Zhao
Baotian Hu
Min Zhang
LRM
302
30
0
15 Nov 2023
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu
Peiqin Lin
Mingyang Wang
Hinrich Schütze
233
36
0
15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games
Kokil Jaidka
Hansin Ahuja
Lynnette Ng
297
13
0
15 Nov 2023
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
160
98
0
14 Nov 2023
AI-generated text boundary detection with RoFT
Laida Kushnareva
T. Gaintseva
German Magai
S. Barannikov
Dmitry Abulkhanov
Kristian Kuznetsov
Eduard Tulchinskii
Irina Piontkovskaya
Sergey I. Nikolenko
DeLMO
296
17
0
14 Nov 2023
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models
International Conference on Learning Representations (ICLR), 2023
.Ilker Kesen
Andrea Pedrotti
Mustafa Dogan
Michele Cafagna
Emre Can Acikgoz
...
Iacer Calixto
Anette Frank
Albert Gatt
Aykut Erdem
Erkut Erdem
276
21
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
141
0
0
12 Nov 2023
Tunable Soft Prompts are Messengers in Federated Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
196
10
0
12 Nov 2023
Early-Exit Neural Networks with Nested Prediction Sets
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Metod Jazbec
Patrick Forré
Stephan Mandt
Dan Zhang
Eric T. Nalisnick
UQCV
195
2
0
10 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Findings (Findings), 2023
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
323
22
0
10 Nov 2023
Hallucination-minimized Data-to-answer Framework for Financial Decision-makers
BigData Congress [Services Society] (BSS), 2023
Sohini Roychowdhury
Andres Alvarez
Brian Moore
Marko Krema
Maria Paz Gelpi
...
Angel Rodriguez
Jose Ramon Cabrejas
Pablo Martinez Serrano
Punit Agrawal
Arijit Mukherjee
173
13
0
09 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David Clifton
LM&MA
736
191
0
09 Nov 2023
Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform
Daniele Giofré
Sneha Ghantasala
AILaw
150
0
0
09 Nov 2023
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining
Martin Kuo
Jianyi Zhang
Yiran Chen
89
2
0
08 Nov 2023
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
Yiyuan Li
Rakesh R Menon
Sayan Ghosh
Shashank Srivastava
LRM
190
2
0
08 Nov 2023
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding
Kehinde E. Ajayi
Xin Wei
Martin Gryder
Winston Shields
Jian Wu
Shawn M. Jones
Michal Kucer
Diane Oyen
3DV
160
5
0
07 Nov 2023
mahaNLP: A Marathi Natural Language Processing Library
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Vidula Magdum
Omkar Dhekane
Sharayu Hiwarkhedkar
Saloni Mittal
Raviraj Joshi
248
5
0
05 Nov 2023
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
200
30
0
03 Nov 2023
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine
Computer Methods and Programs in Biomedicine Update (CMPB), 2023
Guoxing Yang
Jianyu Shi
Zan Wang
Xiaohong Liu
Guangyu Wang
90
38
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
157
1
0
03 Nov 2023
Previous
1
2
3
...
13
14
15
...
59
60
61
Next
Page 14 of 61
Page
of 61
Go