ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSL
    AIMat
ArXivPDFHTML

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,911 papers shown
Title
Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities
Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities
Yangzhen Wu
P. Khuwaja
K. Dev
H. A. Hamadi
Yiming Yang
33
0
0
01 Aug 2024
Big Cooperative Learning
Big Cooperative Learning
Yulai Cong
AI4CE
36
0
0
31 Jul 2024
A Generic Review of Integrating Artificial Intelligence in Cognitive
  Behavioral Therapy
A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy
Meng Jiang
Qing Zhao
Jianqiang Li
Fan Wang
Tianyu He
Xinyan Cheng
Bing Xiang Yang
Grace W.K. Ho
Guanghui Fu
31
6
0
28 Jul 2024
Tracking linguistic information in transformer-based sentence embeddings
  through targeted sparsification
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification
Vivi Nastase
Paola Merlo
28
2
0
25 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using
  Newsflow
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
39
3
0
25 Jul 2024
Large Language Models for Anomaly Detection in Computational Workflows:
  from Supervised Fine-Tuning to In-Context Learning
Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning
Hongwei Jin
George Papadimitriou
Krishnan Raghavan
Pawel Zuk
Prasanna Balaprakash
Cong Wang
A. Mandal
Ewa Deelman
33
1
0
24 Jul 2024
Pre-Training and Prompting for Few-Shot Node Classification on
  Text-Attributed Graphs
Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs
Huan-jing Zhao
Beining Yang
Yukuo Cen
Junyu Ren
Chenhui Zhang
Yuxiao Dong
Evgeny Kharlamov
Shu Zhao
Jie Tang
VLM
49
7
0
22 Jul 2024
Token-Picker: Accelerating Attention in Text Generation with Minimized
  Memory Transfer via Probability Estimation
Token-Picker: Accelerating Attention in Text Generation with Minimized Memory Transfer via Probability Estimation
Junyoung Park
Myeonggu Kang
Yunki Han
Yang-Gon Kim
Jaekang Shin
Lee-Sup Kim
17
0
0
21 Jul 2024
Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Haiquan Lu
Xiaotian Liu
Yefan Zhou
Qunli Li
Kurt Keutzer
Michael W. Mahoney
Yujun Yan
Huanrui Yang
Yaoqing Yang
28
1
0
17 Jul 2024
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer
  Neural Networks
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks
Salma Afifi
Ishan G. Thakkar
S. Pasricha
GNN
25
0
0
17 Jul 2024
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of
  Few-Shot Learning
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning
Mustafa Dogan
.Ilker Kesen
Iacer Calixto
Aykut Erdem
Erkut Erdem
LRM
29
1
0
17 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for
  Fine-Grained Scoring of Textual Semantic Relations
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
20
4
0
17 Jul 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship
  Verification
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
Yujia Hu
Zhiqiang Hu
C. Seah
Roy Ka-wei Lee
32
0
0
16 Jul 2024
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription
  Prediction
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction
Xingzhi Zhou
Xin Dong
Chunhao Li
Yuning Bai
Yulong Xu
...
Simon See
Xinpeng Song
Runshun Zhang
Xuezhong Zhou
Nevin L. Zhang
LM&MA
27
3
0
15 Jul 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of
  Modules
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Zhuocheng Gong
Ang Lv
Jian-Yu Guan
Junxi Yan
Wei Yu Wu
Huishuai Zhang
Minlie Huang
Dongyan Zhao
Rui Yan
MoE
50
6
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
40
43
0
09 Jul 2024
Noise-Free Explanation for Driving Action Prediction
Noise-Free Explanation for Driving Action Prediction
Hongbo Zhu
Theodor Wulff
R. S. Maharjan
Jinpei Han
Angelo Cangelosi
AAML
FAtt
22
0
0
08 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
34
12
0
06 Jul 2024
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM
  Compression
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression
Zhichao Xu
Ashim Gupta
Tao Li
Oliver Bentham
Vivek Srikumar
40
8
0
06 Jul 2024
Not (yet) the whole story: Evaluating Visual Storytelling Requires More
  than Measuring Coherence, Grounding, and Repetition
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
Aditya K Surikuchi
Raquel Fernández
Sandro Pezzelle
18
3
0
05 Jul 2024
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation
  Learning
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
18
1
0
05 Jul 2024
ESQA: Event Sequences Question Answering
ESQA: Event Sequences Question Answering
Irina Abdullaeva
Andrei Filatov
Mikhail Orlov
Ivan Karpukhin
Viacheslav Vasilev
Denis Dimitrov
Andrey Kuznetsov
Ivan A Kireev
Andrey Savchenko
44
0
0
03 Jul 2024
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Dineth Jayakody
Koshila Isuranda
A. V. A. Malkith
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
24
0
0
03 Jul 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter
  Efficient Fine-tuning
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song
Hao Zhao
Soumajit Majumder
Tao Lin
23
3
0
01 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between
  Autoregressive and Masked Pretraining
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
34
3
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
60
3
0
01 Jul 2024
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech
  Synthesis
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis
Yinlin Guo
Yening Lv
Jinqiao Dou
Yan Zhang
Yuehai Wang
18
0
0
30 Jun 2024
"I understand why I got this grade": Automatic Short Answer Grading with
  Feedback
"I understand why I got this grade": Automatic Short Answer Grading with Feedback
Dishank Aggarwal
Pushpak Bhattacharyya
Bhaskaran Raman
18
3
0
30 Jun 2024
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi
Mehmet Fatih Amasyali
Çiğdem Erol
VLM
28
1
0
30 Jun 2024
BioMNER: A Dataset for Biomedical Method Entity Recognition
BioMNER: A Dataset for Biomedical Method Entity Recognition
Chen Tang
Bohao Yang
Kun Zhao
Bo Lv
Chenghao Xiao
Frank Guerin
Chenghua Lin
37
0
0
28 Jun 2024
Protein Representation Learning with Sequence Information Embedding:
  Does it Always Lead to a Better Performance?
Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?
Y. Tan
Lirong Zheng
Bozitao Zhong
Liang Hong
Bingxin Zhou
35
4
0
28 Jun 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Mengnan Du
Shuaiqiang Wang
Dawei Yin
Sumi Helal
47
28
0
28 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse
  Attention Across Heads
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
38
1
0
27 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
37
4
0
27 Jun 2024
Clustering in pure-attention hardmax transformers and its role in
  sentiment analysis
Clustering in pure-attention hardmax transformers and its role in sentiment analysis
Albert Alcalde
Giovanni Fantuzzi
Enrique Zuazua
27
3
0
26 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in
  Transformers
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
36
1
0
26 Jun 2024
ViANLI: Adversarial Natural Language Inference for Vietnamese
ViANLI: Adversarial Natural Language Inference for Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
22
0
0
25 Jun 2024
Are there identifiable structural parts in the sentence embedding whole?
Are there identifiable structural parts in the sentence embedding whole?
Vivi Nastase
Paola Merlo
32
3
0
24 Jun 2024
Large Vocabulary Size Improves Large Language Models
Large Vocabulary Size Improves Large Language Models
Sho Takase
Ryokan Ri
Shun Kiyono
Takuya Kato
37
3
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A
  Classification in Mental Health care
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
26
2
0
23 Jun 2024
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Lorenzo Basile
Santiago Acevedo
Luca Bortolussi
Fabio Anselmi
Alex Rodriguez
34
4
0
22 Jun 2024
Brain-Like Language Processing via a Shallow Untrained Multihead
  Attention Network
Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network
Badr AlKhamissi
Greta Tuckute
Antoine Bosselut
Martin Schrimpf
71
6
0
21 Jun 2024
Text Serialization and Their Relationship with the Conventional
  Paradigms of Tabular Machine Learning
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
22
7
0
19 Jun 2024
Fighting Randomness with Randomness: Mitigating Optimisation Instability
  of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher
Ján Cegin
Róbert Belanec
Jakub Simko
Ivan Srba
M. Bieliková
37
1
0
18 Jun 2024
QueerBench: Quantifying Discrimination in Language Models Toward Queer
  Identities
QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities
Mae Sosto
Alberto Barrón-Cedeño
32
3
0
18 Jun 2024
TroL: Traversal of Layers for Large Language and Vision Models
TroL: Traversal of Layers for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
24
6
0
18 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
62
6
0
18 Jun 2024
A Systematic Survey of Text Summarization: From Statistical Methods to
  Large Language Models
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models
Haopeng Zhang
Philip S. Yu
Jiawei Zhang
35
14
0
17 Jun 2024
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive
  Declarative Grammars
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars
Damien Sileo
LRM
ReLM
23
2
0
16 Jun 2024
Improving Large Models with Small models: Lower Costs and Better
  Performance
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
37
4
0
15 Jun 2024
Previous
123456...575859
Next