Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,911 papers shown
Title
Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language Models
Hamideh Ghanadian
I. Nejadgholi
Hussein Al Osman
SyDa
32
18
0
25 Jan 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
20
1
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
28
14
0
25 Jan 2024
SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report Identification
Y. Liao
T. Zhang
11
0
0
22 Jan 2024
Freely Long-Thinking Transformer (FraiLT)
Akbay Tabak
15
0
0
21 Jan 2024
Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models
Yang Liu
23
2
0
21 Jan 2024
Instructional Fingerprinting of Large Language Models
Jiashu Xu
Fei Wang
Mingyu Derek Ma
Pang Wei Koh
Chaowei Xiao
Muhao Chen
WaLM
22
29
0
21 Jan 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Aly M. Kassem
Sherif Saad
AAML
23
1
0
21 Jan 2024
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection
Atanu Mandal
Gargi Roy
Amit Barman
Indranil Dutta
S. Naskar
15
3
0
19 Jan 2024
Learning High-Quality and General-Purpose Phrase Representations
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
30
3
0
18 Jan 2024
Preparing Lessons for Progressive Training on Language Models
Yu Pan
Ye Yuan
Yichun Yin
Jiaxin Shi
Zenglin Xu
Ming Zhang
Lifeng Shang
Xin Jiang
Qun Liu
16
9
0
17 Jan 2024
Fixed Point Diffusion Models
Xingjian Bai
Luke Melas-Kyriazi
18
3
0
16 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
35
27
0
15 Jan 2024
Milestones in Bengali Sentiment Analysis leveraging Transformer-models: Fundamentals, Challenges and Future Directions
Saptarshi Sengupta
Shreya Ghosh
Prasenjit Mitra
Tarikul Islam Tamiti
35
0
0
15 Jan 2024
Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering
Qing Li
Lei Li
Yu Li
LM&MA
AI4MH
33
6
0
15 Jan 2024
Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive Study
Ahmadul Karim Chowdhury
Saidur Rahman Sujon
Md. Shirajus Salekin Shafi
Tasin Ahmmad
Sifat Ahmed
Khan Md. Hasib
Faisal Muhammad Shah
AI4MH
39
9
0
14 Jan 2024
Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection
Muhammad Tayyab Zamir
Muhammad Asif Ayub
Asma Gul
Nasir Ahmad
Kashif Ahmad
13
2
0
12 Jan 2024
Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text
Muskan Garg
Msvpj Sathvik
Amrit Chadha
Shaina Raza
Sunghwan Sohn
AI4MH
24
1
0
12 Jan 2024
LLMRS: Unlocking Potentials of LLM-Based Recommender Systems for Software Purchase
Angela John
Theophilus Aidoo
Hamayoon Behmanush
Irem B. Gunduz
Hewan Shrestha
M. R. Rahman
Wolfgang Maass
41
2
0
12 Jan 2024
Multi-Task Learning for Front-End Text Processing in TTS
Wonjune Kang
Yun Wang
Shun Zhang
Arthur Hinsvark
Qing He
6
2
0
12 Jan 2024
Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models
K. M. S. Islam
A. S. Nipu
Praveen Madiraju
Priya Deshpande
LM&MA
29
7
0
11 Jan 2024
Phishing Website Detection through Multi-Model Analysis of HTML Content
Furkan Çolhak
Mert İlhan Ecevit
Bilal Emir Uçar
Reiner Creutzburg
Hasan Dag
18
7
0
09 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
20
5
0
09 Jan 2024
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Maciej Pióro
Kamil Ciebiera
Krystian Król
Jan Ludziejewski
Michał Krutul
Jakub Krajewski
Szymon Antoniak
Piotr Miłoś
Marek Cygan
Sebastian Jaszczur
MoE
Mamba
20
54
0
08 Jan 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
16
1
0
08 Jan 2024
An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks
Chen Yang
Peng Liang
Zinan Ma
22
0
0
08 Jan 2024
Building Efficient and Effective OpenQA Systems for Low-Resource Languages
Emrah Budur
Riza Ozccelik
Dilara Soylu
Omar Khattab
Tunga Güngör
Christopher Potts
30
1
0
07 Jan 2024
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Zheng Lian
Licai Sun
Yong Ren
Hao Gu
Haiyang Sun
Lan Chen
Bin Liu
Jianhua Tao
15
12
0
07 Jan 2024
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
26
0
0
06 Jan 2024
SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations
Furkan cColhak
Mert İlhan Ecevit
Hasan Daug
Reiner Creutzburg
11
0
0
06 Jan 2024
Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning
Zhifeng Jiang
Peng Ye
Shiqi He
Wei Wang
Ruichuan Chen
Bo Li
23
2
0
05 Jan 2024
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
Haoyuan Wu
Haisheng Zheng
Zhuolun He
Bei Yu
MoE
ALM
27
14
0
05 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
19
65
0
04 Jan 2024
Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences
Piotr Skalski
David Sutton
Stuart Burrell
Iker Perez
Jason Wong
AI4TS
34
2
0
03 Jan 2024
Evaluating Fairness in Self-supervised and Supervised Models for Sequential Data
Sofia Yfantidou
Dimitris Spathis
Marios Constantinides
Athena Vakali
Daniele Quercia
F. Kawsar
50
2
0
03 Jan 2024
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Anku Rani
Vipula Rawte
Aman Chadha
Amitava Das
HILM
32
179
0
02 Jan 2024
Unifying Structured Data as Graph for Data-to-Text Pre-Training
Shujie Li
Liang Li
Ruiying Geng
Min Yang
Binhua Li
...
Wanwei He
Shao Yuan
Can Ma
Fei Huang
Yongbin Li
LMTD
28
13
0
02 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun-Xiong Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
33
14
0
31 Dec 2023
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
Haiyang Liu
Zihao Zhu
Giorgio Becherini
Yichen Peng
Mingyang Su
You Zhou
Xuefei Zhe
Naoya Iwamoto
Bo Zheng
Michael J. Black
SLR
31
29
0
31 Dec 2023
Research on the Laws of Multimodal Perception and Cognition from a Cross-cultural Perspective -- Taking Overseas Chinese Gardens as an Example
Ran Chen
Xueqi Yao
Jing Zhao
Shuhan Xu
Sirui Zhang
Yijun Mao
21
0
0
29 Dec 2023
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination
Jiawei Wang
Jian Zhao
Zhengtao Cao
Ruili Feng
Rongjun Qin
Yang Yu
27
1
0
25 Dec 2023
Multi-level biomedical NER through multi-granularity embeddings and enhanced labeling
Fahime Shahrokh
Nasser Ghadiri
Rasoul Samani
M. Moradi
17
0
0
24 Dec 2023
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Hongzheng Chen
Jiahao Zhang
Yixiao Du
Shaojie Xiang
Zichao Yue
Niansong Zhang
Yaohui Cai
Zhiru Zhang
48
34
0
23 Dec 2023
Characterizing and Classifying Developer Forum Posts with their Intentions
Xingfang Wu
Eric Thibodeau-Laufer
Heng Li
Foutse Khomh
Santhosh Srinivasan
Jayden Luo
20
0
0
21 Dec 2023
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization
Rahul Chand
Yashoteja Prabhu
Pratyush Kumar
20
3
0
20 Dec 2023
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment
Lingling Xu
Haoran Xie
S. J. Qin
Xiaohui Tao
F. Wang
46
132
0
19 Dec 2023
Assessing Logical Reasoning Capabilities of Encoder-Only Transformer Models
Paulo Pirozelli
M. M. José
Paulo de Tarso P. Filho
A. Brandão
Fabio Gagliardi Cozman
LRM
ELM
29
2
0
18 Dec 2023
A mathematical perspective on Transformers
Borjan Geshkovski
Cyril Letrouit
Yury Polyanskiy
Philippe Rigollet
EDL
AI4CE
40
36
0
17 Dec 2023
RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding
Yuxin Zi
Hariram Veeramani
Kaushik Roy
Amit P. Sheth
AI4TS
28
2
0
15 Dec 2023
BinGo: Identifying Security Patches in Binary Code with Graph Representation Learning
Xu He
Shu Wang
Pengbin Feng
Xinda Wang
Shiyu Sun
Qi Li
Kun Sun
21
1
0
13 Dec 2023
Previous
1
2
3
...
9
10
11
...
57
58
59
Next