Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 14,226 papers shown
Title
Linguistic Complexity and Socio-cultural Patterns in Hip-Hop Lyrics
Aayam Bansal
Raghav Agarwal
Kaashvi Jain
22
0
0
29 Apr 2025
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
Foteini Papadopoulou
Osman Mutlu
Neris Özen
Bas H. M. van der Velden
I. Hendrickx
Ali Hürriyetoǧlu
ViT
39
0
0
29 Apr 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
62
0
0
29 Apr 2025
WenyanGPT: A Large Language Model for Classical Chinese Tasks
Xinyu Yao
Mengdi Wang
Bo Chen
Xiaobing Zhao
67
0
0
29 Apr 2025
From Attention to Atoms: Spectral Dictionary Learning for Fast, Interpretable Language Models
Andrew Kiruluta
24
0
0
29 Apr 2025
Turing Machine Evaluation for Large Language Model
Haitao Wu
Zongbo Han
Huaxi Huang
Changqing Zhang
ELM
LRM
62
0
0
29 Apr 2025
SVD Based Least Squares for X-Ray Pneumonia Classification Using Deep Features
Mete Erdogan
Sebnem Demirtas
44
0
0
29 Apr 2025
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Jesus Lovon
Thouria Ben-Haddi
Jules Di Scala
José G. Moreno
L. Tamine
60
2
0
29 Apr 2025
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
Lovedeep Gondara
Jonathan Simkin
Graham Sayle
Shebnum Devji
Gregory Arbour
Raymond Ng
LM&MA
41
0
0
29 Apr 2025
X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation
Guy Hadad
Haggai Roitman
Yotam Eshel
Bracha Shapira
L. Rokach
BDL
VLM
LRM
47
0
0
29 Apr 2025
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks
Adrian Rebmann
Fabian David Schmidt
Goran Glavaš
Han van der Aa
LRM
31
0
0
29 Apr 2025
Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?
Hao Du
Shang Liu
Yang Cao
AAML
47
0
0
28 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
32
0
0
28 Apr 2025
LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation
Beizhe Hu
Qiang Sheng
Juan Cao
Yang Li
Danding Wang
127
0
0
28 Apr 2025
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets
Lorenz Brehme
Thomas Ströhle
Ruth Breu
59
0
0
28 Apr 2025
Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI
Hugo Georgenthum
Cristian Cosentino
Fabrizio Marozzo
Pietro Liò
MedIm
132
0
0
28 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Y. Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
132
1
0
28 Apr 2025
Generative AI in Education: Student Skills and Lecturer Roles
Stefanie Krause
Ashish Dalvi
Syed Khubaib Zaidi
136
0
0
28 Apr 2025
What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift
Jiamin Chang
H. Li
Hammond Pearce
Ruoxi Sun
Bo-wen Li
Minhui Xue
38
0
0
28 Apr 2025
Towards Robust Multimodal Physiological Foundation Models: Handling Arbitrary Missing Modalities
Xi Fu
Wei-Bang Jiang
Yi Ding
Cuntai Guan
46
0
0
28 Apr 2025
Towards Long Context Hallucination Detection
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
73
0
0
28 Apr 2025
Magnifier: A Multi-grained Neural Network-based Architecture for Burned Area Delineation
Daniele Rege Cambrin
Luca Colomba
Paolo Garza
44
0
0
28 Apr 2025
Multimodal Conditioned Diffusive Time Series Forecasting
Chen Su
Yuanhe Tian
Yan Song
DiffM
AI4TS
57
0
0
28 Apr 2025
Coreference Resolution for Vietnamese Narrative Texts
Hieu-Dai Tran
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
44
0
0
28 Apr 2025
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
Anindya Bijoy Das
Shibbir Ahmed
Shahnewaz Karim Sakib
HILM
LM&MA
57
0
0
27 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
43
0
0
27 Apr 2025
Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation
Qianren Mao
Qili Zhang
Hanwen Hao
Zhentao Han
Runhua Xu
...
Bo Li
Y. Song
Jin Dong
Jianxin Li
Philip S. Yu
71
0
0
27 Apr 2025
Versatile Framework for Song Generation with Prompt-based Control
Y. Zhang
Wenxiang Guo
Changhao Pan
Z. Zhu
Ruiqi Li
...
Rongjie Huang
Ruiyuan Zhang
Zhiqing Hong
Ziyue Jiang
Zhou Zhao
74
1
0
27 Apr 2025
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
Alexander Baumann
Leonardo Ayala
S.
Jan Sellner
Alexander Studier-Fischer
Berkin Özdemir
Lena Maier-Hein
Slobodan Ilic
51
0
0
27 Apr 2025
AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings
Guoqing Hu
An Zhang
Shuo Liu
Zhibo Cai
Xun Yang
X. Wang
34
0
0
27 Apr 2025
HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
Qiuhui Chen
Jintao Wang
Gang Wang
Yi Hong
47
0
0
27 Apr 2025
Towards Robust Dialogue Breakdown Detection: Addressing Disruptors in Large Language Models with Self-Guided Reasoning
Abdellah Ghassel
Xianzhi Li
Xiaodan Zhu
49
0
0
26 Apr 2025
CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval
Hang Yu
Jiahao Wen
Zhedong Zheng
46
0
0
26 Apr 2025
Video CLIP Model for Multi-View Echocardiography Interpretation
Ryo Takizawa
Satoshi Kodera
Tempei Kabayama
Ryo Matsuoka
Yuta Ando
Yuto Nakamura
Haruki Settai
Norihiko Takeda
37
0
0
26 Apr 2025
MTCSC: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction
Junhong Liang
Yu Zhou
LRM
118
0
0
26 Apr 2025
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Shahad Albastaki
Anabia Sohail
I. I. Ganapathi
B. Alawode
Asim Khan
Sajid Javed
N. Werghi
Mohammed Bennamoun
Arif Mahmood
66
0
0
26 Apr 2025
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Sanwoo Lee
Jiahao Liu
Qifan Wang
J. Wang
Xunliang Cai
Yunfang Wu
MoMe
125
0
0
26 Apr 2025
The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing
Yibo Hu
Yiqiao Jin
Meng Ye
Ajay Divakaran
Srijan Kumar
22
0
0
26 Apr 2025
Generative Product Recommendations for Implicit Superlative Queries
Kaustubh D. Dhole
Nikhita Vedula
Saar Kuzi
Giuseppe Castellucci
Eugene Agichtein
S. Malmasi
47
0
0
26 Apr 2025
A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification
Junichiro Niimi
56
0
0
26 Apr 2025
A Langevin sampling algorithm inspired by the Adam optimizer
B. Leimkuhler
René Lohmann
P. Whalley
76
0
0
26 Apr 2025
Extracting Abstraction Dimensions by Identifying Syntax Pattern from Texts
Jian Zhou
J. Li
Sirui Zhuge
Hai Zhuge
14
0
0
26 Apr 2025
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation
Robert Leppich
Michael Stenger
Daniel Grillmeyer
Vanessa Borst
Samuel Kounev
AI4TS
AI4CE
62
0
0
26 Apr 2025
Improved Molecular Generation through Attribute-Driven Integrative Embeddings and GAN Selectivity
Nandan Joshi
Erhan Guven
26
0
0
26 Apr 2025
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
Z. R. K. Rostam
Gábor Kertész
24
0
0
26 Apr 2025
A model and package for German ColBERT
Thuong Dang
Qiqi Chen
VLM
73
0
0
25 Apr 2025
Building UD Cairo for Old English in the Classroom
Lauren Levine
Junghyun Min
Amir Zeldes
45
0
0
25 Apr 2025
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
James D. Finch
Yasasvi Josyula
Jinho D. Choi
38
0
0
25 Apr 2025
Multimodal graph representation learning for website generation based on visual sketch
Tung D. Vu
Chung Hoang
Truong-Son Hy
3DV
56
0
0
25 Apr 2025
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
44
0
0
25 Apr 2025
Previous
1
2
3
4
5
6
...
283
284
285
Next