ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,226 papers shown
Title
Linguistic Complexity and Socio-cultural Patterns in Hip-Hop Lyrics
Linguistic Complexity and Socio-cultural Patterns in Hip-Hop Lyrics
Aayam Bansal
Raghav Agarwal
Kaashvi Jain
22
0
0
29 Apr 2025
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
Foteini Papadopoulou
Osman Mutlu
Neris Özen
Bas H. M. van der Velden
I. Hendrickx
Ali Hürriyetoǧlu
ViT
39
0
0
29 Apr 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
62
0
0
29 Apr 2025
WenyanGPT: A Large Language Model for Classical Chinese Tasks
WenyanGPT: A Large Language Model for Classical Chinese Tasks
Xinyu Yao
Mengdi Wang
Bo Chen
Xiaobing Zhao
67
0
0
29 Apr 2025
From Attention to Atoms: Spectral Dictionary Learning for Fast, Interpretable Language Models
From Attention to Atoms: Spectral Dictionary Learning for Fast, Interpretable Language Models
Andrew Kiruluta
24
0
0
29 Apr 2025
Turing Machine Evaluation for Large Language Model
Turing Machine Evaluation for Large Language Model
Haitao Wu
Zongbo Han
Huaxi Huang
Changqing Zhang
ELM
LRM
62
0
0
29 Apr 2025
SVD Based Least Squares for X-Ray Pneumonia Classification Using Deep Features
SVD Based Least Squares for X-Ray Pneumonia Classification Using Deep Features
Mete Erdogan
Sebnem Demirtas
44
0
0
29 Apr 2025
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Jesus Lovon
Thouria Ben-Haddi
Jules Di Scala
José G. Moreno
L. Tamine
60
2
0
29 Apr 2025
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
Lovedeep Gondara
Jonathan Simkin
Graham Sayle
Shebnum Devji
Gregory Arbour
Raymond Ng
LM&MA
41
0
0
29 Apr 2025
X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation
X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation
Guy Hadad
Haggai Roitman
Yotam Eshel
Bracha Shapira
L. Rokach
BDL
VLM
LRM
47
0
0
29 Apr 2025
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks
Adrian Rebmann
Fabian David Schmidt
Goran Glavaš
Han van der Aa
LRM
31
0
0
29 Apr 2025
Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?
Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?
Hao Du
Shang Liu
Yang Cao
AAML
47
0
0
28 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
32
0
0
28 Apr 2025
LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation
LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation
Beizhe Hu
Qiang Sheng
Juan Cao
Yang Li
Danding Wang
127
0
0
28 Apr 2025
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets
Lorenz Brehme
Thomas Ströhle
Ruth Breu
59
0
0
28 Apr 2025
Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI
Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI
Hugo Georgenthum
Cristian Cosentino
Fabrizio Marozzo
Pietro Liò
MedIm
132
0
0
28 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Y. Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
132
1
0
28 Apr 2025
Generative AI in Education: Student Skills and Lecturer Roles
Generative AI in Education: Student Skills and Lecturer Roles
Stefanie Krause
Ashish Dalvi
Syed Khubaib Zaidi
136
0
0
28 Apr 2025
What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift
What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift
Jiamin Chang
H. Li
Hammond Pearce
Ruoxi Sun
Bo-wen Li
Minhui Xue
38
0
0
28 Apr 2025
Towards Robust Multimodal Physiological Foundation Models: Handling Arbitrary Missing Modalities
Towards Robust Multimodal Physiological Foundation Models: Handling Arbitrary Missing Modalities
Xi Fu
Wei-Bang Jiang
Yi Ding
Cuntai Guan
46
0
0
28 Apr 2025
Towards Long Context Hallucination Detection
Towards Long Context Hallucination Detection
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
73
0
0
28 Apr 2025
Magnifier: A Multi-grained Neural Network-based Architecture for Burned Area Delineation
Magnifier: A Multi-grained Neural Network-based Architecture for Burned Area Delineation
Daniele Rege Cambrin
Luca Colomba
Paolo Garza
44
0
0
28 Apr 2025
Multimodal Conditioned Diffusive Time Series Forecasting
Multimodal Conditioned Diffusive Time Series Forecasting
Chen Su
Yuanhe Tian
Yan Song
DiffM
AI4TS
57
0
0
28 Apr 2025
Coreference Resolution for Vietnamese Narrative Texts
Coreference Resolution for Vietnamese Narrative Texts
Hieu-Dai Tran
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
44
0
0
28 Apr 2025
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
Anindya Bijoy Das
Shibbir Ahmed
Shahnewaz Karim Sakib
HILM
LM&MA
57
0
0
27 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
43
0
0
27 Apr 2025
Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation
Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation
Qianren Mao
Qili Zhang
Hanwen Hao
Zhentao Han
Runhua Xu
...
Bo Li
Y. Song
Jin Dong
Jianxin Li
Philip S. Yu
71
0
0
27 Apr 2025
Versatile Framework for Song Generation with Prompt-based Control
Versatile Framework for Song Generation with Prompt-based Control
Y. Zhang
Wenxiang Guo
Changhao Pan
Z. Zhu
Ruiqi Li
...
Rongjie Huang
Ruiyuan Zhang
Zhiqing Hong
Ziyue Jiang
Zhou Zhao
74
1
0
27 Apr 2025
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
Alexander Baumann
Leonardo Ayala
S.
Jan Sellner
Alexander Studier-Fischer
Berkin Özdemir
Lena Maier-Hein
Slobodan Ilic
51
0
0
27 Apr 2025
AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings
AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings
Guoqing Hu
An Zhang
Shuo Liu
Zhibo Cai
Xun Yang
X. Wang
34
0
0
27 Apr 2025
HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
Qiuhui Chen
Jintao Wang
Gang Wang
Yi Hong
47
0
0
27 Apr 2025
Towards Robust Dialogue Breakdown Detection: Addressing Disruptors in Large Language Models with Self-Guided Reasoning
Towards Robust Dialogue Breakdown Detection: Addressing Disruptors in Large Language Models with Self-Guided Reasoning
Abdellah Ghassel
Xianzhi Li
Xiaodan Zhu
49
0
0
26 Apr 2025
CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval
CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval
Hang Yu
Jiahao Wen
Zhedong Zheng
46
0
0
26 Apr 2025
Video CLIP Model for Multi-View Echocardiography Interpretation
Video CLIP Model for Multi-View Echocardiography Interpretation
Ryo Takizawa
Satoshi Kodera
Tempei Kabayama
Ryo Matsuoka
Yuta Ando
Yuto Nakamura
Haruki Settai
Norihiko Takeda
37
0
0
26 Apr 2025
MTCSC: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction
MTCSC: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction
Junhong Liang
Yu Zhou
LRM
118
0
0
26 Apr 2025
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Shahad Albastaki
Anabia Sohail
I. I. Ganapathi
B. Alawode
Asim Khan
Sajid Javed
N. Werghi
Mohammed Bennamoun
Arif Mahmood
66
0
0
26 Apr 2025
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Sanwoo Lee
Jiahao Liu
Qifan Wang
J. Wang
Xunliang Cai
Yunfang Wu
MoMe
125
0
0
26 Apr 2025
The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing
The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing
Yibo Hu
Yiqiao Jin
Meng Ye
Ajay Divakaran
Srijan Kumar
22
0
0
26 Apr 2025
Generative Product Recommendations for Implicit Superlative Queries
Generative Product Recommendations for Implicit Superlative Queries
Kaustubh D. Dhole
Nikhita Vedula
Saar Kuzi
Giuseppe Castellucci
Eugene Agichtein
S. Malmasi
47
0
0
26 Apr 2025
A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification
A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification
Junichiro Niimi
56
0
0
26 Apr 2025
A Langevin sampling algorithm inspired by the Adam optimizer
A Langevin sampling algorithm inspired by the Adam optimizer
B. Leimkuhler
René Lohmann
P. Whalley
76
0
0
26 Apr 2025
Extracting Abstraction Dimensions by Identifying Syntax Pattern from Texts
Extracting Abstraction Dimensions by Identifying Syntax Pattern from Texts
Jian Zhou
J. Li
Sirui Zhuge
Hai Zhuge
14
0
0
26 Apr 2025
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation
Robert Leppich
Michael Stenger
Daniel Grillmeyer
Vanessa Borst
Samuel Kounev
AI4TS
AI4CE
62
0
0
26 Apr 2025
Improved Molecular Generation through Attribute-Driven Integrative Embeddings and GAN Selectivity
Improved Molecular Generation through Attribute-Driven Integrative Embeddings and GAN Selectivity
Nandan Joshi
Erhan Guven
26
0
0
26 Apr 2025
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
Z. R. K. Rostam
Gábor Kertész
24
0
0
26 Apr 2025
A model and package for German ColBERT
A model and package for German ColBERT
Thuong Dang
Qiqi Chen
VLM
73
0
0
25 Apr 2025
Building UD Cairo for Old English in the Classroom
Building UD Cairo for Old English in the Classroom
Lauren Levine
Junghyun Min
Amir Zeldes
45
0
0
25 Apr 2025
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
James D. Finch
Yasasvi Josyula
Jinho D. Choi
38
0
0
25 Apr 2025
Multimodal graph representation learning for website generation based on visual sketch
Multimodal graph representation learning for website generation based on visual sketch
Tung D. Vu
Chung Hoang
Truong-Son Hy
3DV
56
0
0
25 Apr 2025
Pushing the boundary on Natural Language Inference
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
44
0
0
25 Apr 2025
Previous
123456...283284285
Next