ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 33,017 papers shown
Title
Standard Occupation Classifier -- A Natural Language Processing Approach
Standard Occupation Classifier -- A Natural Language Processing Approach
Sidharth Rony
Jack Patman
80
0
0
28 Nov 2025
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Paulo J. N. Pinto
A. Pinho
Diogo Pratas
AI4CE
203
0
0
28 Nov 2025
CodeFlowLM: Incremental Just-In-Time Defect Prediction with Pretrained Language Models and Exploratory Insights into Defect Localization
CodeFlowLM: Incremental Just-In-Time Defect Prediction with Pretrained Language Models and Exploratory Insights into Defect Localization
Monique Louise Monteiro
George G. Cabral
Adriano L. I. OLiveira
20
0
0
28 Nov 2025
Pooling Attention: Evaluating Pretrained Transformer Embeddings for Deception Classification
Pooling Attention: Evaluating Pretrained Transformer Embeddings for Deception Classification
Sumit Mamtani
Abhijeet Bhure
84
0
0
28 Nov 2025
A Trainable Centrality Framework for Modern Data
A Trainable Centrality Framework for Modern Data
Minh Duc Vu
M. Liu
Doudou Zhou
FedML
120
0
0
28 Nov 2025
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Praveen Gatla
Anushka
Nikita Kanwar
Gouri Sahoo
Rajesh Kumar Mundotiya
68
1
0
28 Nov 2025
BanglaSentNet: An Explainable Hybrid Deep Learning Framework for Multi-Aspect Sentiment Analysis with Cross-Domain Transfer Learning
BanglaSentNet: An Explainable Hybrid Deep Learning Framework for Multi-Aspect Sentiment Analysis with Cross-Domain Transfer Learning
Ariful Islam
Md Rifat Hossen
Tanvir Mahmud
88
0
0
28 Nov 2025
PULSE-ICU: A Pretrained Unified Long-Sequence Encoder for Multi-task Prediction in Intensive Care Units
PULSE-ICU: A Pretrained Unified Long-Sequence Encoder for Multi-task Prediction in Intensive Care Units
Sejeong Jang
Joo Heung Yoon
Hyo Kyung Lee
32
0
0
27 Nov 2025
Beyond Real versus Fake Towards Intent-Aware Video Analysis
Beyond Real versus Fake Towards Intent-Aware Video Analysis
Saurabh Atreya
Nabyl Quignon
Baptiste Chopin
Abhijit Das
A. Dantcheva
AAML
40
0
0
27 Nov 2025
PISA: Prioritized Invariant Subgraph Aggregation
PISA: Prioritized Invariant Subgraph Aggregation
Ali Ghasemi
F. Wani
Maria Sofia Bucarelli
Fabrizio Silvestri
OOD
80
0
0
27 Nov 2025
The Collapse of Patches
The Collapse of Patches
Wei Guo
Shunqi Mao
Zhuonan Liang
Heng Wang
Weidong Cai
16
0
0
27 Nov 2025
Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation
Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation
Daniel Sungho Jung
Kyoung Mu Lee
76
0
0
27 Nov 2025
Contextual Gating within the Transformer Stack: Synergistic Feature Modulation for Enhanced Lyrical Classification and Calibration
Contextual Gating within the Transformer Stack: Synergistic Feature Modulation for Enhanced Lyrical Classification and Calibration
M.A. Gameiro
8
0
0
27 Nov 2025
Efficiency and Effectiveness of SPLADE Models on Billion-Scale Web Document Title
Efficiency and Effectiveness of SPLADE Models on Billion-Scale Web Document Title
Taeryun Won
Tae Kwan Lee
Hiun Kim
Hyemin Lee
28
0
0
27 Nov 2025
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
Róbert Belanec
Branislav Pecher
Ivan Srba
Maria Bielikova
103
1
0
26 Nov 2025
DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models
DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models
Futian Wang
Chaoliu Weng
Xiao Wang
Zhen Chen
Zhicheng Zhao
Jin Tang
44
0
0
26 Nov 2025
Context-Aware Pragmatic Metacognitive Prompting for Sarcasm Detection
Context-Aware Pragmatic Metacognitive Prompting for Sarcasm Detection
Michael Iskandardinata
William Christian
Derwin Suhartono
RALM
474
0
0
26 Nov 2025
$\mathcal{E}_0$: Enhancing Generalization and Fine-Grained Control in VLA Models via Continuized Discrete Diffusion
E0\mathcal{E}_0E0​: Enhancing Generalization and Fine-Grained Control in VLA Models via Continuized Discrete Diffusion
Zhihao Zhan
Jiaying Zhou
Likui Zhang
Qinhan Lv
Hao Liu
...
Ziliang Chen
Tianshui Chen
Keze Wang
Liang Lin
Guangrun Wang
VGenVLM
144
0
0
26 Nov 2025
SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
Mohammad Zare
28
0
0
26 Nov 2025
FITRep: Attention-Guided Item Representation via MLLMs
FITRep: Attention-Guided Item Representation via MLLMs
Guoxiao Zhang
Ao Li
Tan Qu
Qianlong Xie
Xingxing Wang
79
0
0
26 Nov 2025
Enhancing Burmese News Classification with Kolmogorov-Arnold Network Head Fine-tuning
Enhancing Burmese News Classification with Kolmogorov-Arnold Network Head Fine-tuning
Thura Aung
Eaint Kay Khaing Kyaw
Ye Kyaw Thu
Thazin Myint Oo
Thepchai Supnithi
298
0
0
26 Nov 2025
HTTM: Head-wise Temporal Token Merging for Faster VGGT
HTTM: Head-wise Temporal Token Merging for Faster VGGT
Weitian Wang
Lukas Meiner
Rai Shubham
Cecilia De La Parra
Akash Kumar
132
0
0
26 Nov 2025
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong
Yinglong Zhang
Xiaoying Hong
Xuewen Xia
Xing Xu
174
0
0
26 Nov 2025
A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems
A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems
Yuxuan Zhu
Cong Fu
Yabo Ni
Anxiang Zeng
Yuan Fang
OOD
340
0
0
26 Nov 2025
Towards Audio Token Compression in Large Audio Language Models
Towards Audio Token Compression in Large Audio Language Models
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
James R. Glass
AuLLM
233
0
0
26 Nov 2025
BanglaMM-Disaster: A Multimodal Transformer-Based Deep Learning Framework for Multiclass Disaster Classification in Bangla
BanglaMM-Disaster: A Multimodal Transformer-Based Deep Learning Framework for Multiclass Disaster Classification in Bangla
Ariful Islam
Md Rifat Hossen
Md. Mahmudul Arif
Abdullah Al Noman
Md Arifur Rahman
128
0
0
26 Nov 2025
Going with the Speed of Sound: Pushing Neural Surrogates into Highly-turbulent Transonic Regimes
Going with the Speed of Sound: Pushing Neural Surrogates into Highly-turbulent Transonic Regimes
Fabian Paischer
Leo Cotteleer
Yann Dreze
Richard Kurle
Dylan Rubini
Maurits Bleeker
Tobias Kronlachner
Johannes Brandstetter
AI4CE
188
1
0
26 Nov 2025
Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Reham Omar
Abdelghny Orogat
Ibrahim Abdelaziz
Omij Mangukiya
Panos Kalnis
Essam Mansour
150
0
0
26 Nov 2025
On the Origin of Algorithmic Progress in AI
On the Origin of Algorithmic Progress in AI
Hans Gundlach
Alex Fogelson
Jayson Lynch
Ana Trisovic
Jonathan Rosenfeld
Anmol Sandhu
Neil Thompson
68
0
0
26 Nov 2025
A Systematic Study of Model Merging Techniques in Large Language Models
A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
261
0
0
26 Nov 2025
Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
Akbar Anbar Jafari
G. Anbarjafari
32
0
0
26 Nov 2025
Harmonic-Percussive Disentangled Neural Audio Codec for Bandwidth Extension
Harmonic-Percussive Disentangled Neural Audio Codec for Bandwidth Extension
Benoît Giniès
Xiaoyu Bie
Olivier Fercoq
Gaël Richard
152
0
0
26 Nov 2025
HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
Anyang Tong
Xiang Niu
ZhiPing Liu
Chang Tian
Yanyan Wei
Zenglin Shi
Meng Wang
93
1
0
25 Nov 2025
DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation
DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation
Rui Lin
Zhiyue Wu
Jiahe Le
Kangdi Wang
Weixiong Chen
Junyu Dai
Tao Jiang
140
0
0
25 Nov 2025
APT-CGLP: Advanced Persistent Threat Hunting via Contrastive Graph-Language Pre-Training
APT-CGLP: Advanced Persistent Threat Hunting via Contrastive Graph-Language Pre-Training
Xuebo Qiu
Mingqi Lv
Yimei Zhang
Tieming Chen
Tiantian Zhu
Qijie Song
Shouling Ji
189
0
0
25 Nov 2025
"When Data is Scarce, Prompt Smarter"... Approaches to Grammatical Error Correction in Low-Resource Settings
"When Data is Scarce, Prompt Smarter"... Approaches to Grammatical Error Correction in Low-Resource Settings
Somsubhra De
Harsh Kumar
Arun Prakash A
56
0
0
25 Nov 2025
Stragglers Can Contribute More: Uncertainty-Aware Distillation for Asynchronous Federated Learning
Stragglers Can Contribute More: Uncertainty-Aware Distillation for Asynchronous Federated Learning
Yujia Wang
Fenglong Ma
Jinghui Chen
FedML
262
0
0
25 Nov 2025
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
Sree Bhattacharyya
Yaman Kumar Singla
Sudhir Yarram
Somesh Singh
Harini S I
James Z. Wang
92
0
0
25 Nov 2025
Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model
Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model
Rio Fear
Payel Mukhopadhyay
Michael McCabe
Alberto Bietti
M. Cranmer
LLMSVAI4CE
405
1
0
25 Nov 2025
Hybrid Convolution and Frequency State Space Network for Image Compression
Hybrid Convolution and Frequency State Space Network for Image Compression
Haodong Pan
Hao Wei
Yusong Wang
Nanning Zheng
Caigui Jiang
52
0
0
25 Nov 2025
From Words to Wisdom: Discourse Annotation and Baseline Models for Student Dialogue Understanding
From Words to Wisdom: Discourse Annotation and Baseline Models for Student Dialogue Understanding
Farjana Sultana Mim
Shuchin Aeron
Eric Miller
Kristen Wendell
92
1
0
25 Nov 2025
Revisiting KRISP: A Lightweight Reproduction and Analysis of Knowledge-Enhanced Vision-Language Models
Revisiting KRISP: A Lightweight Reproduction and Analysis of Knowledge-Enhanced Vision-Language Models
Souradeep Dutta
Keshav Bulia
Neena S Nair
VLM
115
0
0
25 Nov 2025
Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation
Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation
Ali Jahan
Masood Ghayoomi
Annette Hautli-Janisz
44
0
0
25 Nov 2025
Distilling Cross-Modal Knowledge via Feature Disentanglement
Distilling Cross-Modal Knowledge via Feature Disentanglement
Junhong Liu
Yuan Zhang
Tao Huang
Wenchao Xu
Renyu Yang
93
0
0
25 Nov 2025
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao
Ziyang Gong
Hehai Lin
Yang Liu
Jiashun Cheng
...
C. Qin
Hong Cheng
Xue Yang
Juepeng Zheng
Haohuan Fu
212
0
0
25 Nov 2025
GHR-VQA: Graph-guided Hierarchical Relational Reasoning for Video Question Answering
GHR-VQA: Graph-guided Hierarchical Relational Reasoning for Video Question Answering
Dionysia Danai Brilli
Dimitrios Mallis
Vassilis Pitsikalis
Petros Maragos
144
0
0
25 Nov 2025
Learning to Clean: Reinforcement Learning for Noisy Label Correction
Learning to Clean: Reinforcement Learning for Noisy Label Correction
Marzi Heidari
Hanping Zhang
Yuhong Guo
NoLaOffRLOnRL
361
0
0
25 Nov 2025
Foundry: Distilling 3D Foundation Models for the Edge
Foundry: Distilling 3D Foundation Models for the Edge
Guillaume Letellier
Siddharth Srivastava
F. Jurie
Gaurav Sharma
64
0
0
25 Nov 2025
A Machine Learning Approach for Detection of Mental Health Conditions and Cyberbullying from Social Media
A Machine Learning Approach for Detection of Mental Health Conditions and Cyberbullying from Social Media
Edward Ajayi
Martha Kachweka
Mawuli Deku
Emily Aiken
32
0
0
25 Nov 2025
DinoLizer: Learning from the Best for Generative Inpainting Localization
DinoLizer: Learning from the Best for Generative Inpainting Localization
Minh Thong Doi
Jan Butora
V. Itier
Jérémie Boulanger
Patrick Bas
DiffM
239
0
0
25 Nov 2025
Previous
12345...659660661
Next