Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 33,017 papers shown
Title
Learning Treatment Policies From Multimodal Electronic Health Records
Henri Arno
Thomas Demeester
CML
151
0
0
24 Dec 2025
AI-based Traffic Modeling for Network Security and Privacy: Challenges Ahead
Dinil Mon Divakaran
AAML
260
2
0
24 Dec 2025
Meta-Router: Bridging Gold-standard and Preference-based Evaluations in Large Language Model Routing
Yichi Zhang
Fangzheng Xie
Shu Yang
Chong Wu
96
0
0
24 Dec 2025
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
143
1
0
24 Dec 2025
Improving Speech Emotion Recognition with Mutual Information Regularized Generative Model
Chung-Soo Ahn
R. Rana
Sunil Sivadas
Carlos Busso
Jagath Rajapakse
97
0
0
24 Dec 2025
Q-BERT4Rec: Quantized Semantic-ID Representation Learning for Multimodal Recommendation
Haofeng Huang
Ling Gai
MQ
VLM
212
0
0
02 Dec 2025
Reasoning-Aware Multimodal Fusion for Hateful Video Detection
Shuonan Yang
Tailin Chen
Jiangbei Yue
Guangliang Cheng
Jianbo Jiao
Zeyu Fu
188
0
0
02 Dec 2025
Bangla Hate Speech Classification with Fine-tuned Transformer Models
Yalda Keivan Jafari
Krishno Dey
16
0
0
02 Dec 2025
CryptoQA: A Large-scale Question-answering Dataset for AI-assisted Cryptography
Mayar Elfares
Pascal Reisert
Tilman Dietz
Manpa Barman
Ahmed Zaki
Ralf Küsters
Andreas Bulling
ELM
104
0
0
02 Dec 2025
ADORE: Autonomous Domain-Oriented Relevance Engine for E-commerce
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Zheng Fang
Donghao Xie
Ming Pang
Chunyuan Yuan
Xue Jiang
Changping Peng
Zhangang Lin
Zheng Luo
56
1
0
02 Dec 2025
Flexible Gravitational-Wave Parameter Estimation with Transformers
Annalena Kofler
Maximilian Dax
Stephen R. Green
J. Wildberger
N. Gupte
Jakob H. Macke
J. Gair
A. Buonanno
Bernhard Scholkopf
16
0
0
02 Dec 2025
Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding
Hyeongseop Rha
Jeong Hun Yeo
Junil Won
Se Jin Park
Yong Man Ro
LRM
48
0
0
02 Dec 2025
CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer
Lavish Bansal
Naman Mishra
16
0
0
02 Dec 2025
Revisiting Theory of Contrastive Learning for Domain Generalization
Ali Alvandi
Mina Rezaei
OOD
SSL
145
0
0
02 Dec 2025
Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention
Wenyi Xiong
Jian Chen
84
0
0
02 Dec 2025
Empathy Level Prediction in Multi-Modal Scenario with Supervisory Documentation Assistance
Yufei Xiao
Shangfei Wang
16
0
0
02 Dec 2025
TokenPowerBench: Benchmarking the Power Consumption of LLM Inference
Chenxu Niu
Wei Zhang
Jie Li
Yongjian Zhao
Tongyang Wang
Xi-Zhao Wang
Yong Chen
20
0
0
02 Dec 2025
Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?
Manuel Benavent-Lledo
Konstantinos Bacharidis
Victoria Manousaki
K. Papoutsakis
Antonis Argyros
José García Rodríguez
44
0
0
02 Dec 2025
Low-Rank Prehab: Preparing Neural Networks for SVD Compression
Haoran Qin
Shansita D. Sharma
Ali Abbasi
Chayne Thrash
Soheil Kolouri
96
0
0
01 Dec 2025
Learned-Rule-Augmented Large Language Model Evaluators
Jie Meng
Jin Mao
ALM
ELM
LRM
116
0
0
01 Dec 2025
Scaling and context steer LLMs along the same computational path as the human brain
Joséphine Raugel
Stéphane DÁscoli
Jérémy Rapin
Valentin Wyart
J. King
92
0
0
01 Dec 2025
Enhancing BERT Fine-Tuning for Sentiment Analysis in Lower-Resourced Languages
Jozef Kubík
Marek Suppa
Martin Takáč
104
0
0
01 Dec 2025
MDiff4STR: Mask Diffusion Model for Scene Text Recognition
Yongkun Du
Miaomiao Zhao
S. Fan
Z. Chen
Caiyan Jia
Yu-Gang Jiang
DiffM
16
0
0
01 Dec 2025
Evaluating SAM2 for Video Semantic Segmentation
Syed Hesham Syed Ariff
Yun Liu
Guolei Sun
Jing Yang
Henghui Ding
Xue Geng
Xudong Jiang
VLM
155
0
0
01 Dec 2025
MARSAD: A Multi-Functional Tool for Real-Time Social Media Analysis
Md. Rafiul Biswas
Firoj Alam
Wajdi Zaghouani
8
0
0
01 Dec 2025
M4-BLIP: Advancing Multi-Modal Media Manipulation Detection through Face-Enhanced Local Analysis
Hang Wu
Ke Sun
Jiayi Ji
Xiaoshuai Sun
Rongrong Ji
96
0
0
01 Dec 2025
On the Unreasonable Effectiveness of Last-layer Retraining
John C. Hill
Tyler LaBonte
Xinchen Zhang
Vidya Muthukumar
52
0
0
01 Dec 2025
Handwritten Text Recognition for Low Resource Languages
Sayantan Dey
Alireza Alaei
P. Roy
VLM
76
0
0
01 Dec 2025
DyFuLM: An Advanced Multimodal Framework for Sentiment Analysis
Ruohan Zhou
Jiachen Yuan
Churui Yang
Wenzheng Huang
Guoyan Zhang
Shiyao Wei
Jiazhen Hu
Ning Xin
Md Maruf Hasan
20
0
0
01 Dec 2025
Feature Selection Empowered BERT for Detection of Hate Speech with Vocabulary Augmentation
Pritish N. Desai
Tanay Kewalramani
Srimanta Mandal
20
0
0
01 Dec 2025
Reasoning About the Unsaid: Misinformation Detection with Omission-Aware Graph Inference
Zhengjia Wang
Danding Wang
Qiang Sheng
Jiaying Wu
Juan Cao
92
0
0
01 Dec 2025
WhAM: Towards A Translative Model of Sperm Whale Vocalization
Orr Paradise
Pranav Muralikrishnan
Liangyuan Chen
Hugo Flores Garcia
Bryan Pardo
R. Diamant
David F. Gruber
Shane Gero
S. Goldwasser
20
1
0
01 Dec 2025
Masked Symbol Modeling for Demodulation of Oversampled Baseband Communication Signals in Impulsive Noise-Dominated Channels
Oguz Bedir
Nurullah Sevim
Mostafa Ibrahim
S. Ekin
32
0
0
01 Dec 2025
Testing Transformer Learnability on the Arithmetic Sequence of Rooted Trees
Alessandro Breccia
Federica Gerace
Marco Lippi
Gabriele Sicuro
Pierluigi Contucci
24
0
0
01 Dec 2025
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Letian Yi
Tingpeng Zhang
Mingyuan Zhou
Guannan Wang
Quanke Su
Zhilu Lai
DiffM
36
0
0
01 Dec 2025
Label Forensics: Interpreting Hard Labels in Black-Box Text Classifier
Mengyao Du
Gang Yang
Han Fang
Quanjun Yin
Ee-Chien Chang
56
0
0
01 Dec 2025
FastPOS: Language-Agnostic Scalable POS Tagging Framework Low-Resource Use Case
Md Abdullah Al Kafi
Sumit Kumar Banshal
12
0
0
30 Nov 2025
Advancing Academic Chatbots: Evaluation of Non Traditional Outputs
Nicole Favero
Francesca Salute
Daniel Hardt
12
0
0
30 Nov 2025
Accelerating Bangla NLP Tasks with Automatic Mixed Precision: Resource-Efficient Training Preserving Model Efficacy
Md Mehrab Hossain Opi
Sumaiya Khan
Moshammad Farzana Rahman
8
0
0
30 Nov 2025
FiCoTS: Fine-to-Coarse LLM-Enhanced Hierarchical Cross-Modality Interaction for Time Series Forecasting
Yafei Lyu
Hao Zhou
Lu Zhang
X. Yang
Zhiyong Liu
AI4TS
28
0
0
29 Nov 2025
LAP: Fast LAtent Diffusion Planner with Fine-Grained Feature Distillation for Autonomous Driving
Jinhao Zhang
Wenlong Xia
Zhexuan Zhou
Youmin Gong
Jie Mei
28
0
0
29 Nov 2025
Large Language Model based Smart Contract Auditing with LLMBugScanner
Yining Yuan
Yifei Wang
Yichang Xu
Zachary Yahn
Sihao Hu
Ling Liu
48
0
0
29 Nov 2025
Melody or Machine: Detecting Synthetic Music with Dual-Stream Contrastive Learning
Arnesh Batra
Dev Sharma
Krish Thukral
Ruhani Bhatia
Naman Batra
Aditya Gautam
VLM
64
0
0
29 Nov 2025
Financial Text Classification Based On rLoRA Finetuning On Qwen3-8B model
Zhiming Lian
20
0
0
29 Nov 2025
BioArc: Discovering Optimal Neural Architectures for Biological Foundation Models
Yi Fang
H. Xu
Jiaxin Han
Sirui Ding
Y. Wang
Yue Wang
Xuan Wang
LM&Ro
AI4CE
249
0
0
29 Nov 2025
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Praveen Gatla
Anushka
Nikita Kanwar
Gouri Sahoo
Rajesh Kumar Mundotiya
68
1
0
28 Nov 2025
BanglaSentNet: An Explainable Hybrid Deep Learning Framework for Multi-Aspect Sentiment Analysis with Cross-Domain Transfer Learning
Ariful Islam
Md Rifat Hossen
Tanvir Mahmud
88
0
0
28 Nov 2025
A Trainable Centrality Framework for Modern Data
Minh Duc Vu
M. Liu
Doudou Zhou
FedML
120
0
0
28 Nov 2025
Pooling Attention: Evaluating Pretrained Transformer Embeddings for Deception Classification
Sumit Mamtani
Abhijeet Bhure
84
0
0
28 Nov 2025
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Paulo J. N. Pinto
A. Pinho
Diogo Pratas
AI4CE
203
0
0
28 Nov 2025
1
2
3
4
...
659
660
661
Next