Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 33,342 papers shown
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
Haotian Xu
Yuning You
Tengfei Ma
112
0
0
20 Nov 2025
Boosting Medical Visual Understanding From Multi-Granular Language Learning
Zihan Li
Yiqing Wang
Sina Farsiu
P. E. Kinahan
VLM
224
0
0
20 Nov 2025
Progressive Supernet Training for Efficient Visual Autoregressive Modeling
Xiaoyue Chen
Yuling Shi
Kaiyuan Li
Huandong Wang
Yong Li
Xiaodong Gu
Xinlei Chen
Mingbao Lin
100
0
0
20 Nov 2025
Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models
Yijun Yang
L. Wang
Jianping Zhang
Chi Harold Liu
Lanqing Hong
Q. Xu
AAML
125
0
0
20 Nov 2025
Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation
Kexin Zhao
Ken Forbus
40
0
0
20 Nov 2025
LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving
Pei Liu
Songtao Wang
Lang Zhang
Xingyue Peng
Yuandong Lyu
...
Weiliang Ma
Xueyang Zhang
Yifei Zhan
Xianpeng Lang
Jun Ma
SyDa
400
0
0
20 Nov 2025
Pharos-ESG: A Framework for Multimodal Parsing, Contextual Narration, and Hierarchical Labeling of ESG Report
Yan Chen
Yu Zou
Jialei Zeng
Haoran You
Xiaorui Zhou
Aixi Zhong
94
0
0
20 Nov 2025
Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies
Zohar Rimon
Elisei Shafer
Tal Tepper
Efrat Shimron
Aviv Tamar
OOD
SSL
296
0
0
20 Nov 2025
Incorporating Token Importance in Multi-Vector Retrieval
Archish S
Ankit Garg
Kirankumar Shiragur
N. Kayal
164
0
0
20 Nov 2025
C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models
Nayoung Oh
Dohyun Kim
Junhyeong Bang
Rohan Paul
Daehyung Park
131
0
0
19 Nov 2025
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Michael McCabe
Payel Mukhopadhyay
Tanya Marwah
Bruno Régaldo-Saint Blancard
François Rozet
...
Mariel Pettee
Jeff Shen
Kyunghyun Cho
M. Cranmer
S. Ho
AI4CE
228
1
0
19 Nov 2025
Eq.Bot: Enhance Robotic Manipulation Learning via Group Equivariant Canonicalization
Jian Deng
Yuandong Wang
Yangfu Zhu
Tao Feng
Tianyu Wo
Zhenzhou Shao
148
0
0
19 Nov 2025
HEAD-QA v2: Expanding a Healthcare Benchmark for Reasoning
Alexis Correa-Guillén
Carlos Gómez-Rodríguez
David Vilares
LRM
279
0
0
19 Nov 2025
TiCAL:Typicality-Based Consistency-Aware Learning for Multimodal Emotion Recognition
Wen Yin
Siyu Zhan
Cencen Liu
Xin Hu
Guiduo Duan
Xiurui Xie
Yuan-Fang Li
Tao He
177
0
0
19 Nov 2025
An Optimized Machine Learning Classifier for Detecting Fake Reviews Using Extracted Features
Shabbir Anees
Anshuman
Ayush Chaurasia
Prathmesh Bogar
20
0
0
19 Nov 2025
ProPL: Universal Semi-Supervised Ultrasound Image Segmentation via Prompt-Guided Pseudo-Labeling
Yaxiong Chen
Q. Wang
Chunlei Li
Jingliang Hu
Yilei Shi
Shengwu Xiong
Xiao Xiang Zhu
Lichao Mou
VLM
108
0
0
19 Nov 2025
TB or Not TB: Coverage-Driven Direct Preference Optimization for Verilog Stimulus Generation
Bardia Nadimi
Khashayar Filom
Deming Chen
Hao Zheng
84
0
0
19 Nov 2025
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Yuan Jiang
Zehao Li
Shan Huang
Christoph Treude
Xiaohong Su
Tiantian Wang
AAML
257
0
0
19 Nov 2025
Opinion Dynamics Models for Sentiment Evolution in Weibo Blogs
Yulong He
Anton V. Proskurnikov
Artem Sedakov
48
0
0
19 Nov 2025
Standardising the NLP Workflow: A Framework for Reproducible Linguistic Analysis
Yves Pauli
Jan-Bernard Marsman
Finn Rabe
Victoria Edkins
Roya M. Hüppi
...
Akhil Ratan Misra
Nils Lang
Wolfram Hinzen
Iris Sommer
Philipp Homan
90
0
0
19 Nov 2025
SteganoBackdoor: Stealthy and Data-Efficient Backdoor Attacks on Language Models
Eric Xue
Ruiyi Zhang
Zijun Zhang
AAML
149
0
0
18 Nov 2025
AutoTool: Efficient Tool Selection for Large Language Model Agents
Jingyi Jia
Qinbin Li
LLMAG
144
0
0
18 Nov 2025
Hierarchical Token Prepending: Enhancing Information Flow in Decoder-based LLM Embeddings
Xueying Ding
Xingyue Huang
Mingxuan Ju
Liam Collins
Yozen Liu
Leman Akoglu
Neil Shah
Tong Zhao
91
1
0
18 Nov 2025
It's LIT! Reliability-Optimized LLMs with Inspectable Tools
Ruixin Zhang
J. Donnelly
Zhicheng Guo
Ghazal Khalighinejad
Haiyang Huang
A. Barnett
Cynthia Rudin
104
0
0
18 Nov 2025
LogPurge: Log Data Purification for Anomaly Detection via Rule-Enhanced Filtering
Shenglin Zhang
Z. Chen
Zijing Que
Yilun Liu
Yongqian Sun
Sicheng Wei
Dan Pei
Hailin Li
AI4TS
73
0
0
18 Nov 2025
10Cache: Heterogeneous Resource-Aware Tensor Caching and Migration for LLM Training
Sabiha Afroz
Redwan Ibne Seraj Khan
Hadeel Albahar
Jingoo Han
A. R. Butt
140
0
0
18 Nov 2025
Generalizable and Efficient Automated Scoring with a Knowledge-Distilled Multi-Task Mixture-of-Experts
Luyang Fang
Tao Wang
Ping Ma
Xiaoming Zhai
MoE
168
0
0
18 Nov 2025
RAG-Driven Data Quality Governance for Enterprise ERP Systems
Sedat Bin Vedat
Enes Kutay Yarkan
Meftun Akarsu
Recep Kaan Karaman
Arda Sar
Çağrı Çelikbilek
Savaş Saygılı
108
0
0
18 Nov 2025
Reconstruction-Driven Multimodal Representation Learning for Automated Media Understanding
Yassir Benhammou
Suman Kalyan
Sujay Kumar
114
0
0
17 Nov 2025
Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification
Rifen Lin
Alex Jinpeng Wang
Jiawei Mo
Min Li
151
0
0
17 Nov 2025
ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation
Kaixin Zhang
Ruiqing Yang
Yuan Zhang
Shan You
Tao Huang
VLM
131
0
0
17 Nov 2025
MedGEN-Bench: Contextually entangled benchmark for open-ended multimodal medical generation
Junjie Yang
Yuhao Yan
Gang Wu
Y Samuel Wang
Ruoyu Liang
...
Xiang Wan
Fenglei Fan
Yongquan Zhang
Feiwei Qin
Changmiao Wang
MedIm
LM&MA
VLM
471
0
0
17 Nov 2025
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Jihun Park
Kyoungmin Lee
Jongmin Gim
Hyeonseo Jo
Minseok Oh
Wonhyeok Choi
K. Hwang
Jaeyeul Kim
Minwoo Choi
S. Im
111
0
1
17 Nov 2025
RegionMarker: A Region-Triggered Semantic Watermarking Framework for Embedding-as-a-Service Copyright Protection
Shufan Yang
Zifeng Cheng
Zhiwei Jiang
Yafeng Yin
Cong Wang
Shiping Ge
Yuchen Fu
Qing Gu
WaLM
289
0
0
17 Nov 2025
Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework
Diego Ortego
Marlon Rodríguez
Mario Almagro
Kunal Dahiya
David Jiménez
Juan C. Sanmiguel
VLM
156
0
0
17 Nov 2025
BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
Chuyuan Li
Giuseppe Carenini
108
0
0
17 Nov 2025
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
Henry Herzog
Favyen Bastani
Yawen Zhang
Gabriel Tseng
Joseph Redmon
...
Hannah Kerner
Evan Shelhamer
Ali Farhadi
Ranjay Krishna
Patrick Beukema
VGen
184
0
0
17 Nov 2025
Learning Skill-Attributes for Transferable Assessment in Video
Kumar Ashutosh
Kristen Grauman
183
0
0
17 Nov 2025
Whistledown: Combining User-Level Privacy with Conversational Coherence in LLMs
Chelsea McMurray
Hayder Tirmazi
52
0
0
17 Nov 2025
TacEleven: generative tactic discovery for football open play
Siyao Zhao
Hao Ma
Zhiqiang Pu
J. Huang
Yi Pan
Shijie Wang
Zhi Ming
166
1
0
17 Nov 2025
TaoSearchEmb: A Multi-Objective Reinforcement Learning Framework for Dense Retrieval in Taobao Search
Xingxian Liu
Dongshuai Li
Tao Wen
Jiahui Wan
Gui Ling
Fuyu Lv
Dan Ou
Haihong Tang
RALM
245
0
0
17 Nov 2025
How Good is BLI as an Alignment Measure: A Study in Word Embedding Paradigm
Kasun Wickramasinghe
Nisansa de Silva
113
0
0
17 Nov 2025
MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
Siyuan Li
Kai Yu
Anna Wang
Zicheng Liu
Chang Yu
Jingbo Zhou
Qirong Yang
Yucheng Guo
Xiaoming Zhang
Stan Z. Li
96
0
0
17 Nov 2025
CoS: Towards Optimal Event Scheduling via Chain-of-Scheduling
Yiming Zhao
Jiwei Tang
Shimin Di
Libin Zheng
Jianxing Yu
Jian Yin
76
0
0
17 Nov 2025
3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale
Yijia Fan
Jusheng Zhang
Kaitong Cai
Jing Yang
Jian Wang
Keze Wang
96
12
0
17 Nov 2025
Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels
Sourya Dipta Das
Shubham Kumar
Kuldeep Yadav
109
0
0
17 Nov 2025
Medical Knowledge Intervention Prompt Tuning for Medical Image Classification
IEEE Transactions on Medical Imaging (IEEE TMI), 2025
Ye Du
Nanxi Yu
Shujun Wang
LM&MA
VLM
196
1
0
16 Nov 2025
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
Chenglong Wang
Yifu Huo
Yang Gan
Yongyu Mu
Qiaozhi He
...
Tongran Liu
Anxiang Ma
Zhengtao Yu
Jingbo Zhu
Tong Xiao
104
0
0
16 Nov 2025
Enhancing Conversational Recommender Systems with Tree-Structured Knowledge and Pretrained Language Models
Yongwen Ren
Chao Wang
Peng Du
Chuan Qin
Dazhong Shen
H. Xiong
KELM
140
0
0
16 Nov 2025
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
Sushant Gautam
Michael A. Riegler
Pål Halvorsen
VLM
194
1
0
16 Nov 2025
Previous
1
2
3
4
5
6
...
665
666
667
Next