Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 14,161 papers shown
Title
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
31
0
0
12 May 2025
Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
E. Wilcox
Cui Ding
Giovanni Acampa
Tiago Pimentel
Alex Warstadt
Tamar I. Regev
31
0
0
12 May 2025
Must Read: A Systematic Survey of Computational Persuasion
Nimet Beyza Bozdag
Shuhaib Mehri
Xiaocheng Yang
Hyeonjeong Ha
Zirui Cheng
Esin Durmus
Jiaxuan You
Heng Ji
Gökhan Tür
Dilek Hakkani-Tür
39
0
0
12 May 2025
Tagging fully hadronic exotic decays of the vectorlike
B
\mathbf{B}
B
quark using a graph neural network
Jai Bardhan
Tanumoy Mandal
Subhadip Mitra
Cyrin Neeraj
Mihir Rawat
28
0
0
12 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
38
0
0
12 May 2025
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş
Çağatay Yıldız
29
0
0
12 May 2025
Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation
Linus Stuhlmann
Michael Alexander Saxer
Jonathan Fürst
RALM
31
0
0
12 May 2025
AI-Enabled Accurate Non-Invasive Assessment of Pulmonary Hypertension Progression via Multi-Modal Echocardiography
Jiewen Yang
Taoran Huang
Shangwei Ding
Xiaowei Xu
Qinhua Zhao
...
Bin Pu
Jiexuan Zheng
Caojin Zhang
Hongwen Fei
X. Li
16
0
0
12 May 2025
Fine-Grained Bias Exploration and Mitigation for Group-Robust Classification
Miaoyun Zhao
Qiang Zhang
C. Li
26
0
0
11 May 2025
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
Mihyeon Kim
Juhyoung Park
Youngbin Kim
29
0
0
11 May 2025
Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Francesco Cagnetta
Alessandro Favero
Antonio Sclocchi
M. Wyart
26
0
0
11 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
20
0
0
11 May 2025
Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models
Hongwei Shang
Nguyen Vo
Nitin Yadav
Tian Zhang
Ajit Puthenputhussery
Xunfan Cai
Shuyi Chen
Prijith Chandran
Changsung Kang
RALM
43
0
0
11 May 2025
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
Lhuqita Fazry
VLM
25
0
0
11 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
43
0
0
10 May 2025
Enhancing BERTopic with Intermediate Layer Representations
Dominik Koterwa
Maciej Świtała
24
0
0
10 May 2025
Dynamic Domain Information Modulation Algorithm for Multi-domain Sentiment Analysis
Chunyi Yue
Ang Li
21
0
0
10 May 2025
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs
Tianhao Cai
Liang Wang
Limin Xiao
Meng Han
Zeyu Wang
L. Sun
Xiaojian Liao
31
0
0
10 May 2025
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
Zibo Gao
J. Hu
Feng Guo
Yixin Zhang
Yinglong Han
Siyuan Liu
Haiyang Li
Zhiqiang Lv
26
0
0
10 May 2025
Using External knowledge to Enhanced PLM for Semantic Matching
Min Li
Chun Yuan
24
0
0
10 May 2025
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
Yu Wang
Runxi Yu
Zhongyuan Wang
Jing He
18
0
0
10 May 2025
The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification
Arezoo Hatefi
Xuan-Son Vu
Monowar Bhuyan
Frank Drewes
VLM
30
0
0
10 May 2025
A Short Overview of Multi-Modal Wi-Fi Sensing
Zijian Zhao
31
0
0
10 May 2025
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
Haoyang Xie
Feng Ju
21
0
0
10 May 2025
GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Chengfeng Wang
Wei Zhai
Yuhang Yang
Yang Cao
Zhengjun Zha
3DH
29
0
0
10 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
29
0
0
10 May 2025
Semantic-Space-Intervened Diffusive Alignment for Visual Classification
Zixuan Li
Lei Meng
Guoqing Chao
Wei Wu
Xiaoshuo Yan
Yimeng Yang
Zhuang Qi
X. Meng
DiffM
34
0
0
09 May 2025
Graph Laplacian Wavelet Transformer via Learnable Spectral Decomposition
Andrew Kiruluta
Eric Lundy
Priscilla Burity
24
0
0
09 May 2025
An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers
Alba María Mármol-Romero
Manuel García-Vega
Miguel Ángel García-Cumbreras
Arturo Montejo-Ráez
35
2
0
09 May 2025
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Dawid Wi'sniewski
Antoni Solarski
Artur Nowakowski
LRM
29
0
0
09 May 2025
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Christos Plachouras
Julien Guinot
George Fazekas
Elio Quinton
Emmanouil Benetos
Johan Pauwels
110
1
0
09 May 2025
UniSymNet: A Unified Symbolic Network Guided by Transformer
Xinxin Li
Juan Zhang
Da Li
Xingyu Liu
Jin Xu
Junping Yin
29
0
0
09 May 2025
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
Iuliia Zaitova
Vitalii Hirak
Badr M. Abdullah
Dietrich Klakow
Bernd Möbius
T. Avgustinova
29
0
0
09 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
29
0
0
09 May 2025
Evaluating Financial Sentiment Analysis with Annotators Instruction Assisted Prompting: Enhancing Contextual Interpretation and Stock Prediction Accuracy
A M Muntasir Rahman
Ajim Uddin
Guiling Wang
16
0
0
09 May 2025
ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Shuai Wang
Ivona Najdenkoska
Hongyi Zhu
S. Rudinac
Monika Kackovic
N. Wijnberg
M. Worring
156
0
0
09 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
M. Asim
A. Rehman
Andreas Dengel
31
0
0
08 May 2025
Divide (Text) and Conquer (Sentiment): Improved Sentiment Classification by Constituent Conflict Resolution
Jan Kościałkowski
Paweł Marcinkowski
21
0
0
08 May 2025
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Kun Peng
Chaodong Tong
Cong Cao
Hao Peng
Q. Li
Guanlin Wu
Lei Jiang
Yanbing Liu
Philip S. Yu
LMTD
48
0
0
08 May 2025
A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration
Shaja Arul Selvamani
Nia D'Souza Ganapathy
AI4CE
43
0
0
08 May 2025
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
Hao Zhen
Jidong J. Yang
33
0
0
08 May 2025
A Benchmark Dataset and a Framework for Urdu Multimodal Named Entity Recognition
Hussain Ahmad
Qingyang Zeng
Jing Wan
49
0
0
08 May 2025
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models
Aishwarya Venkataramanan
P. Bodesheim
Joachim Denzler
BDL
VLM
64
0
0
08 May 2025
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
F. Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
33
0
0
08 May 2025
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
Qianbo Zang
Christophe Zgrzendek
Igor Tchappi
Afshin Khadangi
Johannes Sedlmeir
VLM
35
2
0
08 May 2025
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
73
0
0
08 May 2025
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Mingfei Zeng
Ming Xie
Xixi Zheng
Chunhai Li
Chuan Zhang
Liehuang Zhu
29
0
0
08 May 2025
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Jiang Zong
Hao Peng
Jianwei Yin
TDI
48
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
46
0
0
08 May 2025
Previous
1
2
3
4
5
...
282
283
284
Next