ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,161 papers shown
Title
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
31
0
0
12 May 2025
Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
E. Wilcox
Cui Ding
Giovanni Acampa
Tiago Pimentel
Alex Warstadt
Tamar I. Regev
31
0
0
12 May 2025
Must Read: A Systematic Survey of Computational Persuasion
Must Read: A Systematic Survey of Computational Persuasion
Nimet Beyza Bozdag
Shuhaib Mehri
Xiaocheng Yang
Hyeonjeong Ha
Zirui Cheng
Esin Durmus
Jiaxuan You
Heng Ji
Gökhan Tür
Dilek Hakkani-Tür
39
0
0
12 May 2025
Tagging fully hadronic exotic decays of the vectorlike $\mathbf{B}$ quark using a graph neural network
Tagging fully hadronic exotic decays of the vectorlike B\mathbf{B}B quark using a graph neural network
Jai Bardhan
Tanumoy Mandal
Subhadip Mitra
Cyrin Neeraj
Mihir Rawat
28
0
0
12 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
38
0
0
12 May 2025
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş
Çağatay Yıldız
29
0
0
12 May 2025
Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation
Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation
Linus Stuhlmann
Michael Alexander Saxer
Jonathan Fürst
RALM
31
0
0
12 May 2025
AI-Enabled Accurate Non-Invasive Assessment of Pulmonary Hypertension Progression via Multi-Modal Echocardiography
AI-Enabled Accurate Non-Invasive Assessment of Pulmonary Hypertension Progression via Multi-Modal Echocardiography
Jiewen Yang
Taoran Huang
Shangwei Ding
Xiaowei Xu
Qinhua Zhao
...
Bin Pu
Jiexuan Zheng
Caojin Zhang
Hongwen Fei
X. Li
16
0
0
12 May 2025
Fine-Grained Bias Exploration and Mitigation for Group-Robust Classification
Fine-Grained Bias Exploration and Mitigation for Group-Robust Classification
Miaoyun Zhao
Qiang Zhang
C. Li
26
0
0
11 May 2025
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
Mihyeon Kim
Juhyoung Park
Youngbin Kim
29
0
0
11 May 2025
Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Francesco Cagnetta
Alessandro Favero
Antonio Sclocchi
M. Wyart
26
0
0
11 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
20
0
0
11 May 2025
Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models
Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models
Hongwei Shang
Nguyen Vo
Nitin Yadav
Tian Zhang
Ajit Puthenputhussery
Xunfan Cai
Shuyi Chen
Prijith Chandran
Changsung Kang
RALM
43
0
0
11 May 2025
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
Lhuqita Fazry
VLM
25
0
0
11 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
43
0
0
10 May 2025
Enhancing BERTopic with Intermediate Layer Representations
Enhancing BERTopic with Intermediate Layer Representations
Dominik Koterwa
Maciej Świtała
24
0
0
10 May 2025
Dynamic Domain Information Modulation Algorithm for Multi-domain Sentiment Analysis
Dynamic Domain Information Modulation Algorithm for Multi-domain Sentiment Analysis
Chunyi Yue
Ang Li
21
0
0
10 May 2025
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs
Tianhao Cai
Liang Wang
Limin Xiao
Meng Han
Zeyu Wang
L. Sun
Xiaojian Liao
31
0
0
10 May 2025
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
Zibo Gao
J. Hu
Feng Guo
Yixin Zhang
Yinglong Han
Siyuan Liu
Haiyang Li
Zhiqiang Lv
26
0
0
10 May 2025
Using External knowledge to Enhanced PLM for Semantic Matching
Using External knowledge to Enhanced PLM for Semantic Matching
Min Li
Chun Yuan
24
0
0
10 May 2025
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
Yu Wang
Runxi Yu
Zhongyuan Wang
Jing He
18
0
0
10 May 2025
The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification
The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification
Arezoo Hatefi
Xuan-Son Vu
Monowar Bhuyan
Frank Drewes
VLM
30
0
0
10 May 2025
A Short Overview of Multi-Modal Wi-Fi Sensing
A Short Overview of Multi-Modal Wi-Fi Sensing
Zijian Zhao
31
0
0
10 May 2025
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
Haoyang Xie
Feng Ju
21
0
0
10 May 2025
GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Chengfeng Wang
Wei Zhai
Yuhang Yang
Yang Cao
Zhengjun Zha
3DH
29
0
0
10 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
29
0
0
10 May 2025
Semantic-Space-Intervened Diffusive Alignment for Visual Classification
Semantic-Space-Intervened Diffusive Alignment for Visual Classification
Zixuan Li
Lei Meng
Guoqing Chao
Wei Wu
Xiaoshuo Yan
Yimeng Yang
Zhuang Qi
X. Meng
DiffM
34
0
0
09 May 2025
Graph Laplacian Wavelet Transformer via Learnable Spectral Decomposition
Graph Laplacian Wavelet Transformer via Learnable Spectral Decomposition
Andrew Kiruluta
Eric Lundy
Priscilla Burity
24
0
0
09 May 2025
An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers
An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers
Alba María Mármol-Romero
Manuel García-Vega
Miguel Ángel García-Cumbreras
Arturo Montejo-Ráez
35
2
0
09 May 2025
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Dawid Wi'sniewski
Antoni Solarski
Artur Nowakowski
LRM
29
0
0
09 May 2025
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Christos Plachouras
Julien Guinot
George Fazekas
Elio Quinton
Emmanouil Benetos
Johan Pauwels
110
1
0
09 May 2025
UniSymNet: A Unified Symbolic Network Guided by Transformer
UniSymNet: A Unified Symbolic Network Guided by Transformer
Xinxin Li
Juan Zhang
Da Li
Xingyu Liu
Jin Xu
Junping Yin
29
0
0
09 May 2025
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
Iuliia Zaitova
Vitalii Hirak
Badr M. Abdullah
Dietrich Klakow
Bernd Möbius
T. Avgustinova
29
0
0
09 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Learning
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
29
0
0
09 May 2025
Evaluating Financial Sentiment Analysis with Annotators Instruction Assisted Prompting: Enhancing Contextual Interpretation and Stock Prediction Accuracy
Evaluating Financial Sentiment Analysis with Annotators Instruction Assisted Prompting: Enhancing Contextual Interpretation and Stock Prediction Accuracy
A M Muntasir Rahman
Ajim Uddin
Guiling Wang
16
0
0
09 May 2025
ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Shuai Wang
Ivona Najdenkoska
Hongyi Zhu
S. Rudinac
Monika Kackovic
N. Wijnberg
M. Worring
156
0
0
09 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
M. Asim
A. Rehman
Andreas Dengel
31
0
0
08 May 2025
Divide (Text) and Conquer (Sentiment): Improved Sentiment Classification by Constituent Conflict Resolution
Divide (Text) and Conquer (Sentiment): Improved Sentiment Classification by Constituent Conflict Resolution
Jan Kościałkowski
Paweł Marcinkowski
21
0
0
08 May 2025
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Kun Peng
Chaodong Tong
Cong Cao
Hao Peng
Q. Li
Guanlin Wu
Lei Jiang
Yanbing Liu
Philip S. Yu
LMTD
48
0
0
08 May 2025
A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration
A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration
Shaja Arul Selvamani
Nia D'Souza Ganapathy
AI4CE
43
0
0
08 May 2025
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
Hao Zhen
Jidong J. Yang
33
0
0
08 May 2025
A Benchmark Dataset and a Framework for Urdu Multimodal Named Entity Recognition
A Benchmark Dataset and a Framework for Urdu Multimodal Named Entity Recognition
Hussain Ahmad
Qingyang Zeng
Jing Wan
49
0
0
08 May 2025
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models
Aishwarya Venkataramanan
P. Bodesheim
Joachim Denzler
BDL
VLM
64
0
0
08 May 2025
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
F. Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
33
0
0
08 May 2025
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
Qianbo Zang
Christophe Zgrzendek
Igor Tchappi
Afshin Khadangi
Johannes Sedlmeir
VLM
35
2
0
08 May 2025
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
73
0
0
08 May 2025
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Mingfei Zeng
Ming Xie
Xixi Zheng
Chunhai Li
Chuan Zhang
Liehuang Zhu
29
0
0
08 May 2025
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Jiang Zong
Hao Peng
Jianwei Yin
TDI
48
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
46
0
0
08 May 2025
Previous
12345...282283284
Next