ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,229 papers shown
Title
Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving
Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving
Chang Xiao
Brenda Z. Yang
29
0
0
25 Apr 2025
NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation
NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation
Rob Romijnders
Stefanos Laskaridis
Ali Shahin Shamsabadi
Hamed Haddadi
57
0
0
25 Apr 2025
CORG: Generating Answers from Complex, Interrelated Contexts
CORG: Generating Answers from Complex, Interrelated Contexts
Hyunji Lee
Franck Dernoncourt
Trung H. Bui
Seunghyun Yoon
21
0
0
25 Apr 2025
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao-Yu Yang
48
0
0
25 Apr 2025
Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Yinglong Yu
Hao Shen
Zhengyi Lyu
Qi He
130
0
0
25 Apr 2025
Temporal Entailment Pretraining for Clinical Language Models over EHR Data
Temporal Entailment Pretraining for Clinical Language Models over EHR Data
Tatsunori Tanaka
Fi Zheng
Kai Sato
Zhifeng Li
Yuanyun Zhang
Shi Li
24
0
0
25 Apr 2025
Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation
Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation
Qidong Liu
Xiangyu Zhao
Yejing Wang
Zijian Zhang
Howard Zhong
Chong Chen
X. Li
Wei Huang
Feng Tian
AI4TS
19
0
0
25 Apr 2025
Building UD Cairo for Old English in the Classroom
Building UD Cairo for Old English in the Classroom
Lauren Levine
Junghyun Min
Amir Zeldes
45
0
0
25 Apr 2025
Bandit on the Hunt: Dynamic Crawling for Cyber Threat Intelligence
Bandit on the Hunt: Dynamic Crawling for Cyber Threat Intelligence
Philipp Kuehn
Dilara Nadermahmoodi
Markus Bayer
Christian A. Reuter
21
0
0
25 Apr 2025
A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images
A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images
Xin Li
Wenhui Zhu
Peijie Qiu
Oana Dumitrascu
Amal Youssef
Y. Wang
SSL
MedIm
92
0
0
25 Apr 2025
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
James D. Finch
Yasasvi Josyula
Jinho D. Choi
38
0
0
25 Apr 2025
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation
Guojia An
Jie Zou
Jiwei Wei
Chaoning Zhang
Fuming Sun
Yang Yang
109
1
0
24 Apr 2025
Lessons from Deploying Learning-based CSI Localization on a Large-Scale ISAC Platform
Lessons from Deploying Learning-based CSI Localization on a Large-Scale ISAC Platform
Tianyu Zhang
Dongheng Zhang
Ruixu Geng
Xuecheng Xie
Shuai Yang
Yan Chen
39
0
0
24 Apr 2025
JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning
JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning
Zhaolu Kang
Hongtian Cai
Xiangyang Ji
Jinzhe Li
Nanfei Gu
AILaw
ELM
50
0
0
24 Apr 2025
Towards Robust LLMs: an Adversarial Robustness Measurement Framework
Towards Robust LLMs: an Adversarial Robustness Measurement Framework
Natan Levy
Adiel Ashrov
Guy Katz
AAML
20
0
0
24 Apr 2025
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Ocheme Anthony Ekle
Biswarup Das
29
0
0
24 Apr 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
J. Zhang
J. Wang
H. Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
33
0
0
24 Apr 2025
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference
Qingyuan Liu
Liyan Chen
Yanning Yang
H. Wang
Dong Du
Zhigang Mao
Naifeng Jing
Yubin Xia
Haibo Chen
29
0
0
24 Apr 2025
Ustnlp16 at SemEval-2025 Task 9: Improving Model Performance through Imbalance Handling and Focal Loss
Ustnlp16 at SemEval-2025 Task 9: Improving Model Performance through Imbalance Handling and Focal Loss
Zhuoang Cai
Z. Li
Y. Liu
Liyuan Guo
Yangqiu Song
24
0
0
24 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
34
0
0
24 Apr 2025
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Haokai Zhang
Shengtao Zhang
Zijian Cai
Heng Wang
Ruixuan Zhu
Zinan Zeng
Minnan Luo
49
0
0
24 Apr 2025
Aleph-Alpha-GermanWeb: Improving German-language LLM pre-training with model-based data curation and synthetic data generation
Aleph-Alpha-GermanWeb: Improving German-language LLM pre-training with model-based data curation and synthetic data generation
Thomas F Burns
Letitia Parcalabescu
Stephan Wäldchen
Michael Barlow
Gregor Ziegltrum
Volker Stampa
Bastian Harren
Björn Deiseroth
SyDa
36
0
0
24 Apr 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
65
0
0
24 Apr 2025
Contrastive Learning for Continuous Touch-Based Authentication
Contrastive Learning for Continuous Touch-Based Authentication
Mengyu Qiao
Yunpeng Zhai
Yang Wang
AAML
37
0
0
24 Apr 2025
An Empirical Study on Prompt Compression for Large Language Models
An Empirical Study on Prompt Compression for Large Language Models
Z. Zhang
Jinyi Li
Yihuai Lan
X. Wang
Hao Wang
MQ
42
0
0
24 Apr 2025
Tokenization Matters: Improving Zero-Shot NER for Indic Languages
Tokenization Matters: Improving Zero-Shot NER for Indic Languages
Priyaranjan Pattnayak
Hitesh Laxmichand Patel
Amit Agarwal
30
0
0
23 Apr 2025
MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores
MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores
Fengwei Zhou
Jiafei Song
Wenjin Jason Li
Gengjian Xue
Zhikang Zhao
Yichao Lu
Bailin Na
17
0
0
23 Apr 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Y. Zhang
Chuan Qin
Bo Li
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
49
0
0
23 Apr 2025
A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices
A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices
Esam Mahdi
C. Martin-Barreiro
X. Cabezas
AI4TS
29
0
0
23 Apr 2025
Transformer-Based Extraction of Statutory Definitions from the U.S. Code
Transformer-Based Extraction of Statutory Definitions from the U.S. Code
Arpana Hosabettu
Harsh Shah
AILaw
ELM
34
0
0
23 Apr 2025
Do Large Language Models know who did what to whom?
Do Large Language Models know who did what to whom?
Joseph M. Denning
Xiaohan
Bryor Snefjella
Idan A. Blank
52
1
0
23 Apr 2025
Detecting and Understanding Hateful Contents in Memes Through Captioning and Visual Question-Answering
Detecting and Understanding Hateful Contents in Memes Through Captioning and Visual Question-Answering
Ali Anaissi
Junaid Akram
Kunal Chaturvedi
Ali Braytee
22
0
0
23 Apr 2025
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code Repositories
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code Repositories
Ye Tian
Yanqiu Yu
Jianguo Sun
Yanbin Wang
AAML
36
0
0
23 Apr 2025
T-VEC: A Telecom-Specific Vectorization Model with Enhanced Semantic Understanding via Deep Triplet Loss Fine-Tuning
T-VEC: A Telecom-Specific Vectorization Model with Enhanced Semantic Understanding via Deep Triplet Loss Fine-Tuning
Vignesh Ethiraj
Sidhanth Menon
Divya Vijay
30
0
0
23 Apr 2025
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms
Chengkai Huang
Hongtao Huang
Tong Yu
Kaige Xie
Junda Wu
Shuai Zhang
Julian McAuley
Dietmar Jannach
Lina Yao
LRM
AI4CE
24
0
0
23 Apr 2025
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing
Hariseetharam Gunduboina
Muhammad Haris Khan
Biplab Banerjee
VLM
47
0
0
23 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
31
0
0
23 Apr 2025
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
123
0
0
23 Apr 2025
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Kosuke Yamada
Peinan Zhang
22
0
0
23 Apr 2025
How Effective are Generative Large Language Models in Performing Requirements Classification?
How Effective are Generative Large Language Models in Performing Requirements Classification?
Waad Alhoshan
Alessio Ferrari
Liping Zhao
20
0
0
23 Apr 2025
V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
V2^22R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Zhiyuan Fan
Yumeng Wang
Sandeep Polisetty
Yi Ren Fung
50
0
0
23 Apr 2025
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Alexander Shvets
21
0
0
23 Apr 2025
In-Context Learning can distort the relationship between sequence likelihoods and biological fitness
In-Context Learning can distort the relationship between sequence likelihoods and biological fitness
Pranav Kantroo
Günter P. Wagner
Benjamin B. Machta
45
0
0
23 Apr 2025
Cost-Effective Text Clustering with Large Language Models
Cost-Effective Text Clustering with Large Language Models
Hongtao Wang
Taiyan Zhang
Renchi Yang
Jianliang Xu
24
0
0
22 Apr 2025
Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Haohe Liu
Thomas Deacon
Wenwu Wang
Matt Paradis
Mark D. Plumbley
26
0
0
22 Apr 2025
Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Luwei Xiao
Rui Mao
Shuai Zhao
Qika Lin
Yanhao Jia
Liang He
Erik Cambria
22
0
0
22 Apr 2025
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction
Harsh Maheshwari
Srikanth Tenneti
Alwarappan Nakkiran
3DV
29
0
0
22 Apr 2025
Performance Evaluation of Emotion Classification in Japanese Using RoBERTa and DeBERTa
Performance Evaluation of Emotion Classification in Japanese Using RoBERTa and DeBERTa
Yoichi Takenaka
27
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
53
0
0
22 Apr 2025
Methods for Recognizing Nested Terms
Methods for Recognizing Nested Terms
I. Rozhkov
Natalia V. Loukachevitch
36
0
0
22 Apr 2025
Previous
123...567...283284285
Next