Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 14,229 papers shown
Title
Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving
Chang Xiao
Brenda Z. Yang
29
0
0
25 Apr 2025
NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation
Rob Romijnders
Stefanos Laskaridis
Ali Shahin Shamsabadi
Hamed Haddadi
57
0
0
25 Apr 2025
CORG: Generating Answers from Complex, Interrelated Contexts
Hyunji Lee
Franck Dernoncourt
Trung H. Bui
Seunghyun Yoon
21
0
0
25 Apr 2025
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao-Yu Yang
48
0
0
25 Apr 2025
Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Yinglong Yu
Hao Shen
Zhengyi Lyu
Qi He
130
0
0
25 Apr 2025
Temporal Entailment Pretraining for Clinical Language Models over EHR Data
Tatsunori Tanaka
Fi Zheng
Kai Sato
Zhifeng Li
Yuanyun Zhang
Shi Li
24
0
0
25 Apr 2025
Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation
Qidong Liu
Xiangyu Zhao
Yejing Wang
Zijian Zhang
Howard Zhong
Chong Chen
X. Li
Wei Huang
Feng Tian
AI4TS
19
0
0
25 Apr 2025
Building UD Cairo for Old English in the Classroom
Lauren Levine
Junghyun Min
Amir Zeldes
45
0
0
25 Apr 2025
Bandit on the Hunt: Dynamic Crawling for Cyber Threat Intelligence
Philipp Kuehn
Dilara Nadermahmoodi
Markus Bayer
Christian A. Reuter
21
0
0
25 Apr 2025
A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images
Xin Li
Wenhui Zhu
Peijie Qiu
Oana Dumitrascu
Amal Youssef
Y. Wang
SSL
MedIm
92
0
0
25 Apr 2025
Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
James D. Finch
Yasasvi Josyula
Jinho D. Choi
38
0
0
25 Apr 2025
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation
Guojia An
Jie Zou
Jiwei Wei
Chaoning Zhang
Fuming Sun
Yang Yang
109
1
0
24 Apr 2025
Lessons from Deploying Learning-based CSI Localization on a Large-Scale ISAC Platform
Tianyu Zhang
Dongheng Zhang
Ruixu Geng
Xuecheng Xie
Shuai Yang
Yan Chen
39
0
0
24 Apr 2025
JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning
Zhaolu Kang
Hongtian Cai
Xiangyang Ji
Jinzhe Li
Nanfei Gu
AILaw
ELM
50
0
0
24 Apr 2025
Towards Robust LLMs: an Adversarial Robustness Measurement Framework
Natan Levy
Adiel Ashrov
Guy Katz
AAML
20
0
0
24 Apr 2025
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Ocheme Anthony Ekle
Biswarup Das
29
0
0
24 Apr 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
J. Zhang
J. Wang
H. Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
33
0
0
24 Apr 2025
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference
Qingyuan Liu
Liyan Chen
Yanning Yang
H. Wang
Dong Du
Zhigang Mao
Naifeng Jing
Yubin Xia
Haibo Chen
29
0
0
24 Apr 2025
Ustnlp16 at SemEval-2025 Task 9: Improving Model Performance through Imbalance Handling and Focal Loss
Zhuoang Cai
Z. Li
Y. Liu
Liyuan Guo
Yangqiu Song
24
0
0
24 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
34
0
0
24 Apr 2025
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Haokai Zhang
Shengtao Zhang
Zijian Cai
Heng Wang
Ruixuan Zhu
Zinan Zeng
Minnan Luo
49
0
0
24 Apr 2025
Aleph-Alpha-GermanWeb: Improving German-language LLM pre-training with model-based data curation and synthetic data generation
Thomas F Burns
Letitia Parcalabescu
Stephan Wäldchen
Michael Barlow
Gregor Ziegltrum
Volker Stampa
Bastian Harren
Björn Deiseroth
SyDa
36
0
0
24 Apr 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
65
0
0
24 Apr 2025
Contrastive Learning for Continuous Touch-Based Authentication
Mengyu Qiao
Yunpeng Zhai
Yang Wang
AAML
37
0
0
24 Apr 2025
An Empirical Study on Prompt Compression for Large Language Models
Z. Zhang
Jinyi Li
Yihuai Lan
X. Wang
Hao Wang
MQ
42
0
0
24 Apr 2025
Tokenization Matters: Improving Zero-Shot NER for Indic Languages
Priyaranjan Pattnayak
Hitesh Laxmichand Patel
Amit Agarwal
30
0
0
23 Apr 2025
MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores
Fengwei Zhou
Jiafei Song
Wenjin Jason Li
Gengjian Xue
Zhikang Zhao
Yichao Lu
Bailin Na
17
0
0
23 Apr 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Y. Zhang
Chuan Qin
Bo Li
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
49
0
0
23 Apr 2025
A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices
Esam Mahdi
C. Martin-Barreiro
X. Cabezas
AI4TS
29
0
0
23 Apr 2025
Transformer-Based Extraction of Statutory Definitions from the U.S. Code
Arpana Hosabettu
Harsh Shah
AILaw
ELM
34
0
0
23 Apr 2025
Do Large Language Models know who did what to whom?
Joseph M. Denning
Xiaohan
Bryor Snefjella
Idan A. Blank
52
1
0
23 Apr 2025
Detecting and Understanding Hateful Contents in Memes Through Captioning and Visual Question-Answering
Ali Anaissi
Junaid Akram
Kunal Chaturvedi
Ali Braytee
22
0
0
23 Apr 2025
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code Repositories
Ye Tian
Yanqiu Yu
Jianguo Sun
Yanbin Wang
AAML
36
0
0
23 Apr 2025
T-VEC: A Telecom-Specific Vectorization Model with Enhanced Semantic Understanding via Deep Triplet Loss Fine-Tuning
Vignesh Ethiraj
Sidhanth Menon
Divya Vijay
30
0
0
23 Apr 2025
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms
Chengkai Huang
Hongtao Huang
Tong Yu
Kaige Xie
Junda Wu
Shuai Zhang
Julian McAuley
Dietmar Jannach
Lina Yao
LRM
AI4CE
24
0
0
23 Apr 2025
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing
Hariseetharam Gunduboina
Muhammad Haris Khan
Biplab Banerjee
VLM
47
0
0
23 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
31
0
0
23 Apr 2025
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
123
0
0
23 Apr 2025
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Kosuke Yamada
Peinan Zhang
22
0
0
23 Apr 2025
How Effective are Generative Large Language Models in Performing Requirements Classification?
Waad Alhoshan
Alessio Ferrari
Liping Zhao
20
0
0
23 Apr 2025
V
2
^2
2
R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Zhiyuan Fan
Yumeng Wang
Sandeep Polisetty
Yi Ren Fung
50
0
0
23 Apr 2025
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Alexander Shvets
21
0
0
23 Apr 2025
In-Context Learning can distort the relationship between sequence likelihoods and biological fitness
Pranav Kantroo
Günter P. Wagner
Benjamin B. Machta
45
0
0
23 Apr 2025
Cost-Effective Text Clustering with Large Language Models
Hongtao Wang
Taiyan Zhang
Renchi Yang
Jianliang Xu
24
0
0
22 Apr 2025
Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Haohe Liu
Thomas Deacon
Wenwu Wang
Matt Paradis
Mark D. Plumbley
26
0
0
22 Apr 2025
Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Luwei Xiao
Rui Mao
Shuai Zhao
Qika Lin
Yanhao Jia
Liang He
Erik Cambria
22
0
0
22 Apr 2025
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction
Harsh Maheshwari
Srikanth Tenneti
Alwarappan Nakkiran
3DV
29
0
0
22 Apr 2025
Performance Evaluation of Emotion Classification in Japanese Using RoBERTa and DeBERTa
Yoichi Takenaka
27
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
53
0
0
22 Apr 2025
Methods for Recognizing Nested Terms
I. Rozhkov
Natalia V. Loukachevitch
36
0
0
22 Apr 2025
Previous
1
2
3
...
5
6
7
...
283
284
285
Next