Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 2,766 papers shown
Title
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Yifei Duan
Raphael Shang
Deng Liang
Yongqiang Cai
80
0
0
28 Feb 2025
FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework
S M Sarwar
AI4MH
39
0
0
27 Feb 2025
Lotus at SemEval-2025 Task 11: RoBERTa with Llama-3 Generated Explanations for Multi-Label Emotion Classification
Niloofar Ranjbar
Hamed Baghbani
38
1
0
27 Feb 2025
DreamNet: A Multimodal Framework for Semantic and Emotional Analysis of Sleep Narratives
Tapasvi Panchagnula
37
0
0
26 Feb 2025
CAMEx: Curvature-aware Merging of Experts
Dung V. Nguyen
Minh H. Nguyen
Luc Q. Nguyen
R. Teo
T. Nguyen
Linh Duy Tran
MoMe
73
2
0
26 Feb 2025
From Small to Large Language Models: Revisiting the Federalist Papers
So Won Jeong
Veronika Rockova
37
0
0
25 Feb 2025
The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text
Matthieu Meeus
Lukas Wutschitz
Santiago Zanella Béguelin
Shruti Tople
Reza Shokri
75
0
0
24 Feb 2025
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Chenghao Fan
Zhenyi Lu
Sichen Liu
Xiaoye Qu
Wei Wei
Chengfeng Gu
Yu-Xi Cheng
MoE
85
0
0
24 Feb 2025
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Jonathan Bourne
75
0
0
24 Feb 2025
Extraction multi-étiquettes de relations en utilisant des couches de Transformer
Ngoc Luyen Le
Gildas Tagny Ngompé
58
0
0
24 Feb 2025
NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models
Yibo Zhong
Haoxiang Jiang
Lincan Li
Ryumei Nakada
Tianci Liu
Linjun Zhang
Huaxiu Yao
Haoyu Wang
75
2
0
24 Feb 2025
Data-Constrained Synthesis of Training Data for De-Identification
Thomas Vakili
Aron Henriksson
Hercules Dalianis
SyDa
44
0
0
24 Feb 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
J. Wang
Kun Shao
OffRL
34
13
0
24 Feb 2025
Encryption-Friendly LLM Architecture
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
52
1
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
63
8
0
24 Feb 2025
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
David Byrd
LLMAG
LM&Ro
AIFin
45
0
0
22 Feb 2025
Question Answering with Texts and Tables through Deep Reinforcement Learning
M. M. José
Flávio Nakasato Cação
Maria F. Ribeiro
Rafael M. Cheang
Paulo Pirozelli
Fabio Gagliardi Cozman
LMTD
RALM
160
0
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
48
2
0
21 Feb 2025
SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks
Yue Gao
Ilia Shumailov
Kassem Fawaz
AAML
126
0
0
21 Feb 2025
Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting
Gyanendra Shrestha
Chutain Jiang
Sai Akula
Vivek Yannam
Anna Pyayt
Michael Gubanov
LMTD
89
0
0
20 Feb 2025
MoVer: Motion Verification for Motion Graphics Animations
Jiaju Ma
Maneesh Agrawala
VGen
51
0
0
20 Feb 2025
Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models
Peter Carragher
Abhinand Jha
R Raghav
Kathleen M. Carley
RALM
73
0
0
20 Feb 2025
LegalCore: A Dataset for Event Coreference Resolution in Legal Documents
Kangda Wei
Xi Shi
Jonathan Tong
Sai Ramana Reddy
Anandhavelu Natarajan
R. Jain
Aparna Garimella
Ruihong Huang
AILaw
52
0
0
18 Feb 2025
Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey
Ruiyao Xu
Kaize Ding
53
5
0
17 Feb 2025
Prompt-based Depth Pruning of Large Language Models
Juyun Wee
Minjae Park
Jaeho Lee
VLM
86
0
0
17 Feb 2025
Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices
Mohamed Aboelenien Ahmed
Kilian Pfeiffer
R. Khalili
Heba Khdr
J. Henkel
FedML
82
0
0
17 Feb 2025
Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models
Jiexin Ding
Bowen Zhao
Yuntao wang
Xinyun Liu
Rui Hao
Ishan Chatterjee
Yuanchun Shi
44
0
0
17 Feb 2025
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
Xiaoying Song
Sharon Lisseth Perez
Xinchen Yu
Eduardo Blanco
Lingzi Hong
92
0
0
17 Feb 2025
A distributional simplicity bias in the learning dynamics of transformers
Riccardo Rende
Federica Gerace
A. Laio
Sebastian Goldt
68
8
0
17 Feb 2025
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Xin Gu
Yaojie Shen
Chenxi Luo
Tiejian Luo
Yan Huang
Yuewei Lin
Heng Fan
L. Zhang
55
1
0
16 Feb 2025
FinMTEB: Finance Massive Text Embedding Benchmark
Yixuan Tang
Yi Yang
AIFin
63
0
0
16 Feb 2025
LLM-Enhanced Multiple Instance Learning for Joint Rumor and Stance Detection with Social Context Information
Ruichao Yang
Jing Ma
Wei Gao
Hongzhan Lin
63
0
0
13 Feb 2025
Measuring Diversity in Synthetic Datasets
Yuchang Zhu
Huizhe Zhang
Bingzhe Wu
Jintang Li
Zibin Zheng
Peilin Zhao
Liang Chen
Yatao Bian
95
0
0
12 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
40
0
0
11 Feb 2025
Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement
Junyu Lu
Kai Ma
Kaichun Wang
Kelaiti Xiao
Roy Ka-Wei Lee
Bo Xu
Liang Yang
Hongfei Lin
44
0
0
10 Feb 2025
ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Y. Meena
Vaibhav Singh
Ayush Maheshwari
Amrith Krishna
Ganesh Ramakrishnan
AI4TS
67
0
0
09 Feb 2025
SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
Yixian Shen
Qi Bi
Jia-Hong Huang
Hongyi Zhu
Andy D. Pimentel
Anuj Pathania
46
0
0
08 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
53
0
0
07 Feb 2025
Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word Substitutes
Juraj Vladika
Stephen Meisenbacher
Florian Matthes
126
0
0
06 Feb 2025
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
Zhekai Du
Yinjie Min
Jingjing Li
Ke Lu
Changliang Zou
Liuhua Peng
Tingjin Chu
M. Gong
122
1
0
05 Feb 2025
Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study
Calvin Cheng
Scott A. Hale
102
0
0
04 Feb 2025
RandLoRA: Full-rank parameter-efficient fine-tuning of large models
Paul Albert
Frederic Z. Zhang
Hemanth Saratchandran
Cristian Rodriguez-Opazo
Anton van den Hengel
Ehsan Abbasnejad
94
0
0
03 Feb 2025
Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA
Shuangyi Chen
Yuanxin Guo
Yue Ju
Harik Dalal
Ashish Khisti
48
1
0
03 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study
K. K.
Meladel Mistica
Timothy Baldwin
Jey Han Lau
62
0
0
03 Feb 2025
Towards Making Flowchart Images Machine Interpretable
S. Kamath S
Prajwal Gatti
Yogesh Kumar
Vikash Yadav
Anand Mishra
53
5
0
29 Jan 2025
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Kei Katsumata
Motonari Kambara
Daichi Yashima
Ryosuke Korekata
Komei Sugiura
56
0
0
28 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Peiling Yi
A. Zubiaga
Yunfei Long
78
0
0
28 Jan 2025
Irony Detection, Reasoning and Understanding in Zero-shot Learning
Peiling Yi
Yuhan Xia
54
0
0
28 Jan 2025
Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Wenjie Zheng
Qiming Xie
Zengzhi Wang
Jianfei Yu
Rui Xia
60
0
0
28 Jan 2025
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
Yibin Wang
H. Shi
Ligong Han
Dimitris N. Metaxas
Hao Wang
BDL
UQLM
102
6
0
28 Jan 2025
Previous
1
2
3
4
5
6
...
54
55
56
Next