Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 2,766 papers shown
Title
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Alexander Shvets
18
0
0
23 Apr 2025
LLM-based Semantic Augmentation for Harmful Content Detection
Elyas Meguellati
Assaad Zeghina
S. Sadiq
Gianluca Demartini
32
0
0
22 Apr 2025
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length
Issa Sugiura
Kouta Nakayama
Yusuke Oda
29
0
0
22 Apr 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
31
0
0
21 Apr 2025
ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese
H. Phung
Ngoc C. Lê
Van-Chien Nguyen
Hang Thi Nguyen
Thuy Phuong Thi Nguyen
68
1
0
21 Apr 2025
WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada
Braeden Sherritt
Isar Nejadgholi
Marzieh Amini
VLM
44
0
0
17 Apr 2025
Transferrable Surrogates in Expressive Neural Architecture Search Spaces
Shiwen Qin
Gabriela Kadlecová
Martin Pilát
Shay B. Cohen
Roman Neruda
Elliot J. Crowley
Jovita Lukasik
Linus Ericsson
AI4CE
53
0
0
17 Apr 2025
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
S. Bhagat
Ibne Farabi Shihab
Anuj Sharma
27
0
0
17 Apr 2025
C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset
Fuqiang Niu
Y. Yang
Xianghua Fu
Genan Dai
Bowen Zhang
17
0
0
14 Apr 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
38
106
0
10 Apr 2025
LLM-based Automated Grading with Human-in-the-Loop
Hang Li
Yucheng Chu
Kaiqi Yang
Yasemin Copur-Gencturk
Jiliang Tang
AI4Ed
ELM
59
0
0
07 Apr 2025
REFORMER: A ChatGPT-Driven Data Synthesis Framework Elevating Text-to-SQL Models
Shenyang Liu
Saleh Almohaimeed
Liqiang Wang
30
0
0
06 Apr 2025
Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis
Yuchen Wei
Dennis Pearl
Matthew Beckman
Rebecca J. Passonneau
28
0
0
04 Apr 2025
Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles
Chen Wei Kuo
Kevin Chu
Nouar Aldahoul
Hazem Ibrahim
Talal Rahwan
Yasir Zaki
SyDa
54
0
0
04 Apr 2025
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations
Mahjabin Nahar
Eun-Ju Lee
Jin Won Park
Dongwon Lee
HILM
71
0
0
01 Apr 2025
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
J. Lin
Tian Wang
Kun Qian
LRM
35
2
0
31 Mar 2025
CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation
Tongke Ni
Yang Fan
Junru Zhou
Xiangping Wu
Qingcai Chen
43
0
0
31 Mar 2025
Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation
Y. Li
Bo Liu
Sheng Huang
Z. Zhang
Xiaotong Yuan
Richang Hong
41
0
0
31 Mar 2025
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning
Hang Guo
Yawei Li
Taolin Zhang
J. Wang
Tao Dai
Shu-Tao Xia
Luca Benini
67
1
0
30 Mar 2025
Measuring Online Hate on 4chan using Pre-trained Deep Learning Models
Adrian Bermudez-Villalva
M. Mehrnezhad
Ehsan Toreini
40
0
0
30 Mar 2025
Towards Symmetric Low-Rank Adapters
Tales Panoutsos
Rodrygo L. T. Santos
Flavio Figueiredo
26
0
0
29 Mar 2025
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Jiakai Tang
Sunhao Dai
Teng Shi
Jun Xu
X. Chen
Wen Chen
Wu Jian
Yuning Jiang
LRM
63
5
0
28 Mar 2025
From Deep Learning to LLMs: A survey of AI in Quantitative Investment
Bokai Cao
Saizhuo Wang
Xinyi Lin
Xiaojun Wu
Haohan Zhang
L. Ni
Jian Guo
AIFin
52
0
0
27 Mar 2025
Retrieving Time-Series Differences Using Natural Language Queries
Kota Dohi
Tomoya Nishida
Harsh Purohit
Takashi Endo
Y. Kawaguchi
AI4TS
38
0
0
27 Mar 2025
EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues
Yuhan Liu
Yunbo Long
LLMAG
57
0
0
27 Mar 2025
Explainable ICD Coding via Entity Linking
Leonor Barreiros
I. Coutinho
Gonçalo M. Correia
Bruno Martins
55
0
0
26 Mar 2025
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang
Menghang Dong
Yuan Zhang
Liang Heng
Xiaowei Chi
Gaole Dai
Li Du
Dan Wang
Yuan Du
MoE
81
0
0
26 Mar 2025
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
Gaifan Zhang
Yi Zhou
Danushka Bollegala
91
0
0
21 Mar 2025
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation
Thomas Pickard
Aline Villavicencio
Maggie Mi
Wei He
Dylan Phelps
Carolina Scarton
78
1
0
19 Mar 2025
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation
Pritam Kadasi
Sriman Reddy
Srivathsa Vamsi Chaturvedula
Rudranshu Sen
Agnish Saha
Soumavo Sikdar
Sayani Sarkar
Suhani Mittal
Rohit Jindal
Mayank Singh
48
0
0
19 Mar 2025
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models
Alexey Karev
Dong Xu
48
0
0
18 Mar 2025
Progressive Human Motion Generation Based on Text and Few Motion Frames
Ling-an Zeng
Gaojie Wu
Ancong Wu
Jian-Fang Hu
Wei-Shi Zheng
53
1
0
17 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
X. Zhang
Yue Shang
Ge Zhang
AI4CE
58
0
0
17 Mar 2025
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire
Kunal Shah
Mudasir Nazir Khan
Nikhil Pakhale
L. Sookha
M. A. Ganaie
Abhinav Dhall
65
0
0
16 Mar 2025
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu
Khoi Duc Nguyen
Preeti Mukherjee
Saurabh Bagchi
Somali Chaterji
Yingyu Liang
Yin Li
LRM
42
1
0
13 Mar 2025
OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses
Angela Lopez-Cardona
Sebastian Idesis
Miguel Barreda-Ángeles
Sergi Abadal
Ioannis Arapakis
46
0
0
13 Mar 2025
MERGE -- A Bimodal Dataset for Static Music Emotion Recognition
Pedro Lima Louro
Hugo Redinho
Ricardo Santos
Ricardo Malheiro
R. Panda
Rui Pedro Paiva
MoMe
67
3
0
13 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
39
0
0
13 Mar 2025
Who Are You Behind the Screen? Implicit MBTI and Gender Detection Using Artificial Intelligence
Kourosh Shahnazari
Seyed Moein Ayyoubzadeh
41
0
0
12 Mar 2025
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
Mooho Song
Hyeryung Son
Jay-Yoon Lee
45
0
0
12 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Q. Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Enhong Chen
3DV
68
3
0
11 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
101
1
0
10 Mar 2025
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
43
0
0
09 Mar 2025
Heterogeneous bimodal attention fusion for speech emotion recognition
Jiachen Luo
Huy Phan
Lin Wang
Joshua Reiss
44
0
0
09 Mar 2025
CeTAD: Towards Certified Toxicity-Aware Distance in Vision Language Models
Xiangyu Yin
Jiaxu Liu
Zhen Chen
Jinwei Hu
Yi Dong
Xiaowei Huang
Wenjie Ruan
AAML
45
0
0
08 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models
Jie He
Wanqiu Long
Deyi Xiong
ELM
55
2
0
08 Mar 2025
Bimodal Connection Attention Fusion for Speech Emotion Recognition
Jiachen Luo
Huy Phan
Lin Wang
Joshua D. Reiss
46
0
0
08 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
82
1
0
07 Mar 2025
Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models
Jie He
Bo Peng
Yi-Lun Liao
Qun Liu
Deyi Xiong
58
8
0
06 Mar 2025
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
Nam V. Nguyen
Dien X. Tran
Thanh T. Tran
Anh T. Hoang
Tai V. Duong
Di T. Le
Phuc-Lu Le
31
0
0
02 Mar 2025
Previous
1
2
3
4
5
...
54
55
56
Next