Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 3,476 papers shown
Title
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
19
0
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
32
1
0
24 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
53
7
0
24 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
31
1
0
24 Jun 2024
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks
Victor Hugo Nascimento Rocha
I. Silveira
Paulo Pirozelli
Denis Deratani Mauá
Fabio Gagliardi Cozman
29
0
0
21 Jun 2024
Latent Space Translation via Inverse Relative Projection
Valentino Maiorca
Luca Moschella
Marco Fumero
Francesco Locatello
Emanuele Rodolà
34
1
0
21 Jun 2024
CEASEFIRE: An AI-powered system for combatting illicit firearms trafficking
Ioannis Mademlis
Jorgen Cani
Marina Mancuso
C. Paternoster
E. Adamakis
...
Sophia Karagiorgou
George Pantelis
Georgios Stavropoulos
Konstantinos Votis
Georgios Th. Papadopoulos
25
2
0
21 Jun 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
73
7
0
21 Jun 2024
Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
Zhengxin Yang
Wanling Gao
Luzhou Peng
Yunyou Huang
Fei Tang
Jianfeng Zhan
31
0
0
20 Jun 2024
Temporal Knowledge Graph Question Answering: A Survey
Miao Su
Zixuan Li
Zhuo Chen
Long Bai
Xiaolong Jin
Jiafeng Guo
46
2
0
20 Jun 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan S. Nielsen
Kenneth Enevoldsen
Peter Schneider-Kamp
ELM
38
2
0
19 Jun 2024
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
Stefan Sylvius Wagner
Maike Behrendt
Marc Ziegele
Stefan Harmeling
32
9
0
18 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
62
6
0
18 Jun 2024
Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Block Gradient Descent
Lin Wang
Zhichao Wang
Xiaoying Tang
37
1
0
17 Jun 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
39
2
0
17 Jun 2024
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang
Chenchen Yuan
Yao Rong
Felix Steinbauer
Gjergji Kasneci
36
1
0
17 Jun 2024
Can LLMs Learn Macroeconomic Narratives from Social Media?
Almog Gueta
Amir Feder
Zorik Gekhman
Ariel Goldstein
Roi Reichart
21
4
0
17 Jun 2024
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
Ziyue Xu
Peilin Zhou
Xinyu Shi
Jiageng Wu
Yikang Jiang
Bin Ke
Jie-jin Yang
Jie Yang
36
5
0
17 Jun 2024
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation
Yurun Song
Junchen Zhao
Ian G. Harris
S. Jyothi
32
3
0
16 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
46
2
0
15 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
78
4
0
15 Jun 2024
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Wenjun Li
Changyu Chen
Pradeep Varakantham
47
2
0
15 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
38
0
0
14 Jun 2024
SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation Representation
Shuyi Li
Shaojuan Wu
Xiaowang Zhang
Zhiyong Feng
39
0
0
13 Jun 2024
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Xuannan Liu
Zekun Li
Peipei Li
Shuhan Xia
Xing Cui
Linzhi Huang
Huaibo Huang
Weihong Deng
Zhaofeng He
36
13
0
13 Jun 2024
The Impact of Initialization on LoRA Finetuning Dynamics
Soufiane Hayou
Nikhil Ghosh
Bin Yu
AI4CE
36
10
0
12 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
38
2
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
69
50
0
12 Jun 2024
LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models
Dasun Athukoralage
Thushari Atapattu
M. Thilakaratne
Katrina Falkner
LM&MA
18
0
0
11 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
42
1
0
11 Jun 2024
Multimodal Belief Prediction
John Murzaku
Adil Soubki
Owen Rambow
16
0
0
11 Jun 2024
Non-autoregressive Personalized Bundle Generation
Wenchuan Yang
Cheng Yang
Jichao Li
Yuejin Tan
Xin Lu
Chuan Shi
28
0
0
11 Jun 2024
Développement automatique de lexiques pour les concepts émergents : une exploration méthodologique
Revekka Kyriakoglou
Anna Pappa
Jilin He
A. Schoen
P. Laurens
Markarit Vartampetian
P. Larédo
Tita Kyriacopoulou
39
0
0
10 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
49
0
0
10 Jun 2024
LLM Questionnaire Completion for Automatic Psychiatric Assessment
Gony Rosenman
Lior Wolf
Talma Hendler
31
3
0
09 Jun 2024
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning
Yibo Yang
Xiaojie Li
Zhongzhu Zhou
S. Song
Jianlong Wu
Liqiang Nie
Bernard Ghanem
45
6
0
07 Jun 2024
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu
Milad Afshari
Xitong Zhang
Zhiyu Xue
Avrajit Ghosh
Bidhan Bashyal
Rongrong Wang
K. Johnson
27
0
0
06 Jun 2024
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Qi Cheng
Michael Boratko
Pranay Kumar Yelugam
T. O’Gorman
Nalini Singh
Andrew McCallum
X. Li
ELM
LRM
34
3
0
06 Jun 2024
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Naibin Gu
Peng Fu
Xiyu Liu
Bowen Shen
Zheng-Shen Lin
Weiping Wang
30
6
0
06 Jun 2024
NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Shuo Huang
William MacLean
Xiaoxi Kang
Anqi Wu
Lizhen Qu
Qiongkai Xu
Zhuang Li
Xingliang Yuan
Gholamreza Haffari
30
0
0
06 Jun 2024
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
Yang Wu
Chenghao Wang
Ece Gumusel
Xiaozhong Liu
ELM
AILaw
40
4
0
05 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
39
0
0
05 Jun 2024
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Hao Li
Yuping Wu
Viktor Schlegel
R. Batista-Navarro
Tharindu Madusanka
...
Jiayan Zeng
Xiaochi Wang
Xinran He
Yizhi Li
Goran Nenadic
31
6
0
05 Jun 2024
Space Decomposition for Sentence Embedding
Wuttikorn Ponwitayarat
Peerat Limkonchotiwat
E. Chuangsuwanich
Sarana Nutanong
24
0
0
05 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
27
0
0
05 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
35
1
0
04 Jun 2024
Deciphering Oracle Bone Language with Diffusion Models
Haisu Guan
Huanxin Yang
Xinyu Wang
Shengwei Han
Yongge Liu
Lianwen Jin
Xiang Bai
Y. Liu
AAML
AI4CE
85
7
0
02 Jun 2024
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Luis Rei
Dunja Mladenić
M. Dorozynski
Franz Rottensteiner
Thomas Schleider
Raphael Troncy
J. Lozano
Mar Gaitán Salvatella
29
6
0
01 Jun 2024
RoBERTa-BiLSTM: A Context-Aware Hybrid Model for Sentiment Analysis
Md. Mostafizer Rahman
Ariful Islam Shiplu
Yutaka Watanobe
Md. Ashad Alam
21
10
0
01 Jun 2024
CONFINE: Conformal Prediction for Interpretable Neural Networks
Linhui Huang
S. Lala
N. Jha
63
2
0
01 Jun 2024
Previous
1
2
3
...
8
9
10
...
68
69
70
Next