Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 3,071 papers shown
Title
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models
S. Nercessian
Johannes Imort
Ninon Devis
Frederik Blang
29
1
0
22 Jul 2024
MAVEN-Fact: A Large-scale Event Factuality Detection Dataset
Chunyang Li
Hao Peng
Xiaozhi Wang
Y. Qi
Lei Hou
Bin Xu
Juanzi Li
HILM
33
1
0
22 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
34
3
0
20 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Chuheng Zhang
Lei Song
Li Zhao
Jiang Bian
Kee-Eung Kim
VLM
AAML
29
3
0
20 Jul 2024
Evaluating the Reliability of Self-Explanations in Large Language Models
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
LRM
40
0
0
19 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
29
8
0
19 Jul 2024
ECoh: Turn-level Coherence Evaluation for Multilingual Dialogues
John Mendonça
Isabel Trancoso
A. Lavie
29
3
0
16 Jul 2024
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
Xin-Jian Wu
Rui-Song Zhang
Jie Qin
Shijie Ma
Cheng-Lin Liu
VLM
27
1
0
14 Jul 2024
Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation
Ge Gao
Jongin Kim
Sejin Paik
Ekaterina Novozhilova
Yi Liu
Sarah Bonna
Margrit Betke
Derry Wijaya
34
0
0
14 Jul 2024
Low-Rank Interconnected Adaptation Across Layers
Yibo Zhong
Yao Zhou
OffRL
MoE
38
1
0
13 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
35
2
0
12 Jul 2024
FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios
Yongjian Tang
Rakebul Hasan
Thomas Runkler
61
2
0
10 Jul 2024
Measuring Sustainability Intention of ESG Fund Disclosure using Few-Shot Learning
Mayank Singh
Nazia Nafis
Abhijeet Kumar
Mridul Mishra
20
0
0
09 Jul 2024
Cybersecurity Defenses: Exploration of CVE Types through Attack Descriptions
Refat Othman
Bruno Rossi
Barbara Russo
24
1
0
09 Jul 2024
Consistent Document-Level Relation Extraction via Counterfactuals
Ali Modarressi
Abdullatif Köksal
Hinrich Schutze
32
1
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
40
43
0
09 Jul 2024
Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Avinash Anand
Chayan Tank
Sarthak Pol
Vinayak Katoch
Shaina Mehta
R. Shah
32
4
0
08 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
30
3
0
08 Jul 2024
Experiments with truth using Machine Learning: Spectral analysis and explainable classification of synthetic, false, and genuine information
Vishnu S Pendyala
Madhulika Dutta
34
0
0
07 Jul 2024
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions
Zhiwen You
Haejin Lee
Shubhanshu Mishra
Sullam Jeoung
Apratim Mishra
Jinseok Kim
Jana Diesner
27
9
0
07 Jul 2024
Using LLMs to label medical papers according to the CIViC evidence model
Markus Hisch
Xing David Wang
31
0
0
05 Jul 2024
From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs
Marion Bartl
Susan Leavy
35
8
0
05 Jul 2024
Crafting Large Language Models for Enhanced Interpretability
Chung-En Sun
Tuomas P. Oikarinen
Tsui-Wei Weng
25
6
0
05 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Yu Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
31
18
0
04 Jul 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
Xiangyang Li
Kuicai Dong
Yi Quan Lee
Wei Xia
Yichun Yin
Xinyi Dai
Yasheng Wang
Ruiming Tang
57
15
0
03 Jul 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
52
0
0
02 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
34
3
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
60
3
0
01 Jul 2024
IDT: Dual-Task Adversarial Attacks for Privacy Protection
Pedro Faustini
Shakila Mahjabin Tonni
Annabelle McIver
Qiongkai Xu
Mark Dras
SILM
AAML
44
0
0
28 Jun 2024
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
14
0
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
32
1
0
24 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
53
7
0
24 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
31
1
0
24 Jun 2024
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks
Victor Hugo Nascimento Rocha
I. Silveira
Paulo Pirozelli
Denis Deratani Mauá
Fabio Gagliardi Cozman
29
0
0
21 Jun 2024
Latent Space Translation via Inverse Relative Projection
Valentino Maiorca
Luca Moschella
Marco Fumero
Francesco Locatello
Emanuele Rodolà
34
1
0
21 Jun 2024
CEASEFIRE: An AI-powered system for combatting illicit firearms trafficking
Ioannis Mademlis
Jorgen Cani
Marina Mancuso
C. Paternoster
E. Adamakis
...
Sophia Karagiorgou
George Pantelis
Georgios Stavropoulos
Konstantinos Votis
Georgios Th. Papadopoulos
20
2
0
21 Jun 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
71
7
0
21 Jun 2024
Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
Zhengxin Yang
Wanling Gao
Luzhou Peng
Yunyou Huang
Fei Tang
Jianfeng Zhan
31
0
0
20 Jun 2024
Temporal Knowledge Graph Question Answering: A Survey
Miao Su
Zixuan Li
Zhuo Chen
Long Bai
Xiaolong Jin
Jiafeng Guo
46
2
0
20 Jun 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan S. Nielsen
Kenneth Enevoldsen
Peter Schneider-Kamp
ELM
35
2
0
19 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
62
6
0
18 Jun 2024
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
Stefan Sylvius Wagner
Maike Behrendt
Marc Ziegele
Stefan Harmeling
32
9
0
18 Jun 2024
Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Block Gradient Descent
Lin Wang
Zhichao Wang
Xiaoying Tang
34
1
0
17 Jun 2024
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang
Chenchen Yuan
Yao Rong
Felix Steinbauer
Gjergji Kasneci
36
1
0
17 Jun 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
39
2
0
17 Jun 2024
Can LLMs Learn Macroeconomic Narratives from Social Media?
Almog Gueta
Amir Feder
Zorik Gekhman
Ariel Goldstein
Roi Reichart
21
4
0
17 Jun 2024
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
Ziyue Xu
Peilin Zhou
Xinyu Shi
Jiageng Wu
Yikang Jiang
Bin Ke
Jie-jin Yang
Jie Yang
34
5
0
17 Jun 2024
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation
Yurun Song
Junchen Zhao
Ian G. Harris
S. Jyothi
32
3
0
16 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
42
2
0
15 Jun 2024
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Wenjun Li
Changyu Chen
Pradeep Varakantham
45
2
0
15 Jun 2024
Previous
1
2
3
...
7
8
9
...
60
61
62
Next