ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 3,071 papers shown
Title
Generating Sample-Based Musical Instruments Using Neural Audio Codec
  Language Models
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models
S. Nercessian
Johannes Imort
Ninon Devis
Frederik Blang
29
1
0
22 Jul 2024
MAVEN-Fact: A Large-scale Event Factuality Detection Dataset
MAVEN-Fact: A Large-scale Event Factuality Detection Dataset
Chunyang Li
Hao Peng
Xiaozhi Wang
Y. Qi
Lei Hou
Bin Xu
Juanzi Li
HILM
33
1
0
22 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction
  with Phonetic Analysis
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
34
3
0
20 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for
  Prompt Tuning with RL
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Chuheng Zhang
Lei Song
Li Zhao
Jiang Bian
Kee-Eung Kim
VLM
AAML
29
3
0
20 Jul 2024
Evaluating the Reliability of Self-Explanations in Large Language Models
Evaluating the Reliability of Self-Explanations in Large Language Models
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
LRM
40
0
0
19 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text
  Generation: A State-of-the-Art Investigation
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
29
8
0
19 Jul 2024
ECoh: Turn-level Coherence Evaluation for Multilingual Dialogues
ECoh: Turn-level Coherence Evaluation for Multilingual Dialogues
John Mendonça
Isabel Trancoso
A. Lavie
29
3
0
16 Jul 2024
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation
  Models
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
Xin-Jian Wu
Rui-Song Zhang
Jie Qin
Shijie Ma
Cheng-Lin Liu
VLM
27
1
0
14 Jul 2024
Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT
  and Seq2Seq Models for Free-Text Generation
Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation
Ge Gao
Jongin Kim
Sejin Paik
Ekaterina Novozhilova
Yi Liu
Sarah Bonna
Margrit Betke
Derry Wijaya
34
0
0
14 Jul 2024
Low-Rank Interconnected Adaptation Across Layers
Low-Rank Interconnected Adaptation Across Layers
Yibo Zhong
Yao Zhou
OffRL
MoE
38
1
0
13 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
35
2
0
12 Jul 2024
FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios
FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios
Yongjian Tang
Rakebul Hasan
Thomas Runkler
61
2
0
10 Jul 2024
Measuring Sustainability Intention of ESG Fund Disclosure using Few-Shot
  Learning
Measuring Sustainability Intention of ESG Fund Disclosure using Few-Shot Learning
Mayank Singh
Nazia Nafis
Abhijeet Kumar
Mridul Mishra
20
0
0
09 Jul 2024
Cybersecurity Defenses: Exploration of CVE Types through Attack
  Descriptions
Cybersecurity Defenses: Exploration of CVE Types through Attack Descriptions
Refat Othman
Bruno Rossi
Barbara Russo
24
1
0
09 Jul 2024
Consistent Document-Level Relation Extraction via Counterfactuals
Consistent Document-Level Relation Extraction via Counterfactuals
Ali Modarressi
Abdullatif Köksal
Hinrich Schutze
32
1
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
40
43
0
09 Jul 2024
Depression Detection and Analysis using Large Language Models on Textual
  and Audio-Visual Modalities
Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Avinash Anand
Chayan Tank
Sarthak Pol
Vinayak Katoch
Shaina Mehta
R. Shah
32
4
0
08 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization
  Approaches for Language Models
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
30
3
0
08 Jul 2024
Experiments with truth using Machine Learning: Spectral analysis and
  explainable classification of synthetic, false, and genuine information
Experiments with truth using Machine Learning: Spectral analysis and explainable classification of synthetic, false, and genuine information
Vishnu S Pendyala
Madhulika Dutta
34
0
0
07 Jul 2024
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through
  Gender-Neutral Name Predictions
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions
Zhiwen You
Haejin Lee
Shubhanshu Mishra
Sullam Jeoung
Apratim Mishra
Jinseok Kim
Jana Diesner
27
9
0
07 Jul 2024
Using LLMs to label medical papers according to the CIViC evidence model
Using LLMs to label medical papers according to the CIViC evidence model
Markus Hisch
Xing David Wang
31
0
0
05 Jul 2024
From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive
  Language for Bias Reduction in LLMs
From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs
Marion Bartl
Susan Leavy
35
8
0
05 Jul 2024
Crafting Large Language Models for Enhanced Interpretability
Crafting Large Language Models for Enhanced Interpretability
Chung-En Sun
Tuomas P. Oikarinen
Tsui-Wei Weng
25
6
0
05 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive
  Prompt Optimization
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Yu Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
31
18
0
04 Jul 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
Xiangyang Li
Kuicai Dong
Yi Quan Lee
Wei Xia
Yichun Yin
Xinyi Dai
Yasheng Wang
Ruiming Tang
57
15
0
03 Jul 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
52
0
0
02 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between
  Autoregressive and Masked Pretraining
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
34
3
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
60
3
0
01 Jul 2024
IDT: Dual-Task Adversarial Attacks for Privacy Protection
IDT: Dual-Task Adversarial Attacks for Privacy Protection
Pedro Faustini
Shakila Mahjabin Tonni
Annabelle McIver
Qiongkai Xu
Mark Dras
SILM
AAML
44
0
0
28 Jun 2024
Deepfake tweets automatic detection
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
14
0
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing
  Backpropagation
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
32
1
0
24 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
53
7
0
24 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
31
1
0
24 Jun 2024
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New
  Dataset, its Methodology and Associated Tasks
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks
Victor Hugo Nascimento Rocha
I. Silveira
Paulo Pirozelli
Denis Deratani Mauá
Fabio Gagliardi Cozman
29
0
0
21 Jun 2024
Latent Space Translation via Inverse Relative Projection
Latent Space Translation via Inverse Relative Projection
Valentino Maiorca
Luca Moschella
Marco Fumero
Francesco Locatello
Emanuele Rodolà
34
1
0
21 Jun 2024
CEASEFIRE: An AI-powered system for combatting illicit firearms
  trafficking
CEASEFIRE: An AI-powered system for combatting illicit firearms trafficking
Ioannis Mademlis
Jorgen Cani
Marina Mancuso
C. Paternoster
E. Adamakis
...
Sophia Karagiorgou
George Pantelis
Georgios Stavropoulos
Konstantinos Votis
Georgios Th. Papadopoulos
20
2
0
21 Jun 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
71
7
0
21 Jun 2024
Younger: The First Dataset for Artificial Intelligence-Generated Neural
  Network Architecture
Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
Zhengxin Yang
Wanling Gao
Luzhou Peng
Yunyou Huang
Fei Tang
Jianfeng Zhan
31
0
0
20 Jun 2024
Temporal Knowledge Graph Question Answering: A Survey
Temporal Knowledge Graph Question Answering: A Survey
Miao Su
Zixuan Li
Zhuo Chen
Long Bai
Xiaolong Jin
Jiafeng Guo
46
2
0
20 Jun 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan S. Nielsen
Kenneth Enevoldsen
Peter Schneider-Kamp
ELM
35
2
0
19 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
62
6
0
18 Jun 2024
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
Stefan Sylvius Wagner
Maike Behrendt
Marc Ziegele
Stefan Harmeling
32
9
0
18 Jun 2024
Save It All: Enabling Full Parameter Tuning for Federated Large Language
  Models via Cycle Block Gradient Descent
Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Block Gradient Descent
Lin Wang
Zhichao Wang
Xiaoying Tang
34
1
0
17 Jun 2024
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang
Chenchen Yuan
Yao Rong
Felix Steinbauer
Gjergji Kasneci
36
1
0
17 Jun 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
39
2
0
17 Jun 2024
Can LLMs Learn Macroeconomic Narratives from Social Media?
Can LLMs Learn Macroeconomic Narratives from Social Media?
Almog Gueta
Amir Feder
Zorik Gekhman
Ariel Goldstein
Roi Reichart
21
4
0
17 Jun 2024
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
Ziyue Xu
Peilin Zhou
Xinyu Shi
Jiageng Wu
Yikang Jiang
Bin Ke
Jie-jin Yang
Jie Yang
34
5
0
17 Jun 2024
ShareLoRA: Parameter Efficient and Robust Large Language Model
  Fine-tuning via Shared Low-Rank Adaptation
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation
Yurun Song
Junchen Zhao
Ian G. Harris
S. Jyothi
32
3
0
16 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization
  for Language Models
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
42
2
0
15 Jun 2024
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Wenjun Li
Changyu Chen
Pradeep Varakantham
45
2
0
15 Jun 2024
Previous
123...789...606162
Next