ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 3,476 papers shown
Title
Deepfake tweets automatic detection
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
19
0
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing
  Backpropagation
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
32
1
0
24 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
53
7
0
24 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
31
1
0
24 Jun 2024
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New
  Dataset, its Methodology and Associated Tasks
Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks
Victor Hugo Nascimento Rocha
I. Silveira
Paulo Pirozelli
Denis Deratani Mauá
Fabio Gagliardi Cozman
29
0
0
21 Jun 2024
Latent Space Translation via Inverse Relative Projection
Latent Space Translation via Inverse Relative Projection
Valentino Maiorca
Luca Moschella
Marco Fumero
Francesco Locatello
Emanuele Rodolà
34
1
0
21 Jun 2024
CEASEFIRE: An AI-powered system for combatting illicit firearms
  trafficking
CEASEFIRE: An AI-powered system for combatting illicit firearms trafficking
Ioannis Mademlis
Jorgen Cani
Marina Mancuso
C. Paternoster
E. Adamakis
...
Sophia Karagiorgou
George Pantelis
Georgios Stavropoulos
Konstantinos Votis
Georgios Th. Papadopoulos
25
2
0
21 Jun 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
73
7
0
21 Jun 2024
Younger: The First Dataset for Artificial Intelligence-Generated Neural
  Network Architecture
Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
Zhengxin Yang
Wanling Gao
Luzhou Peng
Yunyou Huang
Fei Tang
Jianfeng Zhan
31
0
0
20 Jun 2024
Temporal Knowledge Graph Question Answering: A Survey
Temporal Knowledge Graph Question Answering: A Survey
Miao Su
Zixuan Li
Zhuo Chen
Long Bai
Xiaolong Jin
Jiafeng Guo
46
2
0
20 Jun 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan S. Nielsen
Kenneth Enevoldsen
Peter Schneider-Kamp
ELM
38
2
0
19 Jun 2024
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
Stefan Sylvius Wagner
Maike Behrendt
Marc Ziegele
Stefan Harmeling
32
9
0
18 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
62
6
0
18 Jun 2024
Save It All: Enabling Full Parameter Tuning for Federated Large Language
  Models via Cycle Block Gradient Descent
Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Block Gradient Descent
Lin Wang
Zhichao Wang
Xiaoying Tang
37
1
0
17 Jun 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
39
2
0
17 Jun 2024
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang
Chenchen Yuan
Yao Rong
Felix Steinbauer
Gjergji Kasneci
36
1
0
17 Jun 2024
Can LLMs Learn Macroeconomic Narratives from Social Media?
Can LLMs Learn Macroeconomic Narratives from Social Media?
Almog Gueta
Amir Feder
Zorik Gekhman
Ariel Goldstein
Roi Reichart
21
4
0
17 Jun 2024
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
Ziyue Xu
Peilin Zhou
Xinyu Shi
Jiageng Wu
Yikang Jiang
Bin Ke
Jie-jin Yang
Jie Yang
36
5
0
17 Jun 2024
ShareLoRA: Parameter Efficient and Robust Large Language Model
  Fine-tuning via Shared Low-Rank Adaptation
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation
Yurun Song
Junchen Zhao
Ian G. Harris
S. Jyothi
32
3
0
16 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization
  for Language Models
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
46
2
0
15 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
78
4
0
15 Jun 2024
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
Wenjun Li
Changyu Chen
Pradeep Varakantham
47
2
0
15 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
38
0
0
14 Jun 2024
SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation
  Representation
SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation Representation
Shuyi Li
Shaojuan Wu
Xiaowang Zhang
Zhiyong Feng
39
0
0
13 Jun 2024
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Xuannan Liu
Zekun Li
Peipei Li
Shuhan Xia
Xing Cui
Linzhi Huang
Huaibo Huang
Weihong Deng
Zhaofeng He
36
13
0
13 Jun 2024
The Impact of Initialization on LoRA Finetuning Dynamics
The Impact of Initialization on LoRA Finetuning Dynamics
Soufiane Hayou
Nikhil Ghosh
Bin Yu
AI4CE
36
10
0
12 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
38
2
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
69
50
0
12 Jun 2024
LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of
  Childhood Health Outcomes Using Pre-Trained Language Models
LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models
Dasun Athukoralage
Thushari Atapattu
M. Thilakaratne
Katrina Falkner
LM&MA
18
0
0
11 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
42
1
0
11 Jun 2024
Multimodal Belief Prediction
Multimodal Belief Prediction
John Murzaku
Adil Soubki
Owen Rambow
16
0
0
11 Jun 2024
Non-autoregressive Personalized Bundle Generation
Non-autoregressive Personalized Bundle Generation
Wenchuan Yang
Cheng Yang
Jichao Li
Yuejin Tan
Xin Lu
Chuan Shi
28
0
0
11 Jun 2024
Développement automatique de lexiques pour les concepts émergents :
  une exploration méthodologique
Développement automatique de lexiques pour les concepts émergents : une exploration méthodologique
Revekka Kyriakoglou
Anna Pappa
Jilin He
A. Schoen
P. Laurens
Markarit Vartampetian
P. Larédo
Tita Kyriacopoulou
39
0
0
10 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
49
0
0
10 Jun 2024
LLM Questionnaire Completion for Automatic Psychiatric Assessment
LLM Questionnaire Completion for Automatic Psychiatric Assessment
Gony Rosenman
Lior Wolf
Talma Hendler
31
3
0
09 Jun 2024
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning
Yibo Yang
Xiaojie Li
Zhongzhu Zhou
S. Song
Jianlong Wu
Liqiang Nie
Bernard Ghanem
45
6
0
07 Jun 2024
Towards Understanding Task-agnostic Debiasing Through the Lenses of
  Intrinsic Bias and Forgetfulness
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu
Milad Afshari
Xitong Zhang
Zhiyu Xue
Avrajit Ghosh
Bidhan Bashyal
Rongrong Wang
K. Johnson
27
0
0
06 Jun 2024
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Qi Cheng
Michael Boratko
Pranay Kumar Yelugam
T. O’Gorman
Nalini Singh
Andrew McCallum
X. Li
ELM
LRM
34
3
0
06 Jun 2024
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Naibin Gu
Peng Fu
Xiyu Liu
Bowen Shen
Zheng-Shen Lin
Weiping Wang
30
6
0
06 Jun 2024
NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting
  by Learning from Human
NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Shuo Huang
William MacLean
Xiaoxi Kang
Anqi Wu
Lizhen Qu
Qiongkai Xu
Zhuang Li
Xingliang Yuan
Gholamreza Haffari
30
0
0
06 Jun 2024
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the
  Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
Yang Wu
Chenghao Wang
Ece Gumusel
Xiaozhong Liu
ELM
AILaw
40
4
0
05 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
39
0
0
05 Jun 2024
Which Side Are You On? A Multi-task Dataset for End-to-End Argument
  Summarisation and Evaluation
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Hao Li
Yuping Wu
Viktor Schlegel
R. Batista-Navarro
Tharindu Madusanka
...
Jiayan Zeng
Xiaochi Wang
Xinran He
Yizhi Li
Goran Nenadic
31
6
0
05 Jun 2024
Space Decomposition for Sentence Embedding
Space Decomposition for Sentence Embedding
Wuttikorn Ponwitayarat
Peerat Limkonchotiwat
E. Chuangsuwanich
Sarana Nutanong
24
0
0
05 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
27
0
0
05 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
35
1
0
04 Jun 2024
Deciphering Oracle Bone Language with Diffusion Models
Deciphering Oracle Bone Language with Diffusion Models
Haisu Guan
Huanxin Yang
Xinyu Wang
Shengwei Han
Yongge Liu
Lianwen Jin
Xiang Bai
Y. Liu
AAML
AI4CE
85
7
0
02 Jun 2024
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Luis Rei
Dunja Mladenić
M. Dorozynski
Franz Rottensteiner
Thomas Schleider
Raphael Troncy
J. Lozano
Mar Gaitán Salvatella
29
6
0
01 Jun 2024
RoBERTa-BiLSTM: A Context-Aware Hybrid Model for Sentiment Analysis
RoBERTa-BiLSTM: A Context-Aware Hybrid Model for Sentiment Analysis
Md. Mostafizer Rahman
Ariful Islam Shiplu
Yutaka Watanobe
Md. Ashad Alam
21
10
0
01 Jun 2024
CONFINE: Conformal Prediction for Interpretable Neural Networks
CONFINE: Conformal Prediction for Interpretable Neural Networks
Linhui Huang
S. Lala
N. Jha
63
2
0
01 Jun 2024
Previous
123...8910...686970
Next