ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 3,487 papers shown
Title
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text
  data on social disorders in children and adolescents
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents
Hoang-Thang Ta
Abu Bakar Siddiqur Rahman
Lotfollah Najjar
Alexander Gelbukh
16
0
0
30 Apr 2024
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning
  Leveraging Weight Decomposition
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition
Yuxuan Yan
Qianqian Yang
Shunpu Tang
Zhiguo Shi
38
13
0
29 Apr 2024
Unknown Script: Impact of Script on Cross-Lingual Transfer
Unknown Script: Impact of Script on Cross-Lingual Transfer
Wondimagegnhue Tufa
Ilia Markov
Piek Vossen
37
0
0
29 Apr 2024
Can Perplexity Predict Fine-Tuning Performance? An Investigation of
  Tokenization Effects on Sequential Language Models for Nepali
Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali
Nishant Luitel
Nirajan Bekoju
Anand Kumar Sah
Subarna Shakya
50
0
0
28 Apr 2024
Meta In-Context Learning Makes Large Language Models Better Zero and
  Few-Shot Relation Extractors
Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors
Guozheng Li
Peng Wang
Jiajun Liu
Yikai Guo
Ke Ji
Ziyu Shang
Zijie Xu
LRM
33
7
0
27 Apr 2024
Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel Industry
Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel Industry
Simone Barandoni
F. Chiarello
Lorenzo Cascone
Emiliano Marrale
Salvatore Puccio
51
5
0
27 Apr 2024
Temporal Scaling Law for Large Language Models
Temporal Scaling Law for Large Language Models
Yizhe Xiong
Xiansheng Chen
Xin Ye
Hui Chen
Zijia Lin
...
Zhenpeng Su
Wei Huang
Jianwei Niu
J. Han
Guiguang Ding
43
9
0
27 Apr 2024
Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage
  framework for Emotion-Cause Pair Extraction in Conversations
Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage framework for Emotion-Cause Pair Extraction in Conversations
Shen Zhang
Haojie Zhang
Jing Zhang
Xudong Zhang
Yimeng Zhuang
Jinting Wu
35
2
0
25 Apr 2024
Generalization Measures for Zero-Shot Cross-Lingual Transfer
Generalization Measures for Zero-Shot Cross-Lingual Transfer
Saksham Bassi
Duygu Ataman
Kyunghyun Cho
29
0
0
24 Apr 2024
CASPR: Automated Evaluation Metric for Contrastive Summarization
CASPR: Automated Evaluation Metric for Contrastive Summarization
Nirupan Ananthamurugan
Dat Duong
Philip George
Ankita Gupta
Sandeep Tata
Beliz Gunel
19
0
0
23 Apr 2024
Identifying Fairness Issues in Automatically Generated Testing Content
Identifying Fairness Issues in Automatically Generated Testing Content
Kevin Stowe
Benny Longwill
Alyssa Francis
Tatsuya Aoyama
Debanjan Ghosh
Swapna Somasundaran
38
1
0
23 Apr 2024
Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language
  Models
Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models
Vishruth Veerendranath
Vishwa Shah
Kshitish Ghate
30
0
0
22 Apr 2024
Marking: Visual Grading with Highlighting Errors and Annotating Missing
  Bits
Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits
Shashank Sonkar
Naiming Liu
D. B. Mallick
Richard G. Baraniuk
27
4
0
22 Apr 2024
ColA: Collaborative Adaptation with Gradient Learning
ColA: Collaborative Adaptation with Gradient Learning
Enmao Diao
Qi Le
Suya Wu
Xinran Wang
Ali Anwar
Jie Ding
Vahid Tarokh
24
1
0
22 Apr 2024
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Alexander Shan
John Bauer
Riley Carlson
Christopher D. Manning
25
2
0
20 Apr 2024
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
Tiancheng Gu
Kaicheng Yang
Dongnan Liu
Weidong Cai
MedIm
29
2
0
19 Apr 2024
SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese
  Social Media Analysis
SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis
Hongzhi Qi
Hanfei Liu
Jianqiang Li
Qing Zhao
Wei-dong Zhai
Dan Luo
Tianyu He
Shuo Liu
Bing Xiang Yang
Guanghui Fu
29
1
0
19 Apr 2024
Transformer-Based Classification Outcome Prediction for Multimodal
  Stroke Treatment
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
Danqing Ma
Meng Wang
Ao Xiang
Zongqing Qi
Qin Yang
32
18
0
19 Apr 2024
Latent Concept-based Explanation of NLP Models
Latent Concept-based Explanation of NLP Models
Xuemin Yu
Fahim Dalvi
Nadir Durrani
Marzia Nouri
Hassan Sajjad
LRM
FAtt
24
1
0
18 Apr 2024
Grammatical Error Correction for Code-Switched Sentences by Learners of
  English
Grammatical Error Correction for Code-Switched Sentences by Learners of English
Kelvin Wey Han Chan
Christopher Bryant
Li Nguyen
Andrew Caines
Zheng Yuan
41
2
0
18 Apr 2024
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Yang Lin
Xinyu Ma
Xu Chu
Yujie Jin
Zhibang Yang
Yasha Wang
Hong-yan Mei
46
19
0
15 Apr 2024
A Large-Scale Evaluation of Speech Foundation Models
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
38
19
0
15 Apr 2024
JaFIn: Japanese Financial Instruction Dataset
JaFIn: Japanese Financial Instruction Dataset
Kota Tanabe
Masahiro Suzuki
Hiroki Sakaji
Itsuki Noda
39
1
0
14 Apr 2024
WikiSplit++: Easy Data Refinement for Split and Rephrase
WikiSplit++: Easy Data Refinement for Split and Rephrase
Hayato Tsukagoshi
Tsutomu Hirao
Makoto Morishita
Katsuki Chousa
Ryohei Sasano
Koichi Takeda
38
1
0
13 Apr 2024
Enhancing Visual Question Answering through Question-Driven Image
  Captions as Prompts
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
36
10
0
12 Apr 2024
VertAttack: Taking advantage of Text Classifiers' horizontal vision
VertAttack: Taking advantage of Text Classifiers' horizontal vision
Jonathan Rusert
AAML
35
1
0
12 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced
  Pre-training
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
36
9
0
12 Apr 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
34
3
0
12 Apr 2024
Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
Tianyu Zhu
M. Jung
Jesse Clark
83
1
0
12 Apr 2024
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
Mobashir Sadat
Cornelia Caragea
32
4
0
11 Apr 2024
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
63
5
0
11 Apr 2024
Continuous Language Model Interpolation for Dynamic and Controllable
  Text Generation
Continuous Language Model Interpolation for Dynamic and Controllable Text Generation
Sara Kangaslahti
David Alvarez-Melis
KELM
29
0
0
10 Apr 2024
Low-Cost Generation and Evaluation of Dictionary Example Sentences
Low-Cost Generation and Evaluation of Dictionary Example Sentences
Bill Cai
Clarence Boon Liang Ng
Daniel Tan
Shelvia Hotama
17
3
0
09 Apr 2024
Event-enhanced Retrieval in Real-time Search
Event-enhanced Retrieval in Real-time Search
Yanan Zhang
Xiaoling Bai
Tianhua Zhou
29
1
0
09 Apr 2024
Comprehensive Study on German Language Models for Clinical and
  Biomedical Text Understanding
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
Ahmad Idrissi-Yaghir
Amin Dada
Henning Schafer
Kamyar Arzideh
Giulia Baldini
...
Peter A. Horn
Christin Seifert
F. Nensa
Jens Kleesiek
Christoph M. Friedrich
AI4MH
29
2
0
08 Apr 2024
PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of
  LLM-generated Text?
PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text?
Kseniia Petukhova
Roman Kazakov
Ekaterina Kochmar
DeLMO
18
2
0
08 Apr 2024
Contextual Chart Generation for Cyber Deception
Contextual Chart Generation for Cyber Deception
David D. Nguyen
David Liebowitz
Surya Nepal
S. Kanhere
Sharif Abuadbba
41
0
0
07 Apr 2024
A Multi-Level Framework for Accelerating Training Transformer Models
A Multi-Level Framework for Accelerating Training Transformer Models
Longwei Zou
Han Zhang
Yangdong Deng
AI4CE
32
1
0
07 Apr 2024
A Morphology-Based Investigation of Positional Encodings
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
24
1
0
06 Apr 2024
Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation
Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation
Xinyu Ma
Xu Chu
Zhibang Yang
Yang Lin
Xin Gao
Junfeng Zhao
38
6
0
05 Apr 2024
Investigating the Robustness of Modelling Decisions for Few-Shot
  Cross-Topic Stance Detection: A Preregistered Study
Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study
Myrthe Reuver
Suzan Verberne
Antske Fokkens
32
1
0
05 Apr 2024
Data Augmentation with In-Context Learning and Comparative Evaluation in
  Math Word Problem Solving
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving
Gulsum Yigit
M. Amasyalı
AIMat
44
1
0
05 Apr 2024
Personalized LLM Response Generation with Parameterized Memory Injection
Personalized LLM Response Generation with Parameterized Memory Injection
Kai Zhang
Lizhi Qing
Yangyang Kang
34
11
0
04 Apr 2024
The Impact of Unstated Norms in Bias Analysis of Language Models
The Impact of Unstated Norms in Bias Analysis of Language Models
Farnaz Kohankhaki
D. B. Emerson
David B. Emerson
Laleh Seyyed-Kalantari
Faiza Khan Khattak
52
1
0
04 Apr 2024
FPT: Feature Prompt Tuning for Few-shot Readability Assessment
FPT: Feature Prompt Tuning for Few-shot Readability Assessment
Ziyang Wang
Sanwoo Lee
Hsiu-Yuan Huang
Yunfang Wu
AAML
VLM
27
0
0
03 Apr 2024
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models
Fanxu Meng
Zhaohui Wang
Muhan Zhang
VLM
64
68
0
03 Apr 2024
Toward Informal Language Processing: Knowledge of Slang in Large
  Language Models
Toward Informal Language Processing: Knowledge of Slang in Large Language Models
Zhewei Sun
Qian Hu
Rahul Gupta
Richard Zemel
Yang Xu
38
1
0
02 Apr 2024
A Rationale-centric Counterfactual Data Augmentation Method for
  Cross-Document Event Coreference Resolution
A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution
Bowen Ding
Qingkai Min
Shengkun Ma
Yingjie Li
Linyi Yang
Yue Zhang
46
4
0
02 Apr 2024
Dialogue with Robots: Proposals for Broadening Participation and
  Research in the SLIVAR Community
Dialogue with Robots: Proposals for Broadening Participation and Research in the SLIVAR Community
Casey Kennington
Malihe Alikhani
Heather Pon-Barry
Katherine Atwell
Yonatan Bisk
...
Jivko Sinapov
Angela Stewart
Matthew Stone
Stefanie Tellex
Tom Williams
49
0
0
01 Apr 2024
Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade
  Offs in Large Language Model Training
Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training
Vivian Liu
Yiqiao Yin
40
11
0
01 Apr 2024
Previous
123...101112...686970
Next