Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 3,524 papers shown
Title
A Controlled Reevaluation of Coreference Resolution Models
Ian Porada
Xiyuan Zou
Jackie Chi Kit Cheung
35
1
0
31 Mar 2024
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
43
0
0
31 Mar 2024
MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
Zebang Cheng
Fuqiang Niu
Yuxiang Lin
Zhi-Qi Cheng
Bowen Zhang
Xiaojiang Peng
23
7
0
31 Mar 2024
DOCMASTER: A Unified Platform for Annotation, Training, & Inference in Document Question-Answering
Alex Nguyen
Zilong Wang
Jingbo Shang
Dheeraj Mekala
33
1
0
30 Mar 2024
Jetsons at FinNLP 2024: Towards Understanding the ESG Impact of a News Article using Transformer-based Models
Parag Dakle
Alolika Gon
Sihan Zha
Liang Wang
SaiKrishna Rallabandi
Preethi Raghavan
19
1
0
30 Mar 2024
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
Aru Maekawa
Satoshi Kosugi
Kotaro Funakoshi
Manabu Okumura
DD
39
10
0
30 Mar 2024
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction
Benjamin Townsend
Madison May
Christopher Wells
SyDa
37
0
0
29 Mar 2024
FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Shuang Li
Jiahua Wang
Lijie Wen
LRM
23
0
0
29 Mar 2024
AlloyBERT: Alloy Property Prediction with Large Language Models
Akshat Chaudhari
Chakradhar Guntuboina
Hongshuo Huang
A. Farimani
29
4
0
28 Mar 2024
A Benchmark Evaluation of Clinical Named Entity Recognition in French
N. Bannour
Christophe Servan
Aurélie Névéol
Xavier Tannier
16
0
0
28 Mar 2024
SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages
N. Ousidhoum
Shamsuddeen Hassan Muhammad
Mohamed Abdalla
Idris Abdulmumin
I. Ahmad
...
Thamar Solorio
Nirmal Surange
Krishnapriya Vishnubhotla
Seid Muhie Yimam
Saif M. Mohammad
44
11
0
27 Mar 2024
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
Yejin Bang
Delong Chen
Nayeon Lee
Pascale Fung
29
25
0
27 Mar 2024
Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Modan Tailleur
Junwon Lee
Mathieu Lagrange
Keunwoo Choi
Laurie M. Heller
Keisuke Imoto
Yuki Okamoto
22
10
0
26 Mar 2024
ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition
Haris Riaz
Razvan-Gabriel Dumitru
Mihai Surdeanu
MU
35
0
0
26 Mar 2024
Data-Efficient 3D Visual Grounding via Order-Aware Referring
Tung-Yu Wu
Sheng-Yu Huang
Yu-Chiang Frank Wang
34
0
0
25 Mar 2024
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
Ayushi Nirmal
Amrita Bhattacharjee
Paras Sheth
Huan Liu
AAML
35
11
0
19 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
39
7
0
18 Mar 2024
CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
36
3
0
18 Mar 2024
Rethinking Multi-view Representation Learning via Distilled Disentangling
Guanzhou Ke
Bo Wang
Xiaoli Wang
Shengfeng He
34
3
0
16 Mar 2024
Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
Tingyu Qu
Tinne Tuytelaars
Marie-Francine Moens
MoE
33
2
0
14 Mar 2024
AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta Learning
Ruiyi Zhang
Rushi Qiang
Sai Ashish Somayajula
Pengtao Xie
27
13
0
14 Mar 2024
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
56
0
0
13 Mar 2024
ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model
Zhiwei Liu
Boyang Liu
Paul Thompson
Kailai Yang
Sophia Ananiadou
32
3
0
11 Mar 2024
Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT
Aisha Khatun
Anisur Rahman
Md. Saiful Islam
Hemayet Ahmed Chowdhury
A. Tasnim
24
2
0
08 Mar 2024
YYDS: Visible-Infrared Person Re-Identification with Coarse Descriptions
Yunhao Du
Zhicheng Zhao
Fei Su
34
2
0
07 Mar 2024
Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Jianfeng He
Hang Su
Jason (Jinglun) Cai
Igor Shalyminov
Hwanjun Song
Saab Mansour
24
4
0
06 Mar 2024
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
Xiangci Li
Linfeng Song
Lifeng Jin
Haitao Mi
Jessica Ouyang
Dong Yu
26
1
0
06 Mar 2024
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification
Robert Vacareanu
F. Alam
M. Islam
Haris Riaz
Mihai Surdeanu
NAI
27
2
0
05 Mar 2024
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Yuchen Duan
Weiyun Wang
Zhe Chen
Xizhou Zhu
Lewei Lu
Tong Lu
Yu Qiao
Hongsheng Li
Jifeng Dai
Wenhai Wang
ViT
43
44
0
04 Mar 2024
DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation
Chen Xu
Tian Lan
Changlong Yu
Wei Wang
Jun Gao
...
Qunxi Dong
Kun Qian
Piji Li
Wei Bi
Bin Hu
48
0
0
04 Mar 2024
CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
Xi Liu
Ying Guo
Cheng Zhen
Tong Li
Yingying Ao
Pengfei Yan
DiffM
34
3
0
01 Mar 2024
PeLLE: Encoder-based language models for Brazilian Portuguese based on open data
Guilherme Lamartine de Mello
Marcelo Finger
F. Serras
M. Carpi
Marcos Menon Jose
Pedro Henrique Domingues
Paulo Cavalim
27
0
0
29 Feb 2024
PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
Erxin Yu
Jing Li
Chunpu Xu
27
3
0
29 Feb 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
42
6
0
28 Feb 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Aman Chadha
LRM
43
18
0
28 Feb 2024
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Yuiga Wada
Kanta Kaneda
Daichi Saito
Komei Sugiura
34
24
0
28 Feb 2024
Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization
Han Guo
Ramtin Hosseini
Ruiyi Zhang
Sai Ashish Somayajula
Ranak Roy Chowdhury
Rajesh K. Gupta
Pengtao Xie
31
0
0
28 Feb 2024
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang
Weijie Xu
Fiona Anting Tan
Jiani Zhang
Ziqing Hu
Yanjun Qi
Scott Nickleach
Diego Socolinsky
Srinivasan H. Sengamedu
Christos Faloutsos
LMTD
ALM
37
65
0
27 Feb 2024
Acquiring Linguistic Knowledge from Multimodal Input
Theodor Amariucai
Alexander Scott Warstadt
CLL
29
2
0
27 Feb 2024
Investigating Continual Pretraining in Large Language Models: Insights and Implications
cCaugatay Yildiz
Nishaanth Kanna Ravichandran
Prishruit Punia
Matthias Bethge
B. Ermiş
CLL
KELM
LRM
48
25
0
27 Feb 2024
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
Jiri Gesi
Iftekhar Ahmed
42
0
0
26 Feb 2024
Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems
Enrico Liscio
Luciano Cavalcante Siebert
Catholijn M. Jonker
P. Murukannaiah
35
4
0
26 Feb 2024
From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility
Pravneet Kaur
Gautam Siddharth Kashyap
Ankit Kumar
Md. Tabrez Nafis
Sandeep Kumar
Vikrant Shokeen
LM&MA
48
54
0
25 Feb 2024
Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space
Aviad Rom
Kfir Bar
29
1
0
25 Feb 2024
IR2: Information Regularization for Information Retrieval
Jianyou Wang
Kaicheng Wang
Xiaoyue Wang
Weili Cao
R. Paturi
Leon Bergen
46
1
0
25 Feb 2024
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
Hyunjae Kim
Seunghyun Yoon
Trung Bui
Handong Zhao
Quan Tran
Franck Dernoncourt
Jaewoo Kang
CLIP
19
2
0
23 Feb 2024
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Yanjun Zhao
Sizhe Dang
Haishan Ye
Guang Dai
Yi Qian
Ivor W.Tsang
66
8
0
23 Feb 2024
Generalizing Reward Modeling for Out-of-Distribution Preference Learning
Chen Jia
34
2
0
22 Feb 2024
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations
Aina Garí Soler
Matthieu Labeau
Chloé Clavel
VLM
34
2
0
22 Feb 2024
Novi jezički modeli za srpski jezik
Mihailo vSkorić
13
0
0
22 Feb 2024
Previous
1
2
3
...
11
12
13
...
69
70
71
Next