ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03654
  4. Cited By
DeBERTa: Decoding-enhanced BERT with Disentangled Attention

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

5 June 2020
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
    AAML
ArXivPDFHTML

Papers citing "DeBERTa: Decoding-enhanced BERT with Disentangled Attention"

50 / 1,037 papers shown
Title
An Investigation of Evaluation Metrics for Automated Medical Note
  Generation
An Investigation of Evaluation Metrics for Automated Medical Note Generation
Asma Ben Abacha
Wen-wai Yim
George Michalopoulos
Thomas Lin
17
22
0
27 May 2023
Entailment as Robust Self-Learner
Entailment as Robust Self-Learner
Jiaxin Ge
Hongyin Luo
Yoon Kim
James R. Glass
25
3
0
26 May 2023
With a Little Push, NLI Models can Robustly and Efficiently Predict
  Faithfulness
With a Little Push, NLI Models can Robustly and Efficiently Predict Faithfulness
Julius Steen
Juri Opitz
Anette Frank
K. Markert
HILM
16
9
0
26 May 2023
To Revise or Not to Revise: Learning to Detect Improvable Claims for
  Argumentative Writing Support
To Revise or Not to Revise: Learning to Detect Improvable Claims for Argumentative Writing Support
Gabriella Skitalinskaya
Henning Wachsmuth
11
9
0
26 May 2023
Measuring the Effect of Influential Messages on Varying Personas
Measuring the Effect of Influential Messages on Varying Personas
Chenkai Sun
Jinning Li
Hou Pong Chan
ChengXiang Zhai
Heng Ji
11
6
0
25 May 2023
Perturbation-based Self-supervised Attention for Attention Bias in Text
  Classification
Perturbation-based Self-supervised Attention for Attention Bias in Text Classification
Hu Feng
Zhenxi Lin
Qianli Ma
20
4
0
25 May 2023
Revisiting Sentence Union Generation as a Testbed for Text Consolidation
Revisiting Sentence Union Generation as a Testbed for Text Consolidation
Eran Hirsch
Valentina Pyatkin
Ruben Wolhandler
Avi Caciularu
Asi Shefer
Ido Dagan
MoMe
16
6
0
24 May 2023
Deriving Language Models from Masked Language Models
Deriving Language Models from Masked Language Models
Lucas Torroba Hennigen
Yoon Kim
21
11
0
24 May 2023
Self-Evolution Learning for Discriminative Language Model Pretraining
Self-Evolution Learning for Discriminative Language Model Pretraining
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
29
12
0
24 May 2023
Revisiting Token Dropping Strategy in Efficient BERT Pretraining
Revisiting Token Dropping Strategy in Efficient BERT Pretraining
Qihuang Zhong
Liang Ding
Juhua Liu
Xuebo Liu
Min Zhang
Bo Du
Dacheng Tao
VLM
27
9
0
24 May 2023
Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model
  Fine-tuning
Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning
Zhen-Ru Zhang
Chuanqi Tan
Haiyang Xu
Chengyu Wang
Jun Huang
Songfang Huang
17
29
0
24 May 2023
Dynamic Masking Rate Schedules for MLM Pretraining
Dynamic Masking Rate Schedules for MLM Pretraining
Zachary Ankner
Naomi Saphra
Davis W. Blalock
Jonathan Frankle
Matthew L. Leavitt
19
5
0
24 May 2023
Detecting Multidimensional Political Incivility on Social Media
Detecting Multidimensional Political Incivility on Social Media
Sagi Pendzel
Nir Lotan
Alon Zoizner
Einat Minkov
14
1
0
24 May 2023
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty,
  and GPT-4
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4
Kellin Pelrine
Anne Imouza
Camille Thibault
Meilina Reksoprodjo
Caleb Gupta
J. Christoph
Jean-François Godbout
Reihaneh Rabbany
UQLM
AI4CE
23
35
0
24 May 2023
Coverage-based Example Selection for In-Context Learning
Coverage-based Example Selection for In-Context Learning
Shivanshu Gupta
Matt Gardner
Sameer Singh
18
39
0
24 May 2023
DialogVCS: Robust Natural Language Understanding in Dialogue System
  Upgrade
DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade
Zefan Cai
Xin Zheng
Tianyu Liu
Xu Wang
H. Meng
Jiaqi Han
Gang Yuan
Binghuai Lin
Baobao Chang
Yunbo Cao
14
4
0
24 May 2023
TACR: A Table-alignment-based Cell-selection and Reasoning Model for
  Hybrid Question-Answering
TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering
Jian Wu
Yicheng Xu
Yan Gao
Jian-Guang Lou
Börje F. Karlsson
Manabu Okumura
LMTD
13
3
0
24 May 2023
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts
Li Zhang
Hainiu Xu
Abhinav Kommula
Chris Callison-Burch
Niket Tandon
25
6
0
24 May 2023
From Characters to Words: Hierarchical Pre-trained Language Model for
  Open-vocabulary Language Understanding
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
Li Sun
F. Luisier
Kayhan Batmanghelich
D. Florêncio
Changrong Zhang
VLM
12
6
0
23 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
18
2
0
23 May 2023
Detecting Propaganda Techniques in Code-Switched Social Media Text
Detecting Propaganda Techniques in Code-Switched Social Media Text
Muhammad Salman
Asif Hanif
Shady Shehata
Preslav Nakov
20
5
0
23 May 2023
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
Bo Zhou
Qianglong Chen
Tianyu Wang
Xiaoshi Zhong
Yin Zhang
ELM
27
10
0
23 May 2023
Detecting automatically the layout of clinical documents to enhance the
  performances of downstream natural language processing
Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing
C. Gérardin
Perceval Wajsburt
Basile Dura
Alice Calliger
Alexandre Mouchet
X. Tannier
R. Bey
16
1
0
23 May 2023
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data
  Augmentation
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation
Dan Luo
Chen Zhang
Jiahui Xu
Bin Wang
Yiming Chen
Yan Zhang
Haizhou Li
VLM
22
0
0
23 May 2023
Topic-driven Distant Supervision Framework for Macro-level Discourse
  Parsing
Topic-driven Distant Supervision Framework for Macro-level Discourse Parsing
Feng Jiang
Longwang He
Peifeng Li
Qiaoming Zhu
Haizhou Li
6
0
0
23 May 2023
i-Code Studio: A Configurable and Composable Framework for Integrative
  AI
i-Code Studio: A Configurable and Composable Framework for Integrative AI
Yuwei Fang
Mahmoud Khademi
Chenguang Zhu
Ziyi Yang
Reid Pryzant
...
Yao Qian
Takuya Yoshioka
Lu Yuan
Michael Zeng
Xuedong Huang
30
2
0
23 May 2023
Physics of Language Models: Part 1, Learning Hierarchical Language
  Structures
Physics of Language Models: Part 1, Learning Hierarchical Language Structures
Zeyuan Allen-Zhu
Yuanzhi Li
22
15
0
23 May 2023
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of
  Implied Social Biases
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
Yiming Zhang
Sravani Nanduri
Liwei Jiang
Tongshuang Wu
Maarten Sap
36
7
0
23 May 2023
Open-world Semi-supervised Generalized Relation Discovery Aligned in a
  Real-world Setting
Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting
William Hogan
Jiacheng Li
Jingbo Shang
OffRL
13
1
0
22 May 2023
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large
  Language Models in Knowledge Conflicts
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
198
152
0
22 May 2023
Distilling Robustness into Natural Language Inference Models with
  Domain-Targeted Augmentation
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
Joe Stacey
Marek Rei
14
2
0
22 May 2023
DUMB: A Benchmark for Smart Evaluation of Dutch Models
DUMB: A Benchmark for Smart Evaluation of Dutch Models
Wietse de Vries
Martijn B. Wieling
Malvina Nissim
ELM
ALM
MoE
26
6
0
22 May 2023
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods
  by Language Models
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
21
1
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
24
6
0
21 May 2023
"What do others think?": Task-Oriented Conversational Modeling with
  Subjective Knowledge
"What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge
Chao Zhao
Spandana Gella
Seokhwan Kim
Di Jin
Devamanyu Hazarika
Alexandros Papangelis
Behnam Hedayatnia
Mahdi Namazifar
Yang Liu
Dilek Z. Hakkani-Tür
25
7
0
20 May 2023
Prefix Propagation: Parameter-Efficient Tuning for Long Sequences
Prefix Propagation: Parameter-Efficient Tuning for Long Sequences
Jonathan Li
Will Aitken
R. Bhambhoria
Xiao-Dan Zhu
17
14
0
20 May 2023
Complex Claim Verification with Evidence Retrieved in the Wild
Complex Claim Verification with Evidence Retrieved in the Wild
Jifan Chen
Grace Kim
Aniruddh Sriram
Greg Durrett
Eunsol Choi
HILM
14
68
0
19 May 2023
SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage
  Leveraging Generative Models
SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage Leveraging Generative Models
Akshita Jha
Aida Mostafazadeh Davani
Chandan K. Reddy
Shachi Dave
Vinodkumar Prabhakaran
Sunipa Dev
23
40
0
19 May 2023
S$^3$HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid
  Question Answering
S3^33HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering
Fangyu Lei
Xiang Li
Yifan Wei
Shizhu He
Yiming Huang
Jun Zhao
Kang Liu
RALM
30
13
0
19 May 2023
Empower Large Language Model to Perform Better on Industrial
  Domain-Specific Question Answering
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang
Pu Zhao
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
32
47
0
19 May 2023
Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action
  Recognition through Redefined Skeletal Topology Awareness
Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness
Yuxuan Zhou
Zhi-Qi Cheng
Ju He
Bin Luo
Yifeng Geng
Xuansong Xie
29
11
0
19 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited
  Modalities
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
16
114
0
18 May 2023
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot
  Relation Extractors
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
Kai Zhang
Bernal Jiménez Gutiérrez
Yu-Chuan Su
29
66
0
18 May 2023
Ahead-of-Time P-Tuning
Ahead-of-Time P-Tuning
Daniil Gavrilov
Nikita Balagansky
32
1
0
18 May 2023
Diffusion Language Models Generation Can Be Halted Early
Diffusion Language Models Generation Can Be Halted Early
Sofia Maria Lo Cicero Vaina
Nikita Balagansky
Daniil Gavrilov
DiffM
42
0
0
18 May 2023
Large-Scale Text Analysis Using Generative Language Models: A Case Study
  in Discovering Public Value Expressions in AI Patents
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Sergio Pelaez
Gaurav Verma
Barbara Ribeiro
P. Shapira
14
13
0
17 May 2023
Explaining black box text modules in natural language with language
  models
Explaining black box text modules in natural language with language models
Chandan Singh
Aliyah R. Hsu
Richard Antonello
Shailee Jain
Alexander G. Huth
Bin-Xia Yu
Jianfeng Gao
MILM
16
46
0
17 May 2023
Machine-Made Media: Monitoring the Mobilization of Machine-Generated
  Articles on Misinformation and Mainstream News Websites
Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites
Hans W. A. Hanley
Zakir Durumeric
DeLMO
16
29
0
16 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to
  Support Human-AI Scientific Writing
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
Hua Shen
Huang Chieh-Yang
Tongshuang Wu
Ting-Hao 'Kenneth' Huang
16
37
0
16 May 2023
UOR: Universal Backdoor Attacks on Pre-trained Language Models
UOR: Universal Backdoor Attacks on Pre-trained Language Models
Wei Du
Peixuan Li
Bo-wen Li
Haodong Zhao
Gongshen Liu
AAML
37
7
0
16 May 2023
Previous
123...161718192021
Next