ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03654
  4. Cited By
DeBERTa: Decoding-enhanced BERT with Disentangled Attention

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

5 June 2020
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
    AAML
ArXivPDFHTML

Papers citing "DeBERTa: Decoding-enhanced BERT with Disentangled Attention"

50 / 1,037 papers shown
Title
Sequence-to-Sequence Pre-training with Unified Modality Masking for
  Visual Document Understanding
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding
ShuWei Feng
Tianyang Zhan
Zhanming Jie
Trung Quoc Luong
Xiaoran Jin
11
1
0
16 May 2023
Knowledge Rumination for Pre-trained Language Models
Knowledge Rumination for Pre-trained Language Models
Yunzhi Yao
Peng Wang
Shengyu Mao
Chuanqi Tan
Fei Huang
Huajun Chen
Ningyu Zhang
KELM
17
3
0
15 May 2023
Adam-Smith at SemEval-2023 Task 4: Discovering Human Values in Arguments
  with Ensembles of Transformer-based Models
Adam-Smith at SemEval-2023 Task 4: Discovering Human Values in Arguments with Ensembles of Transformer-based Models
Daniel Schroter
Daryna Dementieva
Georg Groh
9
8
0
15 May 2023
MeeQA: Natural Questions in Meeting Transcripts
MeeQA: Natural Questions in Meeting Transcripts
Reut Apel
Tom Braude
Amir Kantor
Eyal Kolman
RALM
16
1
0
15 May 2023
Text Classification via Large Language Models
Text Classification via Large Language Models
Xiaofei Sun
Xiaoya Li
Jiwei Li
Fei Wu
Shangwei Guo
Tianwei Zhang
Guoyin Wang
RALM
LRM
32
135
0
15 May 2023
Distinguish Before Answer: Generating Contrastive Explanation as
  Knowledge for Commonsense Question Answering
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Qianglong Chen
Guohai Xu
Mingshi Yan
Ji Zhang
Fei Huang
Luo Si
Yin Zhang
8
9
0
14 May 2023
A Simple and Plug-and-play Method for Unsupervised Sentence
  Representation Enhancement
A Simple and Plug-and-play Method for Unsupervised Sentence Representation Enhancement
Lingfeng Shen
Haiyun Jiang
Lemao Liu
Shuming Shi
13
1
0
13 May 2023
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
Wei-Lin Chen
An-Zi Yen
Cheng-Kuang Wu
Hen-Hsen Huang
Hsin-Hsi Chen
ReLM
LRM
17
10
0
12 May 2023
Overinformative Question Answering by Humans and Machines
Overinformative Question Answering by Humans and Machines
Polina Tsvilodub
Michael Franke
Robert D. Hawkins
Noah D. Goodman
12
2
0
11 May 2023
IUST_NLP at SemEval-2023 Task 10: Explainable Detecting Sexism with
  Transformers and Task-adaptive Pretraining
IUST_NLP at SemEval-2023 Task 10: Explainable Detecting Sexism with Transformers and Task-adaptive Pretraining
Hadi Mahmoudi
8
0
0
11 May 2023
THUIR@COLIEE 2023: More Parameters and Legal Knowledge for Legal Case
  Entailment
THUIR@COLIEE 2023: More Parameters and Legal Knowledge for Legal Case Entailment
Haitao Li
Chang Wang
Weihang Su
Yueyue Wu
Qingyao Ai
Y. Liu
AILaw
ELM
14
16
0
11 May 2023
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
João Rodrigues
Luís Gomes
Joao Silva
António Branco
Rodrigo Santos
Henrique Lopes Cardoso
T. Osório
11
43
0
11 May 2023
Beyond Good Intentions: Reporting the Research Landscape of NLP for
  Social Good
Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good
Fernando Gonzalez
Zhijing Jin
Bernhard Schölkopf
Tom Hope
Mrinmaya Sachan
Rada Mihalcea
29
5
0
09 May 2023
Attack Named Entity Recognition by Entity Boundary Interference
Attack Named Entity Recognition by Entity Boundary Interference
Yifei Yang
Hongqiu Wu
Hai Zhao
AAML
14
4
0
09 May 2023
COLA: Contextualized Commonsense Causal Reasoning from the Causal
  Inference Perspective
COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective
Zhaowei Wang
Quyet V. Do
Hongming Zhang
Jiayao Zhang
Weiqi Wang
Tianqing Fang
Yangqiu Song
Ginny Y. Wong
Simon See
LRM
21
28
0
09 May 2023
Summarization with Precise Length Control
Summarization with Precise Length Control
Lesly Miculicich
Yujia Xie
Song Wang
Pengcheng He
17
2
0
09 May 2023
CAT: A Contextualized Conceptualization and Instantiation Framework for
  Commonsense Reasoning
CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning
Weiqi Wang
Tianqing Fang
Baixuan Xu
Chun Yi Louis Bo
Yangqiu Song
Lei Chen
ReLM
LRM
19
34
0
08 May 2023
Toward Adversarial Training on Contextualized Language Representation
Toward Adversarial Training on Contextualized Language Representation
Hongqiu Wu
Y. Liu
Han Shi
Haizhen Zhao
M. Zhang
AAML
13
13
0
08 May 2023
Stanford MLab at SemEval-2023 Task 10: Exploring GloVe- and
  Transformer-Based Methods for the Explainable Detection of Online Sexism
Stanford MLab at SemEval-2023 Task 10: Exploring GloVe- and Transformer-Based Methods for the Explainable Detection of Online Sexism
Hee Jung Choi
Trevor Chow
Aaron Wan
Hong Meng Yam
Swetha Yogeswaran
Beining Zhou
20
1
0
07 May 2023
Unified Demonstration Retriever for In-Context Learning
Unified Demonstration Retriever for In-Context Learning
Xiaonan Li
Kai Lv
Hang Yan
Tianya Lin
Wei-wei Zhu
Yuan Ni
Guotong Xie
Xiaoling Wang
Xipeng Qiu
RALM
VPVLM
19
120
0
07 May 2023
Replicating Complex Dialogue Policy of Humans via Offline Imitation
  Learning with Supervised Regularization
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Zhoujian Sun
Chenyang Zhao
Zheng-Wei Huang
Nai Ding
OffRL
22
1
0
06 May 2023
Pre-training Language Model as a Multi-perspective Course Learner
Pre-training Language Model as a Multi-perspective Course Learner
Beiduo Chen
Shaohan Huang
Zi-qiang Zhang
Wu Guo
Zhen-Hua Ling
Haizhen Huang
Furu Wei
Weiwei Deng
Qi Zhang
11
0
0
06 May 2023
NER-to-MRC: Named-Entity Recognition Completely Solving as Machine
  Reading Comprehension
NER-to-MRC: Named-Entity Recognition Completely Solving as Machine Reading Comprehension
Yuxiang Zhang
Junjie Wang
Xinyu Zhu
Tetsuya Sakai
Hayato Yamana
19
2
0
06 May 2023
NorBench -- A Benchmark for Norwegian Language Models
NorBench -- A Benchmark for Norwegian Language Models
David Samuel
Andrey Kutuzov
Samia Touileb
Erik Velldal
Lilja Ovrelid
Egil Rønningstad
Elina Sigdel
Anna Palatkina
8
23
0
06 May 2023
PTP: Boosting Stability and Performance of Prompt Tuning with
  Perturbation-Based Regularizer
PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer
Lichang Chen
Heng-Chiao Huang
Varun Madhavan
AAML
111
11
0
03 May 2023
ChatGraph: Interpretable Text Classification by Converting ChatGPT
  Knowledge to Graphs
ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs
Yucheng Shi
Hehuan Ma
Wenliang Zhong
Qiaoyu Tan
Gengchen Mai
Xiang Li
Tianming Liu
Junzhou Huang
AI4MH
11
32
0
03 May 2023
PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging
  Narratives
PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
Silin Gao
Beatriz Borges
B. Su
Stan N. Finkelstein
Saya Kanno
Hiromi Wakaki
Yuki Mitsufuji
Antoine Bosselut
37
19
0
03 May 2023
A Systematic Study of Knowledge Distillation for Natural Language
  Generation with Pseudo-Target Training
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
24
17
0
03 May 2023
Mitigating Approximate Memorization in Language Models via Dissimilarity
  Learned Policy
Mitigating Approximate Memorization in Language Models via Dissimilarity Learned Policy
Aly M. Kassem
26
2
0
02 May 2023
RexUIE: A Recursive Method with Explicit Schema Instructor for Universal
  Information Extraction
RexUIE: A Recursive Method with Explicit Schema Instructor for Universal Information Extraction
Chengyuan Liu
Fubang Zhao
Yangyang Kang
Jingyuan Zhang
Xiang Zhou
Changlong Sun
Kun Kuang
Fei Wu
33
9
0
28 Apr 2023
Analyzing Vietnamese Legal Questions Using Deep Neural Networks with
  Biaffine Classifiers
Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers
Nguyen Anh Tu
Hoang Thi Thu Uyen
Tu Minh Phuong
Ngo Xuan Bach
AILaw
18
1
0
27 Apr 2023
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Simran Arora
Brandon Yang
Sabri Eyuboglu
A. Narayan
Andrew Hojel
Immanuel Trummer
Christopher Ré
SyDa
47
69
0
19 Apr 2023
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised
  Learning
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Zheng Lian
Haiyang Sun
Licai Sun
Kang Chen
Mingyu Xu
...
Meng Wang
Erik Cambria
Guoying Zhao
Björn W. Schuller
Jianhua Tao
22
47
0
18 Apr 2023
MisRoBÆRTa: Transformers versus Misinformation
MisRoBÆRTa: Transformers versus Misinformation
Ciprian-Octavian Truică
Elena Simona Apostol
11
37
0
16 Apr 2023
Evaluation of Social Biases in Recent Large Pre-Trained Models
Evaluation of Social Biases in Recent Large Pre-Trained Models
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
Alind Jain
13
0
0
13 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
24
17
0
10 Apr 2023
Attention at SemEval-2023 Task 10: Explainable Detection of Online
  Sexism (EDOS)
Attention at SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS)
Debashish Roy
Manish Shrivastava
16
1
0
10 Apr 2023
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for
  Classifying Common Mental Illnesses on Social Media Posts
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts
Pratinav Seth
Mihir Agarwal
AI4MH
16
1
0
10 Apr 2023
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing
  the Biases Introduced by Task Design
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design
Valentina Pyatkin
Frances Yung
Merel C. J. Scholman
Reut Tsarfaty
Ido Dagan
Vera Demberg
19
12
0
03 Apr 2023
Adapting Pretrained Language Models for Solving Tabular Prediction
  Problems in the Electronic Health Record
Adapting Pretrained Language Models for Solving Tabular Prediction Problems in the Electronic Health Record
C. McMaster
D. Liew
Douglas E. V. Pires
22
4
0
27 Mar 2023
Salient Span Masking for Temporal Understanding
Salient Span Masking for Temporal Understanding
Jeremy R. Cole
Aditi Chaudhary
Bhuwan Dhingra
Partha P. Talukdar
27
11
0
22 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence
  Reasoning
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
21
7
0
10 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
SemEval-2023 Task 10: Explainable Detection of Online Sexism
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
10
117
0
07 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based
  Summarization Evaluation
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq R. Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
11
27
0
07 Mar 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and
  Fine-tuned BERT
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
47
237
0
19 Feb 2023
AutoWS: Automated Weak Supervision Framework for Text Classification
AutoWS: Automated Weak Supervision Framework for Text Classification
Abhinav Bohra
Huy-Thanh Nguyen
Devashish Khatwani
NoLa
17
0
0
07 Feb 2023
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark
  Datasets
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets
Tosin P. Adewumi
Isabella Sodergren
Lama Alkhaled
Sana Sabah Sabry
F. Liwicki
Marcus Liwicki
12
4
0
28 Jan 2023
SWARM Parallelism: Training Large Models Can Be Surprisingly
  Communication-Efficient
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
Max Ryabinin
Tim Dettmers
Michael Diskin
Alexander Borzunov
MoE
15
31
0
27 Jan 2023
Characterizing the Entities in Harmful Memes: Who is the Hero, the
  Villain, the Victim?
Characterizing the Entities in Harmful Memes: Who is the Hero, the Villain, the Victim?
Shivam Sharma
Atharva Kulkarni
Tharun Suresh
Himanshi Mathur
Preslav Nakov
Md. Shad Akhtar
Tanmoy Chakraborty
21
15
0
26 Jan 2023
An Experimental Study on Pretraining Transformers from Scratch for IR
An Experimental Study on Pretraining Transformers from Scratch for IR
Carlos Lassance
Hervé Déjean
S. Clinchant
16
11
0
25 Jan 2023
Previous
123...1718192021
Next