ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03654
  4. Cited By
DeBERTa: Decoding-enhanced BERT with Disentangled Attention

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

5 June 2020
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
    AAML
ArXivPDFHTML

Papers citing "DeBERTa: Decoding-enhanced BERT with Disentangled Attention"

50 / 1,015 papers shown
Title
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
Jiaqi Xue
Qian Lou
Mengxin Zheng
26
1
0
23 Oct 2024
Captions Speak Louder than Images (CASLIE): Generalizing Foundation
  Models for E-commerce from High-quality Multimodal Instruction Data
Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Xinyi Ling
B. Peng
Hanwen Du
Zhihui Zhu
Xia Ning
26
0
0
22 Oct 2024
Fine-Tuning Large Language Models to Appropriately Abstain with Semantic
  Entropy
Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy
Benedict Aaron Tjandra
Muhammed Razzak
Jannik Kossen
Kunal Handa
Yarin Gal
HILM
28
0
0
22 Oct 2024
Can Large Language Models Act as Ensembler for Multi-GNNs?
Can Large Language Models Act as Ensembler for Multi-GNNs?
Hanqi Duan
Yao Cheng
Jianxiang Yu
Xiang Li
AI4CE
35
0
0
22 Oct 2024
1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and
  Large Language Models for Medical Text Classification
1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification
Ram Mohan Rao Kadiyala
M. V. P. Chandra Sekhara Rao
23
0
0
21 Oct 2024
Mitigating Object Hallucination via Concentric Causal Attention
Mitigating Object Hallucination via Concentric Causal Attention
Yun Xing
Yiheng Li
Ivan Laptev
Shijian Lu
40
18
0
21 Oct 2024
Improve Dense Passage Retrieval with Entailment Tuning
Improve Dense Passage Retrieval with Entailment Tuning
Lu Dai
Hao Liu
Hui Xiong
RALM
35
4
0
21 Oct 2024
Evaluating Consistencies in LLM responses through a Semantic Clustering
  of Question Answering
Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering
Yanggyu Lee
Jihie Kim
23
1
0
20 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
German Gritsai
Anastasia Voznyuk
Andrey Grabovoy
Yury Chekhovich
DeLMO
75
1
0
18 Oct 2024
Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement
Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement
Zihao Cheng
Li Zhou
Feng Jiang
Benyou Wang
H. Li
DeLMO
39
4
0
18 Oct 2024
Optimizing Preference Alignment with Differentiable NDCG Ranking
Optimizing Preference Alignment with Differentiable NDCG Ranking
Jiacong Zhou
Xianyun Wang
Jun Yu
20
2
0
17 Oct 2024
Mitigating Biases to Embrace Diversity: A Comprehensive Annotation
  Benchmark for Toxic Language
Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language
Xinmeng Hou
19
1
0
17 Oct 2024
Measuring Free-Form Decision-Making Inconsistency of Language Models in
  Military Crisis Simulations
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Aryan Shrivastava
Jessica Hullman
Max Lamparth
37
6
0
17 Oct 2024
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Catarina G. Belem
Pouya Pezeskhpour
Hayate Iso
Seiji Maekawa
Nikita Bhutani
Estevam R. Hruschka
HILM
65
1
0
17 Oct 2024
Communication-Efficient and Tensorized Federated Fine-Tuning of Large
  Language Models
Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models
Sajjad Ghiasvand
Yifan Yang
Zhiyu Xue
Mahnoosh Alizadeh
Zheng Zhang
Ramtin Pedarsani
FedML
33
3
0
16 Oct 2024
Bridging Large Language Models and Graph Structure Learning Models for
  Robust Representation Learning
Bridging Large Language Models and Graph Structure Learning Models for Robust Representation Learning
Guangxin Su
Yifan Zhu
Wenjie Zhang
Hanchen Wang
Ying Zhang
31
1
0
15 Oct 2024
HR-Agent: A Task-Oriented Dialogue (TOD) LLM Agent Tailored for HR
  Applications
HR-Agent: A Task-Oriented Dialogue (TOD) LLM Agent Tailored for HR Applications
Weijie Xu
Jay Desai
Fanyou Wu
Josef Valvoda
Srinivasan H. Sengamedu
LLMAG
36
1
0
15 Oct 2024
Optimizing Encoder-Only Transformers for Session-Based Recommendation
  Systems
Optimizing Encoder-Only Transformers for Session-Based Recommendation Systems
Anis Redjdal
Luis Pinto
Michel Desmarais
18
0
0
15 Oct 2024
BookWorm: A Dataset for Character Description and Analysis
BookWorm: A Dataset for Character Description and Analysis
Argyrios Papoudakis
Mirella Lapata
Frank Keller
18
1
0
14 Oct 2024
Disentangling Hate Across Target Identities
Disentangling Hate Across Target Identities
Yiping Jin
Leo Wanner
Aneesh Moideen Koya
15
0
0
14 Oct 2024
RoCoFT: Efficient Finetuning of Large Language Models with Row-Column
  Updates
RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates
Md. Kowsher
Tara Esmaeilbeig
Chun-Nam Yu
Mojtaba Soltanalian
Niloofar Yousefi
27
0
0
14 Oct 2024
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
Yun Zhu
Haizhou Shi
Xiaotang Wang
Yongchao Liu
Yaoke Wang
Boci Peng
Chuntao Hong
Siliang Tang
VLM
43
6
0
14 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive
  Survey
Text Classification using Graph Convolutional Networks: A Comprehensive Survey
Syed Mustafa Haider Rizvi
Ramsha Imran
Arif Mahmood
GNN
OOD
FaML
18
0
0
12 Oct 2024
Generation with Dynamic Vocabulary
Generation with Dynamic Vocabulary
Yanting Liu
Tao Ji
Changzhi Sun
Yuanbin Wu
Xiaoling Wang
30
0
0
11 Oct 2024
JurEE not Judges: safeguarding llm interactions with small, specialised
  Encoder Ensembles
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
24
1
0
11 Oct 2024
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELM
LRM
25
0
0
11 Oct 2024
The Effects of Hallucinations in Synthetic Training Data for Relation
  Extraction
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction
Steven Rogulsky
Nicholas Popovic
Michael Färber
HILM
30
1
0
10 Oct 2024
Evaluating Transformer Models for Suicide Risk Detection on Social Media
Evaluating Transformer Models for Suicide Risk Detection on Social Media
Jakub Pokrywka
Jeremi Kaczmarek
Edward Gorzelañczyk
15
0
0
10 Oct 2024
Sample then Identify: A General Framework for Risk Control and
  Assessment in Multimodal Large Language Models
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models
Qingni Wang
Tiantian Geng
Zhiyuan Wang
Teng Wang
Bo Fu
Feng Zheng
25
4
0
10 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking
Chain and Causal Attention for Efficient Entity Tracking
Erwan Fagnou
Paul Caillon
Blaise Delattre
Alexandre Allauzen
13
2
0
07 Oct 2024
Passage Retrieval of Polish Texts Using OKAPI BM25 and an Ensemble of
  Cross Encoders
Passage Retrieval of Polish Texts Using OKAPI BM25 and an Ensemble of Cross Encoders
Jakub Pokrywka
12
1
0
06 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
An evaluation of LLM code generation capabilities through graded
  exercises
An evaluation of LLM code generation capabilities through graded exercises
Álvaro Barbero Jiménez
ELM
26
0
0
06 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
ECon: On the Detection and Resolution of Evidence Conflicts
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
31
1
0
05 Oct 2024
Variational Language Concepts for Interpreting Foundation Language
  Models
Variational Language Concepts for Interpreting Foundation Language Models
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
27
3
0
04 Oct 2024
What Matters for Model Merging at Scale?
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Mohit Bansal
Tsendsuren Munkhdalai
MoMe
44
12
0
04 Oct 2024
GraphRouter: A Graph-based Router for LLM Selections
GraphRouter: A Graph-based Router for LLM Selections
Tao Feng
Yanzhen Shen
Jiaxuan You
56
10
0
04 Oct 2024
Measuring and Improving Persuasiveness of Large Language Models
Measuring and Improving Persuasiveness of Large Language Models
Somesh Singh
Yaman Kumar Singla
Harini SI
Balaji Krishnamurthy
25
3
0
03 Oct 2024
Why context matters in VQA and Reasoning: Semantic interventions for VLM
  input modalities
Why context matters in VQA and Reasoning: Semantic interventions for VLM input modalities
Kenza Amara
Lukas Klein
Carsten T. Lüth
Paul Jäger
Hendrik Strobelt
Mennatallah El-Assady
25
1
0
02 Oct 2024
Unleashing the Power of Large Language Models in Zero-shot Relation
  Extraction via Self-Prompting
Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting
Siyi Liu
Yang Li
Jiang Li
Shan Yang
Yunshi Lan
LRM
19
1
0
02 Oct 2024
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
Yanming Liu
Xinyue Peng
Jiannan Cao
Shi Bo
Yanxin Shen
Tianyu Du
Sheng Cheng
Xun Wang
Jianwei Yin
Xuhong Zhang
58
9
0
02 Oct 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
65
2
0
02 Oct 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling
  Large Language Models
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALM
MQ
40
5
0
30 Sep 2024
HM3: Heterogeneous Multi-Class Model Merging
HM3: Heterogeneous Multi-Class Model Merging
Stefan Hackmann
MoMe
25
0
0
27 Sep 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
21
5
0
26 Sep 2024
DisGeM: Distractor Generation for Multiple Choice Questions with Span
  Masking
DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking
Devrim Cavusoglu
Secil Sen
Ulas Sert
29
0
0
26 Sep 2024
Data Proportion Detection for Optimized Data Management for Large
  Language Models
Data Proportion Detection for Optimized Data Management for Large Language Models
Hao Liang
Keshi Zhao
Yajie Yang
Bin Cui
Guosheng Dong
Zenan Zhou
Wentao Zhang
31
0
0
26 Sep 2024
data2lang2vec: Data Driven Typological Features Completion
data2lang2vec: Data Driven Typological Features Completion
Hamidreza Amirzadeh
Sadegh Jafari
Anika Harju
Rob van der Goot
24
1
0
25 Sep 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical
  Impacts, Constraints, and Emerging Questions
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
40
0
0
25 Sep 2024
Unsupervised Text Representation Learning via Instruction-Tuning for
  Zero-Shot Dense Retrieval
Unsupervised Text Representation Learning via Instruction-Tuning for Zero-Shot Dense Retrieval
Qiuhai Zeng
Zimeng Qiu
Dae Yon Hwang
Xin He
William M. Campbell
RALM
16
0
0
24 Sep 2024
Previous
12345...192021
Next