ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03654
  4. Cited By
DeBERTa: Decoding-enhanced BERT with Disentangled Attention

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

5 June 2020
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
    AAML
ArXivPDFHTML

Papers citing "DeBERTa: Decoding-enhanced BERT with Disentangled Attention"

50 / 1,037 papers shown
Title
A fast and sound tagging method for discontinuous named-entity
  recognition
A fast and sound tagging method for discontinuous named-entity recognition
Caio Corro
19
0
0
24 Sep 2024
ASTE Transformer Modelling Dependencies in Aspect-Sentiment Triplet
  Extraction
ASTE Transformer Modelling Dependencies in Aspect-Sentiment Triplet Extraction
Iwo Naglik
Mateusz Lango
23
0
0
23 Sep 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
25
6
0
23 Sep 2024
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs
  as Science Communicators
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators
Prasoon Bajpai
Niladri Chatterjee
Subhabrata Dutta
Tanmoy Chakraborty
ELM
11
0
0
21 Sep 2024
"I Never Said That": A dataset, taxonomy and baselines on response
  clarity classification
"I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Konstantinos Thomas
Giorgos Filandrianos
Maria Lymperaiou
Chrysoula Zerva
Giorgos Stamou
17
0
0
20 Sep 2024
Contextual Breach: Assessing the Robustness of Transformer-based QA
  Models
Contextual Breach: Assessing the Robustness of Transformer-based QA Models
Asir Saadat
Nahian Ibn Asad
Md Farhan Ishmam
AAML
29
0
0
17 Sep 2024
Propulsion: Steering LLM with Tiny Fine-Tuning
Propulsion: Steering LLM with Tiny Fine-Tuning
Md. Kowsher
Nusrat Jahan Prottasha
Prakash Bhat
38
4
0
17 Sep 2024
Deep Fast Machine Learning Utils: A Python Library for Streamlined
  Machine Learning Prototyping
Deep Fast Machine Learning Utils: A Python Library for Streamlined Machine Learning Prototyping
Fabi Prezja
AI4CE
27
0
0
14 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
14
3
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
58
23
0
10 Sep 2024
RexUniNLU: Recursive Method with Explicit Schema Instructor for
  Universal NLU
RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Chengyuan Liu
Shihang Wang
Fubang Zhao
Kun Kuang
Yangyang Kang
Weiming Lu
Changlong Sun
Fei Wu
23
0
0
09 Sep 2024
Interactive Machine Teaching by Labeling Rules and Instances
Interactive Machine Teaching by Labeling Rules and Instances
Giannis Karamanolakis
Daniel J. Hsu
Luis Gravano
27
0
0
08 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
25
1
0
08 Sep 2024
LATEX-GCL: Large Language Models (LLMs)-Based Data Augmentation for
  Text-Attributed Graph Contrastive Learning
LATEX-GCL: Large Language Models (LLMs)-Based Data Augmentation for Text-Attributed Graph Contrastive Learning
Haoran Yang
Xiangyu Zhao
Sirui Huang
Qing Li
Guandong Xu
24
4
0
02 Sep 2024
Generating Media Background Checks for Automated Source Critical
  Reasoning
Generating Media Background Checks for Automated Source Critical Reasoning
Michael Schlichtkrull
19
3
0
01 Sep 2024
Post-OCR Text Correction for Bulgarian Historical Documents
Post-OCR Text Correction for Bulgarian Historical Documents
Angel Beshirov
Milena Dobreva
Dimitar Dimitrov
Momchil Hardalov
Ivan Koychev
Preslav Nakov
34
1
0
31 Aug 2024
iToT: An Interactive System for Customized Tree-of-Thought Generation
iToT: An Interactive System for Customized Tree-of-Thought Generation
Alan Boyle
Isha Gupta
Sebastian Hönig
Lukas Mautner
Kenza Amara
Furui Cheng
Mennatallah El-Assady
LRM
LM&Ro
32
1
0
31 Aug 2024
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
Jonathan Bourne
49
4
0
30 Aug 2024
Is text normalization relevant for classifying medieval charters?
Is text normalization relevant for classifying medieval charters?
Florian Atzenhofer-Baumgartner
Tamás Kovács
16
0
0
29 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
39
2
0
27 Aug 2024
Cross-Modal Learning for Chemistry Property Prediction: Large Language
  Models Meet Graph Machine Learning
Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning
Sakhinana Sagar Srinivas
Venkataramana Runkana
AI4CE
30
1
0
27 Aug 2024
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
Lingyun Huang
Jianxu Mao
Yaonan Wang
Junfei Yi
Ziming Tao
VLM
VPVLM
37
1
0
27 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph
  Representation Learning with Foundational Large Language Models
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
30
0
0
24 Aug 2024
Preliminary Investigations of a Multi-Faceted Robust and Synergistic
  Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision
  Transformers with Large Language and Multimodal Models
Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
24
0
0
24 Aug 2024
A Comparative Analysis of Faithfulness Metrics and Humans in Citation
  Evaluation
A Comparative Analysis of Faithfulness Metrics and Humans in Citation Evaluation
Weijia Zhang
Mohammad Aliannejadi
Jiahuan Pei
Yifei Yuan
Jia-Hong Huang
Evangelos Kanoulas
HILM
29
4
0
22 Aug 2024
Differentiating Choices via Commonality for Multiple-Choice Question
  Answering
Differentiating Choices via Commonality for Multiple-Choice Question Answering
Wenqing Deng
Zhe Wang
Kewen Wang
Shirui Pan
Xiaowang Zhang
Zhiyong Feng
29
0
0
21 Aug 2024
A Little Confidence Goes a Long Way
A Little Confidence Goes a Long Way
J. Scoville
Shang Gao
Devanshu Agrawal
Javed Qadrud-Din
24
0
0
20 Aug 2024
CHECKWHY: Causal Fact Verification via Argument Structure
CHECKWHY: Causal Fact Verification via Argument Structure
Jiasheng Si
Yibo Zhao
Yingjie Zhu
Haiyang Zhu
Wenpeng Lu
Deyu Zhou
CML
HILM
LRM
35
1
0
20 Aug 2024
Crossing New Frontiers: Knowledge-Augmented Large Language Model
  Prompting for Zero-Shot Text-Based De Novo Molecule Design
Crossing New Frontiers: Knowledge-Augmented Large Language Model Prompting for Zero-Shot Text-Based De Novo Molecule Design
Sakhinana Sagar Srinivas
Venkataramana Runkana
26
1
0
18 Aug 2024
Chinese Metaphor Recognition Using a Multi-stage Prompting Large
  Language Model
Chinese Metaphor Recognition Using a Multi-stage Prompting Large Language Model
Jie Wang
Jin Wang
Xuejie Zhang
LRM
23
1
0
17 Aug 2024
Identifying Technical Debt and Its Types Across Diverse Software
  Projects Issues
Identifying Technical Debt and Its Types Across Diverse Software Projects Issues
Karthik Shivashankar
Mili Orucevic
Maren Maritsdatter Kruke
Antonio Martini
25
1
0
17 Aug 2024
Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius
Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius
Xuehao Wang
Weisen Jiang
Shuai Fu
Yu Zhang
AAML
39
0
0
15 Aug 2024
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep
  Molecular Understanding
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding
Bing Hu
Anita Layton
Helen Chen
MedIm
21
1
0
14 Aug 2024
PolyCL: Contrastive Learning for Polymer Representation Learning via
  Explicit and Implicit Augmentations
PolyCL: Contrastive Learning for Polymer Representation Learning via Explicit and Implicit Augmentations
Jiajun Zhou
Yijie Yang
Austin M. Mroz
Kim E. Jelfs
SSL
22
1
0
14 Aug 2024
LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large
  Language Models
LoRA2^22 : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models
Jia-Chen Zhang
Yu-Jie Xiong
He-Xi Qiu
Dong-Hai Zhu
Chun-Ming Xia
MoE
24
0
0
13 Aug 2024
Path-LLM: A Shortest-Path-based LLM Learning for Unified Graph
  Representation
Path-LLM: A Shortest-Path-based LLM Learning for Unified Graph Representation
Wenbo Shang
Xuliang Zhu
Xin Huang
34
3
0
10 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
19
0
0
09 Aug 2024
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
Ziyuan Zhuang
Zhiyang Zhang
Sitao Cheng
Fangkai Yang
Jia Liu
Shujian Huang
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
RALM
38
6
0
08 Aug 2024
VideoQA in the Era of LLMs: An Empirical Study
VideoQA in the Era of LLMs: An Empirical Study
Junbin Xiao
Nanxin Huang
Hangyu Qin
Dongyang Li
Yicong Li
...
Zhulin Tao
Jianxing Yu
Liang Lin
Tat-Seng Chua
Angela Yao
23
10
0
08 Aug 2024
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
Mervat Abassy
Kareem Elozeiri
Alexander Aziz
Minh Ngoc Ta
Raj Vardhan Tomar
...
Alham Fikri Aji
Artem Shelmanov
Nizar Habash
Iryna Gurevych
Preslav Nakov
DeLMO
48
11
0
08 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
32
0
0
04 Aug 2024
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with
  Accelerated LLMs
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs
Afia Anjum
Maksim E. Eren
V. Setlur
Boian Alexandrov
Manish Bhattarai
24
2
0
02 Aug 2024
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Hao Liang
Linzhuang Sun
Jingxuan Wei
Xijie Huang
Linkun Sun
Bihui Yu
Conghui He
Wentao Zhang
SyDa
40
4
0
31 Jul 2024
Maverick: Efficient and Accurate Coreference Resolution Defying Recent
  Trends
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends
Giuliano Martinelli
Martin Larsson
Johannes Wiesel
21
7
0
31 Jul 2024
Cost-Effective Hallucination Detection for LLMs
Cost-Effective Hallucination Detection for LLMs
Simon Valentin
Jinmiao Fu
Gianluca Detommaso
Shaoyuan Xu
Giovanni Zappella
Bryan Wang
HILM
33
4
0
31 Jul 2024
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods
Gabriel Loiseau
Damien Sileo
Damien Riquet
Maxime Meyer
Marc Tommasi
38
0
0
31 Jul 2024
Learning Robust Named Entity Recognizers From Noisy Data With Retrieval
  Augmentation
Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation
Chaoyi Ai
Yong-jia Jiang
Shen Huang
Pengjun Xie
Kewei Tu
37
0
0
26 Jul 2024
Fairness Definitions in Language Models Explained
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
50
10
0
26 Jul 2024
Tracking linguistic information in transformer-based sentence embeddings
  through targeted sparsification
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification
Vivi Nastase
Paola Merlo
23
2
0
25 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using
  Newsflow
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
31
2
0
25 Jul 2024
Previous
123456...192021
Next