ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 138 papers shown
Title
Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules
Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules
Michail Theologitis
V. Samoladas
Antonios Deligiannakis
29
0
0
07 May 2025
Bi-directional Model Cascading with Proxy Confidence
Bi-directional Model Cascading with Proxy Confidence
David Warren
Mark Dras
44
0
0
27 Apr 2025
Span-Level Hallucination Detection for LLM-Generated Answers
Span-Level Hallucination Detection for LLM-Generated Answers
Passant Elchafei
Mervet Abu-Elkheir
HILM
LRM
59
1
0
25 Apr 2025
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Haokai Zhang
Shengtao Zhang
Zijian Cai
Heng Wang
Ruixuan Zhu
Zinan Zeng
Minnan Luo
49
0
0
24 Apr 2025
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Fengze Liu
Weidong Zhou
Binbin Liu
Zhimiao Yu
Yifan Zhang
...
Yifeng Yu
Bingni Zhang
Xiaohuan Zhou
Taifeng Wang
Yong Cao
55
0
0
23 Apr 2025
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
S. Bhagat
Ibne Farabi Shihab
Anuj Sharma
27
0
0
17 Apr 2025
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
Jennifer D’Souza
Sameer Sadruddin
Holger Israel
Mathias Begoin
Diana Slawig
57
5
0
09 Apr 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
102
1
0
07 Mar 2025
Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace
Jia-Chen Zhang
Yu-Jie Xiong
Chun-Ming Xia
Dong-Hai Zhu
Xi-He Qiu
64
1
0
03 Mar 2025
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
Yi-Ling Chung
Aurora Cobo
Pablo Serna
SyDa
HILM
58
0
0
24 Feb 2025
Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language
Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language
Amalie Brogaard Pauli
Isabelle Augenstein
Ira Assent
44
5
0
24 Feb 2025
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
Song Duong
Florian Le Bronnec
Alexandre Allauzen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
HILM
43
0
0
20 Feb 2025
An Efficient Row-Based Sparse Fine-Tuning
An Efficient Row-Based Sparse Fine-Tuning
Cen-Jhih Li
Aditya Bhaskara
52
0
0
17 Feb 2025
BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop
BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop
Lucas Charpentier
Leshem Choshen
Ryan Cotterell
Mustafa Omer Gul
Michael Y. Hu
...
Candace Ross
Raj Sanjay Shah
Alex Warstadt
Ethan Gotlieb Wilcox
Adina Williams
47
2
0
15 Feb 2025
Behavioral Homophily in Social Media via Inverse Reinforcement Learning: A Reddit Case Study
Behavioral Homophily in Social Media via Inverse Reinforcement Learning: A Reddit Case Study
Lanqin Yuan
Philipp J. Schneider
Marian-Andrei Rizoiu
53
0
0
05 Feb 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
Jan Christian Blaise Cruz
Alham Fikri Aji
41
1
0
22 Jan 2025
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
Alireza Ghahramani Kure
Mahshid Dehghani
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Ehsaneddin Asgari
47
1
0
19 Jan 2025
RAGBench: Explainable Benchmark for Retrieval-Augmented Generation Systems
RAGBench: Explainable Benchmark for Retrieval-Augmented Generation Systems
Robert Friel
Masha Belyi
Atindriyo Sanyal
72
18
0
17 Jan 2025
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models
Fan Wang
Juyong Jiang
Chansung Park
Sunghun Kim
Jing Tang
91
1
0
08 Dec 2024
TabVer: Tabular Fact Verification with Natural Logic
TabVer: Tabular Fact Verification with Natural Logic
Rami Aly
Andreas Vlachos
LMTD
26
0
0
02 Nov 2024
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Kuo-Han Hung
Ching-Yun Ko
Ambrish Rawat
I-Hsin Chung
Winston H. Hsu
Pin-Yu Chen
49
7
0
01 Nov 2024
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
H. Li
Xiaogeng Liu
SILM
37
4
0
30 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
German Gritsai
Anastasia Voznyuk
Andrey Grabovoy
Yury Chekhovich
DeLMO
75
1
0
18 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLM
RALM
26
7
0
17 Oct 2024
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
Xiaonan Jing
Srinivas Billa
Danny Godbout
HILM
40
0
0
16 Oct 2024
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with
  Explanatory Argumentative Structures
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures
Ekaterina Sviridova
Anar Yeginbergen
A. Estarrona
Elena Cabrio
S. Villata
Rodrigo Agerri
42
2
0
07 Oct 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
65
2
0
02 Oct 2024
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in
  German Text Classification
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification
Andreas Waldis
Joel Birrer
Anne Lauscher
Iryna Gurevych
25
1
0
26 Sep 2024
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
Haolin Chen
Philip N. Garner
47
1
0
16 Sep 2024
Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa
Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa
Hayoung Jung
Prerna Juneja
Tanushree Mitra
MLAU
63
0
0
16 Sep 2024
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for
  Political Text
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text
Michael Burnham
Kayla Kahn
Ryan Yank Wang
Rachel X. Peng
32
4
0
03 Sep 2024
From Prediction to Application: Language Model-based Code Knowledge
  Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with
  Pedagogical Prompting for Comprehensive Programming Education
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education
Unggi Lee
Jiyeong Bae
Yeonji Jung
Minji Kang
Gyuri Byun
...
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Hyeoncheol Kim
AI4Ed
KELM
26
1
0
31 Aug 2024
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
Jonathan Bourne
49
4
0
30 Aug 2024
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
Adam Gould
Pere-Lluis Huguet-Cabot
S. Dadhania
Francesca Toni
67
7
0
31 Jul 2024
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting
  Relations in Dialogical Argument Mining
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining
Zihao Zheng
Zhaowei Wang
Qing Zong
Yangqiu Song
LRM
40
1
0
29 Jul 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
52
0
0
02 Jul 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
36
0
0
14 Jun 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James Validad Miranda
Jennifer Santoso
...
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
77
9
0
14 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
39
0
0
05 Jun 2024
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Yuchen Zhuang
Haotian Sun
Yue Yu
Rushi Qiang
Qifan Wang
Chao Zhang
Bo Dai
AAML
41
14
0
05 Jun 2024
Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction
Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction
William Hogan
Jingbo Shang
13
0
0
31 May 2024
Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource
  Language Analysis With Character-Aware Hierarchical Transformers
Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers
Frederick Riemenschneider
Kevin Krahn
27
2
0
30 May 2024
AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering
  System for Commonsense Defying Reasoning
AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning
Mina Ghashami
Soumya Smruti Mishra
LRM
32
1
0
16 May 2024
Explainable Automatic Grading with Neural Additive Models
Explainable Automatic Grading with Neural Additive Models
Aubrey Condor
Z. Pardos
ELM
22
2
0
01 May 2024
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning
  Leveraging Weight Decomposition
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition
Yuxuan Yan
Qianqian Yang
Shunpu Tang
Zhiguo Shi
27
13
0
29 Apr 2024
MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology
  with Multimodal Learning
MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
Nadia Saeed
MedIm
30
2
0
27 Apr 2024
Generating Attractive and Authentic Copywriting from Customer Reviews
Generating Attractive and Authentic Copywriting from Customer Reviews
Yu-Xiang Lin
Wei-Yun Ma
34
2
0
22 Apr 2024
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Alexander Shan
John Bauer
Riley Carlson
Christopher D. Manning
25
2
0
20 Apr 2024
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Yang Lin
Xinyu Ma
Xu Chu
Yujie Jin
Zhibang Yang
Yasha Wang
Hong-yan Mei
44
19
0
15 Apr 2024
JaFIn: Japanese Financial Instruction Dataset
JaFIn: Japanese Financial Instruction Dataset
Kota Tanabe
Masahiro Suzuki
Hiroki Sakaji
Itsuki Noda
39
1
0
14 Apr 2024
123
Next