Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.03654
Cited By
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
5 June 2020
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeBERTa: Decoding-enhanced BERT with Disentangled Attention"
50 / 1,015 papers shown
Title
On the Impact of Noise in Differentially Private Text Rewriting
Stephen Meisenbacher
Maulik Chevli
Florian Matthes
58
0
0
31 Jan 2025
Figurative-cum-Commonsense Knowledge Infusion for Multimodal Mental Health Meme Classification
Abdullah Mazhar
Zuhair Hasan Shaik
Aseem Srivastava
Polly Ruhnke
Lavanya Vaddavalli
Sri Keshav Katragadda
Shweta Yadav
Md. Shad Akhtar
AI4MH
26
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Erik Cambria
LM&MA
AILaw
93
151
0
28 Jan 2025
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
Kritarth Prasad
Mohammadi Zaki
Pratik Rakesh Singh
Pankaj Wasnik
31
0
0
28 Jan 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
Jan Christian Blaise Cruz
Alham Fikri Aji
41
1
0
22 Jan 2025
ShadowGenes: Leveraging Recurring Patterns within Computational Graphs for Model Genealogy
Kasimir Schulz
Kieran Evans
26
0
0
21 Jan 2025
Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning
Qiming Bao
Gael Gendron
A. Peng
Wanjun Zhong
N. Tan
Yang Chen
Michael Witbrock
J. Liu
LRM
ELM
68
2
0
20 Jan 2025
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
Z. Chen
Mingxiao Li
Z. Chen
Nan Du
Xiaolong Li
Yuexian Zou
53
0
0
19 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
49
3
0
08 Jan 2025
Trust Modeling in Counseling Conversations: A Benchmark Study
Aseem Srivastava
Zuhair Hasan Shaik
Tanmoy Chakraborty
Md. Shad Akhtar
28
0
0
06 Jan 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Yang Wang
Chenghua Lin
ELM
35
0
0
05 Jan 2025
Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language
Tomek Rutowski
Amir Harati
Elizabeth Shriberg
Yang Lu
Piotr Chlebek
Ricardo Oliveira
32
7
0
03 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
X. Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
59
17
0
31 Dec 2024
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
Shuguang Chen
Guang Lin
LRM
73
0
0
28 Dec 2024
Adrenaline: Adaptive Rendering Optimization System for Scalable Cloud Gaming
Jin Heo
Ketan Bhardwaj
Ada Gavrilovska
29
0
0
27 Dec 2024
Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants
Mequanent Argaw Muluneh
Yan-Tsung Peng
Li Su
40
0
0
25 Dec 2024
Enriching Social Science Research via Survey Item Linking
Tornike Tsereteli
Daniel Ruffinelli
Simone Paolo Ponzetto
LRM
70
0
0
20 Dec 2024
SEKE: Specialised Experts for Keyword Extraction
Matej Martinc
Hanh Thi Hong Tran
Senja Pollak
Boshko Koloski
MoE
60
0
0
18 Dec 2024
ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models
Yuxi Sun
Wei Gao
Jing Ma
Hongzhan Lin
Ziyang Luo
Wenxuan Zhang
ELM
74
0
0
17 Dec 2024
Understanding Knowledge Hijack Mechanism in In-context Learning through Associative Memory
Shuo Wang
Issei Sato
69
0
0
16 Dec 2024
Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversations
Sayantan Pal
Souvik Das
R. Srihari
78
1
0
15 Dec 2024
MST-R: Multi-Stage Tuning for Retrieval Systems and Metric Evaluation
Yash Malviya
Karan Dhingra
Maneesh Singh
64
0
0
13 Dec 2024
Word Sense Linking: Disambiguating Outside the Sandbox
Andrei Stefan Bejgu
Edoardo Barba
Luigi Procopio
Alberte Fernández-Castro
Roberto Navigli
72
0
0
12 Dec 2024
Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation
Davor Vukadin
Petar Afrić
Marin Šilić
Goran Delač
FAtt
88
2
0
12 Dec 2024
Experimenting with Multi-modal Information to Predict Success of Indian IPOs
Sohom Ghosh
Arnab Maji
N Harsha Vardhan
S. Naskar
64
0
0
08 Dec 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Michael Y. Hu
Aaron Mueller
Candace Ross
Adina Williams
Tal Linzen
Chengxu Zhuang
Ryan Cotterell
Leshem Choshen
Alex Warstadt
Ethan Gotlieb Wilcox
91
7
0
06 Dec 2024
AntLM: Bridging Causal and Masked Language Models
Xinru Yu
Bin Guo
Shiwei Luo
J. Wang
Tao Ji
Yuanbin Wu
CLL
74
1
0
04 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
64
0
0
27 Nov 2024
PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning
Zhen Sun
Tianshuo Cong
Yule Liu
Chenhao Lin
Xinlei He
Rongmao Chen
Xingshuo Han
Xinyi Huang
AAML
72
3
0
26 Nov 2024
TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
Yuan-Ming Li
An-Lan Wang
Kun-Yu Lin
Yu-Ming Tang
Ling-an Zeng
Jian-Fang Hu
Wei-Shi Zheng
93
6
0
26 Nov 2024
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Y. Fu
Yin Yu
Xiaotian Han
Runchao Li
Xianxuan Long
Haotian Yu
Pan Li
SyDa
57
0
0
25 Nov 2024
Interpreting Language Reward Models via Contrastive Explanations
Junqi Jiang
Tom Bewley
Saumitra Mishra
Freddy Lecue
Manuela Veloso
74
0
0
25 Nov 2024
Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa
Jinu Nyachhyon
Mridul Sharma
Bal Krishna Bal
63
0
0
24 Nov 2024
Exploring Large Language Models for Multimodal Sentiment Analysis: Challenges, Benchmarks, and Future Directions
Shezheng Song
56
0
0
23 Nov 2024
Inducing Human-like Biases in Moral Reasoning Language Models
Artem Karpov
Seong Hah Cho
Austin Meek
Raymond Koopmanschap
Lucy Farnik
Bogdan-Ionut Cirstea
LRM
59
0
0
23 Nov 2024
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages
Bethel Melesse Tessema
Akhil Kedia
Tae-Sun Chung
64
0
0
21 Nov 2024
PatentEdits: Framing Patent Novelty as Textual Entailment
Ryan Lee
Alexander Spangher
Xuezhe Ma
56
0
0
20 Nov 2024
Combining Autoregressive and Autoencoder Language Models for Text Classification
João Gonçalves
75
0
0
20 Nov 2024
Unraveling the Gradient Descent Dynamics of Transformers
Bingqing Song
Boran Han
Shuai Zhang
Jie Ding
Mingyi Hong
AI4CE
21
1
0
12 Nov 2024
Multi-head Span-based Detector for AI-generated Fragments in Scientific Papers
German Gritsai
Ildar Khabutdinov
Andrey Grabovoy
DeLMO
50
3
0
11 Nov 2024
GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification
Priya Mishra
Suraj Racha
Kaustubh Ponkshe
Adit Akarsh
Ganesh Ramakrishnan
28
0
0
08 Nov 2024
Gradient Boosting Trees and Large Language Models for Tabular Data Few-Shot Learning
Carlos Huertas
LMTD
38
1
0
06 Nov 2024
Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI
R. Kaur
Colin Samplawski
Adam Cobb
Anirban Roy
Brian Matejek
...
Daniel Elenius
Alexander M. Berenbeim
John A. Pavlik
Nathaniel D. Bastian
Susmit Jha
26
3
0
04 Nov 2024
Graph-based Confidence Calibration for Large Language Models
Yukun Li
Sijia Wang
Lifu Huang
Li-Ping Liu
UQCV
25
1
0
03 Nov 2024
Zipfian Whitening
Sho Yokoi
Han Bao
Hiroto Kurita
Hidetoshi Shimodaira
27
0
0
01 Nov 2024
MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference Calibration
David Anugraha
Garry Kuwanto
Lucky Susanto
Derry Wijaya
Genta Indra Winata
OSLM
35
2
0
01 Nov 2024
The Automated Verification of Textual Claims (AVeriTeC) Shared Task
M. Schlichtkrull
Yulong Chen
Chenxi Whitehouse
Zhenyun Deng
Mubashara Akhtar
...
Christos Christodoulopoulos
O. Cocarascu
Arpit Mittal
James Thorne
Andreas Vlachos
39
6
0
31 Oct 2024
Graph-Augmented Relation Extraction Model with LLMs-Generated Support Document
Vicky Dong
Hao Yu
Yao Chen
25
0
0
30 Oct 2024
Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings
Yashvir S. Grewal
Edwin V. Bonilla
Thang D. Bui
UQCV
25
3
0
30 Oct 2024
Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language Model
Y. Tan
Ruilin Wang
Banghao Wu
Liang Hong
Bingxin Zhou
21
9
0
28 Oct 2024
Previous
1
2
3
4
5
6
...
19
20
21
Next