v1v2v3 (latest)

WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations

28 August 2018

Mohammad Taher Pilehvar

Jose Camacho-Collados

ArXiv (abs)PDF HTML

Papers citing "WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations"

50 / 339 papers shown

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

732

28 Jan 2025

Multi-Objective Hyperparameter Selection via Hypothesis Testing on Reliability Graphs

Amirmohammad Farzaneh

Osvaldo Simeone

875

22 Jan 2025

Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous WordsInternational Conference on Learning Representations (ICLR), 2025

401

09 Jan 2025

JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements

Zhu Liu

Zhen Hu

Ying Liu

282

31 Dec 2024

GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary

F. Almeman

Luis Espinosa-Anke

259

09 Dec 2024

Weak-to-Strong Generalization Through the Data-Centric LensInternational Conference on Learning Representations (ICLR), 2024

Changho Shin

John Cooper

Frederic Sala

456

05 Dec 2024

Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models

395

25 Nov 2024

Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation

Sriram Gopalakrishnan

Niladri Chatterjee

Tanmoy Chakraborty

BDL

484

07 Nov 2024

LASER: Attention with Exponential Transformation

Sai Surya Duvvuri

Inderjit Dhillon

196

05 Nov 2024

Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense

Samuel Cahyawijaya

Ruochen Zhang

Holy Lovenia

Jan Christian Blaise Cruz

326

28 Oct 2024

From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition

233

17 Oct 2024

Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable InformationJournal of Biomedical Informatics (JBI), 2024

297

16 Oct 2024

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces

368

11 Oct 2024

ACCEPT: Adaptive Codebook for Composite and Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

277

10 Oct 2024

Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Vahab Mirrokni

286

09 Oct 2024

Initialization of Large Language Models via Reparameterization to Mitigate Loss SpikesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Kosuke Nishida

Kyosuke Nishida

Kuniko Saito

217

07 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

272

04 Oct 2024

Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024

Jing Li

...

Min Zhang

258

03 Oct 2024

U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Tung-Yu Wu

Pei-Yu Lo

ReLM LRM

299

02 Oct 2024

Realistic Evaluation of Model Merging for Compositional Generalization

255

26 Sep 2024

Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Jinman Zhao

256

21 Sep 2024

Distilling Monolingual and Crosslingual Word-in-Context Representations

Yuki Arase

Tomoyuki Kajiwara

229

13 Sep 2024

Fingerprint Vector: Enabling Scalable and Efficient Model Fingerprint Transfer via Vector Addition

315

13 Sep 2024

Building Better Datasets: Seven Recommendations for Responsible Design from Dataset Creators

Will Orr

Kate Crawford

204

30 Aug 2024

Kraken: Inherently Parallel Transformers For Efficient Multi-Device InferenceNeural Information Processing Systems (NeurIPS), 2024

R. Prabhakar

Hengrui Zhang

D. Wentzlaff

294

14 Aug 2024

Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Verna Dankers

Ivan Titov

278

09 Aug 2024

Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack

Xiaoyue Xu

Qinyuan Ye

Xiang Ren

324

23 Jul 2024

Semantic Change Characterization with LLMs using Rhetorics

Jader Martins Camboim de Sá

Marcos Da Silveira

C. Pruski

242

23 Jul 2024

Internal Consistency and Self-Feedback in Large Language Models: A Survey

...

501

19 Jul 2024

Investigating the Contextualised Word Embedding Dimensions Responsible for Contextual and Temporal Semantic Changes

Taichi Aida

Danushka Bollegala

238

03 Jul 2024

To Word Senses and Beyond: Inducing Concepts with Contextualized Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Bastien Liétard

Pascal Denis

Mikaella Keller

253

28 Jun 2024

LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models

147

27 Jun 2024

The Remarkable Robustness of LLMs: Stages of Inference?

Vedang Lad

Wes Gurnee

Max Tegmark

521

27 Jun 2024

BiLD: Bi-directional Logits Difference Loss for Large Language Model DistillationInternational Conference on Computational Linguistics (COLING), 2024

Minchong Li

Feng Zhou

Xiaohui Song

156

19 Jun 2024

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ting-Yun Chang

Jesse Thomason

Robin Jia

420

19 Jun 2024

UBench: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

441

18 Jun 2024

Paraphrasing in Affirmative Terms Improves Negation Understanding

MohammadHossein Rezaei

Eduardo Blanco

232

11 Jun 2024

SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings

MohammadAli SadraeiJavaeri

176

07 Jun 2024

BERTs are Generative In-Context LearnersNeural Information Processing Systems (NeurIPS), 2024

David Samuel

231

07 Jun 2024

Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning

207

06 Jun 2024

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity

...

Zhaozhuo Xu

267

05 Jun 2024

UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation

296

31 May 2024

Mixture of Experts Using Tensor Products

163

26 May 2024

Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization

540

24 May 2024

Lessons from the Trenches on Reproducible Evaluation of Language Models

...

370

105

23 May 2024

EMR-Merging: Tuning-Free High-Performance Model MergingNeural Information Processing Systems (NeurIPS), 2024

Chenyu Huang

Peng Ye

Tao Chen

Tong He

Xiangyu Yue

Wanli Ouyang

MoMe

294

23 May 2024

eXmY: A Data Type and Technique for Arbitrary Bit Precision Quantization

Aditya Agrawal

Matthew Hedlund

Blake A. Hechtman

285

22 May 2024

Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion

236

19 May 2024

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMsInternational Conference on Learning Representations (ICLR), 2024

191

05 May 2024

Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuningInternational Conference on Machine Learning (ICML), 2024

Jing Xu

Jingzhao Zhang

233

04 May 2024