Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1808.09121
Cited By
v1
v2
v3 (latest)
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
28 August 2018
Mohammad Taher Pilehvar
Jose Camacho-Collados
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations"
50 / 339 papers shown
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Yibin Wang
Haizhou Shi
Ligong Han
Dimitris N. Metaxas
Hao Wang
BDL
UQLM
732
23
0
28 Jan 2025
Multi-Objective Hyperparameter Selection via Hypothesis Testing on Reliability Graphs
Amirmohammad Farzaneh
Osvaldo Simeone
875
1
0
22 Jan 2025
Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words
International Conference on Learning Representations (ICLR), 2025
Gouki Minegishi
Hiroki Furuta
Yusuke Iwasawa
Y. Matsuo
401
10
0
09 Jan 2025
JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements
Zhu Liu
Zhen Hu
Ying Liu
282
0
0
31 Dec 2024
GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary
F. Almeman
Luis Espinosa-Anke
259
1
0
09 Dec 2024
Weak-to-Strong Generalization Through the Data-Centric Lens
International Conference on Learning Representations (ICLR), 2024
Changho Shin
John Cooper
Frederic Sala
456
14
0
05 Dec 2024
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Y. Fu
Yin Yu
Xiaotian Han
Runchao Li
Xianxuan Long
Haotian Yu
Pan Li
SyDa
395
0
0
25 Nov 2024
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation
Ayan Sengupta
Vaibhav Seth
Arinjay Pathak
Aastha Verma
Natraj Raman
Sriram Gopalakrishnan
Niladri Chatterjee
Tanmoy Chakraborty
BDL
484
3
0
07 Nov 2024
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
196
2
0
05 Nov 2024
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense
Samuel Cahyawijaya
Ruochen Zhang
Holy Lovenia
Jan Christian Blaise Cruz
Elisa Gilbert
Hiroki Nomoto
Alham Fikri Aji
LRM
326
0
0
28 Oct 2024
From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition
Qiyuan Yang
Pengda Wang
Luke D. Plonsky
Frederick L. Oswald
Hanjie Chen
ELM
233
2
0
17 Oct 2024
Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information
Journal of Biomedical Informatics (JBI), 2024
Yingya Li
Timothy A. Miller
Steven Bethard
G. Savova
297
4
0
16 Oct 2024
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
Ziming Yu
Pan Zhou
Sike Wang
Jia Li
Hua Huang
Hua Huang
368
0
0
11 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
277
2
0
10 Oct 2024
Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
International Conference on Learning Representations (ICLR), 2024
Zeman Li
Xinwei Zhang
Peilin Zhong
Yuan Deng
Meisam Razaviyayn
Vahab Mirrokni
286
11
0
09 Oct 2024
Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kosuke Nishida
Kyosuke Nishida
Kuniko Saito
217
6
0
07 Oct 2024
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Joey Tianyi Zhou
Tsendsuren Munkhdalai
MoMe
272
43
0
04 Oct 2024
Parameter Competition Balancing for Model Merging
Neural Information Processing Systems (NeurIPS), 2024
Guodong DU
Junlin Lee
Jing Li
Runhua Jiang
Yifei Guo
...
Hanting Liu
Sim Kuan Goh
Jing Li
Daojing He
Min Zhang
MoMe
258
43
0
03 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
International Conference on Learning Representations (ICLR), 2024
Tung-Yu Wu
Pei-Yu Lo
ReLM
LRM
299
5
0
02 Oct 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
255
11
0
26 Sep 2024
Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch
Jinman Zhao
Xueyan Zhang
Xingyu Yue
Weizhe Chen
Zifan Qian
Ruiyu Wang
LRM
256
0
0
21 Sep 2024
Distilling Monolingual and Crosslingual Word-in-Context Representations
Yuki Arase
Tomoyuki Kajiwara
229
0
0
13 Sep 2024
Fingerprint Vector: Enabling Scalable and Efficient Model Fingerprint Transfer via Vector Addition
Zhenhua Xu
Wenpeng Xing
Zhebo Wang
Wenpeng Xing
Chen Jie
Mohan Li
Meng Han
315
2
0
13 Sep 2024
Building Better Datasets: Seven Recommendations for Responsible Design from Dataset Creators
Will Orr
Kate Crawford
204
9
0
30 Aug 2024
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
Neural Information Processing Systems (NeurIPS), 2024
R. Prabhakar
Hengrui Zhang
D. Wentzlaff
294
1
0
14 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Verna Dankers
Ivan Titov
278
9
0
09 Aug 2024
Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack
Xiaoyue Xu
Qinyuan Ye
Xiang Ren
324
15
0
23 Jul 2024
Semantic Change Characterization with LLMs using Rhetorics
Jader Martins Camboim de Sá
Marcos Da Silveira
C. Pruski
242
6
0
23 Jul 2024
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Chenyang Xi
Zifan Zheng
Ding Chen
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Feiyu Xiong
Zhiyu Li
HILM
LRM
501
46
0
19 Jul 2024
Investigating the Contextualised Word Embedding Dimensions Responsible for Contextual and Temporal Semantic Changes
Taichi Aida
Danushka Bollegala
238
0
0
03 Jul 2024
To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Bastien Liétard
Pascal Denis
Mikaella Keller
253
3
0
28 Jun 2024
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
Shouchang Guo
Sonam Damani
Keng-hao Chang
VLM
147
3
0
27 Jun 2024
The Remarkable Robustness of LLMs: Stages of Inference?
Vedang Lad
Wes Gurnee
Max Tegmark
Max Tegmark
521
88
0
27 Jun 2024
BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation
International Conference on Computational Linguistics (COLING), 2024
Minchong Li
Feng Zhou
Xiaohui Song
156
6
0
19 Jun 2024
When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ting-Yun Chang
Jesse Thomason
Robin Jia
420
6
0
19 Jun 2024
UBench: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Xunzhi Wang
Zhuowei Zhang
Qiongyu Li
Gaonan Chen
Mengting Hu
Zhixin Han
Bitong Luo
Zhiyu li
Hang Gao
Mengting Hu
ELM
441
3
0
18 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
232
7
0
11 Jun 2024
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings
MohammadAli SadraeiJavaeri
Ehsaneddin Asgari
A. Mchardy
Hamid R. Rabiee
VLM
AAML
176
4
0
07 Jun 2024
BERTs are Generative In-Context Learners
Neural Information Processing Systems (NeurIPS), 2024
David Samuel
231
13
0
07 Jun 2024
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Naibin Gu
Peng Fu
Xiyu Liu
Bowen Shen
Zheng Lin
Weiping Wang
207
15
0
06 Jun 2024
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
Wentao Guo
Jikai Long
Yimeng Zeng
Zirui Liu
Xinyu Yang
...
Osbert Bastani
Christopher De Sa
Xiaodong Yu
Beidi Chen
Zhaozhuo Xu
267
32
0
05 Jun 2024
UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation
Hanzhang Zhou
Zijian Feng
Zixiao Zhu
Junlang Qian
Kezhi Mao
296
28
0
31 May 2024
Mixture of Experts Using Tensor Products
Zhan Su
Fengran Mo
Prayag Tiwari
Benyou Wang
Jian-Yun Nie
J. Simonsen
MoE
MoMe
163
3
0
26 May 2024
Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization
Zhe Li
Bicheng Ying
Zidong Liu
Chaosheng Dong
Haibo Yang
FedML
540
11
0
24 May 2024
Lessons from the Trenches on Reproducible Evaluation of Language Models
Stella Biderman
Hailey Schoelkopf
Lintang Sutawika
Leo Gao
J. Tow
...
Xiangru Tang
Kevin A. Wang
Genta Indra Winata
Franccois Yvon
Andy Zou
ELM
ALM
370
105
3
23 May 2024
EMR-Merging: Tuning-Free High-Performance Model Merging
Neural Information Processing Systems (NeurIPS), 2024
Chenyu Huang
Peng Ye
Tao Chen
Tong He
Xiangyu Yue
Wanli Ouyang
MoMe
294
73
0
23 May 2024
eXmY: A Data Type and Technique for Arbitrary Bit Precision Quantization
Aditya Agrawal
Matthew Hedlund
Blake A. Hechtman
MQ
285
7
0
22 May 2024
Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion
Pengxiang Lan
Enneng Yang
Yuting Liu
Guibing Guo
Linying Jiang
Jianzhe Zhao
Xingwei Wang
VLM
AAML
236
4
0
19 May 2024
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs
International Conference on Learning Representations (ICLR), 2024
Feiyang Kang
H. Just
Yifan Sun
Himanshu Jahagirdar
Yuanzhi Zhang
Rongxing Du
Anit Kumar Sahu
Ruoxi Jia
191
31
0
05 May 2024
Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning
International Conference on Machine Learning (ICML), 2024
Jing Xu
Jingzhao Zhang
233
11
0
04 May 2024
Previous
1
2
3
4
5
6
7
Next
Page 2 of 7