ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12471
  4. Cited By
Neural Network Acceptability Judgments
v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown
MobiLLM: Enabling LLM Fine-Tuning on the Mobile Device via Server Assisted Side Tuning
MobiLLM: Enabling LLM Fine-Tuning on the Mobile Device via Server Assisted Side Tuning
Liang Li
Xingke Yang
Wen Wu
Hao Wang
Tomoaki Ohtsuki
Xin Fu
Miao Pan
Xuemin Shen
199
7
0
27 Feb 2025
CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
Zongzhen Yang
Binhang Qi
Hailong Sun
Wenrui Long
Ruobing Zhao
Yantao Du
MoMe
281
4
0
26 Feb 2025
CAMEx: Curvature-aware Merging of Experts
CAMEx: Curvature-aware Merging of ExpertsInternational Conference on Learning Representations (ICLR), 2025
Dung V. Nguyen
Minh H. Nguyen
Luc Q. Nguyen
R. Teo
T. Nguyen
Linh Duy Tran
MoMe
356
6
0
26 Feb 2025
Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing
Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing
Akshat Gupta
Christine Fang
Atahan Ozdemir
Maochuan Lu
Ahmed Alaa
Thomas Hartvigsen
Gopala Anumanchipalli
KELM
281
0
0
26 Feb 2025
Encryption-Friendly LLM Architecture
Encryption-Friendly LLM ArchitectureInternational Conference on Learning Representations (ICLR), 2024
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
497
17
0
24 Feb 2025
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Chenghao Fan
Zhenyi Lu
Sichen Liu
Xiaoye Qu
Xiaoye Qu
Wei Wei
Yu Cheng
MoE
1.1K
9
0
24 Feb 2025
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni
Joshua Engels
Senthooran Rajamanoharan
Max Tegmark
Neel Nanda
352
44
0
23 Feb 2025
Tokenization is Sensitive to Language Variation
Tokenization is Sensitive to Language VariationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Anna Wegmann
Dong Nguyen
David Jurgens
434
5
0
21 Feb 2025
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation
Yue Zhou
Yi-Ju Chang
Yuan Wu
MoMe
500
4
0
21 Feb 2025
Using tournaments to calculate AUROC for zero-shot classification with LLMs
Using tournaments to calculate AUROC for zero-shot classification with LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
WonJin Yoon
Ian Bulovic
Timothy A. Miller
249
1
0
20 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation
Scalable Model Merging with Progressive Layer-wise Distillation
Jing Xu
Jiazheng Li
J.N. Zhang
MoMeFedML
645
8
0
18 Feb 2025
An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning
An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning
Cen-Jhih Li
Aditya Bhaskara
396
0
0
17 Feb 2025
Reinforced Lifelong Editing for Language Models
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Cunchun Li
Xinze Wang
KELM
613
20
0
09 Feb 2025
CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Guanduo Chen
Yutong He
Yipeng Hu
Kun Yuan
Binhang Yuan
248
5
0
03 Feb 2025
Harmonic Loss Trains Interpretable AI Models
Harmonic Loss Trains Interpretable AI Models
David D. Baek
Ziming Liu
Riya Tyagi
Max Tegmark
387
3
0
03 Feb 2025
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Akiyoshi Tomihari
Issei Sato
ODL
708
5
0
31 Jan 2025
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Yibin Wang
Haizhou Shi
Ligong Han
Dimitris N. Metaxas
Hao Wang
BDLUQLM
730
23
0
28 Jan 2025
Reference-free Evaluation Metrics for Text Generation: A Survey
Reference-free Evaluation Metrics for Text Generation: A Survey
Takumi Ito
Kees van Deemter
Jun Suzuki
ELM
344
9
0
21 Jan 2025
Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training
Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training
Ziqing Wen
Ping Luo
Jun Wang
Xiaoge Deng
Jinping Zou
Kun Yuan
Tao Sun
Dongsheng Li
CLL
324
0
0
13 Jan 2025
A General Framework for Inference-time Scaling and Steering of Diffusion Models
A General Framework for Inference-time Scaling and Steering of Diffusion Models
R. Singhal
Zachary Horvitz
Ryan Teehan
Mengye Ren
Zhou Yu
Kathleen McKeown
Rajesh Ranganath
DiffM
571
101
0
12 Jan 2025
GPT or BERT: why not both?
GPT or BERT: why not both?
Lucas Georges Gabriel Charpentier
David Samuel
347
20
0
31 Dec 2024
Learning from Impairment: Leveraging Insights from Clinical Linguistics
  in Language Modelling Research
Learning from Impairment: Leveraging Insights from Clinical Linguistics in Language Modelling ResearchInternational Conference on Computational Linguistics (COLING), 2024
Dominique Brunato
310
1
0
20 Dec 2024
Weak-to-Strong Generalization Through the Data-Centric Lens
Weak-to-Strong Generalization Through the Data-Centric LensInternational Conference on Learning Representations (ICLR), 2024
Changho Shin
John Cooper
Frederic Sala
451
14
0
05 Dec 2024
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
Kaustubh Ponkshe
Raghav Singhal
Eduard A. Gorbunov
Alexey Tumanov
Samuel Horváth
Praneeth Vepakomma
757
11
0
29 Nov 2024
LoRA-Mini : Adaptation Matrices Decomposition and Selective Training
LoRA-Mini : Adaptation Matrices Decomposition and Selective Training
Ayush Singh
Rajdeep Aher
Shivank Garg
301
1
0
24 Nov 2024
Mitigating Gender Bias in Contextual Word Embeddings
Mitigating Gender Bias in Contextual Word Embeddings
Navya Yarrabelly
Vinay Damodaran
Feng-Guang Su
252
0
0
18 Nov 2024
Model Fusion through Bayesian Optimization in Language Model Fine-Tuning
Model Fusion through Bayesian Optimization in Language Model Fine-TuningNeural Information Processing Systems (NeurIPS), 2024
Chaeyun Jang
Hyungi Lee
Jungtaek Kim
Juho Lee
MoMe
403
4
0
11 Nov 2024
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation
Ayan Sengupta
Vaibhav Seth
Arinjay Pathak
Aastha Verma
Natraj Raman
Sriram Gopalakrishnan
Niladri Chatterjee
Tanmoy Chakraborty
BDL
473
3
0
07 Nov 2024
Scalable Efficient Training of Large Language Models with
  Low-dimensional Projected Attention
Scalable Efficient Training of Large Language Models with Low-dimensional Projected AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xingtai Lv
Ning Ding
Kaiyan Zhang
Ermo Hua
Ganqu Cui
Bowen Zhou
213
7
0
04 Nov 2024
Decoupling Dark Knowledge via Block-wise Logit Distillation for
  Feature-level Alignment
Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level AlignmentIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024
Chengting Yu
Fengzhao Zhang
Ruizhe Chen
Zuozhu Liu
Shurun Tan
Er-ping Li
Aili Wang
348
5
0
03 Nov 2024
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture
  Gaussian Prior
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian PriorJournal of Data Science (JDS), 2024
Mingxuan Zhang
Y. Sun
F. Liang
298
0
0
01 Nov 2024
Improving In-Context Learning with Small Language Model Ensembles
Improving In-Context Learning with Small Language Model Ensembles
M. Mehdi Mojarradi
Lingyi Yang
Robert McCraith
Adam Mahdi
183
6
0
29 Oct 2024
Learning from Response not Preference: A Stackelberg Approach for LLM
  Detoxification using Non-parallel Data
Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data
Xinhong Xie
Tao Li
Quanyan Zhu
177
4
0
27 Oct 2024
Vulnerability of LLMs to Vertically Aligned Text Manipulations
Vulnerability of LLMs to Vertically Aligned Text ManipulationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Zhen Xiong
Nanyun Peng
Kai-Wei Chang
540
5
0
26 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training
  and Fine-tuning
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
140
2
0
21 Oct 2024
Implicit Regularization of Sharpness-Aware Minimization for
  Scale-Invariant Problems
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant ProblemsNeural Information Processing Systems (NeurIPS), 2024
Bingcong Li
Liang Zhang
Niao He
283
9
0
18 Oct 2024
From Babbling to Fluency: Evaluating the Evolution of Language Models in
  Terms of Human Language Acquisition
From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition
Qiyuan Yang
Pengda Wang
Luke D. Plonsky
Frederick L. Oswald
Hanjie Chen
ELM
227
3
0
17 Oct 2024
Balancing Label Quantity and Quality for Scalable Elicitation
Balancing Label Quantity and Quality for Scalable Elicitation
Alex Troy Mallen
Nora Belrose
167
3
0
17 Oct 2024
Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information
Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable InformationJournal of Biomedical Informatics (JBI), 2024
Yingya Li
Timothy A. Miller
Steven Bethard
G. Savova
291
4
0
16 Oct 2024
StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples
StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples
Ajay Patel
Jiacheng Zhu
Justin Qiu
Zachary Horvitz
Marianna Apidianaki
Kathleen McKeown
Chris Callison-Burch
301
17
0
16 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
273
1
0
15 Oct 2024
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Peijia Qin
Ruiyi Zhang
Pengtao Xie
221
4
0
13 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive
  Survey
Text Classification using Graph Convolutional Networks: A Comprehensive SurveyACM Computing Surveys (ACM CSUR), 2024
Syed Mustafa Haider Rizvi
Ramsha Imran
Arif Mahmood
GNNOODFaML
203
9
0
12 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned ModelsInternational Conference on Learning Representations (ICLR), 2024
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
467
8
0
12 Oct 2024
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic
  Knowledge Tuning
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge TuningScientific Reports (Sci Rep), 2024
Nusrat Jahan Prottasha
Asif Mahmud
Md. Shohanur Islam Sobuj
Prakash Bhat
Md. Kowsher
Niloofar Yousefi
O. Garibay
299
19
0
11 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
276
2
0
10 Oct 2024
Noise is All You Need: Private Second-Order Convergence of Noisy SGD
Noise is All You Need: Private Second-Order Convergence of Noisy SGD
Dmitrii Avdiukhin
Michael Dinitz
Chenglin Fan
G. Yaroslavtsev
273
1
0
09 Oct 2024
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
Xinyu Zhou
Simin Fan
Martin Jaggi
TDI
375
3
0
07 Oct 2024
Neuron-Level Sequential Editing for Large Language Models
Neuron-Level Sequential Editing for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Houcheng Jiang
Cunchun Li
Tianyu Zhang
An Zhang
Ruipeng Wang
Tao Liang
Xiang Wang
KELM
233
11
0
05 Oct 2024
Parameter Competition Balancing for Model Merging
Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024
Guodong DU
Junlin Lee
Jing Li
Runhua Jiang
Yifei Guo
...
Hanting Liu
Sim Kuan Goh
Jing Li
Daojing He
Min Zhang
MoMe
244
43
0
03 Oct 2024
Previous
123456...171819
Next