v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018

Alex Warstadt

Amanpreet Singh

Samuel R. Bowman

ArXiv (abs)PDF HTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown

MobiLLM: Enabling LLM Fine-Tuning on the Mobile Device via Server Assisted Side Tuning

199

27 Feb 2025

CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging

291

26 Feb 2025

CAMEx: Curvature-aware Merging of ExpertsInternational Conference on Learning Representations (ICLR), 2025

356

26 Feb 2025

Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing

Gopala Anumanchipalli

KELM

281

26 Feb 2025

Encryption-Friendly LLM ArchitectureInternational Conference on Learning Representations (ICLR), 2024

498

24 Feb 2025

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

1.1K

24 Feb 2025

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

Subhash Kantamneni

Joshua Engels

Senthooran Rajamanoharan

Max Tegmark

Neel Nanda

356

23 Feb 2025

Tokenization is Sensitive to Language VariationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Anna Wegmann

Dong Nguyen

David Jurgens

434

21 Feb 2025

Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation

500

21 Feb 2025

Using tournaments to calculate AUROC for zero-shot classification with LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

WonJin Yoon

Ian Bulovic

Timothy A. Miller

254

20 Feb 2025

Scalable Model Merging with Progressive Layer-wise Distillation

655

18 Feb 2025

An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning

Cen-Jhih Li

Aditya Bhaskara

432

17 Feb 2025

Reinforced Lifelong Editing for Language Models

625

09 Feb 2025

CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models

251

03 Feb 2025

Harmonic Loss Trains Interpretable AI Models

394

03 Feb 2025

Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers

Akiyoshi Tomihari

Issei Sato

ODL

710

31 Jan 2025

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

732

28 Jan 2025

Reference-free Evaluation Metrics for Text Generation: A Survey

345

21 Jan 2025

Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training

343

13 Jan 2025

A General Framework for Inference-time Scaling and Steering of Diffusion Models

575

101

12 Jan 2025

GPT or BERT: why not both?

Lucas Georges Gabriel Charpentier

David Samuel

360

31 Dec 2024

Learning from Impairment: Leveraging Insights from Clinical Linguistics in Language Modelling ResearchInternational Conference on Computational Linguistics (COLING), 2024

Dominique Brunato

312

20 Dec 2024

Weak-to-Strong Generalization Through the Data-Centric LensInternational Conference on Learning Representations (ICLR), 2024

Changho Shin

John Cooper

Frederic Sala

455

05 Dec 2024

Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning

761

29 Nov 2024

LoRA-Mini : Adaptation Matrices Decomposition and Selective Training

Ayush Singh

Rajdeep Aher

Shivank Garg

304

24 Nov 2024

Mitigating Gender Bias in Contextual Word Embeddings

Navya Yarrabelly

Vinay Damodaran

Feng-Guang Su

252

18 Nov 2024

Model Fusion through Bayesian Optimization in Language Model Fine-TuningNeural Information Processing Systems (NeurIPS), 2024

411

11 Nov 2024

Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation

Sriram Gopalakrishnan

Niladri Chatterjee

Tanmoy Chakraborty

BDL

483

07 Nov 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

213

04 Nov 2024

Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level AlignmentIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024

348

03 Nov 2024

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian PriorJournal of Data Science (JDS), 2024

Mingxuan Zhang

Y. Sun

F. Liang

299

01 Nov 2024

Improving In-Context Learning with Small Language Model Ensembles

183

29 Oct 2024

Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data

Xinhong Xie

Tao Li

Quanyan Zhu

177

27 Oct 2024

Vulnerability of LLMs to Vertically Aligned Text ManipulationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

546

26 Oct 2024

Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning

Arijit Das

140

21 Oct 2024

Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant ProblemsNeural Information Processing Systems (NeurIPS), 2024

Bingcong Li

Liang Zhang

Niao He

290

18 Oct 2024

From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition

233

17 Oct 2024

Balancing Label Quantity and Quality for Scalable Elicitation

Alex Troy Mallen

Nora Belrose

167

17 Oct 2024

Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable InformationJournal of Biomedical Informatics (JBI), 2024

294

16 Oct 2024

StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

310

16 Oct 2024

LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models

273

15 Oct 2024

BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation

Peijia Qin

Ruiyi Zhang

Pengtao Xie

225

13 Oct 2024

Text Classification using Graph Convolutional Networks: A Comprehensive SurveyACM Computing Surveys (ACM CSUR), 2024

Syed Mustafa Haider Rizvi

Ramsha Imran

Arif Mahmood

GNN OOD FaML

207

12 Oct 2024

DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned ModelsInternational Conference on Learning Representations (ICLR), 2024

Yize Zhao

Christos Thrampoulidis

467

12 Oct 2024

Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge TuningScientific Reports (Sci Rep), 2024

Nusrat Jahan Prottasha

Asif Mahmud

Md. Shohanur Islam Sobuj

299

11 Oct 2024

ACCEPT: Adaptive Codebook for Composite and Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

276

10 Oct 2024

Noise is All You Need: Private Second-Order Convergence of Noisy SGD

276

09 Oct 2024

HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation

378

07 Oct 2024

Neuron-Level Sequential Editing for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Houcheng Jiang

Xiang Wang

239

05 Oct 2024

Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024

Jing Li

...

Min Zhang

255

03 Oct 2024