ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12471
  4. Cited By
Neural Network Acceptability Judgments
v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
AlphaEdit: Null-Space Constrained Knowledge Editing for Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Cunchun Li
Houcheng Jiang
Kun Wang
Yunshan Ma
Shi Jie
Xiangnan He
Tat-Seng Chua
Tat-seng Chua
KELM
525
135
0
03 Oct 2024
Fisher Information-based Efficient Curriculum Federated Learning with
  Large Language Models
Fisher Information-based Efficient Curriculum Federated Learning with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ji Liu
Jiaxiang Ren
Ruoming Jin
Zijie Zhang
Yang Zhou
P. Valduriez
Dejing Dou
FedML
285
8
0
30 Sep 2024
Analysing Zero-Shot Readability-Controlled Sentence Simplification
Analysing Zero-Shot Readability-Controlled Sentence SimplificationInternational Conference on Computational Linguistics (COLING), 2024
Abdullah Barayan
Jose Camacho-Collados
Fernando Alva-Manchego
233
16
0
30 Sep 2024
Exposing Assumptions in AI Benchmarks through Cognitive Modelling
Exposing Assumptions in AI Benchmarks through Cognitive Modelling
Jonathan H. Rystrøm
Kenneth C. Enevoldsen
183
0
0
25 Sep 2024
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
Guanyi Mou
Kyumin Lee
301
4
0
25 Sep 2024
Wildlife Product Trading in Online Social Networks: A Case Study on
  Ivory-Related Product Sales Promotion Posts
Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion PostsInternational Conference on Web and Social Media (ICWSM), 2024
Guanyi Mou
Yun Yue
Kyumin Lee
Ziming Zhang
OnRL
95
0
0
25 Sep 2024
Unveiling Language Competence Neurons: A Psycholinguistic Approach to
  Model Interpretability
Unveiling Language Competence Neurons: A Psycholinguistic Approach to Model InterpretabilityInternational Conference on Computational Linguistics (COLING), 2024
Xufeng Duan
Xinyu Zhou
Bei Xiao
Zhenguang G. Cai
MILM
215
9
0
24 Sep 2024
HUT: A More Computation Efficient Fine-Tuning Method With Hadamard
  Updated Transformation
HUT: A More Computation Efficient Fine-Tuning Method With Hadamard Updated Transformation
Geyuan Zhang
Xiaofei Zhou
Chuheng Chen
153
0
0
20 Sep 2024
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language
  Models
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language ModelsInternational Conference on Computational Linguistics (COLING), 2024
Xinyu Zhou
Delong Chen
Samuel Cahyawijaya
Xufeng Duan
Zhenguang G. Cai
247
1
0
19 Sep 2024
Thesis proposal: Are We Losing Textual Diversity to Natural Language
  Processing?
Thesis proposal: Are We Losing Textual Diversity to Natural Language Processing?
Josef Jon
225
0
0
15 Sep 2024
Fingerprint Vector: Enabling Scalable and Efficient Model Fingerprint Transfer via Vector Addition
Fingerprint Vector: Enabling Scalable and Efficient Model Fingerprint Transfer via Vector Addition
Zhenhua Xu
Wenpeng Xing
Zhebo Wang
Wenpeng Xing
Chen Jie
Mohan Li
Meng Han
307
2
0
13 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
225
6
0
10 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
219
1
0
08 Sep 2024
Task-Specific Directions: Definition, Exploration, and Utilization in Parameter Efficient Fine-Tuning
Task-Specific Directions: Definition, Exploration, and Utilization in Parameter Efficient Fine-Tuning
Chongjie Si
Zhiyi Shi
Shifan Zhang
Xiaokang Yang
Hanspeter Pfister
Wei Shen
ALM
425
5
0
02 Sep 2024
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
Aradhye Agarwal
Suhas Kamasetty Ramesh
Ayan Sengupta
Tanmoy Chakraborty
325
2
0
26 Aug 2024
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based
  Computing
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based ComputingIEEE Transactions on Emerging Topics in Computing (IEEE TETC), 2024
Abhishek Moitra
Abhiroop Bhattacharjee
Youngeun Kim
Priyadarshini Panda
ViT
202
3
0
22 Aug 2024
Toward the Evaluation of Large Language Models Considering Score
  Variance across Instruction Templates
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction TemplatesBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Yusuke Sakai
Adam Nohejl
Jiangnan Hang
Hidetaka Kamigaito
Taro Watanabe
ELM
291
9
0
22 Aug 2024
Crafting Tomorrow's Headlines: Neural News Generation and Detection in
  English, Turkish, Hungarian, and Persian
Crafting Tomorrow's Headlines: Neural News Generation and Detection in English, Turkish, Hungarian, and Persian
Cem Uyuk
Danica Rovó
Shaghayegh Kolli
Rabia Varol
Georg Groh
Daryna Dementieva
202
2
0
20 Aug 2024
How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
Yusuke Ide
Yuto Nishida
Miyu Oba
Miyu Oba
Justin Vasselli
Hidetaka Kamigaito
Taro Watanabe
304
7
0
19 Aug 2024
LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large
  Language Models
LoRA2^22 : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models
Jia-Chen Zhang
Yu-Jie Xiong
He-Xi Qiu
Dong-Hai Zhu
Chun-Ming Xia
MoE
177
0
0
13 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for
  Natural Language Classification Tasks
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Verna Dankers
Ivan Titov
273
9
0
09 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAMLELMSILM
440
15
0
05 Aug 2024
Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer
Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer
Wei Chen
Long Chen
Ivan Srba
Yu Wu
MoMeVLM
276
9
0
02 Aug 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models
  for Multilingual Text Retrieval
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
321
231
0
29 Jul 2024
Stress-Testing Long-Context Language Models with Lifelong ICL and Task
  Haystack
Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack
Xiaoyue Xu
Qinyuan Ye
Xiang Ren
320
15
0
23 Jul 2024
Reconstruct the Pruned Model without Any Retraining
Reconstruct the Pruned Model without Any Retraining
Pingjie Wang
Ziqing Fan
Shengchao Hu
Zhe Chen
Yanfeng Wang
Yu Wang
221
2
0
18 Jul 2024
Evaluating Large Language Models with fmeval
Evaluating Large Language Models with fmeval
Pola Schwöbel
Luca Franceschi
Muhammad Bilal Zafar
Keerthan Vasist
Aman Malhotra
Tomer Shenhar
Pinal Tailor
Pinar Yilmaz
Michael Diamond
Michele Donini
LM&MAELM
232
4
0
15 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for
  Resource-Limited Transfer Learning
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
240
11
0
10 Jul 2024
Testing learning hypotheses using neural networks by manipulating
  learning data
Testing learning hypotheses using neural networks by manipulating learning data
Cara Su-Yi Leong
Tal Linzen
208
7
0
05 Jul 2024
Efficient Training of Language Models with Compact and Consistent Next
  Token Distributions
Efficient Training of Language Models with Compact and Consistent Next Token Distributions
Ashutosh Sathe
Sunita Sarawagi
205
0
0
03 Jul 2024
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language
  Models
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
Ying Zhang
Ziheng Yang
Shufan Ji
KELM
143
2
0
03 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods,
  Evaluation, and Application
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALMKELM
284
69
0
02 Jul 2024
CPT: Consistent Proxy Tuning for Black-box Optimization
CPT: Consistent Proxy Tuning for Black-box Optimization
Yuanyang He
Zitong Huang
Xinxing Xu
Rick Siow Mong Goh
Salman Khan
W. Zuo
Yong Liu
Chun-Mei Feng
243
1
0
01 Jul 2024
Exploring Advanced Large Language Models with LLMsuite
Exploring Advanced Large Language Models with LLMsuite
Giorgio Roffo
LLMAG
116
4
0
01 Jul 2024
Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and
  Faithful Controlled Text Generation
Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation
Hye Ryung Son
Jay-Yoon Lee
168
3
0
30 Jun 2024
IDT: Dual-Task Adversarial Attacks for Privacy Protection
IDT: Dual-Task Adversarial Attacks for Privacy Protection
Pedro Faustini
Shakila Mahjabin Tonni
Annabelle McIver
Xingliang Yuan
Mark Dras
SILMAAML
219
0
0
28 Jun 2024
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Longrong Yang
Dong Shen
Chaoxiang Cai
Fan Yang
Size Li
Tingting Gao
Xi Li
MoE
422
8
0
28 Jun 2024
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
A. Bavaresco
Raffaella Bernardi
Leonardo Bertolazzi
Desmond Elliott
Raquel Fernández
...
David Schlangen
Alessandro Suglia
Aditya K Surikuchi
Ece Takmaz
A. Testoni
ALMELM
606
177
0
26 Jun 2024
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse
  Gradients
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients
Aashiq Muhamed
Oscar Li
David Woodruff
Mona Diab
Virginia Smith
241
20
0
25 Jun 2024
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship
  Embeddings
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Zachary Horvitz
Ajay Patel
Kanishk Singh
Chris Callison-Burch
Kathleen McKeown
Zhou Yu
307
12
0
21 Jun 2024
Information Guided Regularization for Fine-tuning Language Models
Information Guided Regularization for Fine-tuning Language Models
Mandar Sharma
Nikhil Muralidhar
Shengzhe Xu
Raquib Bin Yousuf
Naren Ramakrishnan
279
0
0
20 Jun 2024
Open Generative Large Language Models for Galician
Open Generative Large Language Models for Galician
Pablo Gamallo
Pablo Rodríguez
Iria de-Dios-Flores
Susana Sotelo
Silvia Paniagua
Daniel Bardanca
José Ramom Pichel
Marcos Garcia
215
6
0
19 Jun 2024
Fighting Randomness with Randomness: Mitigating Optimisation Instability
  of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher
Ján Cegin
Róbert Belanec
Jakub Simko
Ivan Srba
Maria Bielikova
213
1
0
18 Jun 2024
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory
Haoze Wu
Zihan Qiu
Zili Wang
Hang Zhao
Jie Fu
MoE
204
7
0
18 Jun 2024
Knowledge Fusion By Evolving Weights of Language Models
Knowledge Fusion By Evolving Weights of Language Models
Guodong DU
Yiyao Cao
Hanting Liu
Runhua Jiang
Shuyang Yu
Yifei Guo
Sim Kuan Goh
Jing Li
MoMe
222
16
0
18 Jun 2024
UBench: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
UBench: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Xunzhi Wang
Zhuowei Zhang
Qiongyu Li
Gaonan Chen
Mengting Hu
Zhixin Han
Bitong Luo
Zhiyu li
Hang Gao
Mengting Hu
ELM
424
3
0
18 Jun 2024
Style Transfer with Multi-iteration Preference Optimization
Style Transfer with Multi-iteration Preference Optimization
Shuai Liu
Jonathan May
245
6
0
17 Jun 2024
FamiCom: Further Demystifying Prompts for Language Models with
  Task-Agnostic Performance Estimation
FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Bangzheng Li
Ben Zhou
Xingyu Fu
Fei Wang
Dan Roth
Muhao Chen
240
8
0
17 Jun 2024
Symmetric Dot-Product Attention for Efficient Training of BERT Language
  Models
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois
Malte Ostendorff
Leonhard Hennig
Georg Rehm
287
5
0
10 Jun 2024
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with
  Superposition of Multi Token Embeddings
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings
MohammadAli SadraeiJavaeri
Ehsaneddin Asgari
A. Mchardy
Hamid R. Rabiee
VLMAAML
169
4
0
07 Jun 2024
Previous
12345...171819
Next