Red Teaming Language Model Detectors with Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023 |
RuCoLA: Russian Corpus of Linguistic AcceptabilityConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
ADDMU: Detection of Far-Boundary Adversarial Examples with Data and
Model Uncertainty EstimationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Discovering Differences in the Representation of People using
Contextualized Semantic AxesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
On the Paradox of Learning to Reason from DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022 |
Acceptability Judgements via Examining the Topology of Attention MapsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Variation and generality in encoding of syntactic anomaly information in
sentence embeddingsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021 |
Adversarial Reinforced Instruction Attacker for Robust Vision-Language
NavigationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 |
Contrastive Fine-tuning Improves Robustness for Neural RankersFindings (Findings), 2021 |