An Auditing Test To Detect Behavioral Shift in Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
Do LLMs write like humans? Variation in grammatical and rhetorical stylesProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024 |
Neural Decompiling of Tracr TransformersIAPR International Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), 2024 |