Language Models with Image Descriptors are Strong Few-Shot
Video-Language LearnersNeural Information Processing Systems (NeurIPS), 2022 |
A Study on Transformer Configuration and Training ObjectiveInternational Conference on Machine Learning (ICML), 2022 |
Visually-Augmented Language ModelingInternational Conference on Learning Representations (ICLR), 2022 |
The Unreliability of Explanations in Few-shot Prompting for Textual
ReasoningNeural Information Processing Systems (NeurIPS), 2022 |
MiCS: Near-linear Scaling for Training Gigantic Model on Public CloudProceedings of the VLDB Endowment (PVLDB), 2022 |
mGPT: Few-Shot Learners Go MultilingualTransactions of the Association for Computational Linguistics (TACL), 2022 |
REx: Data-Free Residual Quantization Error ExpansionNeural Information Processing Systems (NeurIPS), 2022 |
In-Context Learning for Few-Shot Dialogue State TrackingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large
Language ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022 |
ZeroGen: Efficient Zero-shot Learning via Dataset GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Quantifying Memorization Across Neural Language ModelsInternational Conference on Learning Representations (ICLR), 2022 |
Fooling MOSS Detection with Pretrained Language ModelsInternational Conference on Information and Knowledge Management (CIKM), 2022 |
Counterfactual Memorization in Neural Language ModelsNeural Information Processing Systems (NeurIPS), 2021 |
Understanding Jargon: Combining Extraction and Generation for Definition
ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel
TrainingInternational Conference on Parallel Processing (ICPP), 2021 |
Creativity and Machine Learning: A SurveyACM Computing Surveys (CSUR), 2021 |
Graphmax for Text GenerationJournal of Artificial Intelligence Research (JAIR), 2021 |
NarrativeTime: Dense Temporal Annotation on a TimelineInternational Conference on Language Resources and Evaluation (LREC), 2019 |