
TrimBERT: Tailoring BERT for Trade-offs
Papers citing "TrimBERT: Tailoring BERT for Trade-offs"
3 / 3 papers shown
Title |
|---|
![]() Simplifying Transformer BlocksInternational Conference on Learning Representations (ICLR), 2023 |
![]() Cramming: Training a Language Model on a Single GPU in One DayInternational Conference on Machine Learning (ICML), 2022 |



