CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive
Tensor OptimizationNeural Information Processing Systems (NeurIPS), 2024 |
Assisted Debate Builder with Large Language ModelsEuropean Conference on Artificial Intelligence (ECAI), 2024 |
BAdam: A Memory Efficient Full Parameter Optimization Method for Large
Language ModelsNeural Information Processing Systems (NeurIPS), 2024 |
Flora: Low-Rank Adapters Are Secretly Gradient CompressorsInternational Conference on Machine Learning (ICML), 2024 |