MP-Rec: Hardware-Software Co-Design to Enable Multi-Path RecommendationInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023 |
The Framework Tax: Disparities Between Inference Efficiency in NLP
Research and DeploymentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
The Flan Collection: Designing Data and Methods for Effective
Instruction TuningInternational Conference on Machine Learning (ICML), 2023 |
A Green(er) World for A.IIEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPS), 2022 |
Mystique: Enabling Accurate and Scalable Generation of Production AI
BenchmarksInternational Symposium on Computer Architecture (ISCA), 2022 |
A Rubric for Human-like Agents and NeuroAIPhilosophical Transactions of the Royal Society of London. Biological Sciences (Phil. Trans. R. Soc. B), 2022 |
FedGPO: Heterogeneity-Aware Global Parameter Optimization for Efficient
Federated LearningIEEE International Symposium on Workload Characterization (IISWC), 2022 |
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language
ModelJournal of machine learning research (JMLR), 2022 |
The Future of Consumer Edge-AI ComputingIEEE pervasive computing (PC), 2022 |
Green Learning: Introduction, Examples and OutlookJournal of Visual Communication and Image Representation (JVCIR), 2022 |
Efficient Methods for Natural Language Processing: A SurveyTransactions of the Association for Computational Linguistics (TACL), 2022 |
Zeus: Understanding and Optimizing GPU Energy Consumption of DNN
TrainingSymposium on Networked Systems Design and Implementation (NSDI), 2022 |
Sustainable Computing -- Without the Hot AirACM SIGEnergy Energy Informatics Review (SEIR), 2022 |
Measuring the Carbon Intensity of AI in Cloud InstancesConference on Fairness, Accountability and Transparency (FAccT), 2022 |
Adversarial Text NormalizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 |
Position: Tensor Networks are a Valuable Asset for Green AIInternational Conference on Machine Learning (ICML), 2022 |
Towards Climate Awareness in NLP ResearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Data-Centric Green AI: An Exploratory Empirical StudyICT for Sustainability (ICT4S), 2022 |
RecShard: Statistical Feature-Based Memory Optimization for
Industry-Scale Neural RecommendationInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022 |
Carbon Explorer: A Holistic Approach for Designing Carbon Aware
DatacentersInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022 |
On Sampling Collaborative Filtering DatasetsWeb Search and Data Mining (WSDM), 2022 |
Efficient Large Scale Language Modeling with Mixtures of ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Edge-Native Intelligence for 6G Communications Driven by Federated
Learning: A Survey of Trends and ChallengesIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021 |