ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.11393
  4. Cited By
First Activations Matter: Training-Free Methods for Dynamic Activation
  in Large Language Models

First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models

21 August 2024
Chi Ma
Mincong Huang
Ying Zhang
Chao Wang
Yujie Wang
Lei Yu
Chuan Liu
Wei Lin
    AI4CE
    LLMSV
ArXivPDFHTML

Papers citing "First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models"

1 / 1 papers shown
Title
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive
  Language Model Pre-training
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
Zexuan Zhong
Mengzhou Xia
Danqi Chen
Mike Lewis
MoE
49
15
0
06 May 2024
1