Towards Greater Leverage: Scaling Laws for Efficient MoE Language Models

4 points | by Anon84 a day ago

No comments yet.