WINA: Weight informed Neuron activation for accelerating LLM inference

2 points | by Ratelman 2 days ago

No comments yet.