FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

20 points | by PaulHoule 17 hours ago

1 comments