Show HN: KVBoost – chunk-level KV cache reuse for HuggingFace, 5–48x faster TTFT

18 points | by pythongiant 8 hours ago

22 comments