Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon

155 points | by tatef 4 hours ago

51 comments