Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

16 points | by gmays a day ago

2 comments

redanddead a day ago
You'd think it'd be bigger news on hn
[-]
- axiologist a day ago
  See https://news.ycombinator.com/item?id=47513475 from two days ago.