Pollux – a natively vector quantized LLM with 0.76 bits per parameter

1 points | by pollux_llm 4 hours ago

1 comments