TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B

3 points | by trykhlieb a day ago

1 comments