Training an LLM in Swift, Part 1: Taking matrix mult from Gflop/s to Tflop/s

255 points | by zdw 3 days ago

13 comments