HN Reader

NewTopBestAskShowJob
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
score icon426
comment icon155
3 days agoby benchmarkist