News
Newest
Ask
Show
Jobs
Open on GitHub
vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep
(blog.vllm.ai)
42 points | by
robertnishihara
10 hours ago
0 comments
0 comments