view post Post 2329 Got to 1199.8 tokens/sec with Devstral Small -2 on my desktop GPU workstation. vLLM nightly. Works out of the box with Mistral Vibe. Next is time to test the big one. See translation 3 replies ยท ๐ 6 6 ๐ 3 3 + Reply