2x RX 9060xt 16gb, is it worth it?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
I'm planning to buy 2x RX 9060xt with 16gb each to run Qwen 3.6 27B and alike. Would it be a good investment? How much tk/s should i expect in generation and prefill? I'm planning to use this as a coding agent in a large codebase.
Currently I'm running this on my i7 64gb laptop and I'm getting 3~4 tk/s with MTP and ~50 tk/s prefill. The generation speed is kind of ok, but 50 tk/s prefill is just unusable in my use case... Every read tool call i have to wait 1~2min just for the prefill
[link] [comments]
More from r/LocalLLaMA
-
Palantir CEO rages against closed models
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
-
[Benchmark] Kimi K2.7 Code Q3 on Mac Studio M3 Ultra + RTX PRO 6000 over llama.cpp RPC: prefill improves, no changes in token generation/decode
Jul 2
-
They fit! Mostly.... 2x 3090, Thermaltake Core p3
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.