How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control.
Few reasons behind this are:
- I wanna own the complete stack around my product.
- Second I wanna fine-tune the model around my usecase.
So, what's the most affordable but a good platform for this? I'm not an AI engineer so don't wanna stuck in CUDA or Transformers hell, anything which can give me a straight path towards my private deployment.
Thanks,
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.