r/MachineLearning · · 1 min read

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.

I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control.

Few reasons behind this are:

- I wanna own the complete stack around my product.

- Second I wanna fine-tune the model around my usecase.

So, what's the most affordable but a good platform for this? I'm not an AI engineer so don't wanna stuck in CUDA or Transformers hell, anything which can give me a straight path towards my private deployment.

Thanks,

submitted by /u/Necessary_Gazelle211
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/MachineLearning