Is AI inference getting cheaper or more expensive over time?

GamingChairModel@lemmy.world · 2 months ago

Is AI inference getting cheaper or more expensive over time?

Danitos@reddthat.com · 2 months ago

https://www.tobyord.com/writing/hourly-costs-for-ai-agents. This person analyzed a question very similar to yours. For me, this means:

Cost of running state of the art models is increasing exponentially.
For a given target “intelligence”, the cost is decreasing linearly.

I don’t really know what to make out of that in a broader picture

GamingChairModel@lemmy.world · 2 months ago

Thanks, this is great.

I’ve found that very few people are asking this question. And when I ask people what they think is happening to these costs over time, their opinions vary wildly.

I’m similarly baffled. How is it that this industry generates so much discussion, but nobody is asking this fundamental question, with an ecosystem of comments, discussions, critiques, and corrections on those analyses?

MagicShel@lemmy.zip · 2 months ago

It means that capability growth is going to slow and require more creative ways to improve than just more tokens and more compute. I’ve seen some research that we can create chips that are 10x as efficient but they can only run a single model and aren’t upgradable. If a model is viable for a number of years because genuine improvement is so slow, the math starts to make sense there.

It’s going to be a good thing when we are forced to start looking at more creative ways of improvement.