We did the math on AI’s energy footprint. Here’s the story you haven’t heard.

kalkulat@lemmy.world · 1 year ago

We did the math on AI’s energy footprint. Here’s the story you haven’t heard.

msage@programming.dev · 1 year ago

Which chatbots are getting smarter?

I know AI has potential, but specifically LLMs (which most people mean when talking about AI) seem to have hit their technological limits.

FreedomAdvocate · 1 year ago

Copilot, ChatGPT, pretty much all of them.

msage@programming.dev · 1 year ago

Smarter how? Synthetic benchmarks?

Because I’ve heard the opposite from users and bloggers.

FreedomAdvocate · 1 year ago

So you want me to provide some evidence that it’s getting smarter, but you can’t provide any that it’s getting worse other than anecdotal evidence?

What evidence would you accept?

msage@programming.dev · 1 year ago

Any proof that we have moved past the current architecture.

FreedomAdvocate · 1 year ago

What does “architecture” mean in this scenario?

msage@programming.dev · 1 year ago

Any significant shift in the model, or a complete restructuralization of the approach.

As it is, it won’t grow anywhere.

FreedomAdvocate · edit-2 1 year ago

So you’ve got access to all this stuffs source code and know what has and hasn’t changed with every update?

msage@programming.dev · 1 year ago

No, if there was any major breakthrough, it would be advertised everywhere.

Jakeroxs@sh.itjust.works · 1 year ago

Advanced Reasoning models came out like 4 months ago lol

msage@programming.dev · 1 year ago

Advanced reasoning? Having LLM talk to itself?

Terrasque@infosec.pub · 1 year ago

Yes, which has improved some tasks measurably. ~20% improvement on programming tasks, as a practical example. It has also improved tool use and agentic tasks, allowing the llm to plan ahead and adjust it’s initial approach based on later parts.

Having the llm talk through the tasks allows it to improve or fix bad decisions taken early based on new realizations on later stages. Sort of like when a human thinks through how to do something.

Jakeroxs@sh.itjust.works · edit-2 1 year ago

Lul yes but no, but they are clearly better at many types of tasks.

technocrit@lemmy.dbzer0.com · edit-2 1 year ago

For example? Citations?

Pretty sure these “tasks” are meaningless metrics made up by pseudo-scientific grifters.

Jakeroxs@sh.itjust.works · 1 year ago

Small bits of code, language related tasks, basic context understanding, not metrics I have literally measured simply noticed has improved compared to non reasoning models in my homelab testing. 🤷‍♂️

IsaamoonKHGDT_6143@lemmy.zip · 1 year ago

AlphaFold 3 which can help in the prediction of some proteins. Although it has some limitations, it cannot be used in all cases, only in what it can perform without any problem.