im personally excited for it.

  • steam@programming.devOP
    link
    fedilink
    arrow-up
    1
    arrow-down
    18
    ·
    2 days ago

    openai for example claims chatgpt can perform close to or above phd level math. so your claim about it not passing math tests isn’t really correct.

    • trxxruraxvr@lemmy.world
      link
      fedilink
      arrow-up
      12
      ·
      2 days ago

      so your claim about it not passing math tests isn’t really correct.

      You base this statement on claims from the company that’s trying to sell the product. Maybe try to make chatgpt do some sums first before believing these claims.

    • slazer2au@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      2 days ago

      openai for example claims

      Claiming something doesn’t make it true. If it were true. It wouldn’t be a claim it would be a statement of fact which is directly opposed to the point of marketing.

      Seeing as you don’t support your claim that it can do permanent head damage maths, I will support my statement of they suck at maths.

      https://www.theregister.com/software/2026/02/26/ai-models-suck-slightly-less-at-math-than-they-did-last-year/5191967

    • GalacticSushi@piefed.blahaj.zone
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      openai … claims

      so your claim … isn’t really correct

      A company selling a product made an unsubstantiated claim to market their product… Therefore any contradictory claim is assumed to be false?

      If you think this passes for sound logic then I’m sure whatever Chat-GPT can do passes for acceptable math to you.

    • brynden_rivers_esq@lemmy.ca
      link
      fedilink
      arrow-up
      7
      ·
      2 days ago

      lol, sorry…the company selling a product made an outlandish claim and you believe it? I got a bridge to sell you!

      I think the fundamental problem is that you haven’t actually used these tools enough to see that the companies peddling them are snake oil salesmen. Nothing they say is true, they’re completely full of shit, and they’re terrified they’re not going to make any of the money they promised their investors they would make.

      And like…if you think about it for a moment it would make sense, given how these tools work, that they can’t do much in the way of reasoning. They just get better and better at replicating the language they’re supposed to mimic, which sometimes even carries the right content! Wow! But that’s only a secondary effect, right?

    • unmagical@lemmy.ml
      link
      fedilink
      arrow-up
      7
      ·
      2 days ago

      That is a testable claim. I just had it calculate the force of gravity between 2 objects and it was wrong by virtue of rounding incorrectly. That’s like highschool level math.

        • slazer2au@lemmy.world
          link
          fedilink
          English
          arrow-up
          7
          ·
          2 days ago

          Why should we have to pay extra for a computer to do maths correctly? That is basic functionality of a computer.

        • unmagical@lemmy.ml
          link
          fedilink
          arrow-up
          6
          ·
          2 days ago

          Chatgpt, as made available by the first search result for “chatgpt.” I chose this approach to match your claim you chose to repeat without verification.

          I am not going to cross shop stochastic regressions with an English LUT in order to find one that works better for math just to appease your suspiciously wheel ladened goal post.

          • steam@programming.devOP
            link
            fedilink
            arrow-up
            2
            arrow-down
            8
            ·
            2 days ago

            ok. but the free models are dumber than pro version. so chatgpt isn’t defined by its base model.

            idc if you buy chatgpt sub or not. im just asking tech questions here

    • one_old_coder@piefed.social
      link
      fedilink
      English
      arrow-up
      4
      ·
      2 days ago

      The math lie was a lie. It turned out that it’s some guy who did all the job. You’ve been lied to like every other sucker.