• dil@piefed.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 hour ago

    When training ai, I was always confused that they never gave a sht about tokens, like that was never something you picked any model over another for or reviewed, it often says to ignore the length but make note of how long it took, tbh that could be because of reasoning text not shown to the user

    • dantheclamman@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 minutes ago

      That’s interesting to hear. I wish efficiency had been considered from the start. It seems like there has been a ton of waste. They should only do the calculations needed to achieve the user objective. But they were prioritizing market share over efficiency. Maybe they thought they could afford another couple years of subsidizing wasteful use to build market share, but it hasn’t turned out that way.