• FlashMobOfOne@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 hours ago

    AI does quality review too. Imperfectly, but the individual functions of AI acting within those systems have all improved dramatically in the past 12 months

    I suppose that depends on how we define improvement, because from where I’m sitting, it’s reasonable to be apprehensive about LLM’s and their output when we see spectacular failure after spectacular failure.

    Whether it’s bombing a school in Iran because Claude fucked up the targeting, or an AI agent deleting your email inbox or your production database, or creating a court case out of thin air, or stats in a SCOTUS ruling that are fictitious, over and over and over again the extravagant promises they keep telling us are just around the corner appear to be decidedly half-baked.

    And if you use Teams or Windows and pieces of functionality that worked for two decades are no longer working as designed in a dependable way, I guess I just don’t know what to tell you.

    It makes perfect sense not to trust this technology, and the speed it promises is often mitigated by the fact that you can’t and shouldn’t trust its output, because if you’re the unlucky SOB that doesn’t check a reference, you can literally become national news.

    Further, being that it’s already been trained on the entirety of recorded human knowledge, I’m not sure how it gets better either. You can make it faster, but it’s just going to spit out slop at a faster rate.

    • MangoCats@feddit.it
      link
      fedilink
      English
      arrow-up
      1
      ·
      32 minutes ago

      Whether it’s bombing a school in Iran because Claude fucked up the targeting

      I’m going to call user error on that, and I don’t think it matters what system they were using - they were going to make mistakes.

      an AI agent deleting your email inbox or your production database

      The real error there? Conducting risky operations without backups.

      creating a court case out of thin air

      That’s just big silicon-brass balls. Interns do it too, but you don’t hear about them. On the other hand, trusting the AI or the intern, that’s disbarment levels of reckless.

      It makes perfect sense not to trust this technology

      Or any technology, until we have figured out what it is, and isn’t, capable of doing reliably.

      But, plenty of people still play Russian Roulette, for one reason or another. Is that the revolver manufacturer’s fault?

      being that it’s already been trained on the entirety of recorded human knowledge, I’m not sure how it gets better either

      Better editing.