• melfie@lemy.lol
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    1
    ·
    edit-2
    8 hours ago

    I’m sure LLMs can be useful for automation as long as you know what you’re doing, have tested your prompts rigorously on the specific version of the model and agent you’re using, and have put proper guardrails in place.

    Just blindly assuming a LLM is intelligent and will do the right thing is stupid, though. LLMs take text you give them as input and then output some predicted text based on statistical patterns. That’s all. If you feed it a pile of text with a chat history that says it deleted all your shit, the text it might predict that statistically should come next is an apology. You can feed that same pile of text to 10 different LLMs, and they might all “apologize” to you.

    • HugeNerd@lemmy.ca
      link
      fedilink
      English
      arrow-up
      16
      ·
      7 hours ago

      Or just learn any of the real automation tools that have been programmed by real programmers over the last half century?

      • jj4211@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        6 hours ago

        Recently someone lamented that just asking for an alarm to be set cost them tons of money and didn’t even work right…

        It was foolish enough to let LLM go to town on automation, but for open ended scenarios, I at least got the logic even if it was stupidly optimistic.

        But implementing an alarm? These people don’t even have rationality to their enthusiasm…

        • Flatfire@lemmy.ca
          link
          fedilink
          English
          arrow-up
          3
          ·
          3 hours ago

          If I remember right, that post wasn’t designed to highlight a practical use-case, but rather to set up a simple task as a “how could I apply this?” type of experimentation. The guy got roasted for it, but I think it’s a very reasonable thing to try because it’s a simple task you can see the direct result of in practice.

          The cost problem was highlighted as well, because if such a simple task is a problem, it can’t possibly scale well.

    • Trainguyrom@reddthat.com
      link
      fedilink
      English
      arrow-up
      4
      ·
      6 hours ago

      Yeah at work I had a realization recently that power automate and similar systems with AI steps are going to be really powerful. Since you have a bunch of deterministic steps you can just have the AI do the one text manipulation bit where you don’t need deterministic output (handy for non-deterministic inputs for example)

  • ReallyCoolDude@lemmy.ml
    link
    fedilink
    English
    arrow-up
    92
    arrow-down
    1
    ·
    1 day ago

    How could any person with some programing literacy event thinking about installing openclaw. A malware ridden by critical bugs

    • Jrockwar@feddit.uk
      link
      fedilink
      English
      arrow-up
      13
      ·
      edit-2
      11 hours ago

      I don’t think there’s anything wrong with running Openclaw. What is way too brave for my taste is giving it access to accounts with your personal data, or the filesystem in your computer. That’s a disaster waiting to happen.

      I run it in an isolated server, and it doesn’t have access to my data - if it goes tits up, it deletes unimportant stuff only. If anyone gets access to the credentials in it, it’s a bunch of budget-limited API keys, so they can spend all of $4 on openrouter. Maybe the riskiest bit is its Google account. I went with the approach of giving it its own Google account, so that it can create docs and calendar events and then add me, rather than getting access to my Google account. But then again… That account has no payment info, nothing that I would be mega worried if it got leaked…

      Sure, it might limit the usefulness a bit, but I think installing something like this is only acceptable if you sandbox it and don’t let it access valuable information. Going full mad scientist on something as “alpha” as this, letting it run wild with your info is nuts.

      • flux@lemmy.ml
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        8 hours ago

        I went with the approach of giving it its own Google account, so that it can create docs and calendar events and then add me, rather than getting access to my Google account.

        I wonder though: if Google can link this account to you as its actual owner, I wonder if there’s a risk if the bot does something against the ToS?

        I hope you have backups of your Google account…

    • XLE@piefed.social
      link
      fedilink
      English
      arrow-up
      54
      ·
      1 day ago

      She’s the head AI Safety Expert for Meta. The field might as well be labeled AI Misunderstander.

      • ReallyCoolDude@lemmy.ml
        link
        fedilink
        English
        arrow-up
        5
        ·
        15 hours ago

        I work with some data sciencetists and ml engineers on web projects. They might be good at etls, fine tuning etx, but dont let them touch anything with a public.layer or infra constraints.

    • 5gruel@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      17
      ·
      1 day ago

      I program medical devices for a living and I have openclaw and nanobot running at home. AMA.

      • melfie@lemy.lol
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        8 hours ago

        I don’t get all the downvotes, unless people misinterpreted your comment and assume you’re using it for medical devices. It’s open source and can be run with locally hosted, open weight models, so no harm in playing around with it as long as you don’t give it access to anything too risky.

        • 5gruel@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          3
          ·
          14 hours ago

          Because i want to work on meaningful things that benefit people directly.

          Because i want to unterstand the capabilities and limitations of openclaw-like agents. LLMs aren’t going away, better be proactive and learn what the hype is about.

          • raspberriesareyummy@lemmy.world
            link
            fedilink
            English
            arrow-up
            7
            arrow-down
            2
            ·
            9 hours ago

            here’s hoping you are just trolling, because people with that kind of approach to medical devices should be in prison.

              • raspberriesareyummy@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                6 hours ago

                there’s no mention of “privately” (some people work at home) and with the introduction, poster is giving the opposite impression - ragebaiting at the very least.

        • 5gruel@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          2
          ·
          14 hours ago

          That’s why unit and integration tests shouldn’t be written by Copilot.

          • jj4211@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            6 hours ago

            Why not, if copilot writes the code and tests, then the tests can be passed so much more easily!

      • ReallyCoolDude@lemmy.ml
        link
        fedilink
        English
        arrow-up
        7
        ·
        24 hours ago

        How you deal with critical vulnerabilities on your system? Do you work with high confidential data and have openclaw os those system? How many medical devices did you have to secure from mass incursion?

  • chuck@lemmy.ca
    link
    fedilink
    English
    arrow-up
    18
    ·
    1 day ago

    Don’t worry ask the pentagon’s grok to taskthe nsa’s chat got to recreate your inbox from their profile of you and meta data of your correspondence 🤣

    • ATS1312@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      7 hours ago

      Last I knew, they switched from Anthropic to chatGPT

      Either way, what Im hearing is you can get private access, with some creativity, to anything the US intelligence apparatus knows.

  • Creat@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    16
    ·
    edit-2
    1 day ago

    Wasn’t this many days ago already, or did it happen again? I remember reading this like 3 or 4 days ago as well.

    • XLE@piefed.social
      link
      fedilink
      English
      arrow-up
      4
      ·
      1 day ago

      This was 3 or 4 days ago.

      I thought of it after Anthropic virtuously announced they would not create autonomous murder devices for the US government (but basically everything else was on the table). Because I’m pretty sure the US military could have just used an Anthropic OpenClaw to bomb civilians as easily as this Facebook AI Safety expert used OpenClaw to destroy her emails.

  • melfie@lemy.lol
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 day ago

    I have no interest in using it, but at least it’s MIT licensed, which puts it ahead of Microslop’s rubbish if nothing else.

    • Elvith Ma'for@feddit.org
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 day ago

      Yeah, but if I understand that correctly, that’s just for the app itself the LLM is very likely still a proprietary one (ChatGPT, Grok,…)

      • Jrockwar@feddit.uk
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 hours ago

        The LLM is whatever you want it to be. Self hosted or from any provider with a compatible endpoint. It’s likely a proprietary one… Because the cost of training LLMs means most are proprietary ones.