Microsoft’s GitHub next month plans to begin using customer interaction data – “specifically inputs, outputs, code snippets, and associated context” – to train its AI models.

  • Nobilmantis@feddit.it
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    3
    ·
    20 hours ago

    Bro, I dont dig this either, but the title is a bit misleading. What they said (and they have been pretty transpartent about it: banner on the site plus email if you have an account) is that they will train their Copilot models from the user interactions with copilot, and you can opt-out.

    Now, I know the importance of defaults, but we are talking about Github, a platform for developers, I would REALLY assume these are the people that REALLY are able to toggle a setting to their preference, especially when they have been properly informed about it.

    Let’s try to save the indignment for when it is justified, this was not executed in a shady way, I would much rather Microsoft do any policy change this way.

    At least thats my opinion lol

  • S4m_S3p1l@infosec.pub
    link
    fedilink
    English
    arrow-up
    10
    ·
    1 day ago

    I’m not surprised, companies are starting to realise that AI is only as useful as the data it’s trained on. If you blast it with all the internet slop we have completely unfiltered, it’s going to start fucking up all it’s responses. It’s not just about the volume of data, it’s about the quality of that data. Sites like Github, and academic journals, contain the exact data that companies need to create well rounded LLMs, that don’t go off on racist rants and declare themselves as “MechaHitler”. That makes data like Github’s pure gold.

    • JigglySackles@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      14 hours ago

      Just made an account there myself. Has it worked nicely for you? (I’m assuming so since you recommend it)

    • mutant_zz@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      2 days ago

      There’s really not much locking us in to GitHub. Even moving an existing repo is not that hard. I started using Codeberg a few months ago and have yet to see the downside

      • panda_abyss@lemmy.ca
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 day ago

        Yeah, I’m on forgejo and the grass is just as green.

        Unless you want to self host runners to public code — I haven’t figured that out yet. But I run my own server on my own network so I’m not exactly worried about security.

    • trolololol@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 day ago

      I’m keeping my new repo in both GitHub and codeberg, but couldn’t figure out yet a few things:

      How do I get unit tests to run on codeberg? I won’t self host it

      How do I make jitpack see/checkout/build from codeberg?

  • Alfredolin@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    25
    arrow-down
    1
    ·
    2 days ago

    Yeah and Github does not let you use an alias for the login email. For real I got shadowbanned (or something similar): I did not see any warning and could not do any search in a repo and noticed my issues went unanswered… because nobody could fucking see them. So I wrote to support and they told me to use a name.surname email address. I told them to fuck off and never logged in again.

    • Skankhunt420@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      14
      ·
      edit-2
      2 days ago

      Holy shit this is insane!

      Microsoft is truly one of the worst companies for the user experience in my opinion. Its like they hate their users.

      • Alfredolin@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        10
        arrow-down
        4
        ·
        edit-2
        1 day ago

        I have not been accurate. Here was the answer:

        GitHub** (GitHub Support)

        May 30, 2025, 8:49 AM UTC

        Hi there,
         
        Thank you for contacting GitHub Support.
         
        Our abuse detecting systems flagged your account because of the email address you used to register the account. Before we can remove the flag we need you to add and verify a personal, non-disposable, non-aliased email address.
         
        You can add an email address by following the steps here:
         
        https://docs.github.com/github/setting-up-and-managing-your-github-user-account/adding-an-email-address-to-your-github-account
         
        …and you can follow these steps to verify it:
         
        https://docs.github.com/github/getting-started-with-github/verifying-your-email-address#verifying-your-email-address
         
        Once more, we’ll need you to remove the current email address from your account.
         
        To clarify, we don’t need anything ‘traceable’ to you, feel free to use protonmail or tutanota etc. (just examples, we don’t have any particular recommendation here) it just can’t be a “throwaway” or temporary domain for security and deliverability reasons. You are also welcome to connect to GitHub using a VPN or TOR node if and as you wish.
         
        Let us know when you’ve completed these steps and we’ll be happy to review your account again.
         **
        Github support,
        Rio.

        The alias was/is active, verified and verifiable, I even have TOTP and my fucking phone number on that account, I just checked… So no, thanks, I am not going to send you DNA samples.

    • Holytimes@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 day ago

      It’s funny you think that deleting your account is goanna remove the data.

      They will just pull a back up. There’s zero chance these companies are going to risk losing the equivalent of pure gold

        • JigglySackles@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          15 hours ago

          Whether they are or not, these companies are. And despite the GDPR or other protections required by EU countries, I highly doubt that these companies will truly honor that. They will just be better about obfuscating it.

  • Anas@lemmy.world
    link
    fedilink
    English
    arrow-up
    33
    ·
    2 days ago

    I’m already in the process of leaving, not to Codeberg, but to a self-hosted instance of Forgejo.

    • VeryVito@lemmy.ml
      link
      fedilink
      English
      arrow-up
      11
      ·
      2 days ago

      You won’t regret it. I’ve been using it for about a year now, and it rocks.

      • panda_abyss@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 day ago

        I like that I can use it as a container repo too

        I mirror the container images I use on my network in case there’s ever a disruption now.

        • originaltnavn@lemmy.zip
          link
          fedilink
          English
          arrow-up
          8
          ·
          2 days ago

          Yes, the only differences are the urls you use when cloning, and the website UI for merge requests and similar. Git is an open source program, github, forgejoe, gitlab, gogs and similar are only managementsoftware for hosting git repositories online.

  • Otter@lemmy.ca
    link
    fedilink
    English
    arrow-up
    204
    ·
    edit-2
    2 days ago

    Date

    As of April 24 you’ll be feeding the Octocat unless you opt out

    Current scope

    The code locker’s revised policy applies to Copilot Free, Pro, and Pro+ customers, as of April 24. Copilot Business and Copilot Enterprise users are exempt thanks to the terms of their contracts. Students and teachers who access Copilot will also be spared.

    To opt out (link edited by me to make it clickable)

    Those affected have the option to opt out in accordance with “established industry practices” – meaning according to US norms as opposed to European norms where opt-in is commonly required. To opt out, GitHub users should visit github.com/settings/copilot/features and disable “Allow GitHub to use my data for AI model training” under the Privacy heading.

      • Otter@lemmy.ca
        link
        fedilink
        English
        arrow-up
        22
        ·
        2 days ago

        Interestingly, mine was still enabled from the last time I must have toggled that setting.

        If they do screw around, they could just train on everything without asking anyone

        • JustEnoughDucks@feddit.nl
          link
          fedilink
          English
          arrow-up
          2
          ·
          19 hours ago

          I would bet literally any amount of money that the button doesn’t stop the AI from training on your data.

        • SCmSTR@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          1
          ·
          2 days ago

          I hate where society is at right now. I just want to skip ahead to where the social contract makes it standard to prevent this sort of hostile behavior. Or something. I refuse to accept that it’s me, and my age or culture makes me so deeply discordant to current socioeconomic practices.

    • Samsy@lemmy.ml
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      2 days ago

      Strange, I was already opt-out, must be an European thing. We are “opt-out” to a lot of things going on in the world lately.

      • Otter@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 days ago

        Do you fall under the affected group? Maybe it’s only listed for those who do

  • chunes@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    ·
    2 days ago

    GitHub is such a shit hole these days. Half the time, they won’t even let me view a repo unless I’m logged in.