Elon Musk said verified accounts would be limited to reading 6,000 posts per day while unverified users will be limited to 600.

  • elghoto@lemmy.world
    link
    fedilink
    English
    arrow-up
    80
    ·
    1 year ago

    I didn’t read the article. But could it be that every platform is trying to limit LLMs to be trained on their data?

      • Pexistralxinz@lemm.ee
        link
        fedilink
        English
        arrow-up
        68
        arrow-down
        1
        ·
        1 year ago

        Seems like the free internet as we knew it is dead. Any site with free, user-generated content to monetize Is about to try and suck every last dime from it.

        • wtfeweguys@lemmy.whynotdrs.org
          link
          fedilink
          English
          arrow-up
          58
          arrow-down
          1
          ·
          1 year ago

          The internet as we knew it wasn’t free. We were the product. Here’s hoping their drive to force payment sends us on to decentralized, open source infrastructure.

        • Zak@lemmy.world
          link
          fedilink
          English
          arrow-up
          30
          arrow-down
          1
          ·
          1 year ago

          Fortunately, we have the user-owned distributed internet to move to.

        • MysteriousSophon21@lemmy.world
          link
          fedilink
          English
          arrow-up
          12
          arrow-down
          1
          ·
          1 year ago

          Yep, capitalist greed ruins everything. Which is why distributed networks run by the community are our best hope for the future.

          Those fuckers would try to ruin this too, by bot attacks, by trying to cut deals with some of the admins or by running their own versions of Lemmy/Mastadon.

        • Marxine@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          1 year ago

          We now have the federated interne though, and I think it’s got a way brighter future.

        • Silviecat44@vlemmy.net
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Honestly a paid internet is better. Just look at the Fediverse. Internet was never profitable. Now the data collection just needs to stop

        • spaceribs@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          1 year ago

          They were always going to do that, the squeeze is basically required if you’re planning on making a public offering and become beholden to investors.

      • TheWorstNL@lemmy.world
        link
        fedilink
        English
        arrow-up
        25
        arrow-down
        1
        ·
        1 year ago

        Also they haven’t paid Google for using their Cloud so they are moving their data.

    • cort@lemm.ee
      link
      fedilink
      English
      arrow-up
      23
      ·
      1 year ago

      No you misunderstand they desperately want them to be trained with their data. They just want them to pay hundreds of thousands to millions of dollars to do so. Twitter is not buckling under the weight of data scraping, Elon is just pissed that companies are data scraping instead paying his exorbitant API fees.

      • DrakeRichards@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 year ago

        They just want them to pay hundreds of thousands to millions of dollars to do so.

        This is the hilarious part to me: some companies might pay these fees, but there will be many more who won’t and will instead use actual web scrapers to get their data anyways. As the number of individuals training LLM models increases in the next couple of years, this will create a much more significant traffic load compared to API calls.

        • cort@lemm.ee
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          1 year ago

          Yeah he doesn’t seem to understand he’s not selling the data, the data is public, he’s selling convenience. And if the convenience isn’t worth the price you’ve set, people will just take the extra effort and avoid the expense.

      • itsJoelleScott@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 year ago

        Exactly. I do selenium scripting as my main task for work, and as soon as I heard about how high the api rates were my first through was “Jesus, it might slower than straight api calls, and the dynamic xpaths might suck, but I could write a script that scrapes the website for cheaper.” Twitter is hurting for cash right now, and I imagine his effort to raise funds is the end goal here. He instituted the api policy, learned about another side effect, and continues to with the most extreme, devoid of nuance response each time.

        All “in my opinion,” of course.

    • 21racecar12@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      ·
      1 year ago

      Hmm. Sounds a lot like something /u/spez said. I wouldn’t expect Twitter to be a good LLM source with its current state anyway…Reddit would be a lot better contextually. The reality is Reddit and Twitter are bleeding cash and they’ve got brain-rotted CEOs that don’t pay their bills or have unrealistic plans and timelines for profitability.

      • Marxine@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        Most of Twitter’s (and soon Reddit’s) data to be fed to LLMs will be porn sharing bots at this point.

    • fuzzzerd@kbin.social
      link
      fedilink
      arrow-up
      3
      ·
      1 year ago

      Seems like a strange way to enforce it, at the user level vs the api client level, unless they’re trying to guard against screen scraper types.