Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Lemmy Shitpost
  3. The internet kind of sucks right now

The internet kind of sucks right now

Scheduled Pinned Locked Moved Lemmy Shitpost
lemmyshitpost
151 Posts 95 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • P [email protected]

    What prevents Discord from selling the chat logs to AI companies?

    I This user is from outside of this forum
    I This user is from outside of this forum
    [email protected]
    wrote last edited by
    #109

    Discord's complete lack of indexing. Although it's definitely not impossible to scrape data from Discord it would take more resources than say reddit.

    repletelocum@lemmy.blahaj.zoneR P 2 Replies Last reply
    1
    • nuko147@lemmy.worldN [email protected]

      I have never understood why people moved stuff to the closed Discord server system..

      A This user is from outside of this forum
      A This user is from outside of this forum
      [email protected]
      wrote last edited by
      #110

      Go for the voicechats, stay for the poorly organized forum megathread experience

      1 Reply Last reply
      3
      • A [email protected]

        What if we made it so that the text that any use posts in the forum website has a bunch of nonsense letters mixed in between the letters they posted, but they're all set to a REALLY small font or even taking no space thanks to those special characters, and colored in such a way to make them disappear into the background? That way when a person reads it makes sense but when a scraper gets it it'll just be a jumbled useless mess!

        S This user is from outside of this forum
        S This user is from outside of this forum
        [email protected]
        wrote last edited by
        #111

        Then search engines wouldn't find anything from the forum either.

        A 1 Reply Last reply
        4
        • I [email protected]

          Discord's complete lack of indexing. Although it's definitely not impossible to scrape data from Discord it would take more resources than say reddit.

          repletelocum@lemmy.blahaj.zoneR This user is from outside of this forum
          repletelocum@lemmy.blahaj.zoneR This user is from outside of this forum
          [email protected]
          wrote last edited by
          #112

          But they Index everything.
          Just request your data and you’ll get a neat package of all your messages with timestamps and all.

          M 1 Reply Last reply
          6
          • I [email protected]

            Discord's complete lack of indexing. Although it's definitely not impossible to scrape data from Discord it would take more resources than say reddit.

            P This user is from outside of this forum
            P This user is from outside of this forum
            [email protected]
            wrote last edited by
            #113

            If an AI company pays Discord they won't scrape but get the data directly.

            1 Reply Last reply
            7
            • nuko147@lemmy.worldN [email protected]

              I have never understood why people moved stuff to the closed Discord server system..

              O This user is from outside of this forum
              O This user is from outside of this forum
              [email protected]
              wrote last edited by
              #114

              To build up a close (the two definitions of "close") community. To speak freely (even if you have to respect the TnCs of Discord + community guidelines.)

              1 Reply Last reply
              1
              • R [email protected]
                This post did not contain any content.
                P This user is from outside of this forum
                P This user is from outside of this forum
                [email protected]
                wrote last edited by
                #115

                Self hosting is the way.

                M 1 Reply Last reply
                4
                • R [email protected]
                  This post did not contain any content.
                  M This user is from outside of this forum
                  M This user is from outside of this forum
                  [email protected]
                  wrote last edited by
                  #116

                  Got a lot "knowledge" AI companies are after?

                  1 Reply Last reply
                  0
                  • R [email protected]
                    This post did not contain any content.
                    T This user is from outside of this forum
                    T This user is from outside of this forum
                    [email protected]
                    wrote last edited by
                    #117

                    The forum I call home tolerates a lot of hate speech.

                    I think I'm out, but it's less about the AI scraping and more about moderation.

                    T 1 Reply Last reply
                    1
                    • R [email protected]
                      This post did not contain any content.
                      B This user is from outside of this forum
                      B This user is from outside of this forum
                      [email protected]
                      wrote last edited by
                      #118

                      Its been shit since covid. Everyone constantly online, and really ramping up the stupid as fuck culture wars. "Back in my day" I could log into a chat room, have some fun conversations, and then log off without getting pissed off or pissed on. I could look at movie news, and not be swapped by performative hate or praise for whatever fucking movie is or isnt "woke".

                      Everywhere you go, you see "Be civil" or "Be respectful". But all that really means is, dont question out echo chamber. And if you do, well, turns out not being civil towards you doesnt count.

                      Left and right doesnt matter. Its all hate and performative praise as far as the eye can see.

                      1 Reply Last reply
                      2
                      • P [email protected]

                        Self hosting is the way.

                        M This user is from outside of this forum
                        M This user is from outside of this forum
                        [email protected]
                        wrote last edited by [email protected]
                        #119

                        Doesn't really solve the AI scraping or the silo problem and as Codeberg found out recently, solving the AI scraping DDOS is never ending

                        1 Reply Last reply
                        3
                        • R [email protected]

                          If I'm going to share my information and knowledge publicly on an Internet site, I'd like everyone to have fair and open access to it, not at the whims of a multinational corp to gatekeep for me. So the fact that AI can access it too doesn't discourage me.

                          You have information from me because I choose to share it, not because a site has demanded I give it up without a clear benefit to me in return.

                          M This user is from outside of this forum
                          M This user is from outside of this forum
                          [email protected]
                          wrote last edited by [email protected]
                          #120

                          I think there's a lot of solid arguments against letting AI steal everything, but with the scraping there's an even more immediate problem. They don't rate limit or do it in an intelligent method. It becomes a full blown ddos that has take down entire sites and slowed many more to the point of near uselessness.

                          They're in a very literal sense crashing large chunks of the Internet and causing havoc which costs very real money to fix, either by upping server resources or installing AI scraping mitigation resources so that every still has access to the free information you mention.

                          R 1 Reply Last reply
                          0
                          • explodicle@sh.itjust.worksE [email protected]

                            (IANAL) Wouldn't this count as fair use since the AI sex bot is only using snippets?

                            M This user is from outside of this forum
                            M This user is from outside of this forum
                            [email protected]
                            wrote last edited by
                            #121

                            That's currently being argued in the courts. There's a lot that goes into it from right to distribution, to proving that although the AI bot can't reproduce everything even though it normally doesn't.
                            [https://arstechnica.com/features/2025/06/study-metas-llama-3-1-can-recall-42-percent-of-the-first-harry-potter-book/](A very real example of reproducibility)

                            There's also arguments about how they accessed large amounts of content. The law doesn't just recognize whether you can access something or not, but what you access it for. There's laws about accessing things with the sole purpose of using it to develop a commercial product. All of it is a tangled mess that there's no current clear answer to (legally, morally I think there is but that's very opinionated)

                            1 Reply Last reply
                            1
                            • T [email protected]

                              Depends on the forum, plenty of echo chambers out there...

                              M This user is from outside of this forum
                              M This user is from outside of this forum
                              [email protected]
                              wrote last edited by
                              #122

                              A lot of the forums I'm seeing talked about where more technical or objective kinds. Like in a car forum there'd be repair manuals or parts lists, fountain pen forums would have loads of images comparing inks side by side for different shades and hues. Those are the sorts of knowledge centers being discussed and reminisced about a lot here.

                              T 1 Reply Last reply
                              0
                              • M [email protected]

                                I think there's a lot of solid arguments against letting AI steal everything, but with the scraping there's an even more immediate problem. They don't rate limit or do it in an intelligent method. It becomes a full blown ddos that has take down entire sites and slowed many more to the point of near uselessness.

                                They're in a very literal sense crashing large chunks of the Internet and causing havoc which costs very real money to fix, either by upping server resources or installing AI scraping mitigation resources so that every still has access to the free information you mention.

                                R This user is from outside of this forum
                                R This user is from outside of this forum
                                [email protected]
                                wrote last edited by
                                #123

                                That is definitely a problem that needs to be dealt with, since AI scrapers hogging bandwidth or making sites inaccessible means it is hampering equal access to everyone. Ignoring conventions and not rate limiting itself are harmful to the open internet.

                                So yes, those kinds of AI scraping behaviours should be mitigated, but on the principle of AI ingesting my public data, I'm not against it, if it can access it reasonably and fairly like anyone else.

                                1 Reply Last reply
                                0
                                • S [email protected]

                                  Then search engines wouldn't find anything from the forum either.

                                  A This user is from outside of this forum
                                  A This user is from outside of this forum
                                  [email protected]
                                  wrote last edited by
                                  #124

                                  True... But what if the forum had its own search engine that could ignore the anti-scraping stuff? The issue would be making a good search engine lol

                                  1 Reply Last reply
                                  0
                                  • R [email protected]
                                    This post did not contain any content.
                                    L This user is from outside of this forum
                                    L This user is from outside of this forum
                                    [email protected]
                                    wrote last edited by
                                    #125

                                    Is that what we're up against? I thought every time I voice my mind on forums it gets upvoted or downvoted or ignored, but always ultimately ignored 🤷🏼‍♀️.

                                    Q 1 Reply Last reply
                                    0
                                    • R [email protected]
                                      This post did not contain any content.
                                      V This user is from outside of this forum
                                      V This user is from outside of this forum
                                      [email protected]
                                      wrote last edited by
                                      #126

                                      What is making you stink discord isn't also selling all its data to AI companies?

                                      1 Reply Last reply
                                      18
                                      • R [email protected]
                                        This post did not contain any content.
                                        A This user is from outside of this forum
                                        A This user is from outside of this forum
                                        [email protected]
                                        wrote last edited by
                                        #127

                                        Lol I knew this one year ago

                                        1 Reply Last reply
                                        1
                                        • T [email protected]

                                          People these days dont realise that confidently incorrect people pre-exist facebook.

                                          It's different though.

                                          If you were a flat earther in 1982, you probably would have a weird self published "newspaper" by someone 4 times a year, and two or three books and no platform beyond literally shouting on the street at people who all considered you a moron.

                                          Nowadays, if you're a crackpot, you can instantly find 17.000 other crackpots who will happily not just confirm your idiocy, but make up fake stories to support your bullshit ideas. They will also drag you along by pure crank magnetism into other bullshit. You can spread your bullshit far and wide, and since people are automatically served with similar content, you're even likely to find other idiots like you "in the wild", which is actually an algorithmic bubble.

                                          Before, nobody you met in real life would agree with you. Nowadays, everyone you "meet" online agrees with you.

                                          So yes, confidently incorrect people have always been there, but not in these numbers, and rarely to this level of confidence. That's why people react to vehemently, they rarely ever reach outside their bubble. Your ideas that the world is round aren't the general concept to them, they hear from flat earthers every single hour of the day.

                                          merc@sh.itjust.worksM This user is from outside of this forum
                                          merc@sh.itjust.worksM This user is from outside of this forum
                                          [email protected]
                                          wrote last edited by
                                          #128

                                          Nowadays, if you're a crackpot, you can instantly find 17.000 other crackpots who will happily not just confirm your idiocy, but make up fake stories to support your bullshit ideas.

                                          And because crackpots like this are very engaged in their crackpottery, it's a great place to put ads. That means that the big Internet ad companies all want to be the ones to host those bullshit ideas.

                                          Back in the day, the reason crackpot newspapers had to be self-published is that the big publishers didn't want to have anything to do with the crackpots. But, in the modern world, Google / Meta can find someone who wants to run an add to your crackpottery, so you get the same treatment as a big media publisher. In fact, you might get better treatment because crackpottery may be stickier than say the Boston Globe, so Google / Meta might prefer to work with you because it allows them to show more ads.

                                          1 Reply Last reply
                                          1
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups