Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Programmer Humor
  3. lads

lads

Scheduled Pinned Locked Moved Programmer Humor
programmerhumor
58 Posts 23 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • grysbok@lemmy.sdf.orgG [email protected]

    Like I said, [edit: at one point] Facebook requested my robots.txt multiple times a second. You've not convinced me that bot writers care about efficiency.

    [edit: they've since stopped, possibly because now I give a 404 to anything claiming to be from facebook]

    quill7513@slrpnk.netQ This user is from outside of this forum
    quill7513@slrpnk.netQ This user is from outside of this forum
    [email protected]
    wrote last edited by
    #49

    You've not convinced me that bot writers care about efficiency.

    and why should bot writers care about efficiency when what they really care about is time. they'll burn all your resources without regard simply because they're not who's paying

    grysbok@lemmy.sdf.orgG 1 Reply Last reply
    0
    • quill7513@slrpnk.netQ [email protected]

      You've not convinced me that bot writers care about efficiency.

      and why should bot writers care about efficiency when what they really care about is time. they'll burn all your resources without regard simply because they're not who's paying

      grysbok@lemmy.sdf.orgG This user is from outside of this forum
      grysbok@lemmy.sdf.orgG This user is from outside of this forum
      [email protected]
      wrote last edited by
      #50

      Yep, they'll just burn taxpayer resources (me and my poor servers) because it's not like they pay taxes anyway (assuming they are either a corporation or not based in the same locality as I am).

      There's only one of me and if I'm working on keeping the servers bare minimum functional today I'm not working on making something more awesome for tomorrow. "Linux sysadmin" is only supposed to be up to 30% of my job.

      grysbok@lemmy.sdf.orgG 1 Reply Last reply
      1
      • grysbok@lemmy.sdf.orgG [email protected]

        Yep, they'll just burn taxpayer resources (me and my poor servers) because it's not like they pay taxes anyway (assuming they are either a corporation or not based in the same locality as I am).

        There's only one of me and if I'm working on keeping the servers bare minimum functional today I'm not working on making something more awesome for tomorrow. "Linux sysadmin" is only supposed to be up to 30% of my job.

        grysbok@lemmy.sdf.orgG This user is from outside of this forum
        grysbok@lemmy.sdf.orgG This user is from outside of this forum
        [email protected]
        wrote last edited by [email protected]
        #51

        I mean, I enjoy linux sysadmining, but fighting bots takes time, experimentation, and research, and there's other stuff I should be doing. For example, accessibility updates to our websites. But, accessibility doesn't matter a lick if you can't access the website anyway due to timeouts.

        1 Reply Last reply
        0
        • C [email protected]

          There's heavy, and then there's heavy. I don't have any experience dealing with threats like this myself, so I can't comment on what's most common, but we're talking about potentially millions of times more resources for the attacker than the defender here.

          There is a lot of AI hype and AI anti-hype right now, that's true.

          isveryloud@lemmy.caI This user is from outside of this forum
          isveryloud@lemmy.caI This user is from outside of this forum
          [email protected]
          wrote last edited by
          #52

          I do. I have a client with a limited budget whose websites I'm considering putting behind Anubis because it's getting hammered by AI scrapers.

          It comes in waves, too, so the website may randomly go down or slow down significantly, which is really annoying because it's unpredictable.

          1 Reply Last reply
          3
          • D [email protected]

            It's very intrusive in the sense that it runs a PoW challenge, unsolicited on the client. That's literally like having a cryptominer running on your computer for each challenge.

            Each one would do what they want with their server, of course. But for instance I'm very fond of scraping. For instance I have FreshRSS running ok my server, and the way it works is that when the target website doesn't provide a RSS feed ot scrapes it to get the articles. I also have other service that scrapes to get pages changes.

            I think part of the beauty of internet is being able to automate processes, software lile Anubis puts a globally significant energy tax on theses automations.

            Once again, each one it's able to do with their server whatever they want. But the think I like the least is that they are targeting with some great PR their software as part of some great anti-AI crusade, I don't know if the devs itself or any other party. And I don't like this mostly because I think is disinformation and just manipulative towards people who is maybe easy to manipulate if you say the right words. I also think that it's a discourse that pushes into radicalization from certain topic, and I'm a firm believer that right now we need to overall reduce radicalization, not increase it.

            xthexder@l.sw0.comX This user is from outside of this forum
            xthexder@l.sw0.comX This user is from outside of this forum
            [email protected]
            wrote last edited by [email protected]
            #53

            A proof of work challenge is infinitely better than the alternative of "fuck you, you're accessing this through a VPN and the IP is banned for being owned by Amazon (or literally any data center)"

            1 Reply Last reply
            0
            • R [email protected]

              Hail Anubis-chan.

              xylight@lemdro.idX This user is from outside of this forum
              xylight@lemdro.idX This user is from outside of this forum
              [email protected]
              wrote last edited by
              #54

              any "bot stopper" ends up stopping me somehow. Including anubis. I'm pretty sure ive been cursed by the rng gods because even at 40 KH/s, I get stuck on the pages for like 2 minutes before it tells me success.

              Similar things like hcaptcha or cloudflare turnstile either never load or never succeed. Recaptcha gaslights me into thinking I was wrong.

              https://iloveanubis.phtn.app/

              T 1 Reply Last reply
              4
              • R [email protected]

                Correct. Anubis' goal is to decrease the web traffic that hits the server, not to prevent scraping altogether. I should also clarify that this works because it costs the scrapers time with each request, not because it bogs down the CPU.

                xylight@lemdro.idX This user is from outside of this forum
                xylight@lemdro.idX This user is from outside of this forum
                [email protected]
                wrote last edited by
                #55

                Why not then just make it a setTimeout or something so that it doesn't nuke the CPU of old devices?

                R 1 Reply Last reply
                0
                • mod_pp@lemmy.worldM [email protected]
                  This post did not contain any content.
                  E This user is from outside of this forum
                  E This user is from outside of this forum
                  [email protected]
                  wrote last edited by
                  #56

                  The block underneath ai is python

                  1 Reply Last reply
                  2
                  • xylight@lemdro.idX [email protected]

                    Why not then just make it a setTimeout or something so that it doesn't nuke the CPU of old devices?

                    R This user is from outside of this forum
                    R This user is from outside of this forum
                    [email protected]
                    wrote last edited by
                    #57

                    Crawlers don't have to follow conventions or specifications. If one has a setTimeout implementation that doesn't wait the specified amount of time and simply executes the callback immediately, it defeats the system. Proof-of-work is meant to ensure that it's impossible to get around the time factor because of computational inefficiency.

                    Anubis is an emergency solution against the flood of scrapers deployed by massive AI companies. Everybody wishes it wasn't necessary.

                    1 Reply Last reply
                    0
                    • xylight@lemdro.idX [email protected]

                      any "bot stopper" ends up stopping me somehow. Including anubis. I'm pretty sure ive been cursed by the rng gods because even at 40 KH/s, I get stuck on the pages for like 2 minutes before it tells me success.

                      Similar things like hcaptcha or cloudflare turnstile either never load or never succeed. Recaptcha gaslights me into thinking I was wrong.

                      https://iloveanubis.phtn.app/

                      T This user is from outside of this forum
                      T This user is from outside of this forum
                      [email protected]
                      wrote last edited by
                      #58

                      Have you ever had a bladerunner moment

                      1 Reply Last reply
                      0
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups