Anubis - Weighs the soul of incoming HTTP requests using proof-of-work to stop AI crawlers

[email protected]

Or even a quick link to the relevant portion of the docs at least would be cool

[email protected]

It could be infinitely wide too if they desired. It shouldn't be that hard to do I wouldn't think. I would suspect they limit the time a chain can use though to eventually escape out, though this still protects data because it obfuscates legitimate data that it wants. The goal isn't to trap them forever. It's to keep them from getting anything useful.

[email protected]

It requires a bunch of browser features that non-user browsers don't have, and the proof-of-work part is like the least relevant piece in this that only gets invoked once a week or so to generate a unique cookie.

I sometimes have the feeling that as soon as some crypto-currency related features are mentioned people shut off part of their brain. Either because they hate crypto-currencies or because crypto-currency scammers have trained them to only look at some technical implementatiin details and fail to see the larger picture that they are being scammed.

[email protected]

I use https://sx.catgirl.cloud/ so I'm already primed to have anime catgirls protecting my webs.

[email protected]

Catgirls, jackalgirls, all embarrassing. Go full-on furry.

[email protected]

This looks like it can can actually fuck up some models, but the unnecessary CPU load it will generate means most websites won't use it unfortunately

[email protected]

What's the ffxiv reference here?

Anubis is from Egyptian mythology.

[email protected]

The names of release versions are famous FFXIV Garleans

[email protected]

That's a tarpit that you're describing, like iocaine or nepthasis. Those are to feed the crawler junk data to try and make their eventual output bad.

Anubis tries to not let the AI crawlers in at all.

[email protected]

So if you try to access a website using this technology via terminal, what happens? The connection fails?

[email protected]

If your browser doesn't have a Mozilla user agent (I.e. like chrome or Firefox) it will pass directly. Most AI crawlers use these user agents to pretend to be human users

[email protected]

That would be reasonable. The people running these things aren't reasonable. They ignore every established mechanism to communicate a lack of consent to their activity because they don't respect others' agency and want everything.

[email protected]

What I'm thinking about is more that in Linux, it's common to access URLs directly from the terminal for various purposes, instead of using a browser.

[email protected]

If you're talking about something like curl, that also uses its own User agent unless asked to impersonate some other UA. If not, then maybe I can't help.

agnos.is Forums

Anubis - Weighs the soul of incoming HTTP requests using proof-of-work to stop AI crawlers