this post was submitted on 21 Mar 2025
156 points (98.1% liked)

Selfhosted

44757 readers
2339 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

I just started using this myself, seems pretty great so far!

Clearly doesn't stop all AI crawlers, but a significantly large chunk of them.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 18 hours ago* (last edited 18 hours ago) (4 children)

It is not great on many levels.

  • It only runs against the Firefox user agent. This is not great as the user agent can easy be changed. It may work now but tomorrow that could all change.

  • It doesn't measure load so even if your website has only a few people accessing it they will stick have to do the proof of work.

  • The POW algorithm is not well designed and requires a lot of compute on the server which means that it could be used as a denial of service attack vector. It also uses sha256 which isn't optimized for a proof of work type calculation and can be brute forced pretty easily with hardware.

  • I don't really care for the animé cat girl thing. This is more of a personal thing but I don't think it is appropriate.

In summary the Tor implementation is a lot better. I would love to see someone port it to the clearnet. I think this project was created by someone lacking experience which I find a bit concerning.

[–] [email protected] 1 points 25 minutes ago

I use https://sx.catgirl.cloud/ so I'm already primed to have anime catgirls protecting my webs.

[–] [email protected] 10 points 18 hours ago* (last edited 18 hours ago) (1 children)
  1. Doesn't run against Firefox only, it runs against whatever you configure it to. And also, from personal experience, I can tell you that majority of the AI crawlers have keyword "Mozilla" in the user agent.

  2. Yes, this isn't cloudflare, but I'm pretty sure that's on the Todo list. If not, make an issue to the project please.

  3. The computational requirements on the server side are a less than a fraction of the cost what the bots have to spend, literally. A non-issue. This tool is to combat the denial of service that these bots cause by accessing high cost services, such as git blame on gitlab. My phone can do 100k sha256 sums per second (with single thread), you can safely assume any server to outperform this arm chip, so you'd need so much resources to cause denial of service that you might as well overload the server with traffic instead of one sha256 calculation.


And this isn't really comparable to Tor. This is a self hostable service to sit between your web server/cdn and service that is being attacked by mass crawling.

Edit: If you don't like the projects stickers, fork it and remove them. This is open source project.

And Xe who made this project is quite talented programmer. More than likely that you have used some of Xe's services/sites/projects before as well.

[–] [email protected] 3 points 15 hours ago

Xe is insanely talented. If she is who I think she is, then I've watched her speak and her depth of knowledge across computer science topics is insane.

[–] [email protected] 7 points 17 hours ago (1 children)

…you do realize that brute forcing it is the work you use to prove yourself, right? That’s the whole point of PoW

[–] [email protected] 1 points 16 hours ago

True, I should of phrased that better.

The issue is that sha256 is fairly easy to do at scale. Modern high performance hardware is well optimized for it so you could still perform attack with a bunch of GPUs. AI scrapers tend to have a lot of those.

[–] [email protected] 1 points 15 hours ago

I look forward to TOR's PoW coming out for FOSS WAFs