Internet Archive played crucial role in tracking shady CDC data removals

[email protected]

That's a bit more than my home server can handle. I could maybe take some CDC data, but definitely not the full shebang. It would be neat if someone could segment the data so we could save some more critical things.

[email protected]

Is there a way to distribute it so everyone just has parts of it? Aren't there p2p cloud storage solutions that exist?

[email protected]

Here's how to help them: https://github.com/ArchiveTeam/warrior-dockerfile

[email protected]

A couple years ago I read that Filecoin has teamed up with the internet archive to synchronize the data on the Blockchain. I'm not sure how far they are yet, but it's something that could work if it doesn't turn out to be just crypto hype in the end.

[email protected]

although could have been avoidable, it begs the question who was behind the attacks.

I think we can safely say it was Peelon Shmusk, the worlds worst spy!

[email protected]

When the internet archive was attacked a few months ago we were like "who would be dumb and mean enough to do that?". We have new suspects!

[email protected]

Oh, cool, didn't know about this, throwing it on my home lab now.

[email protected]

The problem is you'd need to split it down to an amount that people would be happy hosting and then host it multiple times in case any node goes offline.

Another comment in the thread says it's likely over 100PB today (100,000 terabytes). I'd say 4 copies (spread over different time zones) is a relatively minimal level of redundancy (people may host on machines that aren't powered all the time), and you'd get a network with the most participants at around the 150gb per node mark.

That comes to nearly 3 million participants needed

Which isn't insurmountable, but also not remotely easy to get going from nothing

[email protected]

That's not the Internet Archive; that's a separate group (ArchiveTeam). They're completely unrelated. They use the Internet Archive for storage but are otherwise completely unrelated. The data archived by Archive Team Warrior does not go into the Wayback Machine.

[email protected]

This comment from 8 months ago says 152PB: https://www.reddit.com/r/DataHoarder/comments/1cu79ke/the_archiveteam_has_a_cost_shameboard_of_the_top/l4om4m6/

[email protected]

These guys seem cool but they're not the archive.org from the op article

agnos.is Forums

Internet Archive played crucial role in tracking shady CDC data removals