I am specifically asking about software and needed libraries, not stuff like Wikipedia or the writings of Ernest Hemmingway.
To keep people from archiving all of github on thousands of shucked external hard drives cobbled together all Frankenstein-y to create a postapocalyptic data center assume a ~1TB storage limitation. Though I’m sure that person exists here on Lemmy somewhere :D
20TB hard drives are around $300/each. 12 gets you there with excellent redundancy built in.
Toss them in one of these and you have 200TB, with redundancy and room to grow.
Not cheap to do, but the above would only run about 5-6k.
Mine is similar to this except it’s a rack mount case with bays that holds 15 drives (using 14 right now, 252tb -36tb for parity). All of my drives are 18tb and were bought refurbished in the 160-200 range depending on where prices were at.
To anyone looking to do this I strongly suggest reading about raidz expansion. You do not need to just go out and buy 15 drives, you can do what I did and get 2-3 drives many years ago then just keep popping in another every time it gets full and/or one dies
I’m at 80% utilization. Next project: disk shelf to add more drives