Are we fucked?

A guy had the entirety of Usenet for a ten year period on some tapes. It comes to 18GB when uncompressed. Even at the time it would have been possible for a group to save this. Expensive, but possible.

If we were to "lose" the web we'd be totally screwed and it would all be gone since people don't even save their own things any more. And with ever larger file formats (4K video, etc) it's only going to get a whole lot bigger in to the future.

Does any one else worry about the long-term security of all this data we're producing? It seems far too easy to say "Oh, we'll be fine," and hoping for the best.

Other urls found in this thread:

en.wikipedia.org/wiki/Wikipedia:Database_download#English-language_Wikipedia
discord.gg/WrTB4AJ
twitter.com/SFWRedditGifs

We can always make more porn and shitposts.

storage medium size and transfer rates are increasing too.

Rare Pepes will become even more rare.

Internet archive

>security
You meant safety I guess. Safety is about accidents, security is about malicious intent.

They turned out to be cucks and want to migrate to Canada now because "hurr muh Drumpf regime".

Not fast enough. And that's not what the issue is here. No one is going have "Facebook" just sat on some shelves, and its massive size would make it impossible to download and store in the meantime.

They're already begging for money. I don't think you should have much faith in their future.

Whether or not they're cucks is irrelevant as long as they keep providing the service that they've been providing.
With the way the U.S. has been going with internet regulations, I'd kind of hope that useful internet tools do find another country to conduct operations from, lest they be raided and dismantled by the alphabet soup.

Wikipedia's been begging for money for years, and they seem to be doing fine.

Wikipedia is significantly smaller than the web archive. In fact I don't even think it's 20GB.

As long as the anime and mangos get backed up that's all I care about

according to their database dumps at en.wikipedia.org/wiki/Wikipedia:Database_download#English-language_Wikipedia the dump of revision logs alongside pages epands to multiple terabytes of strictly text, and I'd assume that Wikipedia's bandwidth costs are significantly higher even if they don't need as much storage because of the sheer amount of traffic that they get.

>security is about malicious intent

Security is just safety from a threat, the source doesn't matter. Losing all your shit is a threat, so the word choice is fine.

But you do pose another interesting "what if...?"

If you thought you had any expectation of keeping data on other peoples servers, you're a moron.

>pages-articles.xml.bz2 – Current revisions only, no talk or user pages; this is probably what you want, and is approximately 13 GB compressed (expands to over 58 GB when uncompressed)

This is the only thing people would download in a panic, and that there's a torrent for it shows that, like the OP example of Usenet, there is a version out there.

The same can't be said for many of the other major sites.

I think you're missing the point.

We've already lost a shit ton of stuff. Go through your bookmarks (you know, that list of things you said you read/reference later,) and see how much of it still exists.

You hold a massive archive of conceivably everything, a good amount of which some asshat with money could try to exert ownership rights over, and an incoming administration that is unpredictable at best. Don't you want to mitigate as many risks as possible? They've probably got a budget barely enough to keep the lights on. What happens when some corporation with dubious rights decides the archive has something they don't want people to have.

The point is, I save data and you rely on other people to do this for you. Wipe the Internet clean. I don't give a fuck.

duvde discs sealed in tupperware

So how the hell am I supposed to get into PTH?

>the largest library of data and information the human species has ever created (and curated)
>Wipe it clean. I don't give a fuck.

The obvious answer is to have multiple installations, not moving the installation. I mean, what if they do literally lose some of it? I don't mean the data, I mean entire hard drives are missing after the move.

>the largest library of data and information the human species has ever created (and curated)
>Wipe it clean. I don't give a fuck.
No it's not. You are a moron aren't you.

Almost nothing on the web today is worth saving.

Are they moving the servers it sits on, or the administrative headquarters?

I have copies of a shitload of free software such as good images of Debian, GRUB, Libreboot stuff, hardware documentation, etc. I don't trust anyone but myself to keep these things safe. If the US government were to somehow try to kill free software, I would distribute it on flash drives. We might lose some stuff like useless YouTube cat videos and stupid news sites but all of the important stuff will still be kept offline by hobbyists like myself.

i wonder how big twitter's dump would be

>No it's not.

It is. Name the bigger ones if it isn't.

>Does any one else worry about the long-term security of all this data we're producing?
No because data that is not saved and properly backed up is worthless

Also, storage is getting cheaper by the year so everything is getting saved on the service end so it can be mined later.

>Entire internet is wiped clean

And nothing of value was lost, IMO.

only informational data is worth saving, like wikis, books and documentaries

This is where I really disagree with the web archive. They just save everything they can and then complain that they can't deal with it. If they had some standards they wouldn't be in dire straits even half as bad as they are. They'd find it easier to preserve things as well as distribute the data.

But instead they worry about playing video games in the browser.

Most of the internet is just low effort content. Why even bother saving it?

this the prices will soar just like i said they would and we will all be rich beyond our wildest dreams

mock me now, but in the future you will know that i was most right

the day internet is lost is the day i kill myself so it doesnt really matter to me

>implying it will
>implying it wont be replaced

that means i will be staying longer then

>implying you will
>implying you won't be replaced

i run this place you idiot. you wouldnt want me dead.

People have been writing books for more than a millennium and you think 25 years of shitposting and porn equals the greatest source of information? Are you 12?

wrong, I'm the one who makes the world, you're just a line of text on a screen

replaced != dead

>a guy had the entirety of usenet for a ten year period
in the time it took to take the tape out and put it on the shelf it was out of date.

Name one library with more information (useful) information contained in it than exists on the internet.

You can't, because the internet has all the information in that library and more.

You can fuck off, now.

>he thinks all books in the world have been digitized

most of them have
the shitty short story you wrote as a teenager might not have

Do you seriously not see the difference between "a few libraries scattered around" and "every thing we have at my fingertips"?

Or do you think that the entirety of the books from antiquity onwards were sitting on every bookshelf in the world?

That information will exist with or without the Internet. The information on my computer will exist with or without the Internet. Only your dank me mes will disappear because you thought someone elses server would always be there.

Name one book with important information useful to mankind that has not been digitized and made available electronically, or made redundant by another book with that same information already digitized.

I'll wait. Go on.

Probably less than 10% of all the books in the world are worth reading

Yeah but that 10% is different for everyone, so when you take each individual into account it gets close to 100% I reckon.

The book came before the digitizing. If you delete the Internet, that information doesn't disappear.

wanting to read something doesn't make it worth reading
there IS such a thing as universal value

>most
thanks for proving my point
>the shitty short story you wrote as a teenager might not have
i havent written any books

>name a book you have never heard of
thats how retarded you sound

irrelevant

none of us here on this board know what the archives around the world hold (even some of the curators at a few of those dont know everything they have in the shelves), saying that "everything of importance have already been digitzed" is fucking naive as shit.

The argument isn't about information disappearing or not. It's about the internet being the greatest (as in size and accessibility) information resource in the world.

>I can't answer the question, because there exists no book with information not already on the internet, and this contradicts my stupid opinion that the internet is not the greatest source of information in the world, the post

It's about OP wetting herself at the thought of Trump deleting the Internet.

first off, i was not the one that initially claimed everything of worth had been digitized so the burden of proof is not on me.
secondly, who are you to judge what is worth saving and not?
thirdly, they found a book they didnt know existed in an archive a few years ago.

you can stop posting now

The Catholic church probably has fuckloads of old books they keep from the public. That's beside the point. If you guys want to save information, save it on your machine. You have about as much control over someone elses server on the Internet as Trump does.

also i never claimed the internet is not a great source of information, just that you and the other people are fucking retarded if you think EVERYTHING is on the internet, because it's not, that this is a fact.

i was going to mention this but im pretty sure the vatican has digitized their collection.

but to say every archive and library in the world has digitized everything they have is just wrong.

Do you really put it past humans to just, as OP said, lose the internet? I don't. And even if it's not that simple (or silly) it's a very delicate creation as well.

There's a million things that could go wrong.

>Do you really put it past humans to just, as OP said, lose the internet?
Do you understand what you're even saying? There are billions of ip addresses and even if a fraction of these take to deleting the information on THEIR servers, all of the information wont simply disappear. My suggestion to you would be to start archiving valuable information if you want it passed down because any control you may think you have over "the internet" you just don't.

>If we were to "lose" the web
That's like imagining if we were to "lose" books.

You can lose yours, I can lose mine, but there's little that could destroy all books everywhere without also killing everyone.

>Name one book with important information useful to mankind
I can't, because no matter which undigitized book I mention you'll disqualify it for not being sufficiently useful to a sufficient number of people.

literally every single library in the world has far more useful information than the internet
ever tried to find out something actually important on the internet? its practically impossible thanks to the near 0 SNR.
50 billion blog posts written by a simpleton with no grasp of any subject beyond the absolute basics. buried in the shit is perhaps one or two pieces of solid gold source material good fucking luck finding them.
the thing about book (non fiction anyway) is that not every cunt can smear their shit on some toiletpaper and then submit it for publishing.

the reason your teacher tells you not to use wikipedia isn't because its wrong or biased or poorly written, its because it only brushes the surface of any topic.

There's a difference between the Internet and the first page of Google

if we "lost" the web, we'd have much bigger problems.. like lots of dead people

hmmmm well i would rather just walk to the library than fuck around on page like 10000 of google search.

Do not agree. Of only to make it easy for future historians. Yes someone in fifty years will be reading your shit posts when researching digital subcultures during the first Digital Revolution.

>
>Yes someone in fifty years will be reading your shit posts when researching digital subcultures during the first Digital Revolution.

This sounds funny, but not that unrealistic when you think about it

I think we should do a wipe every once and a while. If a site is worth saving someone will have to save it prior to the wipe. too much ancient stuff on there, it shouldn't get to live forever just because its on the internet.

signal to noise ratio is getting pretty bad

>mfw collecting rare pepes become the "I should have bought bitcoins" of 2025

all my teachers that have not accepted wikipedia as a source have all accepted major printed encyclopedia as a source.

even though they "brush the surface" far less than the online encyclopedia.

I'm pretty sure its the fact that anybody can edit it and its always changing. If you teacher goes to check your source, and its wikipedia, the entry might be completely different and by different authors than it was when you cited it

We aren't going to do anything because you only have control over your shit.

What are the contents of the Vatican Secret Library? Like hell I'd know even just the book titles that exist in there.

Every day reddit probably produces more content than all the books that have ever been written

Just image if he uploaded that.

Most of that data is normie noise and is worth less than literal dog shit.

Only a fraction of data hosted online is relevant to you, and even then you probably don't want all of it.

I literally have every single piece of media and information that I care about on a handful of external drives sitting in a DIY Faraday cage, barring future releases and with the way I archive my shit it all fits onto a single drive, the rest being redundant backups of course.

Everything that isn't in my archives can cease to exist right now and I wouldn't give a single fuck.

This desu, most of it is junk... like this post

If you fags want to come play minecraft with some other Sup Forumsmen...

>What is 4craft?
Short-term faction warfare between Sup Forums boards, with a custom map and plugins (that mostly work)
>Why should I play?
That is a good question honestly...
>When will it launch?
December 26th
>How do I participate?
Just join the discord and prefix your nickname with Sup Forums, an admin will move you shortly

Discord: discord.gg/WrTB4AJ
Server IP: 4craft.us

Why will you want to save the internet?
Just save the files you have man

>The obvious answer is to have multiple installations, not moving the installation.

I guess reading is hard for you.

>It’s a U.S.-based non-profit and has started to solicit donations to create an “Internet Archive of Canada,” a copy of its current library, which it says will cost millions of dollars.

>I have literally no idea how the internet works: the post

Only the text is what's important tbqf

95% of the video content is retarded trash

No.

Meanwhile, buy some floppies and a floppy drive.

Is the internet even centralized? It seems kind of impossible for the entire internet to just be gone.

Also isnt the NSA basically cataloging everything on the net? Is there even anything that valuable that would be "lost" by deleting the internet?

>thesis on the NEET culture of the millennial generation.
As a guy with a degree in history. I can tell you this will easily happen. Historians write about all kinds of weird shit, and primary resources are king

I can't wait for them to analyze all of the dank memes and screen caps from Sup Forums.

...They'll probably even look at this thread. Hi future me! :)

what is this? this is the dumbest fucking reply possible.

my new motivation to live is to see the books that come out about this in a few decades time

The web is constantly dying and being reborn.
You think if you upload something its here forever? You're naive.
Ever day more and more original websites are getting shut down. The webmaster gets bored, looses his users and doesnt see the point, or simply gets old and dies, then his relatives dont care about the site or even know and just dont pay the bills.

There are archive sites, but someday even they will be lost, like the first few Sup Forums archives and encyclopedia dramatica.

Eventually everything will be lost. Even if the data remains on some decrepit drive, in a few hundred years, noone will be able to read it. How many of (fuck off audiofags) of you even have cassette/vhs/8track players still?
If I handed you pic related, do you even know what they fuck it is? Let alone get the data off of it.

Its true, our current history will be lost forever if we dont resort to writing it down in nonelectrical based means and sealing it in caves again.

>No one is going have "Facebook" just sat on some shelves

well...

by any chance wasnt that bunch of 150 tapes the equivalent of 150 HDD (lets say 2TB) today?

im sure an archive of all memes, gifs, or their webm conversions that takes into acout duplicates among funnyjoke 9gag facebook, could fit in 3 petabytes, it makes no sense storing profiles and its photos, even those posted pubicly into personal profile blog posts (facebook timeline) because you would be storing private irrelevant crap most of the time... so only things posted to facebookPages timelines should be saved, and those are mostly funnyposting and everyday grievances.

orkut is still archived at google, it had a short run, but a lot of ppl came together in its communities (forums) for any ultra niche subject you could think of, and that makes it historically curious,

differently from facebooks "Pages" timelines that discourage deep discussion, orkuts communities followedd the traditional internet forum / imageboard format; so theres a lot of textual content there, more or less analogous to usenet.

lots of ppl archived orkut when its demise was announced, much like many ppl archive(ed) Sup Forums 8can etc, i seriously doubt anyone is archiving facebook, its just not worth it, its best to just find all the remaining good forums on the deep web and have logged in bots archive that, a lot more effort but isk

>implying social media bullshit is "content"

I mean I still agree with you, but the biggest problem when you boil it down seems to be video, since high quality video as well as the production of content and the re-releasing or saving of old stuff is booming in size without storage being able to keep up. but even then, the original archival versions will always be held by the studio, and film archiving of rare content has been increasing massively as people get more interested in it.

my big worry is that rare, not so popular content that can take up quite a bit of space such as rare music or anime that is no longer "owned" or looked after by anyone official will disappear. there is content that only exists on hoaders hard drives. we already lost (probably) shitloads of rare stuff when megaupload and the other big ddl sites had the squeeze put on them. thankfully private trackers and other such communities are preserving things pretty well.

You're probably the most retarded person in the world. I dug deep for that nugget.

kind of wish the internet would die desu

>We've already lost a shit ton of stuff. Go through your bookmarks (you know, that list of things you said you read/reference later,) and see how much of it still exists.

i know right

i keep telling myself i will research a way to "contextually" rip portion of websites the moment i bookmark them, sometimes you just mass bookmark a dozen articles and you cant even go thru each and save as html+images, let alone the ones that link binaries to download, or are multipage.

the internet is not a single thing you fucking retard
it exists on millions of servers all across the world, many of which have several layers of backups
99.99% of the data on the internet also doesn't matter to anyone

what if the powers that be disconnect the cables, huh?

It doesn't matter, since anything of any substance is recorded in print (scientific discoveries)

Everything else is just shitposting and wouldn't hurt anyone if lost.

what about all the source code? who the fuck prints out source code

>lose the internet?
the internet doesnt exists, its just a series of tubes beneath the ocean.

and the cloud is just a gazilion HDDs and computers across the globe.

a million things could happen and we just need to remanufacture routers and copper cables across land, and re attach those grids to the light tubes and the internet is back...

this second time around, the commercial internet will interoperate with a myriad of neighborhood networks that were already banding together during the netdiaspora, cross connecting to other independent networks thru multiple points and some tech savvy neighbors just running routers from home and collecting the contribution to electricity by enabling/disabling mac addresses (or some futuristic protocol) depending on money+paper notes dropped in the mailbox.

wanna host something on the Smith's router? just drop an HDD and some extra money for extra electricity by their door and include the secondary crypto keys to prove that namecoin domain ownership. they don't even know who you are

What the fuck are you talking about?


Listen man, even if the majority of internet traffic were to suddenly be destroyed for some reason, it's not like it would set us back to some kind of stone age. The discoveries we've made in the past 3 decades (since the wide adoption of the internet) have been recorded in the doctoral dissertations, and and theses of those who made those discoveries. We're not going to suddenly step back 500 years because our primary source of backing up our music and porn suddenly stopped existing.

Even if somehow, all books were destroyed at the same time, it wouldn't take us that much longer to build everything back up, since there are plenty of educated/academic people around to help us recreate our archives of knowledge.