... 'size' is really not that important. What matters is the content, it's uniqueness and relative importance.
Libraries, even books (let alone literacy) during antiquity were rare and generally unique, remember, this is before the printed press.
The Library at Alexandria was a 'first-order' source of knowledge and wisdom, a place where 'intellectual luminaries' (i.e. big rabbis of the day) would have studied. Possibly every single book represented a unique bit of knowledge, and each one a historical artifact.
Losing the Library at Alexandria would be like losing 15% of all Universities, professors, all of their papers, textbooks, 15% of authors, playwrights, all of their works, 15% of all written history - basically most or all copies of their works, vanished into thin air, for all of time.
If the Library continued to exist in its fullest form, we may very well have much more of our history pieced together.
The Internet Archive is not even a 'second-order' resource. It's nice to have, and useful particularly for some historical reasons, but if it were to evaporate tomorrow, civilization wouldn't skip a beat.
There is no specialized, arcane knowledge there. Parts of the IA are of course really nice to have, particularly the news which forms a kind of historical record. But old copies of 'HomeDepot.com' and 'McDonalds.com' are just not that important.
We don't use the IA as a source of science, physics, fiction, students don't go to the IA to access authors' works or textbooks on orbital physics.
If the contents of IA were seriously valuable, then quite a number of parties would be interested in maintaining it, first of all, the parties being documented (if HomeDepot.com doesn't care about their 2006 web-site, then maybe it's not so important).
I support the IA and I would hope that it was even a kind of government-sponsored agency, but it's not an 'intellectual foundation'.
(Edit: changed from 25% to 15%, it's just a number obviously, but my point being 'it's some material share of the compendium of knowledge')
At this point in time, there have been lots of niche websites and forums and many have shut down. In some cases, those websites housed actual practical knowledge. Example: I drive a Mazdaspeed3. There was a forum dedicated (partly) to that car, and it went down. Now that it's gone, a lot of information on how to work on it is also gone or much more difficult to come by.
>'size' is really not that important. [...] each one a historical artifact.
The uniqueness of content in the internet archive is significantly greater by virtue of there simply being more unique knowledge at this point. Knowledge which would be lost (often intentionally) if not for efforts such as the internet archive, or that of the Library at Alexandria.
The library made copies of the knowledge for the original owners rather than having the only copy. making every copy most likely non-unique. The value in its indexing is immense especially when you realize that useful information is only useful within a context where it can be applied and cannot be valued outside of it.
>Losing the Library at Alexandria would be like ...
This did not happen.
>If the Library continued to exist in its fullest form, we may very well have much more of our history [...] orbital physics.
Imagine if a similar effort to archive exists today and in a few thousand years we would be able to use it, this is how I see the internet archive. The archive and efforts like it are used as a first source of knowledge by wikipedia, students, developers, researchers, anyone needing to verify a business existing prior to the investor call. If there was not an archive of the Feynman lectures I would know nothing about orbital physics now, if the current hosts are lost I am very very glad that the internet archive has a copy.
>If the contents of IA [...] important).
Personally I would like to know if homedepot existed 14 years ago if I wanted to do business with them and they claimed so, or if they tried to censor anything (that case of the US government having honeypot sites archives with code comes to mind). Things which would be lost if not for the specific functionality you are mentioning. Also there are many many other efforts, unlike you seem to state (e.g. archive.is, ipfs, the-eye.eu).
Perhaps the most powerful argument: https://web.archive.org/web/*/news.ycombinator.com
edit: I have archived our comments for historical context for anyone studying internet discussion in the year 3020 https://web.archive.org/web/20200611105240/https://news.ycom... https://web.archive.org/web/20200611105414/https://news.ycom...
>>>>>>> the IA is not 'knowledge' it's 'information' and most of it is irrelevant to anything - more importantly: All of the important information in the IA is well documented elsewhere. We don't need the IA for much at all.
2) Losing the Library at Alexandria would be like ... This did not happen.
>>>>>>> Of course it did! Lucio Russo / "Forgotten revolution" argues that a large part of the scientific knowledge of the Hellenistic era was lost.
3) "Personally I would like to know if homedepot existed 14 years ago if I wanted to do business with them and they claimed so, or if they tried to censor anything (that case of the US government having honeypot sites archives with code comes to mind)."
>>>>>>> If that's a good example use case - then you've proven my point because there are exactly 0 people in this world using IA to see if 'some business existed 10 years ago in order that they might do business with them due to concerns of honey potting'.
4) "Imagine if a similar effort to archive exists today and in a few thousand years we would be able to use it, this is how I see the internet archive."
>>>>>> Some contents IA might be a little bit useful to a few historians, but beyond that, it will have no meaning.
The IA could be cut down to 1/1000000 it's size, and still, maintain it's historical relevance.
If the IA were vaporized today, it wouldn't really matter that much, there's nothing in there we need, and the important content is saved elsewhere. The NYT has a great archive.
The internet is mostly noise and babble, most of it is not important at all. Some of it is.