Historical programming-language groups disappearing from Google (opens in new tab)

(lwn.net)

737 pointsbeachwood235y ago332 comments

332 comments

204 comments · 48 top-level

jedberg5y ago· 54 in thread

It's funny, when I took a tour of the US Geological Survey, the curator of the collection hated Google (which was just a few blocks away). He said Google is great now, with all their maps, which were far more accurate and had better coverage than the USGS.

But what happens when they get bored with map data and get rid of it?

He had been ordered to turn over all of their historical arial archives for scanning by Google, and then told the USGS would no longer do arial scanning since Google was doing it. But there was no agreement for Google to turn over their arial scans back to the USGS.

At the time we all told him not to worry, Google would never remove data it had collected. Looks like he was a lot smarter than us.

coliveira5y ago

Well, that's the problem with the whole internet. Remember those pages created in the 90s/early 2000s? People thought they were sharing information to the whole world. It turns out that most pages created in the 90s are now inaccessible or have been siloed by big corporations. The fact that we allowed corporations to take over the internet made it an inhospitable place for everyone else without corporate backing.

zrm5y ago

I don't think it's any harder to create a website than it ever was. The problem seems to be that corporations have made it so easy to do it within their silos that people aren't willing to spend ten hours on something they could do in ten minutes, not realizing that they're going to spend a lot more than ten hours creating content which the company will then vaporize at random whenever they feel like it.

quaintdev5y ago

A decade ago, there use to be celebrity websites which had forums, galleries, blogs now it's just Instagram. Hell so many prominent celebrities don't even own a domain name in their name. Also, it's not like the content has improved. Earlier their use to be HQ images in those celeb galleries now the highest resolution image is 1200x1200. The only thing that has improved is how easily a celebrity can reach millions everything else has gone downhill with respect to discussions, forums, galleries, blogs. Most of these are replaced by poor comments section.

It's not just celebrities, so many independent artists are putting up their talent on Instagram and I don't have access to any of it because I need an Instagram account for that. Instagram web version is forcing to sign up if you scroll 1 page down on a profile.

Sometimes I feel like we need to build cutting edge decentralized applications that will burn these walled gardens to the ground. /rant

7 more replies

clusterfish5y ago

A lot of people, clubs and businesses publish their content on Facebooks and Instagrams because those platforms are better for getting your content out to your followers and more people. They are being rational.

Where's the non-proprietory decentralized platform that lets me reach as many people as I can on Facebook? There isn't one.

Why aren't the social functionality of identity / friends / followers / newsfeed / etc. built into browsers in a standardized way?

Facebook is 16 years old. That was a lot of time to figure out an alternative solution, but all we have are experimental projects that rely on adoption that they don't have to be useful.

Corporations aren't going to change how they behave, but it's annoying that us techies are apparently incapable of beating them at our own game.

5 more replies

6ak74rfy5y ago

I went through exactly this thinking recently when I wanted to setup a blog for myself (and migrate an existing one off of WordPress). I tried my best (and I think I succeeded) in ensuring that I am not locked into one vendor, and it was pretty much free.

Someone else mentioned that you can't reach as many people from your silo-ed website as you can if you go through social networks. I found one way you could get best of both worlds - through Medium's import feature[1]. But I don't yet know how effective that is.

Here's a short write-up in case anyone's interested: [2]

[1]: https://help.medium.com/hc/en-us/articles/214550207-Import-a...

[2]: https://ketanvijayvargiya.com/58-setup-blog-and-email-on-cus...

thayne5y ago

I think that's only half of it. The other half is that it's easier for consumers to find content in the silos of the large corporations than content that exists outside of them.

mc325y ago

Yep, it’s even worse. There are some things that don’t have a definitive answer, like many aspects of CoViD. Some pages with what I would categorize as “inquiring” get removed just because they don’t line up with the WHO. This isn’t about questioning vaccines, but rather the unsettled questions around this new disease. Just gets banned... it’s not like I have a stake in that fight _other_ than dismay at private censorship of opinions that don’t toe a given line. It’s rather frightening.

But... many of the same companies will fill your search results or fill affiliate pages with quackery ads just fine...

colonwqbang5y ago

Why do you feel that it was the job of "corporations" to preserve and archive of every page forever?

In my country, all physical books and magazines which are published must be submitted to the government in X copies. The government then keeps an archive.

With webpages, the problem of obtaining X copies never existed. Why couldn't the government have archived webpages like it always did with books?

PaulDavisThe1st5y ago

It is not that it is their job. The problem is the mismatch between the broader public understanding of the lifetime of "a webpage" and reality, when said "webpage" is inside a walled silo (and maybe even when it isn't).

1 more reply

rikroots5y ago

In the UK, this job is handled by the British Library. They have a legal duty to collect annual snapshots of all websites using the .uk TLD https://www.bl.uk/collection-guides/uk-web-archive

metrokoi5y ago

I believe you are misrepresenting the situation. No one expects corporations to archive and preserve all data, especially not data that they are not associated with.

However, if they create a monopoly on that data they have an obligation to preserve it, especially in the case of a corporation outright aquiring data instead of simply "out competing" for data. And as everyone mentions, of course they are in no way legally obligated to do so, but they are by any reasonable standard ethically obligated.

I do think that the government could and should archive data, but there is currently no system in place for doing so and likely will not be for a long time, if ever. Corporations would simply have to maintain the data that they already have.

r3trohack3r5y ago

I'd argue this is a feature not a bug. The internet is a protocol for communication, not archival or retention. Any notion of persistence is owned by nodes in the network. Retrieval from an "archive" over the internet comes in the form of communication. The web introduced hypertext, and a protocol for exchanging hypertext, on top of this communication protocol. But again any notion of persistence of hypertext, and the "links" between hypertext documents, is the responsibility of the nodes in the network.

They were sharing information with the whole world, but in an ephemeral medium.

The web, and internet, is not an inhospitable place for anyone without corporate backing. You can host a somewhat reliable service on a raspberry pi over your home internet connection.

wolco5y ago

You just can't be found. You can self host but unless someone finds you some other way you are excluded.

1 more reply

nullc5y ago

> It turns out that most pages created in the 90s are now inaccessible

Some of that is because search engines have simply stopped returning them in results even though they're still online.

fiddlerwoaroof5y ago

The issue is is that a web page only lasts as long as its funding does: private sites are great, but someone still has to pay for the server and, when they die, it’s probably going to just vanish, unless the internet archive got it.

opportune5y ago

How are big corporations preventing a web server from serving content over HTTP in old-school HTML?

methodin5y ago

Seems like a fundamental truth of Capitalism: privatization and ultimate destruction of anything that can be monetized. Certain things are impossible without money and to make money to have to generate or consume something which leads to a never-ending cycle.

FpUser5y ago

I seem to have a distrust of corps imprinted in my brain since birth and had never fell for their candies/propaganda. All my stuff is always on my own servers with shadowing of course.

kerkeslager5y ago

> He had been ordered to turn over all of their historical arial archives for scanning by Google, and then told the USGS would no longer do arial scanning since Google was doing it. But there was no agreement for Google to turn over their arial scans back to the USGS.

Jeez, that's horrifying. Literally just giving public assets to private corporations.

Polylactic_acid5y ago

Public funded data should be publicly available which includes use by private corporations.

kerkeslager5y ago

Agreed. You seem to be missing the part where this is no longer publicly available because the USGS no longer has the data.

1 more reply

emiliobumachar5y ago

If getting it takes connections or prestige, then yes.

If any entity with a plausible use case could and still can get that data at the cost of the copy, I don't see why not. The whole "copying does not deprive the original owner" meme applies particularly to such public assets.

kerkeslager5y ago

> If any entity with a plausible use case could and still can get that data at the cost of the copy, I don't see why not.

Can you point me to where I can download this data for the cost of a copy? Didn't think so.

1 more reply

blitmap5y ago

I don't like this but if a corporation is a person, they have the same right to it that the rest of the public has.

If the effort to USGS could be quantified in a cost, I'd expect Google to pay USGS to make the public data available?

It does sound awful. I don't know what the right answer is.

kerkeslager5y ago

> I don't like this but if a corporation is a person, they have the same right to it that the rest of the public has.

1. A corporation is not a person. Corporations don't have rights, except inasmuch as the people within the corporation have rights.

2. The problem isn't that Google has access to the data, it's that USGS and the rest of the world no longer have access to the data, except on Google's terms.

1 more reply

xmprt5y ago

Corporations aren't people. I can't get married to Google. If you can point to specific precedence of corporations being given access to certain data on grounds of their personhood then your argument makes sense but just because corporations are considered like people in the context of speech doesn't mean that applies literally everywhere else.

3 more replies

monadic25y ago

Yea most of the wind about “taxpayer dollars being wasted” is just flatulence but this is a straight up robbery.

tingletech5y ago

> But there was no agreement for Google to turn over their arial scans back to the USGS.

That was poor negotiation by USGS Solicitor's Office. Libraries participating in google digitization programs negotiated to keep copies of their scanned materials in the Hathi Trust Digital Library https://www.hathitrust.org

mywittyname5y ago

You act like the Director of the USGS was acting in good faith. It's pretty likely that Eric Schmidt, or similar, already worked things out with high-level officials within the government and the USGS Director was not given any real decision making capabilities.

est315y ago

There are laws for book publishers, requiring that they send copies to your local government's central library. In the US it's the library of congress. Some of the books they don't keep, but they do filter them by which books are important and which aren't. Maybe the same should be done for "viral" posts, such arial scans, and other data deemed important.

johannes12343215y ago

In Germany the national library is also required by law to take care of digital ("körperlose Werke") media.

They are still figuring out what a good way for archival of those is and are quite selective in choice what they archive, but they plan to expand on that

German page: https://www.dnb.de/DE/Sammlungen/DigitaleSammlungen/dgitaleS... English page: https://www.dnb.de/EN/Sammlungen/DigitaleSammlungen/dgitaleS...

wahern5y ago

Electronic works in the U.S. fall within the mandatory deposit statute, but then are excused by Copyright Office administrative rules. However, it seems they've slightly narrowed the electronic works exception (first in 2010, and tentatively for 2020) since the last time I looked: https://www.copyright.gov/rulemaking/ebookdeposit/

tingletech5y ago

I didn't know that LoC has any discretion regarding keeping items mandatorily deposited for copyright registration. Do you have more information on this?

jeffbee5y ago

The USGS is currently in the middle of an 8-year 1.1-billion-dollar program to develop a nationwide digital elevation model from aerial lidar. The data, which is freely accessible, is hosted on AWS. Cute story though. The hackernewses are going to eat it up.

https://www.usgs.gov/core-science-systems/ngp/3dep/3dep-data...

jcrawfordor5y ago

USGS is in the process of collecting that data right now, it's not from the archives, and DEM is different from USGS aerials (which are photographs) and run out of a different USGS office. This is sort of irrelevant.

Making digital data publicly available is pretty new for USGS. Just a few years ago archived aerial imagery had to be ordered by mail and it was a pretty lengthy process. Topo maps (the earlier equivalent of the DEM data to which you refer) were generally ordered on paper as well up to five or so years ago, but they're in a lot more popular use so more third parties got into the business of distributing them. I've relied moderately heavily on both for some of my research and was a very painful process until just recently to get anything older than current. In the meantime, yes, Google had it all at some point, but mostly stopped using it or providing it because they obtained better quality imagery.

Fortunately USGS now has a slippy map for topo and an admittedly rather clunky ESRI query service for aerials.

jeffbee5y ago

USGS has been providing free public access to DEM data for ages. The SRTM has been available via FTP since at least 15 years ago when I first started using it to render hillshaded maps. There's not a secret handshake needed.

https://dds.cr.usgs.gov/srtm/version2_1/

2 more replies

widforss5y ago

I had to order some maps over Antarctica by fax a year ago. The USGS had a functional webshop, but it only served the US, everybody else had to fill out this form including debit card data and send it over. It turned ou my uni actually still had one (1) functional fax machine.

jedberg5y ago

I took the tour 10 years ago. Obviously his objections were heard. I’m glad they listened to the guy.

ponker5y ago

Companies are necessarily managed for the quarter and countries should be managed for the century.

TheOtherHobbes5y ago

The planet should be managed for the centuries, plural.

But we're just not smart enough to understand that, never mind make it happen.

Instead we prefer to cling to the bizarre delusion that billions of individuals with competing interests will somehow spontaneously self-organise into the best of all possible worlds.

elmo2you5y ago

That is indeed pretty much what it says on the tin.

However, to be fair, it has always been the most greedy and self-interested, with already the most disproportionate power to rig the game in their favor, that have been most vocally advocating this system. No surprise there, of course.

What fascinates me is how a majority of people, who certainly do not personally benefit from that system, have been made to believe that they do. Sure, political corruption, cultural indoctrination/propaganda, horrendous general education, and I can think of a few more .. but still I've always been amazed, how it appears to have canceled even basic logic reasoning among so many.

Who knows, maybe one day, it will turn out to not just "correlate" with an addiction to carbs/sugars, of which the country has plenty of problems with too. Junkies have always been easy to manipulate.

Until then, at least it still gives some hope that a growing number of people now realize that this system just doesn't work as it is advertised.

GekkePrutser5y ago

And it's not even true. At least here in Europe (I can't comment on the US as I've never visited), Google Maps is really poor.

It's fine when you travel by car, but when I'm hiking through the hills I'm just walking through an empty square on Google Maps. Volunteer-driven OpenStreetMap is MUCH better. And there the data is actually open and safeguarded.

Governments should support that kind of project instead of corporate privacy-invading playtoys like Google Maps.

wyclif5y ago

In another life, I was a land surveyor and I did a lot of LiDAR work as well as heavy use of USGS data. Almost anybody except the most blinkered in that industry would have seen this coming, I think. It's just one more data point that convinces me that Google should be broken up or at least not allowed to silo previously public categories of data.

parksy5y ago

I had a similar almost to the letter conversation when I did some web work for a much smaller GIS firm back in the day, but wanted to add that in my experience this isn't just a google thing but an issue with governments and outsourcing in general.

Anecdotally, a close relative (and many others in her institute) designed entire curricula of learning modules for a government-owned nationwide technical college, back when online learning was newish, ~20 years ago (I think back when SCORM was fresh). These were tightly integrated into the traditional in-class offerings. A couple of years later a "trim the fat" government slashed internal capabilities and outsourced all "IT" hosting, management, etc.

All of the online learning modules (which would have cost millions in man-hours to develop) were literally handed over as "content" to a company who to this day offers them back to her institute under per-student licenses (that far exceed any "hosting" costs of these basically static resources) over a decade later. This company also profits off licensing to an array of pop-up online "institutes" that don't even approach the pedagogical context needed to ensure quality education outcomes from these resources.

Like a comedy of errors, from time to time some lecturer at her college will want to ask some question about the materials, their boss directs them to the company support (which is a paid service), after the issue escalates through the support tiers and they realise they need the expert knowledge of the author she'll get an email with the question, a process that can take days or weeks when the lecturer could have walked into the office next door and asked her directly, if the company hadn't stripped all author credits from the materials.

If the company decides to shift business models, or goes out of business, or is acquired and scuttled, these assets get blown to the winds.

There's a lot I could say about this situation, but essentially governments in general seem to devalue their assets at taxpayer expense, the IP of these assets could have been better handled rather than just giving it directly to the first company to win the contract all those years ago.

raldi5y ago

> historical arial archives

A font of knowledge

pwdisswordfish25y ago

Is this a valid reason for not using Go?

I am a holdout.

(Not suggesting I am "smarter" than Go users, but I can forsee issues with Go being controlled by Google.)

teddyh5y ago

I will probably never use Go in its current situation.

https://news.ycombinator.com/item?id=8733705

cosmodisk5y ago

I think most of us are young enough to live up to the point where we would look into the mirror asking ourselves what did we do 20 years ago and nobody will really remember because you know,a few bits here,a few bits there,and all it disappeared...

globular-toast5y ago

Not smarter. Wiser.

It sounds awful that Google has the best mapping data in the US. In the UK Google's data is awful, worse than OpenStreetMap and much worse than Ordnance Survey, the national mapping agency.

Spooky235y ago

The funny thing is that this happened already when Google bought DejaNews and broke the interface after a year.

elmo2you5y ago

The underlying problem here might as well be considered a fundamental shortcoming of pure/fundamental capitalism. I make no claims about the value of alternatives, or even if there are any (better ones, that is).

Anything that is (no longer) of commercial value will be "phased out" and dismantled/destroyed. One might still stretch it a bit, by arguing that the commercial value of something can include its future potential value. But I personally know not a single commercial companies that ever choose that over short term cost reductions and "profit optimizations".

Luckily, there are governments who acknowledge this shortcoming and build structures to compensate for it. But when governments decide to leave (almost) everything to commercial markets, then the importance of anything and everything can and will only be measured by it's commercial (contemporary) value/profitability.

People have every right to vote for and support such a system. But then don't complain, when all that you will get is only what such system supports/provides.

irrational5y ago

Isn't killing projects Google's key strength?

TheSpiceIsLife5y ago

Like we have whitewashing and greenwashing, I propose the term:

Googlewashing - to proclaim “Google would never ...”

stcredzero5y ago

He said Google is great now, with all their maps, which were far more accurate and had better coverage than the USGS...But what happens when they get bored with map data and get rid of it?

Looks like he was a lot smarter than us.

If you would've asked me back when Google was new, and we all believed in "Don't be Evil," I would never have thought that Big Tech would end up being the Ministry of Truth and The Memory Hole.

_kp6z5y ago· 28 in thread

Google's handling of these critical archives they were given is pretty abhorrent. The usenet archives should really be made public since there is no business value to them and they don't care about usenet.

neilv5y ago

When Google started, there was maybe an overall altruistic, visionary, principled culture among many pre-Web Internet-y people, and it looked like Google was of that same school of thought.

(This was at the same time that there was a gold rush of IPO plays, hiring anyone who could spell "HTML", and plopping them down in slick office space, Aerons for everyone, and lavish launch parties, with tons of oblivious posturing and self-congratulating. But Google stood out as looking technically smart, at least I believed the "Don't Be Evil", since that was the OG culture, and it seemed a savvy reference to behaviors in industry and awareness of the power that it was clear they would probably have.)

That might be why it wasn't surprising to hear of things like someone entrusting a bunch of old university backup tapes to Google's stewardship.

This has played out with mixed results, and I think Google could be doing much better for humanity and for techie culture.

enneff5y ago

Google didn’t kill Usenet; it was already pretty much dead. Web forums had all but taken their place (and where are their archives now? So much is lost).

If you look at the history, Google basically rescued the data from a collapsing Deja News, and made it available again. A nice gesture, which didn’t serve to benefit Google much in the long term.

If we want to preserve history then we can’t rely on for-profit companies. We need to instead fund non-profits whose specific charter is archival and preservation, like the Internet Archive.

dragonwriter5y ago

> The usenet archives should really be made public

Given the nature of Usenet, they were if anyone wanted them.

wahern5y ago

Various people sent their old tape reels and other backups to Deja News, which compiled everything. But Deja News never made freely available the individual archives or the collection, nor did Google. The oldest stuff is locked away by Google because the only hard copy was destroyed when sent to Deja News. As time wore on most of the remaining fragments that at one point could have been recompiled independently also disappeared.

What Google is doing by refusing to publish the archive or even share it with parties like the Internet Archive is completely unjustifiable and anathema to everything they once stood for.

DaiPlusPlus5y ago

> What Google is doing by refusing to publish the archive or even share it with parties like the Internet Archive is completely unjustifiable and anathema to everything they once stood for.

Couldn't a copyright claim (or something under the GDPR or UK's DPA) be used to regain access to those though?

Just because something is published to a public forum doesn't mean you relinquish your rights.

1 more reply

erik_seaberg5y ago

Google acquired probably the biggest searchable archive, Deja News. What we needed was some kind of self-sustaining org with a strict charter to preserve the archive no matter what.

mjevans5y ago

Archive.org ?

1 more reply

pfortuny5y ago

They were until they were not.

eternalban5y ago

> they don't care about usenet.

They cared enough about to kill it.

HenryKissinger5y ago

Controversial question: Why should we preserve code that no one uses anymore? Why should we not allow some information to be simply lost?

jlokier5y ago

Because it's a cultural artifact, of its time. It's history. And some people would like to be able to read it, or do other things with it.

Personally I'd like to be able to link to my own posts from that time, for when people asked me what I used to do. But I can't find them any more.

These groups are mostly not code. They are conversations, design discussions, ideological discussions, jokes, that sort of thing.

Like what we have now in social media, except back then there was pretty much only Usenet, and it had a very different feel than the current social networks.

They are where things ideas like the smiley, and free and open source software, and utopian ideas of internet culture were developed. All the early internet memes. And of course all the knowledge people shared.

Conducted in public at the time and thought to be archived for the long term.

zxcvbn40385y ago

Wonder what people will think in a hundred years when they read that everyone believed the universe was made up almost entirely of invisible and intangible matter? It'll be some future generation's flat earth joke.

1 more reply

ornxka5y ago

As someone else pointed out, losing information is bad because we can't know what value it might have in the future, only what value it has to us today. A lot of things from the past that we are certain had no value to people at the time (such as literal garbage heaps) are of immense value to historians today in understanding the past and the context within which those "worthless" things existed.

You're right though that a decision will probably have to be made at some point about what to keep and what to toss (how big is YouTube, exactly? Are we really going to keep every video, in its original resolution, forever?), but this is just plaintext, it takes up almost no space. The decision doesn't even have to be made, since it's easy to find the means to store this, so why bother making it? Kicking the can down the road is actually the best decision in this case, since the people of the future will (hopefully) have a clearer understanding about what was important in our own past than we do currently.

johnfn5y ago

Why should we preserve old websites that no one uses? Why bother with historical documentation at all?

It's because, at the time, you don't know what information is going to be important and what is just garbage. Documents that are apparently useless today could become fascinating tomorrow.

ghaff5y ago

No, it's a reasonable question. We're not going to preserve, certainly not in a findable way, every piece of digital flotsam that has ever been summoned into existence. In general, we probably should save what we can of Usenet for historical value as balanced against the fact that the archives are tiny in the scheme of things. They're probably also messy but that's probably OK.

Interestingly, when some people saved a great deal of the Usenet archives pre-Deja News, one of them said something to the effect of they wished they had prioritized saving social discussions and so forth because, by and large, saving discussions about a bug in a long ago version of SunOS probably wasn't very interesting.

nitrogen5y ago

saving discussions about a bug in a long ago version of SunOS probably wasn't very interesting.

Honestly even that sounds pretty fascinating:

It could help someone gather stats on the nature, frequency, and severity of bugs over time and across companies from another angle.

It could provide a fresh perspective on modern OSes by showing how historic OSes did things.

And it might be good material for a course on the history of software engineering practices, showing classes of bugs that have been eliminated, and styles of development and customer support that worked or didn't work.

1 more reply

enneff5y ago

Why not? Our capacity for storage has been increasing exponentially such that yesterday’s data is basically of negligible size compared to what we are producing today. There’s no reason to delete history.

1 more reply

minerjoe5y ago

You assumption "no one uses anymore" is glaringly wrong in this case.

Those archives are full of useful and informative information.

Not everthing changes fast. Common Lisp has been around for 30 years basically unchanged. The discussions back there can be truly informative for today.

It does take time to wade thought it, but people have been collecting (via the google archive, when it existed, sigh) curated lists.

https://www.xach.com/naggum/articles/ https://www.xach.com/rpw3/articles/

jedberg5y ago

For the same reason we don't just tear down the pyramids and build condos there.

There are still interesting things to be learned from ancient artifacts.

joshuamorton5y ago

But we do tear down old condos to build new ones. Should we also endeavor to retain every geocities and myspace page?

And if not, what makes comp.lang more like the pyramids than geocities?

1 more reply

pfortuny5y ago

Do you know about cuneiform? Lots of what is known are just ledgers and exercise books...

Never forget that we do not know the future.

Zenst5y ago

Future digital tourism.

That or risk future archaeologists thinking COBOL was some God of the time and the natives built large metal obelisks in dedicated worship temples.

rolph5y ago

why do mennonites and other such groups use low/deprecated technologies? partially due to religious creed, but also because when the electricity is gone, oil lamps still function, and horses dont need a petrol pump to keep running.

likewise many people are clinging to the local operating system rather than moving to the SAAS model.

so what happens if we lose the oldschool languages and platforms entirely, for whatever reason ?

if TBTF corporations are somehow hobbled or neutralized, we need old hand tools to build a tech newtopia from the rubble. if those tools are destroyed then we are beholden to a system that stands on very thin ice.

Avicebron5y ago

I would add to this that not all forward progress is necessarily good or well thought out. If there is value in an old thing that hasn't been unlocked yet, and it is lost to history, we become collectively worse for wear. Things like Lisp are old and pretty darn cool to have as an option.

I second the need to rebuild from the rubble is often overlooked, especially by corporations driven by profit centered goals.

bordercases5y ago

The thought process and conversations that produced the code give insight into how to more generally produce code of that kind. Typically code currently in use is in continuity with code that was previously used, either as a system dependency or conceptual dependency. So it's still useful to have history around, like it would be to have comments in current code.

sgillen5y ago

Well I think it’s ok in general for some information to be lost, but I think a lot of HN users value this specific information.

quantified5y ago

I’m sad to see that this was downvoted, it’s contains the key questions. I think they have good answers.

1) Eventually, everything will be lost anyway. The original print of King Kong is gone. A fire at Universal Studios wiped out the masters for a lot of music at once https://en.wikipedia.org/wiki/2008_Universal_fire . Floods destroy family photos all the time. But those are examples of the forces of decay, of natural entropy, of error. The Library of Alexandria probably contained a lot of useless crap but also nuggets we’d want to know today. Information is memories, useful information is useful memories, and there’s no compelling REASON to lose it. Other sections of usenet history were wiped out when Google acquired it (a lot of comp.database.olap content I had a hand in) and groups of people just lost a knowledge base.

2) It’s not simply code that no one uses anymore. It’s a knowledge base on how and why, debates over constructs and usage that are useful beyond code-sharing snippets a la Stack Overflow.

3) There is an argument for letting some information get lost or at least super-obscure, but it’s hard to see this being a good example. Tide Pod Challenge videos come to mind. GDPR and right to be forgotten mandate something akin to information loss.

4) I posted this elsewhere but I’ll share here too: there was a comment made on the original article about preserving prior art for IP (patent) purposes. That alone is in the public interest. Irrelevant to your questions in general, but pertinent to each of them in this case.

JetSpiegel5y ago

It belongs in a museum!

DoctorNick5y ago· 17 in thread

It's becoming clear to me that Google has become a far, far worse monopoly than Microsoft ever was. Microsoft just controlled our computers; Google controls our access to history.

nabla95y ago

Google is becoming worse monopoly trough natural evolution of it's core business. It seems more offhand way. Network effects and economies of scale. Microsoft monopoly grew by planning and plotting. Bill Gates had genuinely sinister motivations and used deception and dirty play.

To fix problems caused by Google, you need to change the principles of competition law. Microsoft was knowingly doing lots of stuff that violated laws. It was just very hard to prove it.

PaulStatezny5y ago

Do you have any links to evidence of these statements about Microsoft? I'd be interested to read more.

nabla95y ago

It's kind of weird to get this question when you lived it and there seems to be relatively little to Google.

I mean, it was all in the news, trade magazines, business journals. Blackmailing OEM's, intentionally breaking things and making them incompatible. At least the legal battles are documented somewhere and Wikipedia has something about them, but they were just the tip of the iceberg.

https://en.wikipedia.org/wiki/Microsoft_litigation

https://en.wikipedia.org/wiki/United_States_v._Microsoft_Cor....

https://en.wikipedia.org/wiki/Browser_wars

There must be book somewhere.

Dan Gilmour's articles in San Jose Mercury news from 90's should be somewhere.

Basically small software startups had to have Microsoft Strategy. They had to find way to stay out of Microsoft radar or MS would steal their work, their developers or block them. You sue them like Stack did and MS just stalls few years and pays few millions in damages. It was worth of losing in court to protect monopoly.

Big OEM's like Dell had to do what MS said or MS would up their price. It was straight blackmail from monopoly position.

DoctorNick5y ago

Yeah, that's the main reason that there isn't as much push back to the control that Google is starting to exert over all of us.

ajuc5y ago

How evil you are is a function of how much power you have and for how long.

beagle35y ago

Potentially far worse - but Google did not yet stop progress for 10 years in multiple fields the way MS did.

They sucked the air out of advertising (in cooperation with Facebook) leaving none for others. But I consider that a small loss.

Microsoft did that for operating systems, productivity software, stalled the web with IE6, and more.

Google is capable of much more damage, for sure. But they haven’t done that damage just yet.

skinkestek5y ago

> Google is capable of much more damage, for sure. But they haven’t done that damage just yet.

That is changing extremely fast.

hysan5y ago

Disagree in that they’ve already done plenty of damage.

Easiest example is with RSS - entered the RSS Reader market for free and at a loss and effectively killed competition because you cannot compete with that. Then subsequently killed Google Reader. This chain of moves essentially drove RSS to being obsolete which in turn made everyone far more reliant on Google and social media.

Now extend this to other products that they’ve started for free and subsequently killed. It’s not the same as embrace, extend, extinguish, but the result is the same. You kill off competition and stunt progress.

beagle35y ago

I don’t think RSS is a good example. Everyone I know who used google reader switched to a different RSS reader;

It’s mostly that RSS isn’t monetizable as easily as web pages. I think FB and Twitter dropping their feeds had a more significant effect; regardless, RSS was always niche.

2 more replies

_lbaq5y ago

> Google did not yet stop progress for 10 years in multiple fields the way MS did.

I beg the difference, Gmail have not changed much since I signed up 16 year:ish etc

They are all the same, as soon as competition goes away, this happens.

pbhjpbhj5y ago

But Gmail is interoperable with other mail systems and they didn't create incompatible extensions to email (AFAIAA); that's quite different to how IE6 was.

If Gmail required emails themselves to be in a special format that broke other MUA and IE6 wouldn't render standards compliant emails in a way you could read. That would be analogous to what IE6 was up to.

2 more replies

anchpop5y ago

I'm still upset with them for killing Inbox

2 more replies

moomin5y ago

I dunno, I look out at the world and think that maybe making journalism unprofitable may have had some negative effects that are a bit bigger than web standards not advancing that fast?

1 more reply

Lammy5y ago

> stalled the web with IE6

By inventing XMLHTTPRequest?

WorldMaker5y ago

> Microsoft did that for operating systems, productivity software, stalled the web with IE6

Android, GSuite, Chrome

beagle35y ago

Google are smart enough to maintain a duopoly (iOS, Office/365, Safari) Whereas Microsoft tried to kill all competitors (and all too often succeeded). That’s a huge difference.

1 more reply

Stubb5y ago

And each other.

none102875y ago· 9 in thread

Google has bought dejanews and has profited immensely from open source and open information.

So I do think they have an obligation either a) to make the whole archive available for anyone or b) maintain it properly.

Properly means restoring the fast UI from around 2004.

imglorp5y ago

If you found a human at Google instead of a bot, it would probably say their only obligation is to their shareholders.

It's probably not a good idea to depend on a public company to steward an important community.

Does the Internet Archive have copies of all the old stuff at least?

lstodd5y ago

Their only obligation, if we take for granted that there are any humans left at Google, is keeping the aforementioned bots powered.

Which is sad, but expected.

dependenttypes5y ago

There are quite a few humans at Google, both in HN and at twitter. Sadly all of them that I talked with seemed like people that I would not want to interact with again.

1 more reply

zentiggr5y ago

Wait, Google feels any obligations at all? I thought they only made decisions based on what's most likely to maximize their growth?

specialist5y ago

"... their only obligation is to their shareholders."

That'd be an improvement.

Page & Brin retain controlling interest, despite their minority stake.

nine_k5y ago

How did it profit from the Usenet archives? Genuinely curious.

goatinaboat5y ago

How did it profit from the Usenet archives? Genuinely curious.

Dejanews was the seed material for Google Groups, any profit derived from that (ads) was from content posted to Usenet by people who never intended for it to be used for that.

joshuamorton5y ago

Groups doesn't (and didn't ever?) Show ads as far as I know. So you're reaching for second or third order effects at best.

1 more reply

microtherion5y ago

I remember how awesome the initial version of the Google usenet archive was. It's horrifying how much they have let the UX deteriorate.

icheishvili5y ago· 7 in thread

This type of behavior is why I can never consider GCP. How many people have been burned at this point by Google randomly shutting down something they rely on?

john-shaffer5y ago

I've had two Google accounts shut down in the last six months with no explanation. There is no appeal. The consumer services I've used (Feed Reader, Play Music) have been shut down, and the cloud service I was most interested in was luckily shut down before I was able to use it. (They used to have a service to resize & manipulate images in Blob Storage. I found a good AWS alternative[1] instead). I cannot rely on Google for anything at all, and definitely not for something as important as cloud services.

[1] https://github.com/awslabs/serverless-image-handler

firebaze5y ago

Are there any indications to you why your accounts got shut down? Any pattern you noticed?

I - as most of us - have a personal google account, and our company uses a google business account. While I'm following news regarding google cancelling accounts at will, I fail to notice a reliable pattern: (alleged) fraud and other illegal stuff seems to comprise a good part of it, but at most 30-50%.

john-shaffer5y ago

No, there is no pattern. The last one happened when I got a new Android phone. I logged in on my work account and my personal account, and the work account got suspended. It said "suspicious app", but the only app I used it with was Google Meet. The personal account was used for much more, but didn't get suspended. I half suspect that they deliberately have false alarms so they can act like they're more secure, but it's more likely just a horrible, unaccountable AI.

I treat all Google accounts as throwaways now and don't use the work email at all because I want to know that I can actually receive emails that are sent to me. That's a huge problem even without randomly losing access, because their spam filter has a ton of false positives and those emails don't get forwarded to my real address.

3 more replies

hobofan5y ago

> Play Music

Play Music has not been shut down (yet), and you can transfer everything to Youtube Music, which is available at the same price (and in my opinon a superior product).

john-shaffer5y ago

Some randomly selected people can transfer everything to YouTube Music. I can't, and it may be months before Google would allow me to. It's exactly that kind of treatment that makes me feel like Google has zero respect for its customers.

Spotify is generally better than Play Music though, so it was for the best in the end.

Gibbon15y ago

Google Achilles heel is they have two businesses

a) Spy on people and sell the data to advertisers.

b) Use that data to directly push ads

That's basically incompatible with b2b services. Or consumer services. As a customer you're judged by how valuable the data they are collecting on you is. Which is less than a support call costs. That bleeds into every facet of their business. As such even if you pay them money you get the same treatment because they can't think any different.

rsa255195y ago

> sell the data to advertisers

Do they?

2 more replies

aidenn05y ago· 7 in thread

Anyone know if anyone not google has newsgroup archives publicly accessible (The Internet Archive maybe?)

rikroots5y ago

I found this Usenet Historical Collection link - https://archive.org/details/usenethistorical - in a previous HN thread (https://news.ycombinator.com/item?id=16667796).

I have no idea how useful the collection may prove to be. I found 'comp' but it doesn't offer a webpage view, just a link to download a file. https://archive.org/details/usenet-comp

u801e5y ago

Maybe someone could set up a public inbox[1] instance that allows access to those groups either via HTTP or NNTP.

[1] https://public-inbox.org/README.html

bensw5y ago

It should be the full archive.

eej715y ago

https://www.eternal-september.org/

I think you have to register. Not sure how much history is there.

dependenttypes5y ago

A lot of posts are missing from this one.

u801e5y ago

Most free and ISP based usenet feeds had a lot of missing posts, especially since they allowed older posts to expire. Even the commercial usenet providers only started their archives about 12 years ago.

avodonosov5y ago

https://www.xach.com/naggum/articles/notes.html

kazinator5y ago· 6 in thread

The vast majority of the spam content is injected into these newsgroups via Google Groups itself, and is not even seen on other NNTP servers.

Blocking posting access to these newsgroups from GG is generally a good thing for those newsgroups.

Not being able to search the archive is the unfortunate collateral damage though. Google is not obliged to provide a Usenet archive, I suppose.

Formerly obtained deep links to the content also do not work!

If you formely cited a comp.lang.lisp article by giving a direct link into Google Groups, people navigating it now get a permission error.

dependenttypes5y ago

What would be a good free NNTP server or NNTP archive?

giancarlostoro5y ago

The D programming language forums work as a NNTP server as well as web forums. I have in the past downloaded all content from the forum allowing me to have fully offline archives of threads. This is so underrated. I think NNTP could make forums much more superior although it feels like there arent many clients springing up AFAICT.

jcranmer5y ago

Adding some new NNTP features to Thunderbird was my introduction to open-source software and ultimately led me to being one of the primary maintainers.

NNTP is a wonderful protocol, arguably the simplest of the 4 mailnews protocols (IMAP, POP, SMTP, and NNTP). While it seems to share the same basic format as RFC822 messages, it actually tends to avoid some of the more inane issues with the RFC822 formatting (generally prohibiting comments and whitespace folding).

Unfortunately, the internet by the early 2000s started turning more and more into an HTTP(S)-only zone. Usenet itself hemorrhaged its population base, especially as ISPs shut down their instances (e.g., because someone found one child porn instance somewhere in alt.binaries.*).

4 more replies

kazinator5y ago

What you can do with NNTP is run a local NNTP caching server. Then connect to that server instead of the real one. Your caching server can retain articles as long as you want; much longer than the upstream server.

(Though mere long article retention is not necessarily the best archive interface, of course.)

Disclaimer: I'm not well-versed in the solutions in this space. Maybe there is some NNTP cacher out there that also has a web archive interface into it or whatever.

WalterBright5y ago

Yes, and I have 100% of the D newsgroups archived back to the very first post. Anyone can get them from the D NNTP server. I also wrote a program to create static web pages from them:

https://github.com/DigitalMars/ngArchiver

and the generated pages:

https://digitalmars.com/d/archives/digitalmars/D/index.html

When we were working on the history of the D programming language paper, this was an invaluable resource.

1 more reply

kazinator5y ago

I've been using the NNTP server provided by https://www.aioe.org/ for quite a few years.

There is also https://www.eternal-september.org/ which I used.

AOIE requires no authentication. The Eternal September server requires account registration via the web site; then you use an authenticated NNTP connection.

There are other servers out there.

These sites do not provide any archive.

bawana5y ago· 4 in thread

is google sinking? Between their mothballing/deletion of services and the obnoxious signup ads on youtube. I am wondering what is going on?

wegs5y ago

It's not doing so hot:

1) Hiring standards have drifted downwards over the past 15 years. Google used to be super-elite, compact, do-no-evil, massive-profit-per-employee. It's now a 140,000 person organization, and at that scale, standards just aren't high. You have a team of dozens of incompetent people doing what one person used to do.

2) With COVID19, ad revenues have crashed. It's not clear the impact on Google.

3) The smart, ethical folks on top (folks like Larry, Sergei, and Eric) are gone, and replaced with professional managers. They were smart to pick an internal CEO, but most of their executive team comes from places like Microsoft, Oracle, or Morgan. Having known a number of professional executives, the key skill is climbing executive ladders and moving into positions of power, not running successful companies.

4) Their products are increasingly starting to crash-and-burn, especially in B2B. Their culture relies on automated systems over people, and their automated systems have taken down tons of mission-critical businesses. Automated works well at 1000 people supporting 7 billion in B2C (small elite team model), and not so well for a massive, 100k person company.

5) I've switched mostly to non-Google products because they're better for what I need. AOL was massive too at one point. Losing the tech edge is not good. I still use gmail.

On the other hand, their revenues have continued to rise exponentially since they started. So perhaps they're doing fine?

jcrawfordor5y ago

The cases of IBM and more arguably Microsoft and Oracle tell us that if you reach a certain scale, you can continue to coast for a very, very long time after you lose relevance. Microsoft might be an example of having the glide time to regain it depending on how Windows 10 and Azure go over years, but it's far easier for a large corporation to spend billions defending their shrinking turf than to make the big changes usually required to regain it. The problem is that defending your shrinking turf can show positive revenue numbers for quite some time, so it all looks great while it happens...

gumby5y ago

Basically neglect and boredom (I say this not an an accusatory way). They have a huge gusher of cash that comes in pretty much regardless of what they do, so there is no penalty if their focus slips. You see this in other companies with this “problem” like Valve. This also happens with monopolies— note that standard oil made more money after being broken up than before that happened.

In google’s case you can see this boredom on Android, on the number of products announced and casually killed (they might be excellent standalone products but can’t move the needle on earnings for the benefit of Wall Street so why bother?).

Contrast this to early Intel in the Grove era: they were on top of the world with the memory business so they pivoted to something else. Google has had the same two products for almost 20 years. The later Intel has been more like that.

Another contrast: they don’t know what to do about the advertising downturn, so are cutting back on hiring and such, while FB is trying to double down.

tmpz225y ago

No they've just secured their kingdom enough that they can do whatever they want.

userbinator5y ago· 3 in thread

One thing that's become extremely clear to me over the last decade or so is that almost all tech companies simply do not care about the past, and I suspect at least part of that is so their narrative of progress can be subjected to fewer challenges from those who look back and compare.

Also, and this may be a bit of a tangential point, but the "deny the past because it has something bad" that Google has effectively done here is uncomfortably close to the set of recent and far more political events.

SyneRyder5y ago

> do not care about the past...

You just reminded me of a quote from an electronic music documentary 25 years ago. One of the Detroit techno artists insisted on taking the filmmakers to a historic theatre that had been left to crumble & turned into a car park:

"In America especially, nobody tends to care about these kinds of things. People in America tend to let this shit just die, let it go. No respect for the history. I, being a techno, electronic, high-tech futurist musician, I totally believe in the future! But as well, I believe in a historic and well kept past. I believe there are some things that are important. Now, maybe this is more important like this, because in this atmosphere, you can realize how much people don't care, how much they don't respect. And it can make you realize how much you should respect."

- Derrick May, DJ/Composer, Universal Techno (1996)

https://youtube.com/watch?v=tdox6H7FJBU&t=955s

The segment starts at 16:00 in the video and is about 2 minutes long.

Lammy5y ago

I don't think it's quite as simple as "Americans don't care about the past" when discussing cities like Detroit. The actual reason those places were left to rot is a lot worse imo, and it's the same reason that led to San Francisco becoming such a (cheap) haven for LGBT people / artists / etc in the 1970s and '80s: http://cornersideyard.blogspot.com/2020/06/repost-personal-s...

jolmg5y ago

> almost all tech companies simply do not care about the past

You may be surprised that it's not just companies. It's not hard to find people who think it's better for old stuff to just be deleted.

WoodenChair5y ago· 3 in thread

I read the article and I read the threads here, and maybe I missed it—but why did these groups disappear? Were they banned due to bad words or a mistaken spam filter?

DanBC5y ago

Here's what I get:

https://groups.google.com/forum/#!forum/comp.lang.forth

> Banned Content Warning

> The group that you are attempting to view (comp.lang.forth) has been identified as containing spam, malware or other malicious content. Content in this group is now limited to view-only mode for those with access.

> Group owners can request an appeal after they have taken steps to clean up potentially offensive content in the forum. For more information about content policies on Google Groups, please see our Help Centre article on abuse and our Terms of Service.

There's no content available for me.

jjgreen5y ago

Forth is pretty grim, but I wouldn't go that far ...

ngcc_hk5y ago

Is there means you access and archive it or is too late?

totalforge5y ago· 3 in thread

SELF FOOT SHOOT DUP

astrobe_5y ago

Actually, what I saw on comp.lang.forth the last few times I checked it (coincidentally, I tried yesterday) makes the news not really surprising.

Aside from the spam, it gradually switched from passionate but respectful debates to name calling and plain insults from newbies to what remained of the veterans.

One could read very long arguments between Elizabeth D. Rather, CEO at that time of Forth, Inc. which she founded with C. Moore somewhere in the 70ies, and Jeff Fox (RIP), who was working at that time with Moore; Moore left his first company to pursue its adventures in hardware, making different "Forth processors", which eventually led to the RTX2000 which powered, notably, the Rosetta probe.

zentiggr5y ago

Or Factor style,

[ SELF FOOT SHOOT ] 1000 REP

DonHopkins5y ago

BEGIN ME FUCK AGAIN

jeffbee5y ago· 2 in thread

The fact that nobody had enough fucks to give to archive these groups tells you everything you need to know about decentralized peer-to-peer proof-of-work blockchain nerd hobbies. This content exists on a completely open peer-to-peer content distribution network and here you are whining that one company -- the company that already rescued this archive in a midnight U-Haul run 20 years ago -- failed to archive it.

dabockster5y ago

Seriously! I have the same issue with a lot of modern online communities/projects too. They all assume whatever platform they're currently publishing on will be there forever.

Brb archiving my Twitter posts

perl4ever5y ago

>The fact that nobody had enough fucks to give to archive these groups

Well, you assume. Maybe it was just decentralized enough you haven't heard about it.

summerlight5y ago· 2 in thread

https://www.lumendatabase.org/notices/search?utf8=%E2%9C%93&...

Looks like there has been (likely automated, nearly all of them are the same Italian phrase) mechanical legal complaints and it probably caused this instance of automated blocking going wild.

As an engineer I can understand the desire to automate everything, but please at least have some heuristics to detect this kind of easy-to-detect mechanical behavior before giving the model a full authority to block anyone it doesn't like.

__void5y ago

Okay, I did some research, and I think I figured out what caused these usenet group banning.

A Genoese lawyer has been a victim of harassing and heavy doxing for some time, you can find many twitter accounts accusing him of paedophilia in cahoots with epstein, berlusconi, the pope and so on (no, I'm not kidding; clearly the stalker has obvious mental sanity problems).

The stalker is very prolific and is wallpapering the internet with his copy-paste-accouse in every corner, from newspaper comments to ancient forums to usenet. The lawyer report and ask for removation where he can but also he does not seem very worried because it seems that this issue goes on from two years ...

I don't think I can say the name of the subjects in question but in any case I'm archiving the harassment accounts before proceeding with the report, then I'll try to get in touch with the lawyer and see if he can request a new, less "coarse" censorship.

__void5y ago

I am Italian and this is very interesting: all the requests were made with the topic "Stalking Diffamation Illegal processing of personal data" but this (https://www.lumendatabase.org/notices/21395773) is simply fantastic. It seem that a fool with persecution mania has reported half usenet and the bots was auto-triggered...

haecceity5y ago· 2 in thread

So Google Groups archives usenet stuff? Where are the usenet stuff hosted originally? How do I connect to it without Google Groups?

ghaff5y ago

The Internet is a distributed system. Usenet was never centrally hosted anywhere AFAIK. It was scattered around lots of individual systems. You'd have to look up the detailed history but Deja News brought together what it could at one point. It was subsequently purchased by Google and it was folded into Google Groups.

icedchai5y ago

Back in the old days (mid 90's and earlier), most universities and large corporations had their own Usenet servers. These peered with other servers, either over the Internet with NNTP, or through older protocols like UUCP using modems.

I had a UUCP news feed from a local internet provider when I was in high school, back in 1993 or so.

DonHopkins5y ago· 2 in thread

Since when were Forth and Lisp historical programming languages??! People still use them. HARUMPH!

velosol5y ago

If it makes you feel better, comp.lang.python is also blocked:

https://groups.google.com/g/comp.lang.python/

DonHopkins5y ago

No, because I like Python too, but it would make me feel much better if comp.lang.perl was blocked.

synack5y ago· 1 in thread

Just recently I collected all of the archives of comp.lang.ada I could find and imported them into a public-inbox repository. There's a gap around 1992 that I couldn't find a copy of, but it's otherwise complete. It took a few days to get everything into the right format and get SpamAssassin dialed in, but it would certainly be possible to do this for the other comp.* groups if one had the patience.

https://archive.legitdata.co/

https://archive.legitdata.co/comp.lang.ada/

https://public-inbox.org/README.html

sneeuwpopsneeuw5y ago

I would personally very much appreciate it if the ada recources could be placed or archived again on the internet. Lately I had the feeling even books where a better option for finding information about the language.

rdiddly5y ago· 1 in thread

Either those Usenet groups are not part of the world, or they don't consist of information, or Google just failed at "organizing the world's information."

StavrosK5y ago

Google has definitely failed. Finding anything that's not frecent is basically impossible.

jolmg5y ago· 1 in thread

> since there is no other comprehensive archive after Google's purchase of Dejanews around 20 years ago

Was I naive in thinking that The Internet Archive would have long archived this type of thing?

foresto5y ago

The Internet Archive is younger than Deja News. Someone would have had to provide the data. Did they?

If you want to look, you might start here: https://archive.org/details/usenet

1 more reply

LockAndLol5y ago· 1 in thread

Why are people even relying on Google to keep any product alive? It's a business, not a charity. They don't do a single thing out of good will. It always has the goal of getting money in the short or long term. Knowing their quarterly obligations to shareholders, that's probably short term.

These groups should be putting more effort into federalisation and decentralisation. Make it possible to store all of this data in a distributed fashion and stop relying on a central authority for archiving purposes.

nabla95y ago

Those groups are running on decentralized system and open protocol https://en.wikipedia.org/wiki/Usenet

The problem is that there is no other searchable archives.

fizixer5y ago· 1 in thread

Can anyone tell me how Google got hold of the whole usenet (I know it was like 15-20 years ago) which looks to me like a community service kinda thing.

Like when Google decided it's going to host comp.lang.c, can there be only one comp.lang.c on the internet, or can someone else start hosting comp.lang.c as well?

rjsw5y ago

That isn't how it works, usenet is distributed, you can still access it using non-Google servers.

ipunchghosts5y ago· 1 in thread

i would like to find the quickbasic archives. anyone know how i can get them?

DanBC5y ago

https://groups.google.com/forum/#!forum/microsoft.public.bas...

Not safe for work!!!

gnabgib5y ago· 1 in thread

This is editorialized (actual title: "Some Usenet groups suspended in Goggle Groups"), or on LWN[1] "Historical programming-language groups disappearing from Google" (basically the same content)

[1]: https://lwn.net/Articles/827233/

dang5y ago

Ok, we've changed to that from https://support.google.com/accounts/thread/61391913?hl=en. Thanks!

Animats5y ago

"He who controls the present controls the past. He who controls the past controls the future" - Orwell, "1984"

fmajid5y ago

> Usenet predates Google's spam handling tools

In fact Usenet predates spam itself, since the first spam (Canter & Siegel) was on Usenet itself in 1994 (I was there).

CrankyBear5y ago

No, no, no. These groups and other Usenet groups archives must be preserved. They're our history.

imhoguy5y ago

Anyone looking for a hobby? It is time to become a data hoarder https://www.reddit.com/r/DataHoarder/

msie5y ago

WTF Google? Are you now so full of young programmers who have no respect for programming history? You’ve lost all greek cred that’s for sure.

mark_l_watson5y ago

Too many people and companies don’t appreciate culture enough. Maintaining a cultural record should apparently not be left to just one company.

Thanks for posting this, it reminded me to donate again to archive.org, which I just did.

I use ‘culture’ to include anything creative, anything that we experience as humans. Everything should be preserved, schools should be well funded, as should the arts.

lkirk5y ago

Is this something that the internet archive would preserve?

avodonosov5y ago

There is a comp.lang.lisp archive published in 2009.

> In 2009, Ron Garret published a 700MB archive file of all of comp.lang.lisp

https://www.xach.com/naggum/articles/notes.html

rurban5y ago

Ridiculous. They are blaming missing moderators, but only Google would be able to solve the spam problem. They open now these old forums, and Gmail is mostly spam free. Now you cannot even browse the archives. Where is the internet police when you need them.

zxcvbn40385y ago

For a long time I've wanted to revisit some the old Usenet stuff. I knew someone in the who ran a commercial usenet feed service in the early 90s and their whole setup depended heavily on low level backplane configuration, number of spindles, disk rotation speed, etc. - a lot of details that AWS hides from most of us. Using everything I've learned about distributed systems in the last thirty years I bet I could build a really awesome news feed today.

Of course the downside of Usenet was most people expected conversations to disappear after a couple weeks or a month but there was always some jerk that kept everything and refused to delete anything.

cptnapalm5y ago

I was learning C, once upon a time, and had a bug that I couldn't figure out. It worked fine on Linux/x86, but was wrong on Solaris/sparc64. Deep Google diving found a newsgroup post from 1992 or so with a very similar problem; it was an endian problem. My search-fu may have been weak, but an old newsgroup post that helped me solve my problem, not stackoverflow or any other site.

NewEntryHN5y ago

Either this archive exists elsewhere, either now is not the proper time for panic -- it was when Google became sole owner of this archive.

smsm425y ago

I think everybody should have learned the lesson now - do not trust Google - or any other major megacorp, but especially Google - to preserve any data for longer that they are contractually obliged to. If there needs to be historic preservation, it should be done by independent organization specifically created for that purpose.

Arjuna1445y ago

They are really shooting their own feet which such moves. They confirm, validate and strengthen the already existing trend to avoid vendor lock in at all cost and move to open, possibly self-hosted and export friendly platforms!

This is really bad marketing

jolmg5y ago

> Perhaps Google can be convinced to restore the content

The support ticket was deleted, so I guess not.

ryanmarsh5y ago

Thank god. I said some really dumb shit on those lists in my youth that I regret.

grappler5y ago

This kind of thing makes it really easy to get interested, and stay interested, in decentralization tech.

Once you see things in this light, the new flavor of the month online service just doesn't hold any allure.

quantified5y ago

(Repeating one of the comments from the post):

> Has anyone (EFF?) considered the aspect of destroying evidence of prior art in the public domain?

I think there’s a case to be made for stewardship of these groups for that reason.

Havoc5y ago

I'm hearing a fair bit of chatter in SEO circles about google de-indexing pages so this certainly rings true.

I guess there was this unjustified assumption that google only adds & never subtracts.

hosh5y ago

Maybe it is something that a non-profit dedicated towards preserving knowledge and internet content (such as Internet Archive) should be handling anyways.

bawolff5y ago

Maybe these types of historical archives can be turned over to internet archive. I trust them a lot more than google for this.

Igelau5y ago

If an AI decided to shut off comp.lang.lisp, I'd say it's officially too late to solve the Alignment Problem.

photon-torpedo5y ago

Guess comp.lang.lisp has too many posts with (((code))) in them... ;)

ZinniaZirconium5y ago

alt.sex is still there and you don't get an adult content warning unless you choose the desktop version.

Ijumfs5y ago

It was a terrible idea to entrust ANYTHING to Google.

Time to de-Google the whole Web.

staycoolboy5y ago

On the plus side, evidence of my awful usenet etiquette from the late 80's is disappearing with some of these groups.

j / k navigate · click thread line to collapse

332 comments

204 comments · 48 top-level

jedberg5y ago· 54 in thread

But what happens when they get bored with map data and get rid of it?

At the time we all told him not to worry, Google would never remove data it had collected. Looks like he was a lot smarter than us.

coliveira5y ago

zrm5y ago

quaintdev5y ago

Sometimes I feel like we need to build cutting edge decentralized applications that will burn these walled gardens to the ground. /rant

7 more replies

clusterfish5y ago

Where's the non-proprietory decentralized platform that lets me reach as many people as I can on Facebook? There isn't one.

Why aren't the social functionality of identity / friends / followers / newsfeed / etc. built into browsers in a standardized way?

Facebook is 16 years old. That was a lot of time to figure out an alternative solution, but all we have are experimental projects that rely on adoption that they don't have to be useful.

Corporations aren't going to change how they behave, but it's annoying that us techies are apparently incapable of beating them at our own game.

5 more replies

6ak74rfy5y ago

Here's a short write-up in case anyone's interested: [2]

[1]: https://help.medium.com/hc/en-us/articles/214550207-Import-a...

[2]: https://ketanvijayvargiya.com/58-setup-blog-and-email-on-cus...

thayne5y ago

I think that's only half of it. The other half is that it's easier for consumers to find content in the silos of the large corporations than content that exists outside of them.

mc325y ago

But... many of the same companies will fill your search results or fill affiliate pages with quackery ads just fine...

colonwqbang5y ago

Why do you feel that it was the job of "corporations" to preserve and archive of every page forever?

In my country, all physical books and magazines which are published must be submitted to the government in X copies. The government then keeps an archive.

With webpages, the problem of obtaining X copies never existed. Why couldn't the government have archived webpages like it always did with books?

PaulDavisThe1st5y ago

1 more reply

rikroots5y ago

In the UK, this job is handled by the British Library. They have a legal duty to collect annual snapshots of all websites using the .uk TLD https://www.bl.uk/collection-guides/uk-web-archive

metrokoi5y ago

I believe you are misrepresenting the situation. No one expects corporations to archive and preserve all data, especially not data that they are not associated with.

r3trohack3r5y ago

They were sharing information with the whole world, but in an ephemeral medium.

The web, and internet, is not an inhospitable place for anyone without corporate backing. You can host a somewhat reliable service on a raspberry pi over your home internet connection.

wolco5y ago

You just can't be found. You can self host but unless someone finds you some other way you are excluded.

1 more reply

nullc5y ago

> It turns out that most pages created in the 90s are now inaccessible

Some of that is because search engines have simply stopped returning them in results even though they're still online.

fiddlerwoaroof5y ago

opportune5y ago

How are big corporations preventing a web server from serving content over HTTP in old-school HTML?

methodin5y ago

FpUser5y ago

I seem to have a distrust of corps imprinted in my brain since birth and had never fell for their candies/propaganda. All my stuff is always on my own servers with shadowing of course.

kerkeslager5y ago

Jeez, that's horrifying. Literally just giving public assets to private corporations.

Polylactic_acid5y ago

Public funded data should be publicly available which includes use by private corporations.

kerkeslager5y ago

Agreed. You seem to be missing the part where this is no longer publicly available because the USGS no longer has the data.

1 more reply

emiliobumachar5y ago

If getting it takes connections or prestige, then yes.

kerkeslager5y ago

> If any entity with a plausible use case could and still can get that data at the cost of the copy, I don't see why not.

Can you point me to where I can download this data for the cost of a copy? Didn't think so.

1 more reply

blitmap5y ago

I don't like this but if a corporation is a person, they have the same right to it that the rest of the public has.

If the effort to USGS could be quantified in a cost, I'd expect Google to pay USGS to make the public data available?

It does sound awful. I don't know what the right answer is.

kerkeslager5y ago

> I don't like this but if a corporation is a person, they have the same right to it that the rest of the public has.

1. A corporation is not a person. Corporations don't have rights, except inasmuch as the people within the corporation have rights.

2. The problem isn't that Google has access to the data, it's that USGS and the rest of the world no longer have access to the data, except on Google's terms.

1 more reply

xmprt5y ago

3 more replies

monadic25y ago

Yea most of the wind about “taxpayer dollars being wasted” is just flatulence but this is a straight up robbery.

tingletech5y ago

> But there was no agreement for Google to turn over their arial scans back to the USGS.

mywittyname5y ago

est315y ago

johannes12343215y ago

In Germany the national library is also required by law to take care of digital ("körperlose Werke") media.

They are still figuring out what a good way for archival of those is and are quite selective in choice what they archive, but they plan to expand on that

German page: https://www.dnb.de/DE/Sammlungen/DigitaleSammlungen/dgitaleS... English page: https://www.dnb.de/EN/Sammlungen/DigitaleSammlungen/dgitaleS...

wahern5y ago

tingletech5y ago

I didn't know that LoC has any discretion regarding keeping items mandatorily deposited for copyright registration. Do you have more information on this?

jeffbee5y ago

https://www.usgs.gov/core-science-systems/ngp/3dep/3dep-data...

jcrawfordor5y ago

Fortunately USGS now has a slippy map for topo and an admittedly rather clunky ESRI query service for aerials.

jeffbee5y ago

https://dds.cr.usgs.gov/srtm/version2_1/

2 more replies

widforss5y ago

jedberg5y ago

I took the tour 10 years ago. Obviously his objections were heard. I’m glad they listened to the guy.

ponker5y ago

Companies are necessarily managed for the quarter and countries should be managed for the century.

TheOtherHobbes5y ago

The planet should be managed for the centuries, plural.

But we're just not smart enough to understand that, never mind make it happen.

Instead we prefer to cling to the bizarre delusion that billions of individuals with competing interests will somehow spontaneously self-organise into the best of all possible worlds.

elmo2you5y ago

That is indeed pretty much what it says on the tin.

Until then, at least it still gives some hope that a growing number of people now realize that this system just doesn't work as it is advertised.

GekkePrutser5y ago

And it's not even true. At least here in Europe (I can't comment on the US as I've never visited), Google Maps is really poor.

Governments should support that kind of project instead of corporate privacy-invading playtoys like Google Maps.

wyclif5y ago

parksy5y ago

If the company decides to shift business models, or goes out of business, or is acquired and scuttled, these assets get blown to the winds.

raldi5y ago

> historical arial archives

A font of knowledge

pwdisswordfish25y ago

Is this a valid reason for not using Go?

I am a holdout.

(Not suggesting I am "smarter" than Go users, but I can forsee issues with Go being controlled by Google.)

teddyh5y ago

I will probably never use Go in its current situation.

https://news.ycombinator.com/item?id=8733705

cosmodisk5y ago

globular-toast5y ago

Not smarter. Wiser.

It sounds awful that Google has the best mapping data in the US. In the UK Google's data is awful, worse than OpenStreetMap and much worse than Ordnance Survey, the national mapping agency.

Spooky235y ago

The funny thing is that this happened already when Google bought DejaNews and broke the interface after a year.

elmo2you5y ago

People have every right to vote for and support such a system. But then don't complain, when all that you will get is only what such system supports/provides.

irrational5y ago

Isn't killing projects Google's key strength?

TheSpiceIsLife5y ago

Like we have whitewashing and greenwashing, I propose the term:

Googlewashing - to proclaim “Google would never ...”

stcredzero5y ago

He said Google is great now, with all their maps, which were far more accurate and had better coverage than the USGS...But what happens when they get bored with map data and get rid of it?

Looks like he was a lot smarter than us.

If you would've asked me back when Google was new, and we all believed in "Don't be Evil," I would never have thought that Big Tech would end up being the Ministry of Truth and The Memory Hole.

_kp6z5y ago· 28 in thread

neilv5y ago

When Google started, there was maybe an overall altruistic, visionary, principled culture among many pre-Web Internet-y people, and it looked like Google was of that same school of thought.

That might be why it wasn't surprising to hear of things like someone entrusting a bunch of old university backup tapes to Google's stewardship.

This has played out with mixed results, and I think Google could be doing much better for humanity and for techie culture.

enneff5y ago

Google didn’t kill Usenet; it was already pretty much dead. Web forums had all but taken their place (and where are their archives now? So much is lost).

If you look at the history, Google basically rescued the data from a collapsing Deja News, and made it available again. A nice gesture, which didn’t serve to benefit Google much in the long term.

If we want to preserve history then we can’t rely on for-profit companies. We need to instead fund non-profits whose specific charter is archival and preservation, like the Internet Archive.

dragonwriter5y ago

> The usenet archives should really be made public

Given the nature of Usenet, they were if anyone wanted them.

wahern5y ago

What Google is doing by refusing to publish the archive or even share it with parties like the Internet Archive is completely unjustifiable and anathema to everything they once stood for.

DaiPlusPlus5y ago

> What Google is doing by refusing to publish the archive or even share it with parties like the Internet Archive is completely unjustifiable and anathema to everything they once stood for.

Couldn't a copyright claim (or something under the GDPR or UK's DPA) be used to regain access to those though?

Just because something is published to a public forum doesn't mean you relinquish your rights.

1 more reply

erik_seaberg5y ago

Google acquired probably the biggest searchable archive, Deja News. What we needed was some kind of self-sustaining org with a strict charter to preserve the archive no matter what.

mjevans5y ago

Archive.org ?

1 more reply

pfortuny5y ago

They were until they were not.

eternalban5y ago

> they don't care about usenet.

They cared enough about to kill it.

HenryKissinger5y ago

Controversial question: Why should we preserve code that no one uses anymore? Why should we not allow some information to be simply lost?

jlokier5y ago

Because it's a cultural artifact, of its time. It's history. And some people would like to be able to read it, or do other things with it.

Personally I'd like to be able to link to my own posts from that time, for when people asked me what I used to do. But I can't find them any more.

These groups are mostly not code. They are conversations, design discussions, ideological discussions, jokes, that sort of thing.

Like what we have now in social media, except back then there was pretty much only Usenet, and it had a very different feel than the current social networks.

Conducted in public at the time and thought to be archived for the long term.

zxcvbn40385y ago

1 more reply

ornxka5y ago

johnfn5y ago

Why should we preserve old websites that no one uses? Why bother with historical documentation at all?

It's because, at the time, you don't know what information is going to be important and what is just garbage. Documents that are apparently useless today could become fascinating tomorrow.

ghaff5y ago

nitrogen5y ago

saving discussions about a bug in a long ago version of SunOS probably wasn't very interesting.

Honestly even that sounds pretty fascinating:

It could help someone gather stats on the nature, frequency, and severity of bugs over time and across companies from another angle.

It could provide a fresh perspective on modern OSes by showing how historic OSes did things.

1 more reply

enneff5y ago

1 more reply

minerjoe5y ago

You assumption "no one uses anymore" is glaringly wrong in this case.

Those archives are full of useful and informative information.

Not everthing changes fast. Common Lisp has been around for 30 years basically unchanged. The discussions back there can be truly informative for today.

It does take time to wade thought it, but people have been collecting (via the google archive, when it existed, sigh) curated lists.

https://www.xach.com/naggum/articles/ https://www.xach.com/rpw3/articles/

jedberg5y ago

For the same reason we don't just tear down the pyramids and build condos there.

There are still interesting things to be learned from ancient artifacts.

joshuamorton5y ago

But we do tear down old condos to build new ones. Should we also endeavor to retain every geocities and myspace page?

And if not, what makes comp.lang more like the pyramids than geocities?

1 more reply

pfortuny5y ago

Do you know about cuneiform? Lots of what is known are just ledgers and exercise books...

Never forget that we do not know the future.

Zenst5y ago

Future digital tourism.

That or risk future archaeologists thinking COBOL was some God of the time and the natives built large metal obelisks in dedicated worship temples.

rolph5y ago

likewise many people are clinging to the local operating system rather than moving to the SAAS model.

so what happens if we lose the oldschool languages and platforms entirely, for whatever reason ?

Avicebron5y ago

I second the need to rebuild from the rubble is often overlooked, especially by corporations driven by profit centered goals.

bordercases5y ago

sgillen5y ago

Well I think it’s ok in general for some information to be lost, but I think a lot of HN users value this specific information.

quantified5y ago

I’m sad to see that this was downvoted, it’s contains the key questions. I think they have good answers.

2) It’s not simply code that no one uses anymore. It’s a knowledge base on how and why, debates over constructs and usage that are useful beyond code-sharing snippets a la Stack Overflow.

JetSpiegel5y ago

It belongs in a museum!

DoctorNick5y ago· 17 in thread

It's becoming clear to me that Google has become a far, far worse monopoly than Microsoft ever was. Microsoft just controlled our computers; Google controls our access to history.

nabla95y ago

To fix problems caused by Google, you need to change the principles of competition law. Microsoft was knowingly doing lots of stuff that violated laws. It was just very hard to prove it.

PaulStatezny5y ago

Do you have any links to evidence of these statements about Microsoft? I'd be interested to read more.

nabla95y ago

It's kind of weird to get this question when you lived it and there seems to be relatively little to Google.

https://en.wikipedia.org/wiki/Microsoft_litigation

https://en.wikipedia.org/wiki/United_States_v._Microsoft_Cor....

https://en.wikipedia.org/wiki/Browser_wars

There must be book somewhere.

Dan Gilmour's articles in San Jose Mercury news from 90's should be somewhere.

Big OEM's like Dell had to do what MS said or MS would up their price. It was straight blackmail from monopoly position.

DoctorNick5y ago

Yeah, that's the main reason that there isn't as much push back to the control that Google is starting to exert over all of us.

ajuc5y ago

How evil you are is a function of how much power you have and for how long.

beagle35y ago

Potentially far worse - but Google did not yet stop progress for 10 years in multiple fields the way MS did.

They sucked the air out of advertising (in cooperation with Facebook) leaving none for others. But I consider that a small loss.

Microsoft did that for operating systems, productivity software, stalled the web with IE6, and more.

Google is capable of much more damage, for sure. But they haven’t done that damage just yet.

skinkestek5y ago

> Google is capable of much more damage, for sure. But they haven’t done that damage just yet.

That is changing extremely fast.

hysan5y ago

Disagree in that they’ve already done plenty of damage.

beagle35y ago

I don’t think RSS is a good example. Everyone I know who used google reader switched to a different RSS reader;

It’s mostly that RSS isn’t monetizable as easily as web pages. I think FB and Twitter dropping their feeds had a more significant effect; regardless, RSS was always niche.

2 more replies

_lbaq5y ago

> Google did not yet stop progress for 10 years in multiple fields the way MS did.

I beg the difference, Gmail have not changed much since I signed up 16 year:ish etc

They are all the same, as soon as competition goes away, this happens.

pbhjpbhj5y ago

But Gmail is interoperable with other mail systems and they didn't create incompatible extensions to email (AFAIAA); that's quite different to how IE6 was.

2 more replies

anchpop5y ago

I'm still upset with them for killing Inbox

2 more replies

moomin5y ago

I dunno, I look out at the world and think that maybe making journalism unprofitable may have had some negative effects that are a bit bigger than web standards not advancing that fast?

1 more reply

Lammy5y ago

> stalled the web with IE6

By inventing XMLHTTPRequest?

WorldMaker5y ago

> Microsoft did that for operating systems, productivity software, stalled the web with IE6

Android, GSuite, Chrome

beagle35y ago

Google are smart enough to maintain a duopoly (iOS, Office/365, Safari) Whereas Microsoft tried to kill all competitors (and all too often succeeded). That’s a huge difference.

1 more reply

Stubb5y ago

And each other.

none102875y ago· 9 in thread

Google has bought dejanews and has profited immensely from open source and open information.

So I do think they have an obligation either a) to make the whole archive available for anyone or b) maintain it properly.

Properly means restoring the fast UI from around 2004.

imglorp5y ago

If you found a human at Google instead of a bot, it would probably say their only obligation is to their shareholders.

It's probably not a good idea to depend on a public company to steward an important community.

Does the Internet Archive have copies of all the old stuff at least?

lstodd5y ago

Their only obligation, if we take for granted that there are any humans left at Google, is keeping the aforementioned bots powered.

Which is sad, but expected.

dependenttypes5y ago

There are quite a few humans at Google, both in HN and at twitter. Sadly all of them that I talked with seemed like people that I would not want to interact with again.

1 more reply

zentiggr5y ago

Wait, Google feels any obligations at all? I thought they only made decisions based on what's most likely to maximize their growth?

specialist5y ago

"... their only obligation is to their shareholders."

That'd be an improvement.

Page & Brin retain controlling interest, despite their minority stake.

nine_k5y ago

How did it profit from the Usenet archives? Genuinely curious.

goatinaboat5y ago

How did it profit from the Usenet archives? Genuinely curious.

Dejanews was the seed material for Google Groups, any profit derived from that (ads) was from content posted to Usenet by people who never intended for it to be used for that.

joshuamorton5y ago

Groups doesn't (and didn't ever?) Show ads as far as I know. So you're reaching for second or third order effects at best.

1 more reply

microtherion5y ago

I remember how awesome the initial version of the Google usenet archive was. It's horrifying how much they have let the UX deteriorate.

icheishvili5y ago· 7 in thread

This type of behavior is why I can never consider GCP. How many people have been burned at this point by Google randomly shutting down something they rely on?

john-shaffer5y ago

[1] https://github.com/awslabs/serverless-image-handler

firebaze5y ago

Are there any indications to you why your accounts got shut down? Any pattern you noticed?

john-shaffer5y ago

3 more replies

hobofan5y ago

> Play Music

Play Music has not been shut down (yet), and you can transfer everything to Youtube Music, which is available at the same price (and in my opinon a superior product).

john-shaffer5y ago

Spotify is generally better than Play Music though, so it was for the best in the end.

Gibbon15y ago

Google Achilles heel is they have two businesses

a) Spy on people and sell the data to advertisers.

b) Use that data to directly push ads

rsa255195y ago

> sell the data to advertisers

Do they?

2 more replies

aidenn05y ago· 7 in thread

Anyone know if anyone not google has newsgroup archives publicly accessible (The Internet Archive maybe?)

rikroots5y ago

I found this Usenet Historical Collection link - https://archive.org/details/usenethistorical - in a previous HN thread (https://news.ycombinator.com/item?id=16667796).

I have no idea how useful the collection may prove to be. I found 'comp' but it doesn't offer a webpage view, just a link to download a file. https://archive.org/details/usenet-comp

u801e5y ago

Maybe someone could set up a public inbox[1] instance that allows access to those groups either via HTTP or NNTP.

[1] https://public-inbox.org/README.html

bensw5y ago

It should be the full archive.

eej715y ago

https://www.eternal-september.org/

I think you have to register. Not sure how much history is there.

dependenttypes5y ago

A lot of posts are missing from this one.

u801e5y ago

avodonosov5y ago

https://www.xach.com/naggum/articles/notes.html

kazinator5y ago· 6 in thread

The vast majority of the spam content is injected into these newsgroups via Google Groups itself, and is not even seen on other NNTP servers.

Blocking posting access to these newsgroups from GG is generally a good thing for those newsgroups.

Not being able to search the archive is the unfortunate collateral damage though. Google is not obliged to provide a Usenet archive, I suppose.

Formerly obtained deep links to the content also do not work!

If you formely cited a comp.lang.lisp article by giving a direct link into Google Groups, people navigating it now get a permission error.

dependenttypes5y ago

What would be a good free NNTP server or NNTP archive?

giancarlostoro5y ago

jcranmer5y ago

Adding some new NNTP features to Thunderbird was my introduction to open-source software and ultimately led me to being one of the primary maintainers.

4 more replies

kazinator5y ago

(Though mere long article retention is not necessarily the best archive interface, of course.)

Disclaimer: I'm not well-versed in the solutions in this space. Maybe there is some NNTP cacher out there that also has a web archive interface into it or whatever.

WalterBright5y ago

Yes, and I have 100% of the D newsgroups archived back to the very first post. Anyone can get them from the D NNTP server. I also wrote a program to create static web pages from them:

https://github.com/DigitalMars/ngArchiver

and the generated pages:

https://digitalmars.com/d/archives/digitalmars/D/index.html

When we were working on the history of the D programming language paper, this was an invaluable resource.

1 more reply

kazinator5y ago

I've been using the NNTP server provided by https://www.aioe.org/ for quite a few years.

There is also https://www.eternal-september.org/ which I used.

AOIE requires no authentication. The Eternal September server requires account registration via the web site; then you use an authenticated NNTP connection.

There are other servers out there.

These sites do not provide any archive.

bawana5y ago· 4 in thread

is google sinking? Between their mothballing/deletion of services and the obnoxious signup ads on youtube. I am wondering what is going on?

wegs5y ago

It's not doing so hot:

2) With COVID19, ad revenues have crashed. It's not clear the impact on Google.

5) I've switched mostly to non-Google products because they're better for what I need. AOL was massive too at one point. Losing the tech edge is not good. I still use gmail.

On the other hand, their revenues have continued to rise exponentially since they started. So perhaps they're doing fine?

jcrawfordor5y ago

gumby5y ago

Another contrast: they don’t know what to do about the advertising downturn, so are cutting back on hiring and such, while FB is trying to double down.

tmpz225y ago

No they've just secured their kingdom enough that they can do whatever they want.

userbinator5y ago· 3 in thread

SyneRyder5y ago

> do not care about the past...

- Derrick May, DJ/Composer, Universal Techno (1996)

https://youtube.com/watch?v=tdox6H7FJBU&t=955s

The segment starts at 16:00 in the video and is about 2 minutes long.

Lammy5y ago

jolmg5y ago

> almost all tech companies simply do not care about the past

You may be surprised that it's not just companies. It's not hard to find people who think it's better for old stuff to just be deleted.

WoodenChair5y ago· 3 in thread

I read the article and I read the threads here, and maybe I missed it—but why did these groups disappear? Were they banned due to bad words or a mistaken spam filter?

DanBC5y ago

Here's what I get:

https://groups.google.com/forum/#!forum/comp.lang.forth

> Banned Content Warning

There's no content available for me.

jjgreen5y ago

Forth is pretty grim, but I wouldn't go that far ...

ngcc_hk5y ago

Is there means you access and archive it or is too late?

totalforge5y ago· 3 in thread

SELF FOOT SHOOT DUP

astrobe_5y ago

Actually, what I saw on comp.lang.forth the last few times I checked it (coincidentally, I tried yesterday) makes the news not really surprising.

Aside from the spam, it gradually switched from passionate but respectful debates to name calling and plain insults from newbies to what remained of the veterans.

zentiggr5y ago

Or Factor style,

[ SELF FOOT SHOOT ] 1000 REP

DonHopkins5y ago

BEGIN ME FUCK AGAIN

jeffbee5y ago· 2 in thread

dabockster5y ago

Seriously! I have the same issue with a lot of modern online communities/projects too. They all assume whatever platform they're currently publishing on will be there forever.

Brb archiving my Twitter posts

perl4ever5y ago

>The fact that nobody had enough fucks to give to archive these groups

Well, you assume. Maybe it was just decentralized enough you haven't heard about it.

summerlight5y ago· 2 in thread

https://www.lumendatabase.org/notices/search?utf8=%E2%9C%93&...

Looks like there has been (likely automated, nearly all of them are the same Italian phrase) mechanical legal complaints and it probably caused this instance of automated blocking going wild.

__void5y ago

Okay, I did some research, and I think I figured out what caused these usenet group banning.

__void5y ago

haecceity5y ago· 2 in thread

So Google Groups archives usenet stuff? Where are the usenet stuff hosted originally? How do I connect to it without Google Groups?

ghaff5y ago

icedchai5y ago

I had a UUCP news feed from a local internet provider when I was in high school, back in 1993 or so.

DonHopkins5y ago· 2 in thread

Since when were Forth and Lisp historical programming languages??! People still use them. HARUMPH!

velosol5y ago

If it makes you feel better, comp.lang.python is also blocked:

https://groups.google.com/g/comp.lang.python/

DonHopkins5y ago

No, because I like Python too, but it would make me feel much better if comp.lang.perl was blocked.

synack5y ago· 1 in thread

https://archive.legitdata.co/

https://archive.legitdata.co/comp.lang.ada/

https://public-inbox.org/README.html

sneeuwpopsneeuw5y ago

rdiddly5y ago· 1 in thread

Either those Usenet groups are not part of the world, or they don't consist of information, or Google just failed at "organizing the world's information."

StavrosK5y ago

Google has definitely failed. Finding anything that's not frecent is basically impossible.

jolmg5y ago· 1 in thread

> since there is no other comprehensive archive after Google's purchase of Dejanews around 20 years ago

Was I naive in thinking that The Internet Archive would have long archived this type of thing?

foresto5y ago

The Internet Archive is younger than Deja News. Someone would have had to provide the data. Did they?

If you want to look, you might start here: https://archive.org/details/usenet

1 more reply

LockAndLol5y ago· 1 in thread

nabla95y ago

Those groups are running on decentralized system and open protocol https://en.wikipedia.org/wiki/Usenet

The problem is that there is no other searchable archives.

fizixer5y ago· 1 in thread

Can anyone tell me how Google got hold of the whole usenet (I know it was like 15-20 years ago) which looks to me like a community service kinda thing.

Like when Google decided it's going to host comp.lang.c, can there be only one comp.lang.c on the internet, or can someone else start hosting comp.lang.c as well?

rjsw5y ago

That isn't how it works, usenet is distributed, you can still access it using non-Google servers.

ipunchghosts5y ago· 1 in thread

i would like to find the quickbasic archives. anyone know how i can get them?

DanBC5y ago

https://groups.google.com/forum/#!forum/microsoft.public.bas...

Not safe for work!!!

gnabgib5y ago· 1 in thread

This is editorialized (actual title: "Some Usenet groups suspended in Goggle Groups"), or on LWN[1] "Historical programming-language groups disappearing from Google" (basically the same content)

[1]: https://lwn.net/Articles/827233/

dang5y ago

Ok, we've changed to that from https://support.google.com/accounts/thread/61391913?hl=en. Thanks!

Animats5y ago

"He who controls the present controls the past. He who controls the past controls the future" - Orwell, "1984"

fmajid5y ago

> Usenet predates Google's spam handling tools

In fact Usenet predates spam itself, since the first spam (Canter & Siegel) was on Usenet itself in 1994 (I was there).

CrankyBear5y ago

No, no, no. These groups and other Usenet groups archives must be preserved. They're our history.

imhoguy5y ago

Anyone looking for a hobby? It is time to become a data hoarder https://www.reddit.com/r/DataHoarder/

msie5y ago

WTF Google? Are you now so full of young programmers who have no respect for programming history? You’ve lost all greek cred that’s for sure.

mark_l_watson5y ago

Too many people and companies don’t appreciate culture enough. Maintaining a cultural record should apparently not be left to just one company.

Thanks for posting this, it reminded me to donate again to archive.org, which I just did.

I use ‘culture’ to include anything creative, anything that we experience as humans. Everything should be preserved, schools should be well funded, as should the arts.

lkirk5y ago

Is this something that the internet archive would preserve?

avodonosov5y ago

There is a comp.lang.lisp archive published in 2009.

> In 2009, Ron Garret published a 700MB archive file of all of comp.lang.lisp

https://www.xach.com/naggum/articles/notes.html

rurban5y ago

zxcvbn40385y ago

cptnapalm5y ago

NewEntryHN5y ago

Either this archive exists elsewhere, either now is not the proper time for panic -- it was when Google became sole owner of this archive.

smsm425y ago

Arjuna1445y ago

This is really bad marketing

jolmg5y ago

> Perhaps Google can be convinced to restore the content

The support ticket was deleted, so I guess not.

ryanmarsh5y ago

Thank god. I said some really dumb shit on those lists in my youth that I regret.

grappler5y ago

This kind of thing makes it really easy to get interested, and stay interested, in decentralization tech.

Once you see things in this light, the new flavor of the month online service just doesn't hold any allure.

quantified5y ago

(Repeating one of the comments from the post):

> Has anyone (EFF?) considered the aspect of destroying evidence of prior art in the public domain?

I think there’s a case to be made for stewardship of these groups for that reason.

Havoc5y ago

I'm hearing a fair bit of chatter in SEO circles about google de-indexing pages so this certainly rings true.

I guess there was this unjustified assumption that google only adds & never subtracts.

hosh5y ago

Maybe it is something that a non-profit dedicated towards preserving knowledge and internet content (such as Internet Archive) should be handling anyways.

bawolff5y ago

Maybe these types of historical archives can be turned over to internet archive. I trust them a lot more than google for this.

Igelau5y ago

If an AI decided to shut off comp.lang.lisp, I'd say it's officially too late to solve the Alignment Problem.

photon-torpedo5y ago

Guess comp.lang.lisp has too many posts with (((code))) in them... ;)

ZinniaZirconium5y ago

alt.sex is still there and you don't get an adult content warning unless you choose the desktop version.

Ijumfs5y ago

It was a terrible idea to entrust ANYTHING to Google.

Time to de-Google the whole Web.

staycoolboy5y ago

On the plus side, evidence of my awful usenet etiquette from the late 80's is disappearing with some of these groups.

j / k navigate · click thread line to collapse