The top-ranking HTML editor on Google is an SEO scam (opens in new tab)

(casparwre.de)

1743 pointscaspii5y ago395 comments

395 comments

227 comments · 71 top-level

ilamont5y ago· 47 in thread

Same story for various Wordpress plugins and widgety things that live in site footers.

Google has turned into a cesspool. Half the time I find myself having to do ridiculous search contortions to get somewhat useful results - appending site: .edu or .gov to search strings, searching by time periods to eliminate new "articles" that have been SEOed to the hilt, or taking out yelp and other chronic abusers that hijack local business results.

XorNot5y ago

Also phone problems: Google a problem with a phone and the top hit will be a whole bunch of churned out articles with generic copy on the cause (sometimes there are bugs in the software, so reboot your phone).

duskwuff5y ago

Any technical issue, really. There's a ton of autogenerated content out there with low-effort troubleshooting tips. A lot of it is used as lead generation for scammy antivirus/antimalware/"cleaner" software, paid tech support, or outright tech support scams.

4 more replies

gerdesj5y ago

"Google has turned into a cesspool."

That's a bit harsh but I agree that it is starting to fail to live up to the expectations I had with Google when it came out and destroyed Altavista in a spectacular shower of sparks.

Could I tender: "uBlacklist" as a stop gap, amongst others as we await Google being given a right old kicking?

Despite being a staunch Arch Linux user I have to deal with rather a lot of MS Windows related stuff. Being able to filter out that bloody awful Microsoft Social thing gets me closer to decent results. The majority of the next 10-100 results will be CnP clones of someone's blog but a human is able to get in reasonably quickly. I'm toying with blocking Stackoverflow and other cough slatwarts to see if results get better for me.

In my opinion: the www has hit a crossroads or perhaps a Spaghetti Junction or a Magic Roundabout for the last five years or so and continuing. However the exits are connected to the entrances on these road systems (take a look at them - they are real junctions. The MR is particularly terrifying but it works really well.)

I still won't use words like cesspool for this but I am increasingly losing my patience over the standard of results from Google. Those featured things (not the Ads - that's fine) at the top which add #blah_blah to the URL to colour search terms yellow is not working for me. The quality of the returns featured in a box are often rubbish too. It would be nice to be able to turn all that stuff off.

I understand that Google are trying to "be" the internet to try and keep the stock ticker pointing north but there seems to be a point when they have overreached themselves and I think that was passed several years ago. I also increasingly feel that Google thinks that it knows best and has removed many choices from their various UIs - that comes across as a bit arrogant.

Many years ago I left Altavista behind for Google. I will move again if I feel I have to. Of course that's not much in the grand scheme of things and I'll probably only take around 100,000 people with me but they have friends - still probably not a big deal.

oska5y ago

I appreciate a lot of what you're saying in this comment but I disagree with this sentiment:

> not the Ads - that's fine

In my strongly held opinion, push advertising is not fine and it's the root cause of all the problems you are discussing. We will only exit this mess that the web has become when everyone blocks push advertising by default. People should only see advertising when they are interested in being advertised to, e.g. sites you consciously choose to go to that advertise products & services, like the old Yellow Pages phonebooks.

Spooky235y ago

I don’t think Google is the cesspool, I think Google is a search engine for an internet that is the cesspool.

We’re moving to the vision of information services that were pioneered by AOL, Prodigy, etc. Honestly, we’re there already.

1 more reply

smegger0015y ago

I wish i could have 2010 google search as a alternative to 2021 google search.

3 more replies

emptyparadise5y ago

I'm amazed that there isn't anything like uBlock Origin for search results.

3 more replies

p5a0u9l5y ago

Comparing Google now to Alta Vista is not very helpful. They don't get to rest on their laurels. Search is less helpful now, and it's not clear to me that they care enough to do something about it.

1 more reply

thaliaarchi5y ago

Anecdotally DuckDuckGo seems to have fewer sponsored sites than Google. DDG also makes it easy to block low-quality sites because it adds a data-domain attribute to the root of every search result. I recently started this mini uBlock Origin filter list for that (suggestions welcome!):

    ! Hide low-quality results on DuckDuckGo
    duckduckgo.com##[data-domain="w3schools.com"]
    duckduckgo.com##[data-domain$=".w3schools.com"]
    duckduckgo.com##[data-domain="w3schools.in"]
    duckduckgo.com##[data-domain$=".w3schools.in"]
    duckduckgo.com##[data-domain="download.cnet.com"]
    !! Stack Exchange mirrors
    duckduckgo.com##[data-domain="exceptionshub.com"]
    duckduckgo.com##[data-domain="intellipaat.com"]

raverbashing5y ago

Great idea. Though I've noticed DDG promotes "blogspam" articles more often than the authoritative sources.

Let's say, if I search for a python builtin library, I want to go to the python website, not some "Python 101" blog post about it.

bassdropvroom5y ago

Great tip! I've been using DDG's official addon but this means one less addon. Thanks!

zem5y ago

pinterest.com would clean up another large chunk of crap

elchupanebre5y ago

The reason for that is actually rational: when Amit Singhal was in charge the search rules were written by hand. Once he was fired, the Search Quality team switched to machine learning. The ML was better in many ways: it produced higher quality results with a lot less effort. It just had one possibly fatal flaw: if some result was wrong there was no recourse. And that's what you are observing now: search quality is good or excellent most of the time while sometimes it's very bad and G can't fix it.

robbrown4515y ago

I wouldn't call that rational. There is no reason you can't apply human weighting on top of ML.

Honestly, I don't believe for a minute they "can't fix it." They do this sort of thing all the time, for instance when ML shows dark skinned people for a search for gorilla, they obviously have recourse.

1 more reply

cookiengineer5y ago

> G can't fix it.

Yes, they can. They should simply stop measuring only positives, and start measuring negatives - e.g. people that press the back button of their browser, or click the second, third, fourth result afterwards...which should hint the ML classifiers that the first result was total crap in the first place.

But I guess this is exactly what happens if you have a business model where leads to sites where you provide ads give you a weird ethics, as your company profits from those scammers more than from legit websites.

From an ML point of view google's search results are the perfect example of overfitting. Kinda ironic that they lead the data science research field and don't realize this in their own product, but teach this flaw everywhere.

1 more reply

coliveira5y ago

My impression is that the ML algorithms at Google have the goal of increasing profitability from search. If that is the case, the quality of search will tend to be secondary to displaying pages that bring more revenue.

jeromegv5y ago

Blatantly false that Google has "no recourse", Google can put on penalty and bring domains down.

humaniania5y ago

"Request manual review of search results" button?

1 more reply

wingworks5y ago

I really don't like how easy it is to fake a "new" article on Google. You can just re-publish an old article and stick a new date on it and Googles takes it on face value and uses the new date.

sellyme5y ago

You can also do the opposite: post something today and say it was up on your site in 2003.

Makes it really difficult to find old pages about something that recently exploded in popularity, because the age filter just doesn't work.

BigJono5y ago

I ran into this for the first time yesterday when trying to find out new info about a footy player. Some article from 15 years ago talking about how he had a good first game, tagged as 5th june 2021. Like, wtf?

1 more reply

colordrops5y ago

Google Search is ripe for disruption. It's been over 20 years now and they are not dynamic or interesting at all anymore.

LeoPanthera5y ago

I still think that the "Yahoo!" style web directory is a good model. A catalogue of hand-curated links has increasing value as the quality of Google results goes down.

Edit: ...and using the search box results in a 404 so I guess it's really dead huh.

Edit 2: Apparently this is the successor! https://curlie.org/en

[1]: https://dmoz-odp.org

3 more replies

lemmiwinks5y ago

The irony being that 20 (more like 25?) years Yahoo search was ripe for disruption... by Google :)

Halt and Catch Fire [1] (As a nerd, I can say it's one of the few TV series that got the hackers spirit correctly) had a few episodes about the Google disruption.

Like some people often say here, things come and go in circles...

[1]: https://en.wikipedia.org/wiki/Halt_and_Catch_Fire_(TV_series...

rickspencer35y ago

Neeva.com

I am in the pre-release program. The hardest initial thing to get used to was not immediately scrolling down to the bottom to avoid all of the spam.

I suspect that their methods are not much different than Google, but the experience has been so much better.

3 more replies

emodendroket5y ago

It's so easy to do better! Just look at what a rousing success Cuil was.

1 more reply

duskwuff5y ago

Free WordPress* themes are particularly bad in this regard. Since they're expected to contain HTML anyway, it's altogether too easy for the author of a theme to include a couple of links to a site they want to promote. Some themes take this to the next level by obfuscating the code that generates the promotional links, and/or including other code which makes the site not work properly if the links are removed.

*: and themes for other web applications, but mostly WordPress these days

normac25y ago

Hmn. I would agree about all crap being mixed in there, but in terms of overall results (both wrt. SEO crap and other irrelevant stuff), my experience has been that the quality troughed something like 2-3 years ago and then came back (my guess is that they're incorporating all of the AI they've been doing throughout the company into search). To me it feels like it's about 80% of its best right now.

I bet it's that we do different types of searches.

jamiek885y ago

Ugh Pinterest results.

wlesieutre5y ago

I swear, Pinterest must have employees working undercover in the Image Search team for Google to have let them destroy image search results the way they have.

It's literally never the original source for anything, but you can bet it's most of the first 10 pages of results. Then it doesn't even let you right click to open the image file, and dumps you to a login prompt if you click on anything. THAT'S NOT EVEN YOUR IMAGE STOP TELLING ME WHAT I CAN DO WITH IT.

2 more replies

ajsnigrutin5y ago

I'd expect a company like google, who tracks what kind of socks you have on everyday, to also track their own search engine... users mistakingly clicks on pinterest link, user immediatly clicks back, and looks for something else... is it so hard to assume, that they don't want pinterest results, because they're useless, and somehow lower their seo score? Nooo, of course not, just put the pinterest results near the top, until users puts "-pinterest" in the search bar.

newacct5835y ago

> Google has turned into a cesspool.

All these same sites appear near the top of Bing searches too. There's nothing particularly Google-specific to this story. It's about SEO hacking that will work against anyone with a PageRank-style system.

Mediterraneo105y ago

Indeed. I recently noticed this while relying on DDG for documentation for Common Lisp, a language I still learning. The top-ranking site for any Common Lisp function was an SEO scam site, where clearly someone had hired freelancers to take preexisting CLisp documentation and rewrite it – in poor-quality English – until it would no longer be detectable as copyright violation, then loaded it with ads.

(I just checked and this copycat documentation site has, thankfully, now been pushed down a bit in DDG results.)

2 more replies

worble5y ago

I think it's high time we had a webring resurgence. It's impossible to get anywhere with plain search anymore, what we need is curated websites that other domain owners are happy to say "I endorse the people running this site, so if like my stuff you'll like them too"

3 more replies

jimbob455y ago

This is my view too. Yes, I’d love to go back to a time when Google’s algorithms were unknown enough for SEO to be futile but those days are gone and the problem isn’t limited to Google.

cookiengineer5y ago

I also noticed that Apple users see way more fake online shop results than Linux users, from the same IP, with regularly cleared browser cache and identical search terms.

Those fake shops are part of discussions in politics right now. Usually they're registered in Ireland or Malta as companies due to their specific banking laws. They make millions with those scams and people can't differ between legit online shops and fake ones - because the legit ones actually look crappier than the fake ones when it comes to the website designs.

In Germany, we have at least for hardware the "geizhals" website which is kind of an index for all kinds of electronics shops and they try to verify as much as possible.

But for other online shop sectors (e.g. clothing or home stuff) I wouldn't trust anything. Even on Amazon I got scammed a lot and heard absurd things from others...like getting packages with no content in them and Amazon refusing to see that the seller is a scammer etc.

ping_pong5y ago

Google is a cesspool because the spammers and SEO-hackers are in full force, and Google is only reactive to these threats these days. I mean, does it really matter if they are making hundreds of billions of dollars a year? They seem to be doing something right.

The only time something will change is when traffic starts decreasing to their site, but it's good enough such that people won't change. Look at Facebook, I don't know anyone who uses it as much as they used to 10 years ago, but it's making the most money it ever has. Why on earth would any behavior change? From their points of view, everyone is happy with it!

luke2m5y ago

I don’t like google and don’t really want to defend it, but this is more of a lots of crappy websites problem than a google problem.

worik5y ago

Google, to justify its huge capital worth, should deal with that crap. Why else bother?

naikrovek5y ago

google isn't the cesspool, people who want to appear at the top of a list of search results are doing whatever it takes to create a cesspool, because that's what it takes to earn more money.

being willing to make other things in order to have more money always creates cesspools.

prepend5y ago

Google’s mission was “organize the world’s information and make it useful” and they are doing a poorer job now than historically.

Of course there are scammers, that’s part of what makes organizing so hard.

Cynically, I think that Google is worse as filtering scammers is because they care less now. Half the page is ads so they make money either way.

Retric5y ago

Google is a cesspool because it’s their job to fix it and they failed. I stopped using Google search because of how far it’s fallen.

beepbooptheory5y ago

If its the only way to make money, it doesn't really feel like the burden is on the people to make a cleaner pool

1 more reply

cyanydeez5y ago

dont forget adding quotes to things to stop the random "did you mean to spell this?" crap

basically, like everything in modernity, its a race to the bottom of the infinite dullards of popular

paulpauper5y ago

I wish duckduckgo had better results. google still better

torbital5y ago

I can't remember the last time I searched on Google without appending "reddit" to the end.

lupire5y ago

> Half the time I find myself having to do ridiculous search contortions to get somewhat useful results - appending site: .edu or .gov

A great opportunity for students and public servants to sell premium URLs.

cybice5y ago· 22 in thread

As webdeveloper I have a strong feeling that we are writing web for google bot and not for people. For any website I created I have a list from SEO what to add. Like 200 links at each page bottom, different titles, headers, metas, human readable urls without query params, all that canonical urls, nofollow rules etc. Most of this things invisible to users and created only for googlebot.

mercury_craze5y ago

It never seemed important to the CEO of a previous company I worked for that we had something to say, only that we gave off the impression that we had something to say. We hired an outsourced blog writing service to fill our wordpress instance with generic, inoffensive platitudes and listicles poorly cribbed from Wikipedia and the ONS. Squint a little bit and you could convince yourself there was value to it, but nobody with any experience in the problem space would treat it as anything more than marketing fluff. His hope was always that one day we would get rewarded by the great Google algorithm and appear on the first page for search terms we were convinced our users were looking for, but the end result was that our blog was largely designed to be read by robots.

It's the same thing as the tweaks you have to perform for SEO optimisation, some have questionable value to the end user but you jump through the hoops anyway because it's what is done, by pleasing the robots you're rewarded with a higher search position.

nimbleal5y ago

Fortunately with GPT3 and the like I’d imagine this approach will soon have had its day. Not that I’m optimistic about whatever will replace it.

1 more reply

ridaj5y ago

Much of it driven by cult cargo SEO, throwing everything and the kitchen sink into the page in completely unproven hope that it'll somehow game the rankings

Jenk5y ago

It is cargo cult but it's cargo cult because it is the way to "success". Company A have great page ranking, and blog about how they think they got there. Company B also have great page ranking, but think they did something different to Company A, so they blog about it, too. Everyone else reads both blogs and intersects what both companies did, and implement those changes. Iterate for every difference you encounter and voila.. you now have your rubber stamp SEO method.

lvncelot5y ago

Even (or rather, especially) if every SEO advice is correct, it still means that Google effectively has a lot of control over the shape of the modern web, alone through indirect pressure via SEO.

spiderfarmer5y ago

I am running every SEO advise as an experiment before implementing it across my network and a lot of advise actually brings results.

hliyan5y ago

I know that it's being done, but I don't know if it's necessary. I frequently find good old unstyled HTML pages from the 90's internet (the ones with Prev/Next/Up links, like this: https://tldp.org/LDP/abs/html/here-docs.html) at the top of Google results.

pbhjpbhj5y ago

I didn't check but to my recollection that domain is pretty old, domain age is supposed to be a principle metric for trust (which in turn is a strong signal for page rank). So, ...

I mean it's pretty reasonable, if a site has been around a long time it's going to be generally 'good'.

ricardo815y ago

Some of the technical SEO is good though, like simply making the page crawlable and content being in a logical order.

The "fiddle with H1" or "write X amount of words" or "buy Y number of links with a % of anchor text" is silly.

tomcooks5y ago

> Some of the technical SEO is good though, like simply making the page crawlable and content being in a logical order.

Semantic HTML has been created to help screen readers and browsers understand content organization, it having been hijacked by SE is just a side-effect.

2 more replies

lvncelot5y ago

Yes, but even for those, it means that we are left to hope that what's good for a crawler and what's good for e.g. a screen-reader will still align in the future. Right now it feels almost coincidental.

hyperhopper5y ago

The problem is, is that the internet at its conception was just a way to host content, not a way to discover content. When discovery was done via word of mouth or extra-internet means, the websites themselves were just for the people that viewed them.

Now, when the website needs to not only contain content, but also be its own advertisement, writing it in a way that will maximize virality is the natural course of action to make sure the site actually gets seen.

This will likely be true until a method of finding webpages that is not based on automated scraping or the page itself.

Sharlin5y ago

On the contrary, the Web, being a hypertext system, was definitely always about discovering content. If you found an interesting website, it would typically link to other interesting sites. There used to be ways to systematize these ad-hoc linkings, such as Web rings. And the first attempts to catalogue and categorize the contents of the (then tiny) Web were in the form of human-curated directories à la Yahoo. It’s just that in just a few years it became apparent that this approach could not scale, and search engines based on automatic crawlers became the norm – but again, critically, these too are of course fundamentally dependent on the Web’s discoverability by following hyperlinks!

2 more replies

novaleaf5y ago

worse: paid content farms / ai to generate crap "articles" by the boatload, targeting every organic search term 5 different ways.

The result is that ACTUALLY USEFUL articles are buried on page 5. Any slightly helpful bit of content in the top articles are repeated (using different grammar of course) in all the other "top" articles.

tootie5y ago

What I tell every client is that 90% of SEO is in writing good, relevant content. Technical SEO is more like housekeeping. Adding footer links is redundant if you have a sitemap and good navigation. If your users can find stuff easily, crawlers can too. The biggest technical things that I make a stink over are canonical URLs and https.

Cthulhu_5y ago

On the other hand, Google over the years has tweaked their algorithms and recommendations to match up with what makes a good site, in terms of content and markup.

growt5y ago

Human readable Urls don't sound that bad.

loonster5y ago

Extremely useful for when a link dies and there is no useful archive.

apples_oranges5y ago

As a mobile developer (sometimes) I rarely/never see apps that don't have Google SDKs bundled either..

atatatat5y ago

That sounds...like a great reason not to get into "mobile" dev and stick to PWAs.

pjc505y ago

Well, yes, because googlebot is the gatekeeper of popularity and income for websites. Got to appease the decision maker.

Mauricebranagh5y ago

Apart from stuffing 200 links in the footer why is this bad?

shmiga5y ago· 19 in thread

SEO is so broken, it's not about website content or website quality. It's about how much money you pay to some punks - "SEO experts" who are hacking a system. I'm so sick of that.

mrtksn5y ago

If you Google stuff like "opening hours of ..." in Turkish(probably in other languages too), since many years the search results are only news websites spamming google, including the Turkish franchise of CNN, the CNN Turk.

The format goes like this: Lately people are searching for XYZ but is it safe to search for XYZ? What experts say for XYZ? To find out continue to read our article.

Then it's followed by wall of text made of keywords(in sentences that don't make sense), if you are lucky there would be the opening hours(which are often not accurate) somewhere down the text.

But that doesn't stop there. Even actual news articles are written for the consumption of the Google bot, the sentences often don't make sence, they are repeated multiple times with the synonyms of one of the words, making it into a lengthy article that doesn't have any meat beyond the title.

I argue that the problem is not SEO experts with low ethics, the problem is the way the business is structured. SEO experts don't do it for the sake of the art but because they are paid to do it. They are paid to do it because it has a positive ROI on bringing eyeballs and people pay Google for eyeballs, then Google pays those who generate the eyeballs.

Isn't it better for Google and everyone involved if you can't find what you are looking for, continuing your search brings more eyeballs? It's not like you are going to switch to Bing? You are also not going to abandon the internet and go to a library.

dspillett5y ago

I've not seen it for opening times (UK here) but the same pattern is very visible elsewhere.

Entertainment/news sites are chock full of pages like "<whatever>, what we know so far, release date, cast, will it be renewed, has it been cancelled..." pages that spend many paragraphs saying "we know nothing, randomly plucking crap out of thin air we could guess something-or-other but that remains to be confirmed". A new news story, film, show, or even just a hint of something, and the pages go up to try capture early clicks. Irritatingly they are often not updated quickly when real information becomes available or that information changes (particularly over the last year that has affected release dates). I have several sites DNS blocked because that annoys me less than getting one of these useless/out-of-date pages more often than not when I follow one of their links.

1 more reply

eino5y ago

> It's not like you are going to switch to Bing

From personal experience, I switched to another tool (DDG) a couple of years ago. When I occasionally try Google, for 95% of common requests I'm appalled by the results: the top is only SEO garbage. For very specific and precise searches (where people are not trying to game the system), Google is still the best, though.

3 more replies

Avamander5y ago

> Then it's followed by wall of text made of keywords.

I've noticed a rise of that as well. With some searches such spam is all I've received. But that's really a problem in all languages Google supports I think.

There's even malware that infects websites and generates such content, not sure what's the point of that. Anyone knows?

1 more reply

lodovic5y ago

> It's not like you are going to switch to Bing?

I changed the default search engine from Google to Bing and DDG in all browsers. Google does have better results, so sometimes I still need to use them. But for 90% of generic queries such as the weather, product information, or finding a company's website, Bing is good enough.

1 more reply

1vuio0pswjnm75y ago

Fix the system? People who comment online seem to think the concept of the "search engine" cannot be improved, except by Google. The list of inactive search engines at https://en.wikipedia.org/wiki/Search_engine is depressing. The problem for us is that the supposed innovator Google has little financial incentive to improve the system regarding "content" or "quality". As long as the traffic keeps coming, the ad revenue keeps coming. Their best bet is promote what's "popular" ("top-ranking"). Because the traffic keeps coming no matter what Google does, "content" and "quality" are not really their major concerns. There are no true alternatives for users. Bing is basically a Google clone. No new ideas. Other search engines, like DDG, just piggyback off Google or Bing crawlers. Not sure about Baidu, Yandex or others but I suspect they are more or less Google clones as well. In every case, advertising dictates design. No new ideas.

raxxorrax5y ago

If a topic or search term is present in any form of news article (If one paper has them, they all have them shortly after), the search results are just extremely bad. You know that Google promotes its media friends and by now Google results look like an ad list. They haven't stopped innovating, they are moving backwards.

It would need an option to ignore any form of news media in search results.

apples_oranges5y ago

I'm trying alternative search engines from time to time and and they are much weaker than Google. So yeah, I'd bet on them to improve stage. The others first need to catch up.

1 more reply

topicseed5y ago

True to some extent but it is improving with Google updates. Now, there is a way to go still, and some legit websites get hit by updates unfortunately, but overall fewer and fewer scams pass through.

SEO used to be extremely gameable (seniority of site, keyword stuffing, backlinks), but these levers aren't as obvious now, if at all.

shmiga5y ago

That is great, but can google change their algo to some point where it works differently? Their ad business is there in the web.

1 more reply

maze-le5y ago

I wonder why google is not more rigorous about that. Google search is riddled since years with "optimized" content nobody wants. It's become so bad even my non-techie friends are beginning to switch to DuckDuckGo -- which is not better per se (probably worse at contextualizig).

bungle5y ago

Getting stuff on PagaRank feels a game. Getting stuff out of Google feels a game too. To the point that moving to an alternative feels worth it, at least to try.

WarOnPrivacy5y ago

Everyone wonders about that. Googling most phone numbers return nothing but pages of spam links.

A decade from now, Google will have made no improvement.

DelightOne5y ago

That's why for certain things Google is useless. Have to add certain keywords to avoid the SEO content to get comparisons, reviews, forums.

One day Google may introduce multiple search rankings, where one of them is SEO and another is the "useful things". But I don't hold my breath.

mikevin5y ago

I still do this but I'm 99% sure Google and DDG max out at around 3 keywords these days. I just get results for the top 3 SEO keywords, no matter how much I try to refine my search.

Maybe it's just because I'm searching for technical stuff but DDG and Google are both a big source of frustration for me,

DDG thinks I mistype most of my queries and will desperately try to correct my 'mistake' because "surely nobody is really searching for documentation about ARM32 bootloaders, they just mistyped when they were really trying to look for a webshop that sells 32 different ARMchairs and ARMy boots.".

Google will understand my input at least half of the time but uses that power to show me the power of websites that do some article/keyword scraping and run GPT on it, or this great new Medium blogpost with two paragraphs of someone copying a Wikipedia summary of what ARM is and copy pasting build instructions from a GitHub README.

I've tried searching github.com itself but that's just a nice way to find out that apparently most of the data they store is just scraped websites, input for ML models or dictionaries and they will happily show me all 9K forks of the one repo that contains the highest density of these keywords.

/rant

alpaca1285y ago

It's also so frustrating to get results for websites which present themselves in the search results with "Results for <your query>"...only to show "no results found" when you actually click on them.

Good thing /etc/hosts has no size limit.

ma2rten5y ago

The useful thing would instantly become useless because people would start gaming it.

2 more replies

enknamel5y ago

Or .... how much money you pay Google. This is working as intended for a free search engine.

AtNightWeCode5y ago

This is not how it generally works, I would say. It is more about how much you pay Google and how good your page is. I worked with several SEO experts and none of them suggested scams like this. The risk of doing something like this is too high for many companies.

qeternity5y ago· 8 in thread

I have little to no experience in SEO. Does Google have a history of weighing in on situations like this and manually penalizing bad actors? If so, I would love a link to read about.

RileyJames5y ago

I agree with some of the other comments, googles actions on SEO are always shrouded in a little "algorithmic" mystery. That said, they do apply "manual action" penalties to individual websites.

Using google search console you can determine if a manual action has been applied to your own website: https://support.google.com/webmasters/answer/9044175?hl=en

Rather than determine the ranks, these actions remove / punish offending websites from the ranks, effectively making room for 'good' actors.

Manual actions often come after a a significant change in ranking algorithm or policy, and can be reverted / resolved in some cases. This usually requires removing or disavowing (in the case of unauthorized or unresponsive sites) the links pointing to a website.

vgeek5y ago

You may want to dig into http://www.seobook.com/blog for an opinionated (albeit typically objectively correct) perspective on many things related to the SEO industry. There are a few studies about Thumbtack (with GV investment), RapGenius and eBay penalties and their subsequent recoveries.

jboynyc5y ago

Here you go: https://www.mattcutts.com/blog/

bliteben5y ago

https://www.mattcutts.com/blog/join-the-us-digital-service/

wow that's amazing, I guess I sort of quit reading blogs like this when all the RSS readers died.

1 more reply

aww_dang5y ago

https://support.google.com/webmasters/answer/9044175?hl=en

>Google issues a manual action against a site when a human reviewer at Google has determined that pages on the site are not compliant with Google's webmaster quality guidelines. Most manual actions address attempts to manipulate our search index. Most issues reported here will result in pages or sites being ranked lower or omitted from search results without any visual indication to the user.

cocoafleck5y ago

https://www.wsj.com/articles/how-google-interferes-with-its-... I'm sure that the title tells you that the article has an opinion (not unbiased), but I think it is a useful source.

silviot5y ago

They state that they don't manually pick results, but improve their algorythms to solve these problems. They prefer to share the least amount of details though, since it would better inform SEO spammers.

AtNightWeCode5y ago

Not true, you can get penalized and you may be noticed about it in the google search console.

1 more reply

shanecleveland5y ago· 7 in thread

Could it be that Scorecounter is paying for their links to be embedded, as opposed to them being the owner/developer of both sites? If so, and provable, can they be flagged in some way?

Doesn't say much for Google's ability to determine relevancy in linking or recognizing suspicious link growth. Or perhaps it just takes some time ...

topicseed5y ago

Google used to impose manual penalties for unnatural links BUT this gave the rise to, you guessed it, competitors buying unnatural links for their enemy and waiting for the penalty to be given.

Nowadays, unnatural links are mostly ignored.

duskwuff5y ago

Probably. It'd be weird for a SEO spammer to put the effort into building a popular HTML editor/optimizer just to inject links to a few sites they own and operate. It's far more likely that they're offering that link injection as a service.

dstick5y ago

If I’m not mistaken, paying for links is still very much against Google’s policies. Whatever weight that should carry... in my opinion you should always try to be as independent from Google as possible. It’s such a huge liability.

shanecleveland5y ago

Clearly. But I guess it is not outright proven that they are technically buying links. Though they would likely fall under some form of bad behavior in Google's eyes.

And, buying or otherwise, I am not sure what the mechanism is for bringing this to Googles attention.

I doubt there is another acquisition channel for a project like this that would compare to SEO (and not just Google).

enriquto5y ago

> paying for links is still very much against Google’s policies.

quite a strange think to say about a company whose bussiness is based on selling links (to ads)

1 more reply

vitus5y ago

> as opposed to them being the owner/developer of both sites?

If they're not owned by the same entity, then this blog post is rather odd: https://html-online.com/articles/scoreboard/

(To be fair, that entire blog seems odd...)

shanecleveland5y ago

Agreed. Sure seems that way. Though that may actually make it less likely to be a violation than if one was paying the other for the links. Not within the spirit of the terms, but may not be a violation either.

commandlinefan5y ago· 6 in thread

I suspect this will only get worse over time. There was a time when, if you wanted to put a site online, you (or somebody that represented you) made a point of understanding everything that went into it. But, even as what's considered a professional web site has gotten exponentially more complicated, too many people see setting up an online presence as something like printing a brochure: details irrelevant. Somebody who does understand the details is going to use them to their advantage.

onion2k5y ago

There was a time when, if you wanted to put a site online, you (or somebody that represented you) made a point of understanding everything that went into it.

I've been making websites for 24 years. Making a website has always been quite hard, especially for a nontechnical user, and there has always been scammers happy to take their money. What's worse is that a lot of the time the scammers believe they're actually selling a good service. There have always been people happy to chuck any old rubbish up on a domain and call it a website, even if it was full of scammy links, stuffed keywords the same color as the background or in tiny text, with JS that overwrote your browser history and blocked the back button, with no context menu, etc etc.

Its annoying, and sad, for those of us who care and consider ourselves professional. But it definitely wasn't any better years ago.

julianz5y ago

True. A company we bought in the very early 2000's was paying $1000 a month to an SEO "expert". The expert hadn't noticed that the site had a robots.txt file that was excluding all search bots but was still happy to take their money and produce faked up reports about how busy they'd been pushing search terms around.

1 more reply

adventured5y ago

I agree, there has been a clear, negative direction of stacking complexity in Web development for the past 20 years. It's one of the primary reasons Wordpress has 1/3 of the Web and there is a cottage industry of developers that specialize in just hacking at Wordpress to make it do things it's not particularly great at. Most people and most businesses can't come remotely close to building their own high-functioning sites (from scratch) in a cost effective manner, while getting all the critical details (eg building for SEO) right. So you get an obese do-everything CMS, and throw in some plug-ins, to sort of shim the problem.

Why is Shopify worth $150 billion? Well, other than the bubble, this effect is why. People can't easily build their own ecommerce sites, can't integrate everything they need to, in a way that doesn't cost them a small fortune.

Wix is a pretty mediocre service, clunky and slow. It's worth $15 billion? How in the world does that happen. Well, building sites is super difficult for most people. The opportunity to make that problem better is, apparently, huge.

Moru5y ago

What they value is the users, not the platform as such.

bentcorner5y ago

Feels similar to "Reflections on Trusting Trust".

Could someone inject links into content in such a way that you cannot find the link in your own source or even your hosting stack?

bombcar5y ago

You could modify the web server to modify the code in a similar way to the reflections paper.

But even more imaginative would be to work it into the kernel or the ssl layer somehow.

didip5y ago· 5 in thread

The old Google would have hunted these down mercilessly (Panda update in 2011). What happened to Google these days?

cirno5y ago

They have no competition to care anymore. Their closest competitor, Bing, has a 2.24% market share which consists mostly of people who don't bother to change their default browser's default search engine. Competition is necessary to breed innovation. See for example, IE6.

progx5y ago

That is true! Why should google do something? They say "use ads", to make money.

Use other search engines is the only way to do something.

rchaud5y ago

"SEO scammers got you down? Call Google Ads now!"

adrr5y ago

I’ve always wondered if Google AdWords hurts your SEO. Let’s say you sell widgets and searching for widgets you are ranked 5. You buy AdWords to be on the top. Since people click you AdWords ad that’s on top, they are less likely to click the organic listing thus penalizing your organic listing since it’s not getting clicks. Google factors in which organic listing click counts when determining ranking since it is a strong signal.

2 more replies

rondrabkin5y ago

Yes they would have. OMG that was 10 years ago and ...what is new in Google search these last 10 years. Maybe a lot but I don't see it. I just see ads and, when I do some long tail query most of the results are just random sites in russia or whatever with keyword salad (is there a word for that kind of site?)

DaveExeter5y ago· 4 in thread

This is clever!

https://www.google.com/search?q=%22Learn+how+to+solve+a+Rubi...

jesseryoung5y ago

I find it hilarious that this made it's way into an Amazon listing for some waterproofing chemical. https://web.archive.org/web/20210607233655/https://www.amazo...

duskwuff5y ago

I find it even funnier that it appears in a research paper:

https://www.researchsquare.com/article/rs-8615/v1

(It's on page 24, at the bottom of the References section.)

2 more replies

autorun5y ago

It's even freaking funny that "SEO" appears as a Related search at the bottom of that search url. It has nothing to do other than a lot of people (we) come from a SEO article

caspiiOP5y ago

Amazing!

31tor5y ago· 3 in thread

Google is getting worse and worse. It's harder than ever to find real information. All you get is seo scams trying to lour you in and sell you stuff. It's tragic. I miss the old internet.

Johnythree5y ago

This: As an electronic engineer I would often search for component data sheets. Usually the sheet I wanted would be the first hit. These days however I get pages and pages of crap sites that want to sell me the data sheet. Or even pages that say that they don't actually have it.

pbhjpbhj5y ago

To be fair, this is how the web has changed too. High-value content has been duplicated and hidden behind pay walls when in the early days (ie my early days on the internet/web) everyone seemed to come with their own content and share freely.

1 more reply

jaclaz5y ago

Yes, and what I always wonder about is that while I can understand the crappy sites that want money for an otherwise freely available documentation, I cannot understand the reason behind those sites (they are not only related to electronics) that come up in search (as they do have the very specific keyword/part number searched for) only to say "Sorry we don't have any of these, nor anything related".

dataviz10005y ago· 3 in thread

I'd put my money that they included 'online scoreboard' as the first phrase in the `<meta name="description" content="Online scoreboard"> tag per Google's recommendations for SEO which put them ahead.[0] Also, to this one point of many when searching for `online leaderboard` you get the top spot because leaderboard is not included in the other website's description.

The network tab in devtools isn't loading Google Analytics on you site. I think the bigger conspiracy is that Google isn't giving high search result rankings to websites that don't include Google Analytics. Part of the reason is they use time on site after following through a search result link as a dimension of quality for that search result. If that makes sense? They give 10 search results and their algorithm can tell if the search result satisfies the end user's request if they don't go back to the search results but rather continue on that site.

Lastly, clicking through a search result to your site might not give the searching user what they are looking for. Amazon discovered every time a person has to click they are far less likely to purchase an item so they created one click. Your competition makes it visually clear what their site does. You probably would get far more retention on the original click to your site if you have an image of what the end product looks like in a hero, front and center (with all the meta tags described in Google's document on SEO of course.) That way people won't click back to the search results page which Google is tracking as a dimension.

[0] https://static.googleusercontent.com/media/www.google.dk/en/...

lloyddobbler5y ago

> I'd put my money that they included 'online scoreboard' as the first phrase in the `<meta name="description" content="Online scoreboard"> tag per Google's recommendations for SEO which put them ahead.[0] Also, to this one point of many when searching for `online leaderboard` you get the top spot because leaderboard is not included in the other website's description.

While I like the thought progression you're going through, this is a "not really." Google has confirmed a number of times over the past 15+ years (going back to the Matt Cutts era) and even in the document you linked that the meta description does nothing to influence ranking in the SERPs. However, the meta title does influence ranking.

I'm on mobile, so unable to dig in right now - but my guess is either this has something to do with the meta title, or the specific anchor text of the backlinks that are getting inserted via the app in question.

Aside from that, agree 100% with your other assessments.

1 more reply

LocalPCGuy5y ago

This (and another comment you made deeper down) make me think you may be stuck on the mechanical/technical side of SEO. While it is important, it is not nearly as important as it used to be, say 10 years ago. It's probably much less than 50% of your overall ranking. Yes, you need to get it right, but relevant back links and the associated link text on the sites linking TO you will have a far greater affect. Google prioritizes what OTHERS say about your site more than any factors you can control. Mainly because people abuse those factors significantly.

2 more replies

caspiiOP5y ago

I only switched Google Analytics off last week! I had it on for the whole time before and it made no difference.

1 more reply

superasn5y ago· 3 in thread

Google really needs to come up with a better way than backlinks to rank sites.

It's 2021 and surprisingly for all the billion dollar A.I. it can still be gamed with a bunch of unrelated links with little or no connection from the article to the site.

Also it's pretty unnatural and shady to get these backlinks. For my own SaaS site almost every blogger I contacted for a review just straight up asked me money in exchange for link. What the software did was of no consequence to this exchange. Most sites which have these "list of 10 XYZ" are just similar money making scams yet they rank so highly on Google.

P.S. And likewise I too get dozens of emails daily with "offers" from free article to actual dollar amounts just for putting a paid link. These SEO guys are just relentless because such shenanigans are working great at beating Google so far.

dafelst5y ago

Backlinks are not as important as Google would have you think, they are a pretty weak ranking factor except in the deep tail of the web.

Google (and others) keep up the narrative that they're important so that black and grey hat SEO folks keep focusing effort in the wrong places.

Source: ran the web spam detection team on a different well known search engine

4 more replies

somehnguy5y ago

>Most sites which have these "list of 10 XYZ" are just similar money making scams yet they rank so highly on Google.

I was just talking to my SO about this the other day when we were trying to find an air purifier for allergies. I'm the kind of person that likes to compare products a ton before dropping more than about ~$100 on anything. The way the internet has become in the last 10-15 years has made this increasingly more difficult. You really have to dig to find in-depth unbiased content on anything someone stands to make money from. For every 1 good review there are 100 'top 10 best ranked' blogspam sites..

2 more replies

marcodiego5y ago

I don't think there are incentives for a change. The way it is done now is probably more profitable and the competition is doing exactly the same.

fogof5y ago· 2 in thread

> and my personal favorite: a blog post on Kaspersky.com

Wow, embarrassing for Kaspersky as a computer security focused site to be a victim of this.

When I searched for "Rubiks" as it said to do, I couldn't find it though. Has the Kaspersky post been changed?

caspiiOP5y ago

Yeah, looks like they removed it.

zulban5y ago

Embarrassing but understandable. Computer security isn't about perfection, which is impossible. It's about vigilance, resilience, backups, and responding quickly. I'd say they nailed it, here.

alphabetting5y ago· 2 in thread

It's worth noting the scam site is the top result in Bing and DuckDuckGo as well

drzaiusapelord5y ago

Yep this! How can you really beat SEO when people can just try new things all day and see if it helps their rankings? I don't feel there's a solution here. Everyone suffers under SEO types just trying to bring scammy things to the top of the results page.

topicseed5y ago

The thing is, I have a site in a very competitive niche that's full of black hat SEO tactics, and I am doing my white hat best hoping that Google tanks these sites when they update algos over time, and I'd then be the best placed to take their spots over.

But in the meantime, yep... It sucks.

nostromo5y ago· 2 in thread

I've heard so much about how PageRank isn't that important to Google anymore -- but there are many reports of SEO tricks that get people on the first page of Google for common queries. It seems like it's still quite important after all.

advisedwang5y ago

They may have abandoned the actual page-rank scoring system (a quite specific implementation) without wholly abandoning the idea of using "who links to who" as a quality signal.

Lammy5y ago

Those can both be true. PageRank is a relic of a time when search engines more consistently returned the same results for the same query. These days we're all filter bubbled with personalized results

stefan_5y ago· 2 in thread

I'm so happy the author is an ethical SEO scammer and will not stoop to these tactics.

alvah5y ago

What would you have the author do? Market his business or build it and let them come?

caspiiOP5y ago

Big difference

1 more reply

jrochkind15y ago· 2 in thread

> The creators of Scorecounter also made an online HTML editor

Or paid the entity running the malware HTML editor. It's probably injecting links to a variety of sites who paid them for placement.

discmonkey5y ago

[Disclosure work at Google] Unless i'm misreading this, the publisher of the html editor can easily replace links on some pretty large websites. Then it can also hack Google to remain a top html editor. This is a very clever virus!

janmo5y ago

He's probably selling that service on blackhatworld

fnord775y ago· 2 in thread

I searched for my local USPS store today. Every result was some SEO crap. No usps.gov result on the first page.

bombcar5y ago

The United States Post Office in Local Town USA is a great place to buy stamps. Find out about Local Town USA mail service here at randomlocaltownmailservice dot com.

dpedu5y ago

Well, it's USPS.com, so... :-)

1 more reply

lopatin5y ago· 2 in thread

Sorry in advance for off topic, but seeing as how both this website and Reddit is currently down with the same 503 Varnish cache error currently, I wanted to ask how that would be possible? Surely Reddit uses their own infrastructure and not some shared hosted server where this could happen across the board?

Edit: Wow, this is much bigger than just those two sites. Looks like half the internet is down. https://downdetector.com/

Wronnay5y ago

Seems like this is connected to the https://fastly.com outage ;-)

fluential5y ago

Terraform.io Reddit.com same error msg, looks like CDN issue ?

1 more reply

imaginamundo5y ago· 1 in thread

Well, that is unfortunately.

In 2015 I was fired because some issues on a site that I was working on because some friction with the company owner. Two months before I was fired I reported that some links to others sites non related to our service was on the initial page (some porn and some scams pages). After that I heard from my ex-coworkers that a manager from another area from the company told that I was fired because I was linking porn on some pages from our service. I didn’t knew at the time that those tools existed, but only today I realized that it is an option.

I was really sad with that manager and didn’t understood the reason to lie to my friends the reason of my demission. But is nice to know what may have caused the issue. Better late than never hahaha.

geek_at5y ago

Wow that sucks. It's not just HTML Cleaners though. A few years ago (before snowden) I analyzed free proxy servers and found that most of them blocked https and many even injected JS or HTML into all requests [1].

I also wrote a tutorial on how you can build an infecting proxy too [2]. Doesn't work anymore though since HTTPS is everywhere. Thank god

[1] https://blog.haschek.at/2015-analyzing-443-free-proxies [2] https://blog.haschek.at/2013/05/why-free-proxies-are-free-js...

dalbasal5y ago· 1 in thread

This is apropos...

Google's old link-based authority algorithm, pagerank, isn't alaysing the same web anymore. I think there's barely any signal in links these days.

The first major event was Google itself. Once you use something as a metric, it becomes currency. SEO vs anti-spam became a defining cat and mouse game. This kind of stuff was born then, and antispam was meant to curb it.

The second major event was user generated content. The old link pages and blogrolls die slowly. Comments, twitter, and such become the way links are shared. High signal, but extremely spam prone. Google tapped out of this early, and mostly ignore user generated content.

The third major event is facebook, and facebook like ways of doing things. This made most regular people's content unindexable. Search for esoteric keywords used to return a lot of forum results. Still does, to an extent. The thread is usually years, or decades old. What's left on the open web is a subset, a non random subset.

Wikipedia is one of the last sites that does "hypertext" the way pagerank assumes the web works.

In any case, I feel like search (or what search used to be) is in decline. There isn't as much web to search anymore, in a sense. The broad brush way of doing antispam (eg user generated content is just ignored) makes more sense. Why deal with all that noise/spam, just to search what's left of the old web.

What's left? User behaviour, a la analytics. That's makes for more feedback loops and winner takes most dynamics. Localisation became localisation to your bubble. Meanwhile "officialness" measures aren't against google's ethic/aesthetic anymore. They got burned by the "fake news^" crisis, and the quick fix was officialness. In for a penny. In for a pound.

Meanwhile, web search is increasingly just another thing that google search does. It searches "your" data, content of your devices, search history and NN generated whatnot. It searches news, ads, returns answers to questions, does math... There's nothing new about seo scams, antispam just isn't Google's primary solution anymore. Just default to other ways of returning results.

I'm calling it. Web search is dead. Long live the new websearch.

^Circa 2015 usage, not the current

ricardo815y ago

>first major event

IIRC with PageRank there were very specific values associated with 'toolbar PageRank', e.g. a PR7 link could be sold for $1K a month. Understandable because at that time there was no context to PageRank at all, it was simply about being linked to by an "authority". This was 20 years ago though.

lurquer5y ago· 1 in thread

No great fan of Google, but a large component of the problem is the Library of Babel phenomenon: there’s just too much crap being published.

Let’s face it... the early internet was interesting because the only people who could use it (and publish on it) were smart eccentrics. That was its charm. The technological hurdle served as the curator: you might have been a crazy white supremacist, anarchist, conspiracy theorist, or ‘expert’ in how to grow radishes or some other bizarrely eclectic field... but all of them were necessarily a bit smarter than the average bear just by virtue of knowing how to host content and access it; not a trivial task in the late 90’s.

Maybe it’s time to think up some convoluted alternate network that is a royal pain-in-the-ass to use. Perhaps there the eclectic and useful content creators will once again arise (and searching their trove will be a snap as most everything there will be fresh, unique, and interesting.) It will exist, I suppose, for a few years before tools are made to enable grandma to easily use it.

chrisfrantz5y ago

I think that’s somewhat the promise of Web 3.0 at this point. Painful to use and relatively empty. However, it’s mostly people hyping random crypto instead of actually creating value.

janmo5y ago· 1 in thread

It probably started with the guy adding something like "Edited using XXXX Editor tool" to make himself some publicity. Seeing that it worked he started selling those backlinks a fortune.

slim5y ago

Circa 1999 I was running a webdesing studio. We added that link to all the websites we designed, then the next logical step was to make it link to a page with our entire portfolio which in turn linked to our website. That boosted the SEO of all our customers, and in turn boosted ours exponentially.

CR0075y ago· 1 in thread

I believe that Google search quality has degraded a ton. Not surprised here.

By the way, I develop proprietary software. Hope that someone reads at Google and stop indexing all those pirate websites where people steal from others. Not torrents, talking about those websites where they even sell you paid access to stolen stuff.

Serously Google? You can't filter "nulled"?

linuxfan20215y ago

That must suck. All my life I've developed open-source software so I don't mind about people sharing it, but to think that people would not only take it, but actually SELL it for their own profit? People are lame.

munk-a5y ago· 1 in thread

Imagine my surprise to find that the scammy site at the top of searches isn't w3cschools - that cesspool of terrible references.

Once I discovered that everything I would ever need was better explained on the MDN my life as a webdeveloper strongly improved.

forgotpwd165y ago

Can you provide an example of terrible reference in W3Schools?

HelloNurse5y ago· 1 in thread

Lately I've seen many automatically generated trash pages in high-ranking Google results, typically copied or "AI"-generated plausible text and spam links, suggesting that gaming Google ranking algorithms is a solved problem.

b0afc375b55y ago

Yes, those stackoverflow clones and github clones are really annoying. You think someone might finally have an answer to your problem, but it's just a copy-paste of the stackoverflow you've read previously.

helsinkiandrew5y ago· 1 in thread

What I don't understand is why does Google continue to use metrics that are so easy to fake and game.

Surely some kind of fairly trivial NN/Not very deep learning system can classify HTML content so that out of context links (like "Learn how to solve a Rubic Cube" in a Seventh Day Adventists sabbath lesson) and content that is copied is ignored or marked down.

Whilst I'm sure GPT-3 could be used to create more realistic looking fake content - this would eliminate 99% of the script kiddies creating low value SEO spamming sites.

nickodell5y ago

Are non-sequiturs always malicious? For example, suppose you have a news site, and it has a story about Ukraine, followed by a story about school shootings. Even if two links next to one another are unrelated, that doesn't prove that they're not genuine.

1 more reply

dmje5y ago· 1 in thread

Every time I think about developing a product I'm put off by the hell that is SEO. The whole landscape is just horrible, full of snake oil sellers. It's not like the end result is any good either: man, Google results are so fkn bland these days. You stand zero chance of finding anything interesting, it's all just MONETIZED in such a boring way. Yes, I want the old web back.

aembleton5y ago

Ignore the SEO and concentrate on the product. You can worry about SEO later, there are even specialists that can help you with that.

mjthompson5y ago· 1 in thread

Just an observation: your competitor site is using AMP pages,* while you don't appear to be. I suspect without knowing that Google take this into account in ranking.

https://en.wikipedia.org/wiki/Accelerated_Mobile_Pages

* forgive my RAS syndrome

caspiiOP5y ago

I know. But I'd rather shoot myself in the foot than use AMP.

ziftface5y ago· 1 in thread

While I think this is extremely shady and will avoid ever using a tool like this in the future, does it actually break google's TOS? It seems like a valid defense could be made.

shanecleveland5y ago

This is a valid question. Though, I would argue in this case that they have found a loophole more than anything, if they are not in violation of the TOS.

As others have pointed out and the author acknowledges, he is technically injecting links when his users embed their scoreboard on their website through an auto-included link-back to his site.

Now, I don't frown upon this. It is not deceptive and its placement is more than relevant.

The same cannot be said for the scheme the author uncovered. But whether it is violating Google's TOS is another question. I'm not sure of the answer.

lifeisstillgood5y ago· 1 in thread

SomInjust tried one of these sites - and I cannot reproduce the scam - my output from htmltidy.net seems to work fine and I cannot find any weird back links.

Any notes on how to reproduce?

bombcar5y ago

Keep trying, it seems to only do it some times. I’ve used them and am always checking my code and have seen it a few times.

Maybe clear cookies and try from a diffferbroawer?

thar3275y ago· 1 in thread

How does ahrefs.com get the information of any websites backlinks? This is more interesting than the original article itself

aembleton5y ago

With web crawlers. They index the web to give you this information: https://ahrefs.com/big-data

Mauricebranagh5y ago· 1 in thread

WTF is a html cleaner and why would you use one.

aembleton5y ago

It can be used to simplify HTML, removing comments, attributes, classes or anything you like. You would use it to simplify and shorten your HTML.

Here's an example of one https://html-cleaner.com/

1 more reply

FridayoLeary5y ago· 1 in thread

Several points; the title is slightly misleading, i initially thought (in my ignorance) that OP was referring to a company employee, also this article is surprisingly open in 'naming and shaming' his competitor.

caspiiOP5y ago

Yikes, should I have shown more discretion in naming the competitor?

4 more replies

lumpa5y ago

Amazing how such a simple approach can achieve content injection on a diverse network of unrelated websites, to the point of raising the profile of the vector and increasing the chances of further spread.

I hope someone figures out which other campaigns were run with these tools. Also, whether you can find output with the link injections in source code, like on GitHub or distro packages.

RileyJames5y ago

Totally agree with all the comments here, seo broke google, and they don't care. Probably sells more adwords in the end.

I found uBlacklist from this thread, and the subscription functionality enables some collaborative effort.

So I've started making a list, but unfortunately there aren't many uBlacklist subscription lists out there yet.

Be interested to see how far this could go: https://github.com/rjaus/awesome-ublacklist/

baby5y ago

Same story for chrome a while back. I formatted my father computer because he had a bunch of malware. The first thing he did was to google “chrome” and download the first result. Which was an ads. Which was a malware.

chrischen5y ago

Similar to “my ip address” https://news.ycombinator.com/item?id=27415897.

Google just seems to give way too much weight to domain name matches with the search keyword.

slugiscool995y ago

Honestly kind of a brilliant SEO strategy. Makes me nervous to use any free online tool

bluedino5y ago

Plus google returns results that link to shit like http://edva.implantologiadentalecroazia.it/somewhat-related-...

And then you get re-direct to some prize-winning spam site.

I love getting a search result that includes Google Books because those are usually useful. That’s what Google was best at, bringing in things that weren’t regular web pages.

forgotpwd165y ago

Scummy but pretty smart setting up such a network of sites. Even more reverse figuring it out. Bravo to the author.

Wronnay5y ago

https://web.archive.org/web/20210608085551/https://casparwre...

przemub5y ago

I am a programmer but it gets harder and harder to find good results for anything outside my niche, not to mention outside programming. Maybe there are any alternatives being worked on? Googling increasingly feels to be a complete waste of time.

bredren5y ago

> Now if you are feeling very magnanimous, you could argue that the editor is a freemium tool, and that added links are how you pay for the free version.

This unknown exchange of value for “free” products and services is what everyone from Facebook and Google down to malware-like browser extensions do to extract difficult-to-acquire resources.

People don’t understand how their personal data, internet connection (residential proxy network node), or in this case, publicly displayed website are being monetized or used indirectly for monetization.

People don’t know or are tricked into allowing themselves or their resources to serve as an ugly cost externality to some other clean-looking business endeavor.

mpva5y ago

You should report this to google, its clearly a huge violation of terms of service.

napolux5y ago

Worked with SEO until 2020. It's really a scam. I know people making 1000$ per month by just generating a bunch of interlinked domains with semi-random text like: 'phone number $randomNumber'.

gitgud5y ago

> Now if you are feeling very magnanimous, you could argue that the editor is a freemium tool, and that added links are how you pay for the free version.

Well, unfortunately this is basically how every freemium tool works. They have some way of advertising, in exchange for free use of the tool.

Even reputable CMS tools like WordPress include back links to wordpress on a new site and themes.

Although, this is much less common with open-source free tools, as the community resists these kinds of changes.

No such thing as a free lunch!

tcarambat10105y ago

The site has been hammered down by google now and will be unranked since this story broke.

https://html-online.com/editor/

In case you cannot view it the banner across the site says now "Goodbye!

This site has been penalized for unnatural link building and will be removed from Google Search

Please bookmark if you wish to continue use of the site.

We are sorry and are working on fixing the problem to recover from the penalty. "

They are only sorry they got caught

mtnGoat5y ago

ive learned that almost any keyword related to sale-able products has been gamed on Google. all i see are affiliate links and people that keep track of which format to build their website in, that google likes this week.

more importantly, what ive also learned is that Bing search results are less of an affiliate link cesspool because fewer SEO spammers are working at gaming Bing's results.

raverbashing5y ago

So people edit html using these "online tools" (because <p> is hard apparently) and then nobody cares to proofread, and even whole paragraphs are added without anyone noticing.

Great.

Nobody cares about the content apparently. Nobody checks if the generated HTML makes sense. It's all about spinning the wheel.

Sigh.

ricardo815y ago

On the positive side I suppose, at least it was just regular HTML, could've been injected JS.

clydethefrog5y ago

I have been thinking, it almost seems we are going back to the old way you would browse the web - with homepages and a page with relevant urls to other places of the web. Except the homepage is now your instagram account and a linktree in the bio.

kontxt5y ago

Summary: https://www.kontxt.io/document/d/9LogKuJbXwihQd6nj0Y0iSP8h8t...

joeyoungblood5y ago

SEO professional here, I can't see the article due to the wide-spread outage but have reviewed the comments here, comments in closed FB groups, and injected links on sites from this tool, as well as the tool's admission of such links on their site. This tool is most likely owned by a publisher attempting to steal SEO link value from user websites, it is also possible they are selling these links outright or via a PBN system. This type of link building was a common practice in the early 00's used by CMS theme developers and tool makers alike to gain link value. Google took a stand against "widget links", which is likely what these would be classified as, and as recent as 2016 even warned against their usage: https://developers.google.com/search/blog/2016/09/a-reminder...

A year later Google's John Mueller, a trends analyst who often also acts as a liasion between Google and the webmaster community, stated that Google might automatically apply a 'nofollow' attribute to these types of links, effectively killing their ability to siphon SEO link value to improve themselves: https://www.seroundtable.com/google-auto-nofollow-widget-lin...

We have noted in our agency research for clients several similar usages over the past few years that appear to be giving websites positive value instead of either being ignored or penalized, including a WordPress plugin that injects links on government and collegiate websites. The way Google assigns value based on links has changed quite a bit over the past 5 years and there is a chance they no longer penalize for widget links (unlikely) OR that their ability to detect them has degraded significantly (my guess is the later).

One thing is for certain, Google absolutely retains the ability to manually devalue links and penalize a website for violating their guidelines. They do not enjoy negative press or communinity discussions on search quality like this one and in the past have taken swift action when such issues arised in the media.

At our agency we advise clients against this type of link building as it has no long-term value for a brand and could cause long-term pain instead. SEO should be used to help new brands gain a competitive advantage against more established incumbents such as a startup taking on Amazon or a new SaaS tool providing valuable data to an industry.

trungdq885y ago

Sorry for the plug, but this is the exact reason why I build https://devutils.app

Developers paste their data to online websites too frequently these days.

LoveMortuus5y ago

I personally think that SEO is kinda a waste of time. Because why would we have to adapt to a bot, a program. When the program should have to adapt to us.

The problem is that people will always try to game the system :/

marcodiego5y ago

If you're using google to find adequate tools, you're searching the wrong place. Wikipedia (even stack overflow) seems way better and has curated lists of open source tools for a particular purpose. Of course, wikipedia can be edited but IME, it is way better than google to find adequate tools.

If you're decided on googling for a suggestion of a tool, at least include "open source". Even if you're searching for proprietary tools, you'll probably find the traditional "it has better X, Y, compared to proprietary tool W" review.

ChrisArchitect5y ago

so lazy/terrible developers were using random tools online and not noticing injected spam links into their pages. Whatever man, you're getting beat because of it, better find a new strategy. There's tons of link spam and stuff out there but google's results are still good for real content for the most part if you build it up

joeyoungblood5y ago

This HTML editor site has now been penalized by Google for unnatural link practices as I had assumed they would be.

Giorgi5y ago

Ok, that's ultra-smart. Something you would think "how come I never thought of that?"

FranchuFranchu5y ago

I wonder how many similar tools are also like this. We can't trust search engines anymore.

ra33o5y ago

Begin to make your own content and make it public... Oh, like a blog. Like in the old days.

vagab0nd5y ago

This sounds like something a reinforcement learning agent would do to maximize income.

progx5y ago

google search is garbage, that's it.

We had a compititor who spams his page full with SEO garbage Words, our Software is used 100 times more than his software, more people search for our software, click it and use it, link it, but who is on 1st place in search results? Right, the SEO spammer, with the slower page, full of shiny SEO words that has nothing todo with the software.

@google i wait for working AI that detects such garbage sites!

codehawke5y ago

We need an open source Google, yesterday.

tartoran5y ago

Do away with google. Problem fixed

jmspring5y ago

The problem with an algorithm, you can find ways to game the algorithm.

funman75y ago

Don’t the kids say nowadays first link on google always sus

billyharris5y ago

This isn't the only website which is ranking on first position using such blackhat SEO tactics but if everyone mention them like you do, then they will surely not going to remain at top for long.

diveanon5y ago

Easiest and most effective google SEO is to just buy ads.

mastrsushi5y ago

>Some highly-ranked online tools for editing or “cleaning” HTML seem to be secretly injecting links into their output to push themselves and affiliated sites up the search engine rankings.

You can't pretend this isn't funny as fuck lol.

sova5y ago

While Google still has market dominance, I wonder if privacy-centric search engines will be the future. I am considering putting substantial effort towards such a goal, you can sign up to learn about when at puubl.com [1]

[1] https://www.puubl.com/

j / k navigate · click thread line to collapse

395 comments

227 comments · 71 top-level

ilamont5y ago· 47 in thread

Same story for various Wordpress plugins and widgety things that live in site footers.

XorNot5y ago

duskwuff5y ago

4 more replies

gerdesj5y ago

"Google has turned into a cesspool."

That's a bit harsh but I agree that it is starting to fail to live up to the expectations I had with Google when it came out and destroyed Altavista in a spectacular shower of sparks.

Could I tender: "uBlacklist" as a stop gap, amongst others as we await Google being given a right old kicking?

oska5y ago

I appreciate a lot of what you're saying in this comment but I disagree with this sentiment:

> not the Ads - that's fine

Spooky235y ago

I don’t think Google is the cesspool, I think Google is a search engine for an internet that is the cesspool.

We’re moving to the vision of information services that were pioneered by AOL, Prodigy, etc. Honestly, we’re there already.

1 more reply

smegger0015y ago

I wish i could have 2010 google search as a alternative to 2021 google search.

3 more replies

emptyparadise5y ago

I'm amazed that there isn't anything like uBlock Origin for search results.

3 more replies

p5a0u9l5y ago

Comparing Google now to Alta Vista is not very helpful. They don't get to rest on their laurels. Search is less helpful now, and it's not clear to me that they care enough to do something about it.

1 more reply

thaliaarchi5y ago

    ! Hide low-quality results on DuckDuckGo
    duckduckgo.com##[data-domain="w3schools.com"]
    duckduckgo.com##[data-domain$=".w3schools.com"]
    duckduckgo.com##[data-domain="w3schools.in"]
    duckduckgo.com##[data-domain$=".w3schools.in"]
    duckduckgo.com##[data-domain="download.cnet.com"]
    !! Stack Exchange mirrors
    duckduckgo.com##[data-domain="exceptionshub.com"]
    duckduckgo.com##[data-domain="intellipaat.com"]

raverbashing5y ago

Great idea. Though I've noticed DDG promotes "blogspam" articles more often than the authoritative sources.

Let's say, if I search for a python builtin library, I want to go to the python website, not some "Python 101" blog post about it.

bassdropvroom5y ago

Great tip! I've been using DDG's official addon but this means one less addon. Thanks!

zem5y ago

pinterest.com would clean up another large chunk of crap

elchupanebre5y ago

robbrown4515y ago

I wouldn't call that rational. There is no reason you can't apply human weighting on top of ML.

1 more reply

cookiengineer5y ago

> G can't fix it.

1 more reply

coliveira5y ago

jeromegv5y ago

Blatantly false that Google has "no recourse", Google can put on penalty and bring domains down.

humaniania5y ago

"Request manual review of search results" button?

1 more reply

wingworks5y ago

I really don't like how easy it is to fake a "new" article on Google. You can just re-publish an old article and stick a new date on it and Googles takes it on face value and uses the new date.

sellyme5y ago

You can also do the opposite: post something today and say it was up on your site in 2003.

Makes it really difficult to find old pages about something that recently exploded in popularity, because the age filter just doesn't work.

BigJono5y ago

1 more reply

colordrops5y ago

Google Search is ripe for disruption. It's been over 20 years now and they are not dynamic or interesting at all anymore.

LeoPanthera5y ago

I still think that the "Yahoo!" style web directory is a good model. A catalogue of hand-curated links has increasing value as the quality of Google results goes down.

Edit: ...and using the search box results in a 404 so I guess it's really dead huh.

Edit 2: Apparently this is the successor! https://curlie.org/en

[1]: https://dmoz-odp.org

3 more replies

lemmiwinks5y ago

The irony being that 20 (more like 25?) years Yahoo search was ripe for disruption... by Google :)

Halt and Catch Fire [1] (As a nerd, I can say it's one of the few TV series that got the hackers spirit correctly) had a few episodes about the Google disruption.

Like some people often say here, things come and go in circles...

[1]: https://en.wikipedia.org/wiki/Halt_and_Catch_Fire_(TV_series...

rickspencer35y ago

Neeva.com

I am in the pre-release program. The hardest initial thing to get used to was not immediately scrolling down to the bottom to avoid all of the spam.

I suspect that their methods are not much different than Google, but the experience has been so much better.

3 more replies

emodendroket5y ago

It's so easy to do better! Just look at what a rousing success Cuil was.

1 more reply

duskwuff5y ago

*: and themes for other web applications, but mostly WordPress these days

normac25y ago

I bet it's that we do different types of searches.

jamiek885y ago

Ugh Pinterest results.

wlesieutre5y ago

I swear, Pinterest must have employees working undercover in the Image Search team for Google to have let them destroy image search results the way they have.

2 more replies

ajsnigrutin5y ago

newacct5835y ago

> Google has turned into a cesspool.

Mediterraneo105y ago

(I just checked and this copycat documentation site has, thankfully, now been pushed down a bit in DDG results.)

2 more replies

worble5y ago

3 more replies

jimbob455y ago

This is my view too. Yes, I’d love to go back to a time when Google’s algorithms were unknown enough for SEO to be futile but those days are gone and the problem isn’t limited to Google.

cookiengineer5y ago

I also noticed that Apple users see way more fake online shop results than Linux users, from the same IP, with regularly cleared browser cache and identical search terms.

In Germany, we have at least for hardware the "geizhals" website which is kind of an index for all kinds of electronics shops and they try to verify as much as possible.

ping_pong5y ago

luke2m5y ago

I don’t like google and don’t really want to defend it, but this is more of a lots of crappy websites problem than a google problem.

worik5y ago

Google, to justify its huge capital worth, should deal with that crap. Why else bother?

naikrovek5y ago

google isn't the cesspool, people who want to appear at the top of a list of search results are doing whatever it takes to create a cesspool, because that's what it takes to earn more money.

being willing to make other things in order to have more money always creates cesspools.

prepend5y ago

Google’s mission was “organize the world’s information and make it useful” and they are doing a poorer job now than historically.

Of course there are scammers, that’s part of what makes organizing so hard.

Cynically, I think that Google is worse as filtering scammers is because they care less now. Half the page is ads so they make money either way.

Retric5y ago

Google is a cesspool because it’s their job to fix it and they failed. I stopped using Google search because of how far it’s fallen.

beepbooptheory5y ago

If its the only way to make money, it doesn't really feel like the burden is on the people to make a cleaner pool

1 more reply

cyanydeez5y ago

dont forget adding quotes to things to stop the random "did you mean to spell this?" crap

basically, like everything in modernity, its a race to the bottom of the infinite dullards of popular

paulpauper5y ago

I wish duckduckgo had better results. google still better

torbital5y ago

I can't remember the last time I searched on Google without appending "reddit" to the end.

lupire5y ago

> Half the time I find myself having to do ridiculous search contortions to get somewhat useful results - appending site: .edu or .gov

A great opportunity for students and public servants to sell premium URLs.

cybice5y ago· 22 in thread

mercury_craze5y ago

nimbleal5y ago

Fortunately with GPT3 and the like I’d imagine this approach will soon have had its day. Not that I’m optimistic about whatever will replace it.

1 more reply

ridaj5y ago

Much of it driven by cult cargo SEO, throwing everything and the kitchen sink into the page in completely unproven hope that it'll somehow game the rankings

Jenk5y ago

lvncelot5y ago

Even (or rather, especially) if every SEO advice is correct, it still means that Google effectively has a lot of control over the shape of the modern web, alone through indirect pressure via SEO.

spiderfarmer5y ago

I am running every SEO advise as an experiment before implementing it across my network and a lot of advise actually brings results.

hliyan5y ago

pbhjpbhj5y ago

I didn't check but to my recollection that domain is pretty old, domain age is supposed to be a principle metric for trust (which in turn is a strong signal for page rank). So, ...

I mean it's pretty reasonable, if a site has been around a long time it's going to be generally 'good'.

ricardo815y ago

Some of the technical SEO is good though, like simply making the page crawlable and content being in a logical order.

The "fiddle with H1" or "write X amount of words" or "buy Y number of links with a % of anchor text" is silly.

tomcooks5y ago

> Some of the technical SEO is good though, like simply making the page crawlable and content being in a logical order.

Semantic HTML has been created to help screen readers and browsers understand content organization, it having been hijacked by SE is just a side-effect.

2 more replies

lvncelot5y ago

hyperhopper5y ago

This will likely be true until a method of finding webpages that is not based on automated scraping or the page itself.

Sharlin5y ago

2 more replies

novaleaf5y ago

worse: paid content farms / ai to generate crap "articles" by the boatload, targeting every organic search term 5 different ways.

tootie5y ago

Cthulhu_5y ago

On the other hand, Google over the years has tweaked their algorithms and recommendations to match up with what makes a good site, in terms of content and markup.

growt5y ago

Human readable Urls don't sound that bad.

loonster5y ago

Extremely useful for when a link dies and there is no useful archive.

apples_oranges5y ago

As a mobile developer (sometimes) I rarely/never see apps that don't have Google SDKs bundled either..

atatatat5y ago

That sounds...like a great reason not to get into "mobile" dev and stick to PWAs.

pjc505y ago

Well, yes, because googlebot is the gatekeeper of popularity and income for websites. Got to appease the decision maker.

Mauricebranagh5y ago

Apart from stuffing 200 links in the footer why is this bad?

shmiga5y ago· 19 in thread

SEO is so broken, it's not about website content or website quality. It's about how much money you pay to some punks - "SEO experts" who are hacking a system. I'm so sick of that.

mrtksn5y ago

The format goes like this: Lately people are searching for XYZ but is it safe to search for XYZ? What experts say for XYZ? To find out continue to read our article.

Then it's followed by wall of text made of keywords(in sentences that don't make sense), if you are lucky there would be the opening hours(which are often not accurate) somewhere down the text.

dspillett5y ago

I've not seen it for opening times (UK here) but the same pattern is very visible elsewhere.

1 more reply

eino5y ago

> It's not like you are going to switch to Bing

3 more replies

Avamander5y ago

> Then it's followed by wall of text made of keywords.

I've noticed a rise of that as well. With some searches such spam is all I've received. But that's really a problem in all languages Google supports I think.

There's even malware that infects websites and generates such content, not sure what's the point of that. Anyone knows?

1 more reply

lodovic5y ago

> It's not like you are going to switch to Bing?

1 more reply

1vuio0pswjnm75y ago

raxxorrax5y ago

It would need an option to ignore any form of news media in search results.

apples_oranges5y ago

I'm trying alternative search engines from time to time and and they are much weaker than Google. So yeah, I'd bet on them to improve stage. The others first need to catch up.

1 more reply

topicseed5y ago

True to some extent but it is improving with Google updates. Now, there is a way to go still, and some legit websites get hit by updates unfortunately, but overall fewer and fewer scams pass through.

SEO used to be extremely gameable (seniority of site, keyword stuffing, backlinks), but these levers aren't as obvious now, if at all.

shmiga5y ago

That is great, but can google change their algo to some point where it works differently? Their ad business is there in the web.

1 more reply

maze-le5y ago

bungle5y ago

Getting stuff on PagaRank feels a game. Getting stuff out of Google feels a game too. To the point that moving to an alternative feels worth it, at least to try.

WarOnPrivacy5y ago

Everyone wonders about that. Googling most phone numbers return nothing but pages of spam links.

A decade from now, Google will have made no improvement.

DelightOne5y ago

That's why for certain things Google is useless. Have to add certain keywords to avoid the SEO content to get comparisons, reviews, forums.

One day Google may introduce multiple search rankings, where one of them is SEO and another is the "useful things". But I don't hold my breath.

mikevin5y ago

I still do this but I'm 99% sure Google and DDG max out at around 3 keywords these days. I just get results for the top 3 SEO keywords, no matter how much I try to refine my search.

Maybe it's just because I'm searching for technical stuff but DDG and Google are both a big source of frustration for me,

/rant

alpaca1285y ago

It's also so frustrating to get results for websites which present themselves in the search results with "Results for <your query>"...only to show "no results found" when you actually click on them.

Good thing /etc/hosts has no size limit.

ma2rten5y ago

The useful thing would instantly become useless because people would start gaming it.

2 more replies

enknamel5y ago

Or .... how much money you pay Google. This is working as intended for a free search engine.

AtNightWeCode5y ago

qeternity5y ago· 8 in thread

I have little to no experience in SEO. Does Google have a history of weighing in on situations like this and manually penalizing bad actors? If so, I would love a link to read about.

RileyJames5y ago

I agree with some of the other comments, googles actions on SEO are always shrouded in a little "algorithmic" mystery. That said, they do apply "manual action" penalties to individual websites.

Using google search console you can determine if a manual action has been applied to your own website: https://support.google.com/webmasters/answer/9044175?hl=en

Rather than determine the ranks, these actions remove / punish offending websites from the ranks, effectively making room for 'good' actors.

vgeek5y ago

jboynyc5y ago

Here you go: https://www.mattcutts.com/blog/

bliteben5y ago

https://www.mattcutts.com/blog/join-the-us-digital-service/

wow that's amazing, I guess I sort of quit reading blogs like this when all the RSS readers died.

1 more reply

aww_dang5y ago

https://support.google.com/webmasters/answer/9044175?hl=en

cocoafleck5y ago

https://www.wsj.com/articles/how-google-interferes-with-its-... I'm sure that the title tells you that the article has an opinion (not unbiased), but I think it is a useful source.

silviot5y ago

AtNightWeCode5y ago

Not true, you can get penalized and you may be noticed about it in the google search console.

1 more reply

shanecleveland5y ago· 7 in thread

Could it be that Scorecounter is paying for their links to be embedded, as opposed to them being the owner/developer of both sites? If so, and provable, can they be flagged in some way?

Doesn't say much for Google's ability to determine relevancy in linking or recognizing suspicious link growth. Or perhaps it just takes some time ...

topicseed5y ago

Google used to impose manual penalties for unnatural links BUT this gave the rise to, you guessed it, competitors buying unnatural links for their enemy and waiting for the penalty to be given.

Nowadays, unnatural links are mostly ignored.

duskwuff5y ago

dstick5y ago

shanecleveland5y ago

Clearly. But I guess it is not outright proven that they are technically buying links. Though they would likely fall under some form of bad behavior in Google's eyes.

And, buying or otherwise, I am not sure what the mechanism is for bringing this to Googles attention.

I doubt there is another acquisition channel for a project like this that would compare to SEO (and not just Google).

enriquto5y ago

> paying for links is still very much against Google’s policies.

quite a strange think to say about a company whose bussiness is based on selling links (to ads)

1 more reply

vitus5y ago

> as opposed to them being the owner/developer of both sites?

If they're not owned by the same entity, then this blog post is rather odd: https://html-online.com/articles/scoreboard/

(To be fair, that entire blog seems odd...)

shanecleveland5y ago

commandlinefan5y ago· 6 in thread

onion2k5y ago

There was a time when, if you wanted to put a site online, you (or somebody that represented you) made a point of understanding everything that went into it.

Its annoying, and sad, for those of us who care and consider ourselves professional. But it definitely wasn't any better years ago.

julianz5y ago

1 more reply

adventured5y ago

Moru5y ago

What they value is the users, not the platform as such.

bentcorner5y ago

Feels similar to "Reflections on Trusting Trust".

Could someone inject links into content in such a way that you cannot find the link in your own source or even your hosting stack?

bombcar5y ago

You could modify the web server to modify the code in a similar way to the reflections paper.

But even more imaginative would be to work it into the kernel or the ssl layer somehow.

didip5y ago· 5 in thread

The old Google would have hunted these down mercilessly (Panda update in 2011). What happened to Google these days?

cirno5y ago

progx5y ago

That is true! Why should google do something? They say "use ads", to make money.

Use other search engines is the only way to do something.

rchaud5y ago

"SEO scammers got you down? Call Google Ads now!"

adrr5y ago

2 more replies

rondrabkin5y ago

DaveExeter5y ago· 4 in thread

This is clever!

https://www.google.com/search?q=%22Learn+how+to+solve+a+Rubi...

jesseryoung5y ago

I find it hilarious that this made it's way into an Amazon listing for some waterproofing chemical. https://web.archive.org/web/20210607233655/https://www.amazo...

duskwuff5y ago

I find it even funnier that it appears in a research paper:

https://www.researchsquare.com/article/rs-8615/v1

(It's on page 24, at the bottom of the References section.)

2 more replies

autorun5y ago

It's even freaking funny that "SEO" appears as a Related search at the bottom of that search url. It has nothing to do other than a lot of people (we) come from a SEO article

caspiiOP5y ago

Amazing!

31tor5y ago· 3 in thread

Google is getting worse and worse. It's harder than ever to find real information. All you get is seo scams trying to lour you in and sell you stuff. It's tragic. I miss the old internet.

Johnythree5y ago

pbhjpbhj5y ago

1 more reply

jaclaz5y ago

dataviz10005y ago· 3 in thread

[0] https://static.googleusercontent.com/media/www.google.dk/en/...

lloyddobbler5y ago

Aside from that, agree 100% with your other assessments.

1 more reply

LocalPCGuy5y ago

2 more replies

caspiiOP5y ago

I only switched Google Analytics off last week! I had it on for the whole time before and it made no difference.

1 more reply

superasn5y ago· 3 in thread

Google really needs to come up with a better way than backlinks to rank sites.

It's 2021 and surprisingly for all the billion dollar A.I. it can still be gamed with a bunch of unrelated links with little or no connection from the article to the site.

dafelst5y ago

Backlinks are not as important as Google would have you think, they are a pretty weak ranking factor except in the deep tail of the web.

Google (and others) keep up the narrative that they're important so that black and grey hat SEO folks keep focusing effort in the wrong places.

Source: ran the web spam detection team on a different well known search engine

4 more replies

somehnguy5y ago

>Most sites which have these "list of 10 XYZ" are just similar money making scams yet they rank so highly on Google.

2 more replies

marcodiego5y ago

I don't think there are incentives for a change. The way it is done now is probably more profitable and the competition is doing exactly the same.

fogof5y ago· 2 in thread

> and my personal favorite: a blog post on Kaspersky.com

Wow, embarrassing for Kaspersky as a computer security focused site to be a victim of this.

When I searched for "Rubiks" as it said to do, I couldn't find it though. Has the Kaspersky post been changed?

caspiiOP5y ago

Yeah, looks like they removed it.

zulban5y ago

Embarrassing but understandable. Computer security isn't about perfection, which is impossible. It's about vigilance, resilience, backups, and responding quickly. I'd say they nailed it, here.

alphabetting5y ago· 2 in thread

It's worth noting the scam site is the top result in Bing and DuckDuckGo as well

drzaiusapelord5y ago

topicseed5y ago

But in the meantime, yep... It sucks.

nostromo5y ago· 2 in thread

advisedwang5y ago

They may have abandoned the actual page-rank scoring system (a quite specific implementation) without wholly abandoning the idea of using "who links to who" as a quality signal.

Lammy5y ago

Those can both be true. PageRank is a relic of a time when search engines more consistently returned the same results for the same query. These days we're all filter bubbled with personalized results

stefan_5y ago· 2 in thread

I'm so happy the author is an ethical SEO scammer and will not stoop to these tactics.

alvah5y ago

What would you have the author do? Market his business or build it and let them come?

caspiiOP5y ago

Big difference

1 more reply

jrochkind15y ago· 2 in thread

> The creators of Scorecounter also made an online HTML editor

Or paid the entity running the malware HTML editor. It's probably injecting links to a variety of sites who paid them for placement.

discmonkey5y ago

janmo5y ago

He's probably selling that service on blackhatworld

fnord775y ago· 2 in thread

I searched for my local USPS store today. Every result was some SEO crap. No usps.gov result on the first page.

bombcar5y ago

The United States Post Office in Local Town USA is a great place to buy stamps. Find out about Local Town USA mail service here at randomlocaltownmailservice dot com.

dpedu5y ago

Well, it's USPS.com, so... :-)

1 more reply

lopatin5y ago· 2 in thread

Edit: Wow, this is much bigger than just those two sites. Looks like half the internet is down. https://downdetector.com/

Wronnay5y ago

Seems like this is connected to the https://fastly.com outage ;-)

fluential5y ago

Terraform.io Reddit.com same error msg, looks like CDN issue ?

1 more reply

imaginamundo5y ago· 1 in thread

Well, that is unfortunately.

geek_at5y ago

I also wrote a tutorial on how you can build an infecting proxy too [2]. Doesn't work anymore though since HTTPS is everywhere. Thank god

[1] https://blog.haschek.at/2015-analyzing-443-free-proxies [2] https://blog.haschek.at/2013/05/why-free-proxies-are-free-js...

dalbasal5y ago· 1 in thread

This is apropos...

Google's old link-based authority algorithm, pagerank, isn't alaysing the same web anymore. I think there's barely any signal in links these days.

Wikipedia is one of the last sites that does "hypertext" the way pagerank assumes the web works.

I'm calling it. Web search is dead. Long live the new websearch.

^Circa 2015 usage, not the current

ricardo815y ago

>first major event

lurquer5y ago· 1 in thread

No great fan of Google, but a large component of the problem is the Library of Babel phenomenon: there’s just too much crap being published.

chrisfrantz5y ago

I think that’s somewhat the promise of Web 3.0 at this point. Painful to use and relatively empty. However, it’s mostly people hyping random crypto instead of actually creating value.

janmo5y ago· 1 in thread

It probably started with the guy adding something like "Edited using XXXX Editor tool" to make himself some publicity. Seeing that it worked he started selling those backlinks a fortune.

slim5y ago

CR0075y ago· 1 in thread

I believe that Google search quality has degraded a ton. Not surprised here.

Serously Google? You can't filter "nulled"?

linuxfan20215y ago

munk-a5y ago· 1 in thread

Imagine my surprise to find that the scammy site at the top of searches isn't w3cschools - that cesspool of terrible references.

Once I discovered that everything I would ever need was better explained on the MDN my life as a webdeveloper strongly improved.

forgotpwd165y ago

Can you provide an example of terrible reference in W3Schools?

HelloNurse5y ago· 1 in thread

b0afc375b55y ago

helsinkiandrew5y ago· 1 in thread

What I don't understand is why does Google continue to use metrics that are so easy to fake and game.

Whilst I'm sure GPT-3 could be used to create more realistic looking fake content - this would eliminate 99% of the script kiddies creating low value SEO spamming sites.

nickodell5y ago

1 more reply

dmje5y ago· 1 in thread

aembleton5y ago

Ignore the SEO and concentrate on the product. You can worry about SEO later, there are even specialists that can help you with that.

mjthompson5y ago· 1 in thread

Just an observation: your competitor site is using AMP pages,* while you don't appear to be. I suspect without knowing that Google take this into account in ranking.

https://en.wikipedia.org/wiki/Accelerated_Mobile_Pages

* forgive my RAS syndrome

caspiiOP5y ago

I know. But I'd rather shoot myself in the foot than use AMP.

ziftface5y ago· 1 in thread

While I think this is extremely shady and will avoid ever using a tool like this in the future, does it actually break google's TOS? It seems like a valid defense could be made.

shanecleveland5y ago

This is a valid question. Though, I would argue in this case that they have found a loophole more than anything, if they are not in violation of the TOS.

As others have pointed out and the author acknowledges, he is technically injecting links when his users embed their scoreboard on their website through an auto-included link-back to his site.

Now, I don't frown upon this. It is not deceptive and its placement is more than relevant.

The same cannot be said for the scheme the author uncovered. But whether it is violating Google's TOS is another question. I'm not sure of the answer.

lifeisstillgood5y ago· 1 in thread

SomInjust tried one of these sites - and I cannot reproduce the scam - my output from htmltidy.net seems to work fine and I cannot find any weird back links.

Any notes on how to reproduce?

bombcar5y ago

Keep trying, it seems to only do it some times. I’ve used them and am always checking my code and have seen it a few times.

Maybe clear cookies and try from a diffferbroawer?

thar3275y ago· 1 in thread

How does ahrefs.com get the information of any websites backlinks? This is more interesting than the original article itself

aembleton5y ago

With web crawlers. They index the web to give you this information: https://ahrefs.com/big-data

Mauricebranagh5y ago· 1 in thread

WTF is a html cleaner and why would you use one.

aembleton5y ago

It can be used to simplify HTML, removing comments, attributes, classes or anything you like. You would use it to simplify and shorten your HTML.

Here's an example of one https://html-cleaner.com/

1 more reply

FridayoLeary5y ago· 1 in thread

caspiiOP5y ago

Yikes, should I have shown more discretion in naming the competitor?

4 more replies

lumpa5y ago

I hope someone figures out which other campaigns were run with these tools. Also, whether you can find output with the link injections in source code, like on GitHub or distro packages.

RileyJames5y ago

Totally agree with all the comments here, seo broke google, and they don't care. Probably sells more adwords in the end.

I found uBlacklist from this thread, and the subscription functionality enables some collaborative effort.

So I've started making a list, but unfortunately there aren't many uBlacklist subscription lists out there yet.

Be interested to see how far this could go: https://github.com/rjaus/awesome-ublacklist/

baby5y ago

chrischen5y ago

Similar to “my ip address” https://news.ycombinator.com/item?id=27415897.

Google just seems to give way too much weight to domain name matches with the search keyword.

slugiscool995y ago

Honestly kind of a brilliant SEO strategy. Makes me nervous to use any free online tool

bluedino5y ago

Plus google returns results that link to shit like http://edva.implantologiadentalecroazia.it/somewhat-related-...

And then you get re-direct to some prize-winning spam site.

I love getting a search result that includes Google Books because those are usually useful. That’s what Google was best at, bringing in things that weren’t regular web pages.

forgotpwd165y ago

Scummy but pretty smart setting up such a network of sites. Even more reverse figuring it out. Bravo to the author.

Wronnay5y ago

https://web.archive.org/web/20210608085551/https://casparwre...

przemub5y ago

bredren5y ago

> Now if you are feeling very magnanimous, you could argue that the editor is a freemium tool, and that added links are how you pay for the free version.

This unknown exchange of value for “free” products and services is what everyone from Facebook and Google down to malware-like browser extensions do to extract difficult-to-acquire resources.

People don’t know or are tricked into allowing themselves or their resources to serve as an ugly cost externality to some other clean-looking business endeavor.

mpva5y ago

You should report this to google, its clearly a huge violation of terms of service.

napolux5y ago

Worked with SEO until 2020. It's really a scam. I know people making 1000$ per month by just generating a bunch of interlinked domains with semi-random text like: 'phone number $randomNumber'.

gitgud5y ago

> Now if you are feeling very magnanimous, you could argue that the editor is a freemium tool, and that added links are how you pay for the free version.

Well, unfortunately this is basically how every freemium tool works. They have some way of advertising, in exchange for free use of the tool.

Even reputable CMS tools like WordPress include back links to wordpress on a new site and themes.

Although, this is much less common with open-source free tools, as the community resists these kinds of changes.

No such thing as a free lunch!

tcarambat10105y ago

The site has been hammered down by google now and will be unranked since this story broke.

https://html-online.com/editor/

In case you cannot view it the banner across the site says now "Goodbye!

This site has been penalized for unnatural link building and will be removed from Google Search

Please bookmark if you wish to continue use of the site.

We are sorry and are working on fixing the problem to recover from the penalty. "

They are only sorry they got caught

mtnGoat5y ago

more importantly, what ive also learned is that Bing search results are less of an affiliate link cesspool because fewer SEO spammers are working at gaming Bing's results.

raverbashing5y ago

So people edit html using these "online tools" (because <p> is hard apparently) and then nobody cares to proofread, and even whole paragraphs are added without anyone noticing.

Great.

Nobody cares about the content apparently. Nobody checks if the generated HTML makes sense. It's all about spinning the wheel.

Sigh.

ricardo815y ago

On the positive side I suppose, at least it was just regular HTML, could've been injected JS.

clydethefrog5y ago

kontxt5y ago

Summary: https://www.kontxt.io/document/d/9LogKuJbXwihQd6nj0Y0iSP8h8t...

joeyoungblood5y ago

trungdq885y ago

Sorry for the plug, but this is the exact reason why I build https://devutils.app

Developers paste their data to online websites too frequently these days.

LoveMortuus5y ago

I personally think that SEO is kinda a waste of time. Because why would we have to adapt to a bot, a program. When the program should have to adapt to us.

The problem is that people will always try to game the system :/

marcodiego5y ago

ChrisArchitect5y ago

joeyoungblood5y ago

This HTML editor site has now been penalized by Google for unnatural link practices as I had assumed they would be.

Giorgi5y ago

Ok, that's ultra-smart. Something you would think "how come I never thought of that?"

FranchuFranchu5y ago

I wonder how many similar tools are also like this. We can't trust search engines anymore.

ra33o5y ago

Begin to make your own content and make it public... Oh, like a blog. Like in the old days.

vagab0nd5y ago

This sounds like something a reinforcement learning agent would do to maximize income.

progx5y ago

google search is garbage, that's it.

@google i wait for working AI that detects such garbage sites!

codehawke5y ago

We need an open source Google, yesterday.

tartoran5y ago

Do away with google. Problem fixed

jmspring5y ago

The problem with an algorithm, you can find ways to game the algorithm.

funman75y ago

Don’t the kids say nowadays first link on google always sus

billyharris5y ago

This isn't the only website which is ranking on first position using such blackhat SEO tactics but if everyone mention them like you do, then they will surely not going to remain at top for long.

diveanon5y ago

Easiest and most effective google SEO is to just buy ads.

mastrsushi5y ago

>Some highly-ranked online tools for editing or “cleaning” HTML seem to be secretly injecting links into their output to push themselves and affiliated sites up the search engine rankings.

You can't pretend this isn't funny as fuck lol.

sova5y ago

[1] https://www.puubl.com/

j / k navigate · click thread line to collapse