Open source code with profanity in comments is statistically better (opens in new tab)

(blog.desdelinux.net)

292 pointsdev_snd2y ago214 comments

214 comments

From the research paper:

> we calculate the swear factor as the number of swearwords divided by the lines of code

That's what I suspected. Assuming that most swear words will be contained in comments, what this is actually measuring is the ratio of comments to code. In other words, code that is more heavily commented is better.

I think we already knew this.

That said I would like to see a more critical analysis. First control for comment density. Then compare code quality to swearing in comments and also variable names.

tremon2y ago

It is anecdata, but I can confirm this is the case for my code.

I tend to focus more on documenting the surprising code paths, not the mundane. And when my code needs to do something special because some other component (library, hardware, API) has issues, there's usually some colourful language describing the sad state of the world outside my control.

taneq2y ago

Also any mature, well maintained code will have found a lot more of those truly ‘wtf’ bugs and edge cases, which often involve same colourful language when we finally figure them out.

balaji12y ago

that's *** brilliant I @#@#@ know that feeling

midoridensha2y ago

>...code that is more heavily commented is better. >I think we already knew this

Who's "we"?

In my many years of software development, I've found a very large fraction of developers use very few, or even zero comments, and it's getting worse. Just look at the posts below here: there's a bunch of people arguing that comments are useless or harmful. It's no wonder that software sucks so much these days, since apparently no one believes in documentation or code maintenance any more.

briantakita2y ago

> It's no wonder that software sucks so much these days, since apparently no one believes in documentation or code maintenance any more.

I think this comment explains why software gets worse in many cases:

> Of course nowadays, this is legacy nonsense. Everything uses UTF-8 for "char", and what doesn't is broken and terrible anyway. But the old ways stayed with us, and the stupidity of it as well.

The problem is the "legacy nonsense" tends to accumulate over time & as people depend on it, takes a long time to finally remove.

> They are so hilariously misdesigned and insufficient, I can't even fathom how this shit was _standardized_.

They did their best given their circumstances & abilities. Now we must forever pay the price.

> Several decades later, the moronic standard committees noticed that this was (still is) kind of a bad situation. Instead of fixing the situation, they added more garbage on top of it. (Probably for the sake of "compatibility").

At least they tried...

> All in all, I believe this proves that software developers as a whole and as a culture produce worse results than drug addicted butt fucked monkeys randomly hacking on typewriters while inhaling the fumes of a radioactive dumpster fire fueled by chinese platsic toys for children and Elton John/Justin Bieber crossover CDs for all eternity.

Yeah! Time to get back to work...

Credit to https://news.ycombinator.com/item?id=36626018 for pointing this out.

jancsika2y ago

> In other words, code that is more heavily commented is better.

It could also be that understanding code in any non-trivial project is likely to back the developer into a corner where they become frustrated and swear at the computer.

More importantly, the lack of swearing might be a sign that the devs lack the competence to know when they are cornered.

matheusmoreira2y ago

I think anger is a sign the developer actually cares about what they are doing. In my experience, people who don't care aren't at all irritated by the imperfections of the software they have to use, they just accept it, slay their dragon and move on. People who care tend to get very angry about what's ultimately philosophical matters.

1 more reply

darkerside2y ago

By the time you are swearing in your comments, I'm pretty sure you know you're cornered

Mathnerd3142y ago

He don't seem to use the swear factor anywhere. The actual statistical comparison (Table 3.1) is simply mean SoftWipe score of repos with swears (5.87) vs. mean SoftWipe score of repos with 4+ stars (5.41). The increase is due to 2-3 clusters of swear repos with SoftWipe score ~7.5 and ~20k lines of code. It seems like he deduplicated the repos based on URL, not content, and Github could have biased the results returned in the GitHub search, so I wonder if it is simply sample bias.

paper: https://cme.h-its.org/exelixis/pubs/JanThesis.pdf

tessierashpool2y ago

as long as we’re designing the ideal experiment for someone else to do, let’s throw in the commit messages as well.

I wouldn’t be surprised if code quality goes up with comment curses and down again with commit message curses.

0cf8612b2e1e2y ago

My favorite infamous example being the MPV C locale commit: https://github.com/mpv-player/mpv/commit/1e70e82baa9193f6f02...

You can really feel the author's rage at the state of the world.

1 more reply

Brian_K_White2y ago

The observation of the association is already valid since it doesn't try to say anything that no one can say.

Saying "control for comment density" presumes one knows how to even do that or how to even define it.

How do you decide that a given line of code or comment should weigh more or less than another?

If a codebase has both a lot of swear words and a lot of all other words, so what?

slowmovintarget2y ago

That assumes that most comments contain cursing.

comfypotato2y ago

No it doesn’t. If any more than 0% of comments contain profanity, then on average code with more profanity will be better.

1 more reply

mr_00ff002y ago

Isn’t commented code no longer considered a good idea in most companies?

I used to work for a bank and the policy is no comments unless absolutely necessary, because comments become out of date. Doxygen is the only real comments allowed.

flatline2y ago

It really depends on the comments. Best practice is to comment “why” something was done, not “what” is being done, or “how”. Every programmer can read code, most code should be pretty self-explanatory. But in any sufficiently complex routine, there are going to be some things that a programmer struggled to get working the first time, that they had to work around, or that was simply unintuitive. These should be commented. Generally I’ve found uncommented code bases to be thrown together haphazardly and to be of lower quality. Same with those that only trivially leave comment breadcrumbs about what is being done, duplicating the code itself.

5 more replies

paxys2y ago

That's an idiotic policy and definitely not something that is industry standard.

Code gets out of date as well, so let's just stop writing it altogether..

3 more replies

hakunin2y ago

Pasting again my 4 reasons to leave a code comment:

1. An odd business requirement (share the origin story)

2. It took research (summarize with links)

3. Multiple options were considered (justify decision)

4. Question in a code review (answer in a comment)

And the article on how/what/why in code: https://max.engineer/maintainable-code

2 more replies

IshKebab2y ago

No, that is stupid. People just don't want you to write comments like

    // Set foo to true
    foo = true;

Somebody saw one too many comments like that and overreacted. As long as you a) don't write comments describing what is self evident from the code, and b) try to make the code as descriptive as possible, then it's fine. Comment away.

1 more reply

gen2202y ago

If you're curious to read a well-earned take on comments: http://antirez.com/news/124.

Inline comments are a reflection on the authors' abilities to write good comments. They can be kinda useless, actually-bad, or really helpful.

One canonical example of a "good comment" is explaining why a strange or not-the-least-complex approach was taken to implementing a certain solution. The code is like chesterton's fence, and the comment is a post explaining why it's there. That way, future readers can better assess for themselves whether it's worth their time trying to tear down the fence.

makeworld2y ago

Wow, that seems strange to me. Seems like the policy should be to make comments when needed, and keep them up to date.

furyofantares2y ago

No-comments are better than low-effort, low-quality, unmaintained comments, for sure.

You can imagine a world where all the projects that aren't realistically going to spend the effort on high-quality maintained comments makes the correct choice to skip comments unless absolutely necessary. And where projects that are realistically going to put effort into high-quality, maintained comments, do so.

In this world, comment density would correlate highly with code quality per line of code. Profanity might not, I'm not sure. I do think you'd still find profanity in high-effort, high-quality, maintained comments, but it might indicate lower quality surrounding code, not higher.

And it would still be unclear whether the existence of comments are a cause of higher quality code, or just a proxy for amount of effort and care taken per line of code.

throwaway143562y ago

While I carefully keep others away from my code the notes say I have a complicated relationship with my future self. While comments should be the least useful to me I've tried many formats and found that it is spectacular to have the full elaborate comment above each bite sized nugget of code. I mean that what was solved as a single thing after breaking down the larger problem.

The result is that I don't read any code at all. The whole thing is compiled to the native format that is human language. The code is great for illustration.

If I keep it in separate files as documentation it takes to much effort to find and update. It takes needles extra effort and is less precise.

It is just a personal preference of course but if one had any experience writing code in any language it should be easy to grasp say at 4 am while drunk.

almet2y ago

I'm not sure the ratio of comments to LoC is a sign of good quality code.

Too many comments might actually be a bad thing. It's more lines to maintain, and sometimes the comments just tell what the code is doing where there is no need to.

1 more reply

vezuchyy2y ago

If you have a process where every commit is well documented, you don't need much comments since you can rely on whatever is your analogue for git blame. It's not a lack of comments, it's actually the opposite but aside from the code base.

When I worked at SAP where VCS for ABAP is ancient and has no analogue for git blame we had a practice of putting a SAP Note next to every code change, since some of the things that we had to implement are dictated by business/legislation, so you need a proper explanation from time to time. Without it, the code becomes unmaintainable.

1 more reply

smrtinsert2y ago

I get where they are going with it - every block of code should probably be obvious and final since it has one item to do well. Unfortunately there are always times when n random fields will be used for a conditional that is completely non-obvious. Comments will always be necessary to some degree.

fx19942y ago

that's why three and I think the fourth will leave our company after original developer left undocumented and hard to understand code (it's pretty complex and has tons of hacks to work on different OS'es), first year they learn what the hell is that code, how it works and they are not allowed to comment anything (I asked few times why we waste so much time for basic stuff, they said it is unnecessary...ok I guess). Now they hired original developer for tons of money just to consult newest developer and explain him code. Reasonable I guess.

jacobsenscott2y ago

I never write comments, and I think they typically have a negative correlation to quality (this code is so f**ed up, plain english is, of all things, more clear and precise than code!). Unless you are releasing code publicly, and you are documenting the public API, I've never seen a valuable comment. I've seen plenty of harmful comments. However I wholeheartedly endorse using this otherwise useless but common language feature for swearing.

I think swearing in comments indicates you are unburdened by bureaucracy and pointy haired bosses (because they prohibit such things), which would certainly lead to better code.

chinchilla20202y ago

you've never seen a useful comment?

1 more reply

vharuck2y ago

Possible explanation: swearing is more likely to be committed into code by people who either (1) own the code, or (2) know they're too valuable to be punished. So it self-selects.

I personally have very different commenting styles between my work and personal projects. Not that any of it's good.

atleastoptimal2y ago

Imo, it's because swearing indicates frustration, and frustration indicates effort.

hinkley2y ago

Or as Good to Great calls it, "Confronting the brutal facts".

"This is bullshit" is an important realization. If you can't say it, then things will stay miserable.

throwaway86892y ago

So I need to use more profanity at work, then ask for a raise?

(But I concede that effort and productivity are not the same thing.)

monksy2y ago

Alternative explanation in the same vane as your theory:

The cognitive and time cost of compliance for language policing takes away from valuable programming and planning involved in developing solutions. (i.e. "banned words" [swear words] and politicalized words [whitelist/blacklist,etc])

Antoher possibility is the people who don't want to deal with that are gone and we're seeing a loss of their contributions.

Forgeties792y ago

That’s a pretty big leap IMO. What gives you that impression?

3 more replies

moffkalast2y ago

Possible explanation: The code for the fast inv sqrt is copy pasted everywhere and is skewing the results.

NoMoreNicksLeft2y ago

God save me from working somewhere that I could be punished for code comments.

Who's the narc on your team that would even point it out? It's not like HR has some commit hook on the repos filtering for this stuff...

bawolff2y ago

I'd bet a lot of the non-profanity code is people open sourcing code just to be impressive on resumes or for school, where the profanity code is probably real code.

Sounds likely to be a classic case of correlation != causation

bitofhope2y ago

Rorschach test for programmers: give your confident gut feeling explanation for this phenomenon.

I'll do mine: there's likely a correlation between needing to maintain a professional conduct which includes forgoing foul language (you're programming at work) and writing code under time pressure where getting a product ready for release is more important than strict adherence to clean programming practice (you're programming at work).

Everyone post your favourite conjecture!

jerf2y ago

Everything is correlated: https://gwern.net/everything

Take almost any two things like this and you're actually virtually guaranteed to draw out some weak, but quite likely statistically significant, correlation.

What lies behind that correlation is probably a entropic mishmash of so many factors that it defies human explanation, and also, defies any attempt to try to "harness" the forces that seem to appear. It could be that all the siblings to the comment are right all at once.

I'll cop to just glancing at the graphs, but they don't look out of line for this effect to me intuitively.

Also backing this is that more-or-less the same article/thesis could easily have been written for the opposite correlation.

dogleash2y ago

> Everyone post your favourite conjecture!

Places uptight enough that developers never swear in comments are uptight in other ways that lead to poor team dynamics which hinders quality.

1 more reply

painted-now2y ago

My gut feeling: when you start to submit swear words in your code, it indicates that you "breathe" the code and know it in and out.

The other extreme: if you have no idea what you are doing, you might try to mimic "corp speak" in your code to hide the fact that you actually have no clue.

In other words: it needs some confidence in your ability to assess some aspect of the code in order to use swear words.

bawolff2y ago

This seems unlikely to be true in this case because the study was looking at github projects, and it seems unlikely the sample had enough code from "uptight" work places, to have an affect one way or another

lcnPylGDnU4H9OF2y ago

The developer who knows what they're doing is also more likely to be 1) overworked because they do much of the useful stuff and 2) cognizant of bureaucracy which gets in the way of them doing useful stuff.

jghn2y ago

I remember there being a startup in the Dotcom era, I forget the name but for people familiar with Cambridge, MA it was where the IDEO is now. They were notorious for a few things, but one of them was writing open source software with a lot of profanity.

I thought this was cool, and was talking excitedly about it to my boss and some of the senior devs. They were less amused. Cut 20 years later and I too am less impressed by this.

Not that I think it's *bad* per se, I'm not clutching pearls or anything. But I never find myself thinking what the code really needs are profanities in the comments. Whereas back then I thought it'd be funny/cool and went out of my way to do so when I could. Which wasn't often.

didntcheck2y ago

Swearing for the sake of it does look childish, yes. I've noticed that in a few streaming TV shows, where they've gotten too excited over a lack of censorship that they just end up looking like teenagers who still think saying "fuck" is an act of rebellion

On the other hand, I'd like to write something like "this is a bit shit but will be replaced later" because that's how I naturally speak. Sanitising it to "crap" or "poor" just makes me feel like I'm teaching a youth club or something, and it is a minor pipeline stall in my train of thought while I do a mental synonym search

nomel2y ago

I wonder if swearing can help "free the mind" in some way, with the "rebellion" opening up more, perhaps non-standard/out of the box, "fucking good" ideas?

1 more reply

seadan832y ago

I hear this, comments generally should not draw attention to themselves. For this, short & terse win. I routinely look to cut any unnecessary words from comments.

It was the most painful code review where I asked someone to remove a joke they wrote in the comments. It was a good joke, funny, short, in good taste, I loved it, but.. distracting and unnecessary.

rfw3002y ago

I don’t think anyone is saying it’s causation, the correlation is in and of itself interesting!

bawolff2y ago

I mean i think the article is implying that. However i think the bigger thing is the correlation is misleading due to the sample being the long tail of github projects, which i dont think is representitive of "production" open source projects and certainly not software in general.

MoSattler2y ago

So, you're saying that my code won't improve simply by sprinkling F-Bombs everywhere?

mikrl2y ago

The C code so impressive they had to remove it from K&R:

if (*some_bullshit >= shit_tolerance){

fucks_given = 0;

exit(IM_DONE);

}

1 more reply

bawolff2y ago

Correct: fork bombs rarely help

passwordoops2y ago

There's only one way to find out!

gweinberg2y ago

Nobody suggested causation. The idea that you can improve code quality by adding profane comments is so self-evidently absurd that nobody would even suggest such a thing. Except you kind of just did.

zitterbewegung2y ago

I would bet the opposite because I can make a blind assertion.

bawolff2y ago

You're beting that people swear in code in order to impress future employers?

1 more reply

betamike2y ago

I skimmed the paper, and it looks like they are looking for swearing _anywhere_ in the repos' code, not just comments.

I would be curious to see the ratio of swearing in comments vs code identifiers. I'd also be curious to see if the repos with swearing in their comments just have more comments in total. Perhaps the correlation is, "code with more comments is more likely to be higher quality".

tombert2y ago

The jury is still out if I'm a good programmer, but I did one time need to use a hashmap that had to grow to about ~100gb in size. Because of that, I ended up calling it "bigassHashTable".

It makes me happy that it remained being called that for quite awhile.

squeaky-clean2y ago

I remember a day at a previous job when our CEO came in and told us we weren't an early stage startup anymore and had to start acting like it. Remove profanity and inside jokes from the code, and no more Quake during lunch breaks. Morale took a big hit that day.

2 more replies

DonHopkins2y ago

Swearing in the comments is for goodie goodies. Bad assed programmers swear in public apis like class names, functions, variables, and documentation!

blarghyblarg2y ago

The best programmers I've worked with swore at their coworkers regularly, but never in their code.

They were not great people, and I'd happily kick them in the face if I would encounter no legal or professional repercussions, but, there definitely does seem to be some correlation (in my experience) between being abrasive and being a skilled programmer.

3 more replies

andrewedstrom2y ago

I'm sure the top comment here will be something like "this is invalid because no way can you assign a numerical value to code quality! wtf?!"

I'm withholding my own judgement on that.

For anyone curious, the authors are coming up with a code quality score using an open-source tool called SoftWipe[0]. From the paper:

> SoftWipe is an open source tool and benchmark to assess, rate, and review scientific software written in C or C++ with respect to coding standard adherence. The coding standard adherence is assessed using a set of static and dynamic code analysers such as Lizard (https://github.com/terryyin/lizard) or the Clang address sanitiser (https: //clang.llvm.org/). It returns a score between 0 (low adherence) and 10 (good adherence). In order to simplify our experimental setup, we excluded the compilation warnings, which require a difficult to automate compilation of the assessed software, from the analysis using the --exclude-compilation option.

[0]: https://github.com/adrianzap/softwipe

jtbayly2y ago

The obvious question is whether the source code for this tool has profanity in it…

cjsplat2y ago

While at Sun in the early 2000's, I was part of the due diligence team for an acquisition and had two days to review the entire code base of a 3 year old, 50 person software team.

This was standard practice, and the M&A policies knew that there was no way to actually understand all the code so there was a policy document to describe what to look for.

Of course the red flag things were unexpected 3rd party copyrights and/or license terms in case the code was encumbered.

But "swear words" were on the yellow flag list, in addition to "ToDo", "XXXX", and "Fix Me" types of things.

I remember thinking about places I have been in the past and that the people used those style comments tended to be the better programmers.

I mentioned this to the person leading the evaluation, and was told that point of noticing these kinds of comments was to look a more closely at the nearby code and try to decide if major functionality was missing or being faked.

It all worked out for that acquisition, but I remember being curious about whatever deal had gone bad in the distant past that made them codify this specific practice.

KolmogorovComp2y ago

Correlation is not causality. Swearing in the comments will not magically make your code better, but fixing a hidden bugs that you have been chasing for weeks will certainly make you swear when fixed.

fsckboy2y ago

> Correlation is not causality.

I'm fond of pointing out, despite every time I get downvoted, that causation is the thing we have no knowledge of, and therefore correlation is all we have. As Feynman said about gravity, there is no how or why to gravity, as far as we know it's simply a property of matter. But of course, that means we only know that because of the perfect correlation between matter and gravity, including every time we conduct an experiment about it; but still we have no cause to point to.

dash22y ago

A reasonable working definition of causality, used by almost all scientists today, is that X causes Y if a change in X, unaccompanied by any other change, changes Y. At root, this is indeed a statement about correlations, but it's a special kind of correlation, which is hard to estimate from observational data where many other things may change along with X.

danans2y ago

While that may be the case, the correlation coefficient of matter and gravity is so close to 1 that we can't tell the difference and the correlation coefficient of swearing in code to good code is far less.

WalterBright2y ago

Maybe gravity causes matter.

1 more reply

bitofhope2y ago

Sounds like you're suggesting a causal relationship the other way, though. As per this explanation, putting effort into debugging edge cases will statistically cause the comments to swear more.

nomel2y ago

From the article:

> This means that swearing will not automatically improve the quality of your code.

skrebbel2y ago

My pet theory is that this is because honest, emotional comments are much more useful than the usual “professional” style that try to hide it when you have no clue what you’re doing.

When it’s clear someone was stuck, frustrated, banging their head against the wall etc while writing a particular bit of code, you can refactor a lot less defensively because you know the crappy parts weren’t secretly there for a reason.

I love real, honest, emotional comments. Pour all the frustration in there. Future you and your colleagues will thank you.

jamesgreenleaf2y ago

I think so too. Profanity, in small amounts, is an indicator of honesty.

Everyone swears sometimes. If you never do it in front of others, it signals that you're always filtering yourself.

version_five2y ago

I remember reading that people who swear a lot are statistically smarter. I'm sure there are lots of caveats to that, as with the code.

How long will it be before someone who doesn't understand causality starts encouraging developers to write profane comments? It wouldn't be any more absurd than lots of other non-causal behaviors I've seen pushed because somebody successful does them.

LeifCarrotson2y ago

Failing to provide the typical social signals seems correlated either with extreme competence - they don't need to use polite language or other signals to boost employability - or with complete incompetence. There's a skill floor that cuts off the latter from this dataset when they can't configure an SSL certificate for their git client; their curses at "unable to get local issuer certificate" or "fatal: repository not found" are not uploaded to the Internet.

ahamm2y ago

I believe Nassim Taleb wrote about this in relation to 'virtue signaling' - not swearing being a sign that people are trying to 'signal' professionalism in order to keep their jobs, and therefore are more likely to have less actual professional skills (which if good enough would secure their jobs regardless of profanity) and vice-versa. Same for hoodies over ties, etc. Although all this probably gets flipped now that we have people writing papers like this - context is everything.

halkony2y ago

Do you remember where he wrote about that? Sounds like something in Antifragile, but let's be honest, any one of his books is better called a corpus.

WalterBright2y ago

> I remember reading that people who swear a lot are statistically smarter.

I'd need more evidence of that. My anecdotal experience is that saying "fuck" a lot is indicative of a lack of imagination. For example, Winston Churchill's legendary devastating insults, with no profanity necessary.

davely2y ago

There is probably some cognitive overhead required to actually not swear in certain situations, especially if you’re prone to doing it.

“What the f… heck is this, kiddo?”

Definitely gotta utilize those brain wrinkles in that case.

yongjik2y ago

Sorry for being off topic, but let me introduce you to the only true metric of code quality: WTF/minute.

https://www.osnews.com/story/19266/wtfsm/

One wonders if profanity in the source code interferes with reviewers and skews this important metric ...

yk2y ago

> Next, Strehmel and his team quantified the compliance of these two different sets of open source code with coding standards. The results were presented as an indicator of the quality of the source code through the SoftWipe tool.

I would read that study as coding standards lead to profanity. (Not sure wether or not coding standards should be correlated with code quality, I just think it is obvious that the measure is correlated with the conclusion in an obvious way.)

[Post posting:] Also looking at the plots, it seems that the two distributions are different, first the swear word distribution seems to be wider and second it has a clear outlier at "software quality" 8, so if anything it is an indication that something much more complex is going on.

gridspy2y ago

My Hypothesis

1. Passionate developers often swear more often when they feel safe to do so

2. Developers work better in a "safe environment" where they are not judged / forced to follow other guidelines by social or employment pressure.

And another point : those places where it's unsafe (often due to managerial micromanagement) are miserable places to work. That can drive away skilled developers or suppress them if they remain.

All this is assuming the research metric is real, though I'm not sure it is. If the metric for "code quality" is actually "precision following a coding standard" you'd have though that rigid adherence to procedure would lead to a higher score?

l0b02y ago

Absolutely. Passion and trust → swearing and quality.

hinkley2y ago

Fuckin'-A

dev_sndOP2y ago

Here's the link to the original full PDF: https://cme.h-its.org/exelixis/pubs/JanThesis.pdf

scns2y ago

Reminds me of a study. It showed, that swearing enables you to tolerate pain better. It was simple. Two groups, both had to put their hands into ice water. The group that was allowed to swear could do it longer.

I'd hypothesize, that programmers, who actually care about quality, swear more.

Individuals with AD(H)D might have a have a lower tolerance to pain. This, coupled with wide open sensual channels and decreased impulse control, might be a contributing factor.

[Edit] added parenthesis and link

Not correlated to swearing, but AD(H)D:

https://www.youtube.com/watch?v=XdT4DIiX7Nk

mpweiher2y ago

> I'd hypothesize, that programmers, who actually care about quality, swear more.

Ding ding ding, I think we have a winner!

If you're not moved to profanity by most code-bases, you're either not paying attention or don't understand.

makeitdouble2y ago

An alternative take:

Swearing was more abundant in the earlier days and the code that survived until today is probably better that what got lost along the way.

In general the coding population has grown, we're more used to coding in corporate settings with code reviews, commit message processing etc. and the bulk of devs aren't just as emotional in writing their comments (some will still swear like sailors, but it's not the norm)

> The study relied solely on the source code written in C.

This in particular, probably reduced the number of hobby and beginner's project in the study.

cozzyd2y ago

Improve your C code with this one neat trick!

  #define fuck if 
  #define shit else 
  #define ass return

fnordpiglet2y ago

When we open sourced the Netscape Navigator a major undertaking was code sanitation. This including excising licensed libraries etc (resulting in an initial release that wasn’t able to compile), but also removing enormous amounts of profanity and references to how evil Microsoft was.

DonHopkins2y ago

JWZ sure could flame in the comments, especially about Motif.

http://www.art.net/~hopkins/Don/unix-haters/x-windows/motif....

gweinberg2y ago

I've noticed the same effect in HNN posts. The more profanity there is the comments, the better the original post! Unfortunately there's no good way to take advantage of this; optimizing to the metric destroys the value of the metric.

saintradon2y ago

Isn't there a study that shows that people can tolerate pain better when they swear? This doesn't really surprise me. I'll swear in my code if I really feel like it but commits and everything else I keep more presentable.

pyeri2y ago

It's just that you've evolved biological filters that "block" the swearing from getting voiced out as being presentable takes the priority? The way our society is progressing right now, such evolution seems to be the direction that we all will take eventually.

mikecoles2y ago

My code, by twisting this finding, is bug-free.

alpaca1282y ago

> Much of the community considers profanity as a vulgar display of lack of intelligence and education, because why use profanity when you have a rich vocabulary?

Why not use the full range of one's vocabulary?

bregma2y ago

Ten thousand bilious blue blistering barnacles that's a tremendous idea!

Or by "full range" did you mean "limit it to a few well-worn cliches"?

aosmith2y ago

This is a normal part of software... You find something really bad, git blame says it's your own, you leave a vulgar comment about how bad it is for the next guy.

ratel2y ago

My favorite (almost) obscene quote I found reviewing code, although I never could find the back story to it:

"Which idiot wrote this crap?

You did!

Which idiot hired me?"

I think this also points to the statistical significance. Code that has been worked over a couple of times and/or has been worked on by different people for all those hard and fringe problems will be better, but also accumulate more comments venting the trouble people had fixing them. It does not seem very interesting.

ydnaclementine2y ago

One rule I live by is I never ever swear in comments or commits, just not worth it. Even in personal projects.

But one of my favorite projects to ctrl-f for "fuck" is in the jedi outcast source code. Since it is proprietary and was a good game: https://github.com/search?q=repo%3Agrayj%2FJedi-Outcast+fuck...

evilotto2y ago

I'm guessing the multiple instances of

  i  = 0x5f3759df - ( i >> 1 );

in the results are one of those inverse-square floating point bit tricks.

arp2422y ago

> But one of my favorite projects to ctrl-f for "fuck" is in the jedi outcast source code.

https://www.youtube.com/watch?v=R_b2B5tKBUM

mike_hock2y ago

> Sign in to search code on GitHub

lolwat

danans2y ago

I bet there are a lot of less visible but stronger correlations to code quality, including incentive structures, programmer time spent to code ratio, quality of tools, quality of documentation, etc.

Swearing in code, however, is much easier to quantify, and of course chosen to chuff up those who think swearing itself is a virtue.

It would be a mistake to draw the conclusion that allowing swearing in code will improve code quality.

tsukikage2y ago

"In 2018, Adam Farley, a contributor to the OpenJDK project, the presence of profanity in the source code."

Someone accidentally a verb.

koromak2y ago

"As part of your study, reviewed and analyzed over 3800 open source code containing profanity in English and over 7600 profanity-free open source code on GitHub."

Wow, over 3800 code? Thats so many code! And its my study? Even better!

jraph2y ago

They didn't their sentence.

It makes it quite.

z3t42y ago

Or that those that need to adhere to strict linters, formatting, etc are less happy with their life and thus use more profanity. eg. "code quality" tools that does not have any benefits besides finding potential bugs that does not effect the state of the program, like lines that are 71 characters long instead of 70 characters.

wjholden2y ago

This story and the resulting discussion here on HN are such a great example of data mining with statistical methods. The researchers found a non-obvious result using statistics. Now we're all speculating about the underlying cause, trying to apply our domain knowledge to explain the result.

sircastor2y ago

Anecdotally, it seems to me that I work with a lot of folks that swear frequently but not in their code comments.

jansommer2y ago

I sometimes feel like swearing in the comments or commit messages, which can be the first thought coming to mind, and spend a few resources on writing in a kinder way.

Perhaps I could use this as an excuse for not reaching a deadline...

pyeri2y ago

Probably same goes about the people? The emotionally charged person that swears and gives a mouthful usually ACTS better than the cold calculated one that speaks the right words but full of cunning inside?

moonchrome2y ago

Being passionate about code correlates with quality - shocking

BizarreByte2y ago

I find it a bit suspect swearing would ever even get though a proper code review. It’s extremely unprofessional, I would tell someone to remove it.

MacsHeadroom2y ago

And this is precisely how language policing slows down technical progress.

BizarreByte2y ago

No, this is why you should be a professional. Swearing doesn’t belong in your employers codebase, it’s tasteless, looks bad, and may age very, very poorly.

klysm2y ago

What harm does it cause?

KerrAvon2y ago

you're from the east coast, aren't you?

klysm2y ago

At least in New England we swear quite a bit I’m not sure on what basis you are making that accusation

BizarreByte2y ago

I’m not from America at all.

charonn02y ago

I only swear in commit messages. Am I doing it wrong?

pickingdinner2y ago

Not to get too philosophical, but does profanity measure the children in the room, or does it measure the adults in the room?

Schrodinger's chat (room).

briantakita2y ago

Until every fucking wanker who reads this article adds profanity to their shitty code expecting their bullshit to be better.

coding1232y ago

in other words, it increases the chance that the programmer is in a specific locale (like the US?) such that the location has less bad programmers than other locations.

And probably, increases the chance that the person is fed up with fixing someone else's code - hence the anger

twodave2y ago

The best CS professor I ever had always said that the #1 language among programmers is profanity.

paxys2y ago

I'd first like to know how they judged what is "good" vs "bad" code.

DonHopkins2y ago

The original terminal emulator terminal.el in gnu emacs, written by mly (Richard Mlynarik), was particularly salty. I finally tracked down a copy, but it looks like somebody complained and in 1990 it was begrudgingly cleaned up a bit, so some of the worst stuff was moved out into a separate file called term-nasty.el for posterity (you, here, now), so as not to give "in to the pressure to censor obscenity that currently threatens freedom of speech and of the press in the US" (oh, Richard <3 ):

https://opensource.apple.com/source/emacs/emacs-59.0.80/emac...

1990-08-26 Richard Stallman (rms@mole.ai.mit.edu)

* terminal.el: Move possibly offensive comments to term-nasty.el.

https://www.digiater.nl/openvms/freeware/v10/emacs/common/li...

[...]

    ;; disgusting unix-required shit
    ;;  Are we living twenty years in the past yet?

    (defun te-losing-unix ()
      nil)

[...]

    ;; (A version of the following comment which might be distractingly offensive
    ;; to some readers has been moved to term-nasty.el.)
    ;; unix lacks ITS-style tty control...
    (defun te-process-output (preemptable)
      ;;>> There seems no good reason to ever disallow preemption
      (setq preemptable t)

[...]

              ;; I suppose if I split the guts of this out into a separate
              ;;  function we could trivially emulate different terminals
              ;; Who cares in any case?  (Apart from stupid losers using rlogin)

[...]

                                     (?\C-b . te-backward-char)
                                     ;; should be C-d, but un*x
                                     ;;  pty's won't send \004 through!
                                     ;; Can you believe this?

[...]

                                     ;; Did I ask to be sent these characters?
                                     ;; I don't remember doing so, either.
                                     ;; (Perhaps some operating system or
                                     ;; other is completely incompetent...)

[...]

                         ;;-- Not-widely-known (ie nonstandard) flags, which mean
                         ;; o writing in the last column of the last line
                         ;;   doesn't cause idiotic scrolling, and
                         ;; o don't use idiotische c-s/c-q sogenannte
                         ;;   ``flow control'' auf keinen Fall.
                         "LP:NF:"
                         ;;-- For stupid or obsolete programs
                         "ic=^p_!:dc=^pd!:al=^p^o!:dl=^p^k!:ho=^p=  :"
                         ;;-- For disgusting programs.
                         ;; (VI? What losers need these, I wonder?)
                         "im=:ei=:dm=:ed=:mi:do=^p^j:nl=^p^j:bs:")))

[...]

              (setq te-process
                    (start-process "terminal-emulator" (current-buffer)
                                   "/bin/sh" "-c"
                                   ;; Yuck!!! Start a shell to set some terminal
                                   ;; control characteristics.  Then start the
                                   ;; "env" program to setup the terminal type
                                   ;; Then finally start the program we wanted.
                                   (format "%s; exec %s"
                                           te-stty-string
                                           (mapconcat 'te-quote-arg-for-sh
                                                      (cons program args) " ")))))

[...]

    ;;;; what a complete loss

[...]

https://www.digiater.nl/openvms/freeware/v10/emacs/common/li...

    ;;; term-nasty.el --- Damned Things from terminfo.el
    ;;; This file is in the public domain, and was written by Stallman and Mlynarik

    ;;; Commentary:

    ;; Some people used to be bothered by the following comments that were
    ;; found in terminal.el.  We decided they were distracting, and that it
    ;; was better not to have them there.  On the other hand, we didn't want
    ;; to appear to be giving in to the pressure to censor obscenity that
    ;; currently threatens freedom of speech and of the press in the US.
    ;; So we decided to put the comments here.

    ;;; Code:

    These comments were removed from te-losing-unix.
      ;(what lossage)
      ;(message "fucking-unix: %d" char)

    This was before te-process-output.
    ;; fucking unix has -such- braindamaged lack of tty control...

    And about the need to handle output characters such as C-m, C-g, C-h
    and C-i even though the termcap doesn't say they may be used:
    ;fuck me harder
    ;again and again!
    ;wa12id!!
    ;(spiked)

    ;;; term-nasty.el ends here

Note to the gentle readers: "wa12id" stands for "with a 12 inch dildo".

Jamie Zawinski kept Lucid Emacs nasty:

https://groups.google.com/g/gnu.misc.discuss/c/U5oXKOfWinQ/m...

Noah Friedman, Aug 3, 1992, 4:54:20 AM

In article <15i2n9...@hal.com> wood...@hal.com (Nathan Hess) writes:

>In article <FRIEDMAN.9...@nutrimat.gnu.ai.mit.edu>, friedman@gnu (Noah Friedman) writes:

>>It's by no means necessary, but it's funny.

>Along the same lines, look at lisp/terminal.el

Of course, terminal.el is actually useful, albeit not terribly powerful.

(and terminal.el is pretty mild compared to some of the other things I've seen written by mly. :-))

Incidentally, a lot of terminal.el has been rewritten in version 19.

Too bad... I liked all the variable names and comments in the original.

Jamie Zawinski, Aug 5, 1992, 12:40:38 AM

In the FSF-distributed Emacs 19, the obscenities (will) have been stripped from terminal.el, though they are preserved in a file called term-nasty.el, to avoid appearing to bow to the censors.

In Lucid GNU Emacs, terminal.el will remain as nasty as it ever was.

-- Jamie "Truth, Justice, and the Fucking First Amendment" Zawinski

francasso2y ago

This is an example where correlation does imply causation IMHO

grayhatter2y ago

The author of the paper suggests it's because the author of the code cares more, and is more passionate. Do you think it's just random chance there is a correlation, or do you have a better explanation for the results?

ftxbro2y ago

adding swears into codebases would improve their quality

twodave2y ago

Article should have included some juicy examples. 4/10

bjornsing2y ago

Sure, that means someone cares.

pak9rabid2y ago

// fuckin eh

stainablesteel2y ago

someone hired a team to review 10k repos just for this?

ftxbro2y ago

now we let goodhart's take its course

happytiger2y ago

Fuck yea it is.

OnlyMortal2y ago

Fuck that code!

helmsb2y ago

Correlation ≠ Causation

yodsanklai2y ago

I'm surprised this is still a thing. I suppose this is associated with "toxic masculinity" which is frowned upon nowadays. I'm always a bit worried that I forget to edit my swearing and that it goes to code review.

j / k navigate · click thread line to collapse

214 comments

whoopdedo2y ago

From the research paper:

> we calculate the swear factor as the number of swearwords divided by the lines of code

I think we already knew this.

That said I would like to see a more critical analysis. First control for comment density. Then compare code quality to swearing in comments and also variable names.

tremon2y ago

It is anecdata, but I can confirm this is the case for my code.

taneq2y ago

Also any mature, well maintained code will have found a lot more of those truly ‘wtf’ bugs and edge cases, which often involve same colourful language when we finally figure them out.

balaji12y ago

that's *** brilliant I @#@#@ know that feeling

midoridensha2y ago

>...code that is more heavily commented is better. >I think we already knew this

Who's "we"?

briantakita2y ago

> It's no wonder that software sucks so much these days, since apparently no one believes in documentation or code maintenance any more.

I think this comment explains why software gets worse in many cases:

> Of course nowadays, this is legacy nonsense. Everything uses UTF-8 for "char", and what doesn't is broken and terrible anyway. But the old ways stayed with us, and the stupidity of it as well.

The problem is the "legacy nonsense" tends to accumulate over time & as people depend on it, takes a long time to finally remove.

> They are so hilariously misdesigned and insufficient, I can't even fathom how this shit was _standardized_.

They did their best given their circumstances & abilities. Now we must forever pay the price.

At least they tried...

Yeah! Time to get back to work...

Credit to https://news.ycombinator.com/item?id=36626018 for pointing this out.

jancsika2y ago

> In other words, code that is more heavily commented is better.

It could also be that understanding code in any non-trivial project is likely to back the developer into a corner where they become frustrated and swear at the computer.

More importantly, the lack of swearing might be a sign that the devs lack the competence to know when they are cornered.

matheusmoreira2y ago

1 more reply

darkerside2y ago

By the time you are swearing in your comments, I'm pretty sure you know you're cornered

Mathnerd3142y ago

paper: https://cme.h-its.org/exelixis/pubs/JanThesis.pdf

tessierashpool2y ago

as long as we’re designing the ideal experiment for someone else to do, let’s throw in the commit messages as well.

I wouldn’t be surprised if code quality goes up with comment curses and down again with commit message curses.

0cf8612b2e1e2y ago

My favorite infamous example being the MPV C locale commit: https://github.com/mpv-player/mpv/commit/1e70e82baa9193f6f02...

You can really feel the author's rage at the state of the world.

1 more reply

Brian_K_White2y ago

The observation of the association is already valid since it doesn't try to say anything that no one can say.

Saying "control for comment density" presumes one knows how to even do that or how to even define it.

How do you decide that a given line of code or comment should weigh more or less than another?

If a codebase has both a lot of swear words and a lot of all other words, so what?

slowmovintarget2y ago

That assumes that most comments contain cursing.

comfypotato2y ago

No it doesn’t. If any more than 0% of comments contain profanity, then on average code with more profanity will be better.

1 more reply

mr_00ff002y ago

Isn’t commented code no longer considered a good idea in most companies?

I used to work for a bank and the policy is no comments unless absolutely necessary, because comments become out of date. Doxygen is the only real comments allowed.

flatline2y ago

5 more replies

paxys2y ago

That's an idiotic policy and definitely not something that is industry standard.

Code gets out of date as well, so let's just stop writing it altogether..

3 more replies

hakunin2y ago

Pasting again my 4 reasons to leave a code comment:

1. An odd business requirement (share the origin story)

2. It took research (summarize with links)

3. Multiple options were considered (justify decision)

4. Question in a code review (answer in a comment)

And the article on how/what/why in code: https://max.engineer/maintainable-code

2 more replies

IshKebab2y ago

No, that is stupid. People just don't want you to write comments like

    // Set foo to true
    foo = true;

1 more reply

gen2202y ago

If you're curious to read a well-earned take on comments: http://antirez.com/news/124.

Inline comments are a reflection on the authors' abilities to write good comments. They can be kinda useless, actually-bad, or really helpful.

makeworld2y ago

Wow, that seems strange to me. Seems like the policy should be to make comments when needed, and keep them up to date.

furyofantares2y ago

No-comments are better than low-effort, low-quality, unmaintained comments, for sure.

And it would still be unclear whether the existence of comments are a cause of higher quality code, or just a proxy for amount of effort and care taken per line of code.

throwaway143562y ago

The result is that I don't read any code at all. The whole thing is compiled to the native format that is human language. The code is great for illustration.

If I keep it in separate files as documentation it takes to much effort to find and update. It takes needles extra effort and is less precise.

It is just a personal preference of course but if one had any experience writing code in any language it should be easy to grasp say at 4 am while drunk.

almet2y ago

I'm not sure the ratio of comments to LoC is a sign of good quality code.

Too many comments might actually be a bad thing. It's more lines to maintain, and sometimes the comments just tell what the code is doing where there is no need to.

1 more reply

vezuchyy2y ago

1 more reply

smrtinsert2y ago

fx19942y ago

jacobsenscott2y ago

I think swearing in comments indicates you are unburdened by bureaucracy and pointy haired bosses (because they prohibit such things), which would certainly lead to better code.

chinchilla20202y ago

you've never seen a useful comment?

1 more reply

vharuck2y ago

Possible explanation: swearing is more likely to be committed into code by people who either (1) own the code, or (2) know they're too valuable to be punished. So it self-selects.

I personally have very different commenting styles between my work and personal projects. Not that any of it's good.

atleastoptimal2y ago

Imo, it's because swearing indicates frustration, and frustration indicates effort.

hinkley2y ago

Or as Good to Great calls it, "Confronting the brutal facts".

"This is bullshit" is an important realization. If you can't say it, then things will stay miserable.

throwaway86892y ago

So I need to use more profanity at work, then ask for a raise?

(But I concede that effort and productivity are not the same thing.)

monksy2y ago

Alternative explanation in the same vane as your theory:

Antoher possibility is the people who don't want to deal with that are gone and we're seeing a loss of their contributions.

Forgeties792y ago

That’s a pretty big leap IMO. What gives you that impression?

3 more replies

moffkalast2y ago

Possible explanation: The code for the fast inv sqrt is copy pasted everywhere and is skewing the results.

NoMoreNicksLeft2y ago

God save me from working somewhere that I could be punished for code comments.

Who's the narc on your team that would even point it out? It's not like HR has some commit hook on the repos filtering for this stuff...

bawolff2y ago

I'd bet a lot of the non-profanity code is people open sourcing code just to be impressive on resumes or for school, where the profanity code is probably real code.

Sounds likely to be a classic case of correlation != causation

bitofhope2y ago

Rorschach test for programmers: give your confident gut feeling explanation for this phenomenon.

Everyone post your favourite conjecture!

jerf2y ago

Everything is correlated: https://gwern.net/everything

Take almost any two things like this and you're actually virtually guaranteed to draw out some weak, but quite likely statistically significant, correlation.

I'll cop to just glancing at the graphs, but they don't look out of line for this effect to me intuitively.

Also backing this is that more-or-less the same article/thesis could easily have been written for the opposite correlation.

dogleash2y ago

> Everyone post your favourite conjecture!

Places uptight enough that developers never swear in comments are uptight in other ways that lead to poor team dynamics which hinders quality.

1 more reply

painted-now2y ago

My gut feeling: when you start to submit swear words in your code, it indicates that you "breathe" the code and know it in and out.

The other extreme: if you have no idea what you are doing, you might try to mimic "corp speak" in your code to hide the fact that you actually have no clue.

In other words: it needs some confidence in your ability to assess some aspect of the code in order to use swear words.

bawolff2y ago

lcnPylGDnU4H9OF2y ago

jghn2y ago

I thought this was cool, and was talking excitedly about it to my boss and some of the senior devs. They were less amused. Cut 20 years later and I too am less impressed by this.

didntcheck2y ago

nomel2y ago

I wonder if swearing can help "free the mind" in some way, with the "rebellion" opening up more, perhaps non-standard/out of the box, "fucking good" ideas?

1 more reply

seadan832y ago

I hear this, comments generally should not draw attention to themselves. For this, short & terse win. I routinely look to cut any unnecessary words from comments.

It was the most painful code review where I asked someone to remove a joke they wrote in the comments. It was a good joke, funny, short, in good taste, I loved it, but.. distracting and unnecessary.

rfw3002y ago

I don’t think anyone is saying it’s causation, the correlation is in and of itself interesting!

bawolff2y ago

MoSattler2y ago

So, you're saying that my code won't improve simply by sprinkling F-Bombs everywhere?

mikrl2y ago

The C code so impressive they had to remove it from K&R:

if (*some_bullshit >= shit_tolerance){

fucks_given = 0;

exit(IM_DONE);

}

1 more reply

bawolff2y ago

Correct: fork bombs rarely help

passwordoops2y ago

There's only one way to find out!

gweinberg2y ago

Nobody suggested causation. The idea that you can improve code quality by adding profane comments is so self-evidently absurd that nobody would even suggest such a thing. Except you kind of just did.

zitterbewegung2y ago

I would bet the opposite because I can make a blind assertion.

bawolff2y ago

You're beting that people swear in code in order to impress future employers?

1 more reply

betamike2y ago

I skimmed the paper, and it looks like they are looking for swearing _anywhere_ in the repos' code, not just comments.

tombert2y ago

The jury is still out if I'm a good programmer, but I did one time need to use a hashmap that had to grow to about ~100gb in size. Because of that, I ended up calling it "bigassHashTable".

It makes me happy that it remained being called that for quite awhile.

squeaky-clean2y ago

2 more replies

DonHopkins2y ago

Swearing in the comments is for goodie goodies. Bad assed programmers swear in public apis like class names, functions, variables, and documentation!

blarghyblarg2y ago

The best programmers I've worked with swore at their coworkers regularly, but never in their code.

3 more replies

andrewedstrom2y ago

I'm sure the top comment here will be something like "this is invalid because no way can you assign a numerical value to code quality! wtf?!"

I'm withholding my own judgement on that.

For anyone curious, the authors are coming up with a code quality score using an open-source tool called SoftWipe[0]. From the paper:

[0]: https://github.com/adrianzap/softwipe

jtbayly2y ago

The obvious question is whether the source code for this tool has profanity in it…

cjsplat2y ago

While at Sun in the early 2000's, I was part of the due diligence team for an acquisition and had two days to review the entire code base of a 3 year old, 50 person software team.

This was standard practice, and the M&A policies knew that there was no way to actually understand all the code so there was a policy document to describe what to look for.

Of course the red flag things were unexpected 3rd party copyrights and/or license terms in case the code was encumbered.

But "swear words" were on the yellow flag list, in addition to "ToDo", "XXXX", and "Fix Me" types of things.

I remember thinking about places I have been in the past and that the people used those style comments tended to be the better programmers.

It all worked out for that acquisition, but I remember being curious about whatever deal had gone bad in the distant past that made them codify this specific practice.

KolmogorovComp2y ago

fsckboy2y ago

> Correlation is not causality.

dash22y ago

danans2y ago

WalterBright2y ago

Maybe gravity causes matter.

1 more reply

bitofhope2y ago

Sounds like you're suggesting a causal relationship the other way, though. As per this explanation, putting effort into debugging edge cases will statistically cause the comments to swear more.

nomel2y ago

From the article:

> This means that swearing will not automatically improve the quality of your code.

skrebbel2y ago

My pet theory is that this is because honest, emotional comments are much more useful than the usual “professional” style that try to hide it when you have no clue what you’re doing.

I love real, honest, emotional comments. Pour all the frustration in there. Future you and your colleagues will thank you.

jamesgreenleaf2y ago

I think so too. Profanity, in small amounts, is an indicator of honesty.

Everyone swears sometimes. If you never do it in front of others, it signals that you're always filtering yourself.

version_five2y ago

I remember reading that people who swear a lot are statistically smarter. I'm sure there are lots of caveats to that, as with the code.

LeifCarrotson2y ago

ahamm2y ago

halkony2y ago

Do you remember where he wrote about that? Sounds like something in Antifragile, but let's be honest, any one of his books is better called a corpus.

WalterBright2y ago

> I remember reading that people who swear a lot are statistically smarter.

davely2y ago

There is probably some cognitive overhead required to actually not swear in certain situations, especially if you’re prone to doing it.

“What the f… heck is this, kiddo?”

Definitely gotta utilize those brain wrinkles in that case.

yongjik2y ago

Sorry for being off topic, but let me introduce you to the only true metric of code quality: WTF/minute.

https://www.osnews.com/story/19266/wtfsm/

One wonders if profanity in the source code interferes with reviewers and skews this important metric ...

yk2y ago

gridspy2y ago

My Hypothesis

1. Passionate developers often swear more often when they feel safe to do so

2. Developers work better in a "safe environment" where they are not judged / forced to follow other guidelines by social or employment pressure.

And another point : those places where it's unsafe (often due to managerial micromanagement) are miserable places to work. That can drive away skilled developers or suppress them if they remain.

l0b02y ago

Absolutely. Passion and trust → swearing and quality.

hinkley2y ago

Fuckin'-A

dev_sndOP2y ago

Here's the link to the original full PDF: https://cme.h-its.org/exelixis/pubs/JanThesis.pdf

scns2y ago

I'd hypothesize, that programmers, who actually care about quality, swear more.

Individuals with AD(H)D might have a have a lower tolerance to pain. This, coupled with wide open sensual channels and decreased impulse control, might be a contributing factor.

[Edit] added parenthesis and link

Not correlated to swearing, but AD(H)D:

https://www.youtube.com/watch?v=XdT4DIiX7Nk

mpweiher2y ago

> I'd hypothesize, that programmers, who actually care about quality, swear more.

Ding ding ding, I think we have a winner!

If you're not moved to profanity by most code-bases, you're either not paying attention or don't understand.

makeitdouble2y ago

An alternative take:

Swearing was more abundant in the earlier days and the code that survived until today is probably better that what got lost along the way.

> The study relied solely on the source code written in C.

This in particular, probably reduced the number of hobby and beginner's project in the study.

cozzyd2y ago

Improve your C code with this one neat trick!

  #define fuck if 
  #define shit else 
  #define ass return

fnordpiglet2y ago

DonHopkins2y ago

JWZ sure could flame in the comments, especially about Motif.

http://www.art.net/~hopkins/Don/unix-haters/x-windows/motif....

gweinberg2y ago

saintradon2y ago

pyeri2y ago

mikecoles2y ago

My code, by twisting this finding, is bug-free.

alpaca1282y ago

> Much of the community considers profanity as a vulgar display of lack of intelligence and education, because why use profanity when you have a rich vocabulary?

Why not use the full range of one's vocabulary?

bregma2y ago

Ten thousand bilious blue blistering barnacles that's a tremendous idea!

Or by "full range" did you mean "limit it to a few well-worn cliches"?

aosmith2y ago

This is a normal part of software... You find something really bad, git blame says it's your own, you leave a vulgar comment about how bad it is for the next guy.

ratel2y ago

My favorite (almost) obscene quote I found reviewing code, although I never could find the back story to it:

"Which idiot wrote this crap?

You did!

Which idiot hired me?"

ydnaclementine2y ago

One rule I live by is I never ever swear in comments or commits, just not worth it. Even in personal projects.

But one of my favorite projects to ctrl-f for "fuck" is in the jedi outcast source code. Since it is proprietary and was a good game: https://github.com/search?q=repo%3Agrayj%2FJedi-Outcast+fuck...

evilotto2y ago

I'm guessing the multiple instances of

  i  = 0x5f3759df - ( i >> 1 );

in the results are one of those inverse-square floating point bit tricks.

arp2422y ago

> But one of my favorite projects to ctrl-f for "fuck" is in the jedi outcast source code.

https://www.youtube.com/watch?v=R_b2B5tKBUM

mike_hock2y ago

> Sign in to search code on GitHub

lolwat

danans2y ago

I bet there are a lot of less visible but stronger correlations to code quality, including incentive structures, programmer time spent to code ratio, quality of tools, quality of documentation, etc.

Swearing in code, however, is much easier to quantify, and of course chosen to chuff up those who think swearing itself is a virtue.

It would be a mistake to draw the conclusion that allowing swearing in code will improve code quality.

tsukikage2y ago

"In 2018, Adam Farley, a contributor to the OpenJDK project, the presence of profanity in the source code."

Someone accidentally a verb.

koromak2y ago

"As part of your study, reviewed and analyzed over 3800 open source code containing profanity in English and over 7600 profanity-free open source code on GitHub."

Wow, over 3800 code? Thats so many code! And its my study? Even better!

jraph2y ago

They didn't their sentence.

It makes it quite.

z3t42y ago

wjholden2y ago

sircastor2y ago

Anecdotally, it seems to me that I work with a lot of folks that swear frequently but not in their code comments.

jansommer2y ago

I sometimes feel like swearing in the comments or commit messages, which can be the first thought coming to mind, and spend a few resources on writing in a kinder way.

Perhaps I could use this as an excuse for not reaching a deadline...

pyeri2y ago

moonchrome2y ago

Being passionate about code correlates with quality - shocking

BizarreByte2y ago

I find it a bit suspect swearing would ever even get though a proper code review. It’s extremely unprofessional, I would tell someone to remove it.

MacsHeadroom2y ago

And this is precisely how language policing slows down technical progress.

BizarreByte2y ago

No, this is why you should be a professional. Swearing doesn’t belong in your employers codebase, it’s tasteless, looks bad, and may age very, very poorly.

klysm2y ago

What harm does it cause?

KerrAvon2y ago

you're from the east coast, aren't you?

klysm2y ago

At least in New England we swear quite a bit I’m not sure on what basis you are making that accusation

BizarreByte2y ago

I’m not from America at all.

charonn02y ago

I only swear in commit messages. Am I doing it wrong?

pickingdinner2y ago

Not to get too philosophical, but does profanity measure the children in the room, or does it measure the adults in the room?

Schrodinger's chat (room).

briantakita2y ago

Until every fucking wanker who reads this article adds profanity to their shitty code expecting their bullshit to be better.

coding1232y ago

in other words, it increases the chance that the programmer is in a specific locale (like the US?) such that the location has less bad programmers than other locations.

And probably, increases the chance that the person is fed up with fixing someone else's code - hence the anger

twodave2y ago

The best CS professor I ever had always said that the #1 language among programmers is profanity.

paxys2y ago

I'd first like to know how they judged what is "good" vs "bad" code.

DonHopkins2y ago

https://opensource.apple.com/source/emacs/emacs-59.0.80/emac...

1990-08-26 Richard Stallman (rms@mole.ai.mit.edu)

* terminal.el: Move possibly offensive comments to term-nasty.el.

https://www.digiater.nl/openvms/freeware/v10/emacs/common/li...

[...]

    ;; disgusting unix-required shit
    ;;  Are we living twenty years in the past yet?

    (defun te-losing-unix ()
      nil)

[...]

    ;; (A version of the following comment which might be distractingly offensive
    ;; to some readers has been moved to term-nasty.el.)
    ;; unix lacks ITS-style tty control...
    (defun te-process-output (preemptable)
      ;;>> There seems no good reason to ever disallow preemption
      (setq preemptable t)

[...]

              ;; I suppose if I split the guts of this out into a separate
              ;;  function we could trivially emulate different terminals
              ;; Who cares in any case?  (Apart from stupid losers using rlogin)

[...]

                                     (?\C-b . te-backward-char)
                                     ;; should be C-d, but un*x
                                     ;;  pty's won't send \004 through!
                                     ;; Can you believe this?

[...]

                                     ;; Did I ask to be sent these characters?
                                     ;; I don't remember doing so, either.
                                     ;; (Perhaps some operating system or
                                     ;; other is completely incompetent...)

[...]

                         ;;-- Not-widely-known (ie nonstandard) flags, which mean
                         ;; o writing in the last column of the last line
                         ;;   doesn't cause idiotic scrolling, and
                         ;; o don't use idiotische c-s/c-q sogenannte
                         ;;   ``flow control'' auf keinen Fall.
                         "LP:NF:"
                         ;;-- For stupid or obsolete programs
                         "ic=^p_!:dc=^pd!:al=^p^o!:dl=^p^k!:ho=^p=  :"
                         ;;-- For disgusting programs.
                         ;; (VI? What losers need these, I wonder?)
                         "im=:ei=:dm=:ed=:mi:do=^p^j:nl=^p^j:bs:")))

[...]

              (setq te-process
                    (start-process "terminal-emulator" (current-buffer)
                                   "/bin/sh" "-c"
                                   ;; Yuck!!! Start a shell to set some terminal
                                   ;; control characteristics.  Then start the
                                   ;; "env" program to setup the terminal type
                                   ;; Then finally start the program we wanted.
                                   (format "%s; exec %s"
                                           te-stty-string
                                           (mapconcat 'te-quote-arg-for-sh
                                                      (cons program args) " ")))))

[...]

    ;;;; what a complete loss

[...]

https://www.digiater.nl/openvms/freeware/v10/emacs/common/li...

    ;;; term-nasty.el --- Damned Things from terminfo.el
    ;;; This file is in the public domain, and was written by Stallman and Mlynarik

    ;;; Commentary:

    ;; Some people used to be bothered by the following comments that were
    ;; found in terminal.el.  We decided they were distracting, and that it
    ;; was better not to have them there.  On the other hand, we didn't want
    ;; to appear to be giving in to the pressure to censor obscenity that
    ;; currently threatens freedom of speech and of the press in the US.
    ;; So we decided to put the comments here.

    ;;; Code:

    These comments were removed from te-losing-unix.
      ;(what lossage)
      ;(message "fucking-unix: %d" char)

    This was before te-process-output.
    ;; fucking unix has -such- braindamaged lack of tty control...

    And about the need to handle output characters such as C-m, C-g, C-h
    and C-i even though the termcap doesn't say they may be used:
    ;fuck me harder
    ;again and again!
    ;wa12id!!
    ;(spiked)

    ;;; term-nasty.el ends here

Note to the gentle readers: "wa12id" stands for "with a 12 inch dildo".

Jamie Zawinski kept Lucid Emacs nasty:

https://groups.google.com/g/gnu.misc.discuss/c/U5oXKOfWinQ/m...

Noah Friedman, Aug 3, 1992, 4:54:20 AM

In article <15i2n9...@hal.com> wood...@hal.com (Nathan Hess) writes:

>In article <FRIEDMAN.9...@nutrimat.gnu.ai.mit.edu>, friedman@gnu (Noah Friedman) writes:

>>It's by no means necessary, but it's funny.

>Along the same lines, look at lisp/terminal.el

Of course, terminal.el is actually useful, albeit not terribly powerful.

(and terminal.el is pretty mild compared to some of the other things I've seen written by mly. :-))

Incidentally, a lot of terminal.el has been rewritten in version 19.

Too bad... I liked all the variable names and comments in the original.

Jamie Zawinski, Aug 5, 1992, 12:40:38 AM

In the FSF-distributed Emacs 19, the obscenities (will) have been stripped from terminal.el, though they are preserved in a file called term-nasty.el, to avoid appearing to bow to the censors.

In Lucid GNU Emacs, terminal.el will remain as nasty as it ever was.

-- Jamie "Truth, Justice, and the Fucking First Amendment" Zawinski

francasso2y ago

This is an example where correlation does imply causation IMHO

grayhatter2y ago

ftxbro2y ago

adding swears into codebases would improve their quality

twodave2y ago

Article should have included some juicy examples. 4/10

bjornsing2y ago

Sure, that means someone cares.

pak9rabid2y ago