The Linux kernel has surpassed one million git commits (opens in new tab)

(git.kernel.org)

250 pointscalmingsolitude4y ago127 comments

127 comments

totorovirus4y ago

In next version of Sid Meier's Civilization there should be Linux Kernel in technology tree.

1_player4y ago

Wouldn't that make it available to only one civilization?

Should be a shared Wonder all civilizations would benefit from.

nicklecompte4y ago

Shared wonder is a much better idea - every civilization that researches Computers tech will have operating systems, but only one gets to develop Linux (let’s say it grants a science boost to everyone who finished researching Computers, but a huge culture/diplomacy boost to the civ that actually “built” Linux).

Similar to every civ gets a library, but there’s only one Library of Alexandria, etc.

1 more reply

Waterluvian4y ago

The tech tree is researchable by everyone.

1 more reply

eru4y ago

Technologies are usually available for everyone to research.

1 more reply

Redoubts4y ago

Well, it could also be like the Manhattan Project. Once you do it, everyone can get nukes.

indy4y ago

And one of the disasters should be a craze for Bitcoin mining.

peterkelly4y ago

The next time you hear someone make an argument against remote work, remind them of what has been achieved in the last 30 years by the Linux kernel developers working across so many different countries and time zones.

wheels4y ago

So, I like working remotely, but there are many fallacies in your statement:

Distributed open source projects are obviously an example of survivorship bias: the people who thrive in them are those that work well in a distributed environment. Contributors are also usually self-motived (again, by survivorship) in a way that one wouldn't expect of rank-and-file office workers.

Also, the existence of a thing doesn't show that its genesis was an optimal path to its current state.

Additionally, large open source projects are not necessarily good at converging upon a particular goal. They do well when the goals of many individuals or small groups aggregate well. It would be very difficult to convince all of the kernel team to, e.g. optimize for mobile performance for the next two years.

Nobody contests that large things can be done by distributed teams. Usually there's contention that a particular set of employees, that work on a particular set of projects / goals can be transitioned to working that way.

u801e4y ago

> Distributed open source projects are obviously an example of survivorship bias: the people who thrive in them are those that work well in a distributed environment. Contributors are also usually self-motived (again, by survivorship) in a way that one wouldn't expect of rank-and-file office workers.

Couldn't the same argument be made for those who thrive in an office environment and need to be around other people to work effectively as well as those who are not self-motivated?

I guess it really depends on what the majority prefers from a worker/contributor point of view.

whazor4y ago

Interesting enough, the Linux kernel as a whole is growing exponentially in size but this is mainly because of new drivers being added and maintained. The base code has a linear growth. The interfaces allow driver developers to work on the kernel asynchronously, and are I think the key.

readams4y ago

Just imagine how much easier it would have been if they were all in the same place.

GoOnThenDoTell4y ago

The core devs meet up all the time at conferences

NieDzejkob4y ago

Perhaps I'm blind, but I can't actually see the number of commits on this page.

dorianmariefr4y ago

visible on the github repo https://github.com/torvalds/linux

gigatexal4y ago

The world is so much better off with Linux and Linus and his merry band of hackers. They do an amazing job keeping up with the sheer amount of work that goes into the kernel from everyday needs like you and me to patch sets from companies. But what I am most proud of is git. It’s not perfect but since learning its warts it’s the Swiss Army knife of awesome just like GNU find is for me on the terminal.

sfgweilr4f4y ago

Which files get the most commits?

Which sections of each files get more than usual commits? eg which functions?

Who wrote this particular function first? who subsequently?

How many of those include expletives? Curious minds need to know.

If I want to answer these earth shattering questions, could I just grab the entire git repo and go from there? is it that simple? is it text exportable without too much "other" scary?

sega_sai4y ago

Here is the top 20 files by the number of commits

  12846 MAINTAINERS
   4167 drivers/gpu/drm/i915/intel_display.c
   3330 drivers/gpu/drm/i915/i915_drv.h
   2360 drivers/gpu/drm/i915/i915_gem.c
   2328 arch/arm/Kconfig
   2118 arch/x86/kvm/x86.c
   2079 sound/pci/hda/patch_realtek.c
   2019 Makefile
   2001 fs/btrfs/inode.c
   1927 include/linux/sched.h
   1903 net/core/dev.c
   1888 drivers/gpu/drm/i915/i915_reg.h
   1824 arch/arm/boot/dts/Makefile
   1801 drivers/gpu/drm/i915/intel_pm.c
   1781 include/linux/fs.h
   1763 arch/x86/Kconfig
   1737 mm/page_alloc.c
   1643 kernel/sched.c
   1628 fs/btrfs/extent-tree.c

It's quite interesting how present intel gpus are in the list, which probably tells something about the code on them...

NullPrefix4y ago

>which probably tells something about the code on them...

Million ways to interpret this.

Lots of commits could mean that it was constantly improved or there was a constant stream of bugs.

A single commit could mean that it was a perfect masterpiece on first try or it was simply forgotten.

sdesol4y ago

Here is what I got with additional information

   commits | authors |       first_commit       |      last_commit       |                 path
  ---------+---------+--------------------------+------------------------+--------------------------------------
     12846 |    2438 | 16 years 16 days         | 00:00:00               | MAINTAINERS
      4167 |     205 | 12 years 4 mons 4 days   | 1 year 10 mons 13 days | drivers/gpu/drm/i915/intel_display.c
      3330 |     205 | 12 years 9 mons 19 days  | 2 days                 | drivers/gpu/drm/i915/i915_drv.h
      2360 |     146 | 12 years 6 mons 16 days  | 1 mon 9 days           | drivers/gpu/drm/i915/i915_gem.c
      2328 |     429 | 16 years 16 days         | 2 days                 | arch/arm/Kconfig
      2118 |     315 | 13 years 3 mons 3 days   | 1 day                  | arch/x86/kvm/x86.c
      2079 |     216 | 16 years 16 days         | 4 days                 | sound/pci/hda/patch_realtek.c
      2019 |     300 | 16 years 16 days         | 1 day                  | Makefile
      2001 |     186 | 13 years 10 mons 20 days | 5 days                 | fs/btrfs/inode.c
      1927 |     367 | 16 years 16 days         | 2 days                 | include/linux/sched.h
      1903 |     388 | 16 years 16 days         | 6 days                 | net/core/dev.c
      1888 |     177 | 12 years 6 mons 16 days  | 1 mon 9 days           | drivers/gpu/drm/i915/i915_reg.h
      1824 |     494 | 8 years 7 mons 18 days   | 6 days                 | arch/arm/boot/dts/Makefile
      1801 |     101 | 9 years 14 days          | 4 days                 | drivers/gpu/drm/i915/intel_pm.c
      1781 |     306 | 16 years 16 days         | 2 days                 | include/linux/fs.h
      1763 |     392 | 13 years 5 mons 20 days  | 2 days                 | arch/x86/Kconfig
      1737 |     391 | 16 years 16 days         | 2 days                 | mm/page_alloc.c
      1643 |     258 | 16 years 16 days         | 9 years 3 mons 27 days | kernel/sched.c
      1628 |     121 | 12 years 4 mons 4 days   | 1 year 8 mons 12 days  | drivers/gpu/drm/i915/intel_drv.h
      1628 |     118 | 14 years 2 mons 4 days   | 13 days                | fs/btrfs/extent-tree.c

Note: Authors may and probably is incorrect since a single user could have committed with different emails.

sfgweilr4f4y ago

Personally I'd like NVidia to be somewhere in the Top 50 of commits, or Top 100, Top 200. At least for a few weeks. Just saying.

Is wanting them in the top 1000 being too cynical or just wishful thinking?

1 more reply

mobilemidget4y ago

After you calculated all that and let us know :)

https://gource.io

To create visuals of the gitrepo history/commits.

sfgweilr4f4y ago

what a fascinating tool.

19h4y ago

I’d find a list of files that have the least changes more interesting (excluding text files et al.).

And then check if the lack of activity is related to the stability of the code, lack of use or its complexity.

bogota4y ago

You could. But I would recommend downloading it to a RAM disk to make generating that a bit faster. The which function sees the most use would likely take a bit of work to figure out.

DecoPerson4y ago

This really speaks to the reliability of Git.

Are there any examples of projects with 1kk+ commits that use SVN, Mercurial, Perforce, or some other SCM?

Cyph0n4y ago

Mercurial was used at Facebook afaik, and I would guess they ended up exceeding 1 million.

frob4y ago

When I left, the diff number was in the 15 million range. Not all diffs are landed, but I would assume >60% are, so FB's repo is almost certainly above 10M commits

1 more reply

eru4y ago

And Google uses a hacked up Perforce.

1 more reply

Calzifer4y ago

Apache had a single SVN repository for all projects in the past. That reached 1889412 commits.

https://svn.apache.org/viewvc

jcranmer4y ago

http://hg.mozilla.org/try appears to have over 3M commits, and probably in excess of 100k heads (effectively git branches, although I don't think git has any proper term for a commit with no children that isn't referred to by a branch).

Strictly speaking, it's not actually the main project repository (which has closer to 600k commits), but the repository that contains what is effectively all of the pull requests for the past several years (more specifically, all the changes you want to test in automation).

The closed-source monorepos of Google (perforce IIRC), Facebook (Mercurial), and Microsoft (Git) are all going to be far larger than any open-source repository, of which Linux is in the largest size class but not the largest (I believe Chromium's the largest open-source repo I've found).

elteto4y ago

> although I don't think git has any proper term for a commit with no children that isn't referred to by a branch

I think this would be one case of a “detached head”.

mfateev4y ago

Google is mimicking perforce command line. The backend is 100% proprietary.

Microsoft is based on Git, but with a lot of engineering on top of it: https://devblogs.microsoft.com/bharry/scaling-git-and-some-b...

jeffbee4y ago

Google announced they had 35 million commits to their monorepo, five years ago.

WanderPanda4y ago

Do they have a quick response team to incarcerate newbies who commit binaries to their giant monster?

3 more replies

neurocline4y ago

Epic’s Unreal Perforce repo is >1.5 million at this point.

jedimastert4y ago

>1kk

That's going in the ol' geek toolbox

the84724y ago

just use the other SI prefixes. 1M.

https://en.wikipedia.org/wiki/Metric_prefix#List_of_SI_prefi...

1 more reply

maccard4y ago

Epic Games' p4 depot has well over 1mm changelists. Many of those numbers are taken up by developer changes that never get submitted, and many are automated merges though

kingsuper204y ago

Isn't OpenBSD at the 500k'ish mark using CVS?

kibwen4y ago

I thought I read that Linux jettisons old history every few years for the sake of practicality, and that if you want the full history you have to look at special archive repos. Am I wrong? I wouldn't blame them; git is fast, but it's not that fast, and cloning becomes a bear after only a few hundred thousand commits (and I would be surprised if that's the only operation that scales poorly).

CorrectHorseBat4y ago

No they don't do that. Only happened once when moving to git: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/lin...

1 more reply

tannhaeuser4y ago

Now, the question is when systemd surpasses Linux in terms of commits/LOCs ;)

tofof4y ago

Surpassed, not bypassed.

bpodgursky4y ago

Lot of people view 1mm as a huge number of commits. Which maybe it is... if your team has a habit of big PRs + rebasing.

On the other hand, if your team is used to making quick iterative commits, throwing them in a PR, never rebasing, and pulling in merge commits all over the place, uh, I can attest that you can get to a million commits pretty fast.

LeegleechN4y ago

Closing in on a nice, round 2^20!

DecoPerson4y ago

I wonder if the Git project has tests for over 2^20 commits.

globular-toast4y ago

Git's data model doesn't care how many commits there are. It doesn't care how many objects there are and commits are just one type of object. The git repo probably has tens to hundreds of millions of objects. The limiting factor is probably the filesystem, but I think most can handle it just fine.

kibwen4y ago

How do other open source projects compare? I'll admit, I would have figured that Linux had passed one million commits some time ago, and I feel like web browsers might give Linux a run for its money here.

aexl4y ago

The "mozilla-unified" repository seems to be at 646.5k commits and the "chromium" repository at 999.5k commits.

muterad_murilax4y ago

Honestly expected it to be way more.

andrewclunn4y ago

Time to Squash it down to a few hundred thousand...

bitcharmer4y ago

It truly is the greatest open source project of all time. I just wish they moved away from email as the only way of contributing. It doesn't really scale well and definitely doesn't make contributing for newbies easier.

otabdeveloper44y ago

> It doesn't really scale well

Evidence? One million commits seems to indicate otherwise.

bitcharmer4y ago

Have the Linux foundation ever tried anything else?

1 more reply

adwn4y ago

> I just wish they moved away from email as the only way of contributing.

What alternative do you suggest, and in what way is it better than email?

> It doesn't really scale well [...]

There are few things that scale better than email...

> [...] doesn't make contributing for newbies easier.

That's a good thing: If you haven't even mastered sending a plaintext email, I really wouldn't expect you to be able to constructively contribute to kernel development. Feel free to experiment with your local copy, though – it's open source after all.

bitcharmer4y ago

> If you haven't even mastered sending a plaintext email

Surprise, I've had a few of my patches accepted and this statement is just thinly veiled insult, if I ever saw one.

Have you ever heard of pull requests? Pull requests are 10x easier than adhering to lkml's email rituals.

2 more replies

williamdclt4y ago

> If you haven't even mastered sending a plaintext email, , I really wouldn't expect you to be able to constructively contribute to kernel development.

That's entirely BS. To contribute to these projects (my experience is contributing to Git), you need to respect a dozen conventions that seem to come from another age. Just subscribing to the mailing list is not trivial for somebody in their twenties that never had to do something like that: it's the sort of things that's easy in retrospect, but the UX is hard to discover and the lack of parallel with other tools we regularly use (such mailing lists aren't a thing that most devs use) adds a huge amount of friction.

And then you need to find somewhere that explain the conventions to try and contribute and figure out how to configure your email client, how to get a patch for your commits, how to insert your patch in your email, how to write an acceptable email subject and an acceptable email body and how it relates to your commit message, who you should CC, how to handle multi-commits contributions, how to answer emails (while respecting another half-dozen conventions)...

It's not impossible, but there's a dozen things you need to figure out, half of them you don't even _know_ you need to figure out, so it's a lot of friction. This friction might be a good thing (that's another debate), but saying "you just need to be able to send a plaintext email" is completely false and dismissive.

2 more replies

tester7564y ago

I always wondered - how do people read those emails?

in just plain text with no colors for code?

it must be painful as hell

2 more replies

jorvi4y ago

I do think committing via Github would significantly lower the friction of contributing to the Linux kernel for a chunk of the developer population, and there might actually be a decent well of untapped potential there.

Of course there will also be a lot of pull request noise, so I can't say if that tradeoff is ultimately worth it.

Edit: well I guess OSS people are extremely hostile to even the discussion of more people contributing to the Linux kernel. My bad.

2 more replies

gspr4y ago

> I just wish they moved away from email as the only way of contributing. It doesn't really scale well

Bullshit. What you really means is the second half of your comment:

> and definitely doesn't make contributing for newbies easier.

That's fair enough. I would guess that the scalability and workflow of the actual developers is more important.

throwaway36994y ago

This push for every project to be as 'easy' as possible to contribute to is just weird to me. Like entering a room and immediately uplifting their entire productivity flow.

mch824y ago

I’m curious if you use the Linux email list and archive?

I’ve been thinking about how Linux and Wikipedia use an email list that is archived to a website. The archive can be browsed like an issue tracker. Many people spend their day in their email app. I wonder if maybe most projects aren’t using email correctly…

ziml774y ago

If I were contributing to Linux I would prefer some nicer methods of submitting patches, but email seems to be working out just fine for the project. Also I wonder if doing it by email creates a bit of quality filter by adding that little speed bump. It's not as simple to contribute as clicking "fork" and then "submit pull request", but also not something that discriminates since email is a system open to anyone.

DestruKaneda4y ago

Bypassed? As in dropped 1 million commits?

caffeinatedgoat4y ago

There's really nothing significant about this, other than people like big round numbers.

macksd4y ago

The millionth commit in isolation? Sure. The scale it takes to get into that order of magnitude? Quite an achievement.

Ambolia4y ago

There's nothing significant about anything, other than people liking it.

agons4y ago

This can definitely be significant as part of a trend

1 more reply

anothernewdude4y ago

We really do.

cryptica4y ago

Part of me wonders why nobody tries to make software like we make buildings... After some time, it's all done and nothing else needs to be added.

People will be quick to point out that "the hardware keeps changing so the software has to adapt". This is true, but why not design the software in such a way that different drivers can easily be substituted (so the drivers can change but the interface doesn't)?

I did this with my open source project. I haven't made any code changes for over a year and it still works perfectly and still relevant.

I don't understand why there is such a fetish in this industry for never finishing any project. I find the whole attitude very frustrating.

DC-34y ago

You know buildings also require maintenance, right?

cryptica4y ago

Hardly any compared to software. We're talking about 1 change every 20 years versus 1 to 100s of changes per day. That's a huge magnitude of difference. But that's not even important to my argument.

To suggest that software maintenance and building maintenance are anything alike is ignoring the entire context of the two activities.

With buildings, builders have very little control over the wear and tear caused by the environment. In software, developers collectively have total control over the software and hardware environment which determines whether or not software breaks. Most of the wear and tear in software is a direct result of people compulsively changing stuff in other software (or hardware) up the stack - It's all 100% avoidable. If the software never changed materially (aside from bug fixes), it would not break. Simple as that.

And most of the software changes are simply taking us round in circles. New generations of developers undoing the work of the previous generation, then later reversing direction again, surely we've all seen it happening at most of the software companies we've worked at...

1 more reply

sophacles4y ago

And they regularly get overhauls and additions.

uncomputation4y ago

A couple things:

> so the drivers can change but the interface doesn't

This is already how it is. Take write(sockfd, …). Sure there are some configuration options in the parameters but nothing compared to the real complexity of networking. This is the downside to abstraction; roughly, the least complex implementation wins. Eventually, we shift and add more and more standards, but it’s never cutting edge (and for good reason).

> I haven't made any code changes for over a year

Relatively speaking, a year is nothing in the timeline of software, so this is not surprising and it’s likely that even if you hadn’t written your software in a abstracted way (which kudos to you for doing so), it would still be fine after only one year. write() has been the same interface for over 40 years.

Also, not to rain on anybody’s parade, but OSS is - generally - not held to the same performance standards that proprietary code is. This makes sense intuitively right? “If I’m paying for it, it better work.” And the vast, vast majority of code is not OSS. We just get a false impression since, by definition, we only have access to OSS. The worst that can happen for bad OSS is lack of adoption or a tsunami of incoming GitHub issues. For proprietary code, you could lose your job if a product doesn’t take.

> I don't understand why there is such a fetish in this industry for never finishing any project

This is similar to the argument that a company, once it has a good product, should just stop. Why do we need updates?, I like the features we have, Don’t change it, it’s perfect. But to survive in the market - not just on GitHub or code coverage tests - requires constant competition and innovation. If Intel launches a new multi-register write feature, Chip Company X can’t just say “Well our project is done.” It’s not anymore! And if it, then Chip Company X might be done too…

nikanj4y ago

> Also, not to rain on anybody’s parade, but OSS is - generally - not held to the same performance standards that proprietary code is. This makes sense intuitively right? “If I’m paying for it, it better work.”

I've noticed the opposite to be true. Code from a proprietary vendor is buggy? Too bad, the corp just a faceless, nameless borg of an entity that doesn't care about your bugs. OSS has bugs? You can easily go rant at the poor coders who are working on it

2 more replies

fiddlerwoaroof4y ago

> This is similar to the argument that a company, once it has a good product, should just stop. Why do we need updates?, I like the features we have, Don’t change it, it’s perfect.

The annoying thing to me is that the actual software development rule seems to be “improve it until it breaks”: Slack was pretty great for a while, but at a certain point they started adding misfeatures (the new rich text input is still really annoying in a hundred little ways) and eventually the app just became really buggy: I’m still stuck with it for Reasons, but I’m constantly reloading/force-quitting it just to read messages and bits of the UI appear and disappear seemingly at random.

npteljes4y ago

A work of art is never finished, merely abandoned. Also, Why do you think building are finished at any time? The insides change a lot, living space expands to the attic, old wiring get replaced, walls get extra support, roof gets patched, replaced, or maybe they even build a new story.

j / k navigate · click thread line to collapse

127 comments

totorovirus4y ago

In next version of Sid Meier's Civilization there should be Linux Kernel in technology tree.

1_player4y ago

Wouldn't that make it available to only one civilization?

Should be a shared Wonder all civilizations would benefit from.

nicklecompte4y ago

Similar to every civ gets a library, but there’s only one Library of Alexandria, etc.

1 more reply

Waterluvian4y ago

The tech tree is researchable by everyone.

1 more reply

eru4y ago

Technologies are usually available for everyone to research.

1 more reply

Redoubts4y ago

Well, it could also be like the Manhattan Project. Once you do it, everyone can get nukes.

indy4y ago

And one of the disasters should be a craze for Bitcoin mining.

peterkelly4y ago

wheels4y ago

So, I like working remotely, but there are many fallacies in your statement:

Also, the existence of a thing doesn't show that its genesis was an optimal path to its current state.

u801e4y ago

Couldn't the same argument be made for those who thrive in an office environment and need to be around other people to work effectively as well as those who are not self-motivated?

I guess it really depends on what the majority prefers from a worker/contributor point of view.

whazor4y ago

readams4y ago

Just imagine how much easier it would have been if they were all in the same place.

GoOnThenDoTell4y ago

The core devs meet up all the time at conferences

NieDzejkob4y ago

Perhaps I'm blind, but I can't actually see the number of commits on this page.

dorianmariefr4y ago

visible on the github repo https://github.com/torvalds/linux

gigatexal4y ago

sfgweilr4f4y ago

Which files get the most commits?

Which sections of each files get more than usual commits? eg which functions?

Who wrote this particular function first? who subsequently?

How many of those include expletives? Curious minds need to know.

If I want to answer these earth shattering questions, could I just grab the entire git repo and go from there? is it that simple? is it text exportable without too much "other" scary?

sega_sai4y ago

Here is the top 20 files by the number of commits

  12846 MAINTAINERS
   4167 drivers/gpu/drm/i915/intel_display.c
   3330 drivers/gpu/drm/i915/i915_drv.h
   2360 drivers/gpu/drm/i915/i915_gem.c
   2328 arch/arm/Kconfig
   2118 arch/x86/kvm/x86.c
   2079 sound/pci/hda/patch_realtek.c
   2019 Makefile
   2001 fs/btrfs/inode.c
   1927 include/linux/sched.h
   1903 net/core/dev.c
   1888 drivers/gpu/drm/i915/i915_reg.h
   1824 arch/arm/boot/dts/Makefile
   1801 drivers/gpu/drm/i915/intel_pm.c
   1781 include/linux/fs.h
   1763 arch/x86/Kconfig
   1737 mm/page_alloc.c
   1643 kernel/sched.c
   1628 fs/btrfs/extent-tree.c

It's quite interesting how present intel gpus are in the list, which probably tells something about the code on them...

NullPrefix4y ago

>which probably tells something about the code on them...

Million ways to interpret this.

Lots of commits could mean that it was constantly improved or there was a constant stream of bugs.

A single commit could mean that it was a perfect masterpiece on first try or it was simply forgotten.

sdesol4y ago

Here is what I got with additional information

   commits | authors |       first_commit       |      last_commit       |                 path
  ---------+---------+--------------------------+------------------------+--------------------------------------
     12846 |    2438 | 16 years 16 days         | 00:00:00               | MAINTAINERS
      4167 |     205 | 12 years 4 mons 4 days   | 1 year 10 mons 13 days | drivers/gpu/drm/i915/intel_display.c
      3330 |     205 | 12 years 9 mons 19 days  | 2 days                 | drivers/gpu/drm/i915/i915_drv.h
      2360 |     146 | 12 years 6 mons 16 days  | 1 mon 9 days           | drivers/gpu/drm/i915/i915_gem.c
      2328 |     429 | 16 years 16 days         | 2 days                 | arch/arm/Kconfig
      2118 |     315 | 13 years 3 mons 3 days   | 1 day                  | arch/x86/kvm/x86.c
      2079 |     216 | 16 years 16 days         | 4 days                 | sound/pci/hda/patch_realtek.c
      2019 |     300 | 16 years 16 days         | 1 day                  | Makefile
      2001 |     186 | 13 years 10 mons 20 days | 5 days                 | fs/btrfs/inode.c
      1927 |     367 | 16 years 16 days         | 2 days                 | include/linux/sched.h
      1903 |     388 | 16 years 16 days         | 6 days                 | net/core/dev.c
      1888 |     177 | 12 years 6 mons 16 days  | 1 mon 9 days           | drivers/gpu/drm/i915/i915_reg.h
      1824 |     494 | 8 years 7 mons 18 days   | 6 days                 | arch/arm/boot/dts/Makefile
      1801 |     101 | 9 years 14 days          | 4 days                 | drivers/gpu/drm/i915/intel_pm.c
      1781 |     306 | 16 years 16 days         | 2 days                 | include/linux/fs.h
      1763 |     392 | 13 years 5 mons 20 days  | 2 days                 | arch/x86/Kconfig
      1737 |     391 | 16 years 16 days         | 2 days                 | mm/page_alloc.c
      1643 |     258 | 16 years 16 days         | 9 years 3 mons 27 days | kernel/sched.c
      1628 |     121 | 12 years 4 mons 4 days   | 1 year 8 mons 12 days  | drivers/gpu/drm/i915/intel_drv.h
      1628 |     118 | 14 years 2 mons 4 days   | 13 days                | fs/btrfs/extent-tree.c

Note: Authors may and probably is incorrect since a single user could have committed with different emails.

sfgweilr4f4y ago

Personally I'd like NVidia to be somewhere in the Top 50 of commits, or Top 100, Top 200. At least for a few weeks. Just saying.

Is wanting them in the top 1000 being too cynical or just wishful thinking?

1 more reply

mobilemidget4y ago

After you calculated all that and let us know :)

https://gource.io

To create visuals of the gitrepo history/commits.

sfgweilr4f4y ago

what a fascinating tool.

19h4y ago

I’d find a list of files that have the least changes more interesting (excluding text files et al.).

And then check if the lack of activity is related to the stability of the code, lack of use or its complexity.

bogota4y ago

You could. But I would recommend downloading it to a RAM disk to make generating that a bit faster. The which function sees the most use would likely take a bit of work to figure out.

DecoPerson4y ago

This really speaks to the reliability of Git.

Are there any examples of projects with 1kk+ commits that use SVN, Mercurial, Perforce, or some other SCM?

Cyph0n4y ago

Mercurial was used at Facebook afaik, and I would guess they ended up exceeding 1 million.

frob4y ago

When I left, the diff number was in the 15 million range. Not all diffs are landed, but I would assume >60% are, so FB's repo is almost certainly above 10M commits

1 more reply

eru4y ago

And Google uses a hacked up Perforce.

1 more reply

Calzifer4y ago

Apache had a single SVN repository for all projects in the past. That reached 1889412 commits.

https://svn.apache.org/viewvc

jcranmer4y ago

elteto4y ago

> although I don't think git has any proper term for a commit with no children that isn't referred to by a branch

I think this would be one case of a “detached head”.

mfateev4y ago

Google is mimicking perforce command line. The backend is 100% proprietary.

Microsoft is based on Git, but with a lot of engineering on top of it: https://devblogs.microsoft.com/bharry/scaling-git-and-some-b...

jeffbee4y ago

Google announced they had 35 million commits to their monorepo, five years ago.

WanderPanda4y ago

Do they have a quick response team to incarcerate newbies who commit binaries to their giant monster?

3 more replies

neurocline4y ago

Epic’s Unreal Perforce repo is >1.5 million at this point.

jedimastert4y ago

>1kk

That's going in the ol' geek toolbox

the84724y ago

just use the other SI prefixes. 1M.

https://en.wikipedia.org/wiki/Metric_prefix#List_of_SI_prefi...

1 more reply

maccard4y ago

Epic Games' p4 depot has well over 1mm changelists. Many of those numbers are taken up by developer changes that never get submitted, and many are automated merges though

kingsuper204y ago

Isn't OpenBSD at the 500k'ish mark using CVS?

kibwen4y ago

CorrectHorseBat4y ago

No they don't do that. Only happened once when moving to git: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/lin...

1 more reply

tannhaeuser4y ago

Now, the question is when systemd surpasses Linux in terms of commits/LOCs ;)

tofof4y ago

Surpassed, not bypassed.

bpodgursky4y ago

Lot of people view 1mm as a huge number of commits. Which maybe it is... if your team has a habit of big PRs + rebasing.

LeegleechN4y ago

Closing in on a nice, round 2^20!

DecoPerson4y ago

I wonder if the Git project has tests for over 2^20 commits.

globular-toast4y ago

kibwen4y ago

aexl4y ago

The "mozilla-unified" repository seems to be at 646.5k commits and the "chromium" repository at 999.5k commits.

muterad_murilax4y ago

Honestly expected it to be way more.

andrewclunn4y ago

Time to Squash it down to a few hundred thousand...

bitcharmer4y ago

otabdeveloper44y ago

> It doesn't really scale well

Evidence? One million commits seems to indicate otherwise.

bitcharmer4y ago

Have the Linux foundation ever tried anything else?

1 more reply

adwn4y ago

> I just wish they moved away from email as the only way of contributing.

What alternative do you suggest, and in what way is it better than email?

> It doesn't really scale well [...]

There are few things that scale better than email...

> [...] doesn't make contributing for newbies easier.

bitcharmer4y ago

> If you haven't even mastered sending a plaintext email

Surprise, I've had a few of my patches accepted and this statement is just thinly veiled insult, if I ever saw one.

Have you ever heard of pull requests? Pull requests are 10x easier than adhering to lkml's email rituals.

2 more replies

williamdclt4y ago

> If you haven't even mastered sending a plaintext email, , I really wouldn't expect you to be able to constructively contribute to kernel development.

2 more replies

tester7564y ago

I always wondered - how do people read those emails?

in just plain text with no colors for code?

it must be painful as hell

2 more replies

jorvi4y ago

Of course there will also be a lot of pull request noise, so I can't say if that tradeoff is ultimately worth it.

Edit: well I guess OSS people are extremely hostile to even the discussion of more people contributing to the Linux kernel. My bad.

2 more replies

gspr4y ago

> I just wish they moved away from email as the only way of contributing. It doesn't really scale well

Bullshit. What you really means is the second half of your comment:

> and definitely doesn't make contributing for newbies easier.

That's fair enough. I would guess that the scalability and workflow of the actual developers is more important.

throwaway36994y ago

This push for every project to be as 'easy' as possible to contribute to is just weird to me. Like entering a room and immediately uplifting their entire productivity flow.

mch824y ago

I’m curious if you use the Linux email list and archive?

ziml774y ago

DestruKaneda4y ago

Bypassed? As in dropped 1 million commits?

caffeinatedgoat4y ago

There's really nothing significant about this, other than people like big round numbers.

macksd4y ago

The millionth commit in isolation? Sure. The scale it takes to get into that order of magnitude? Quite an achievement.

Ambolia4y ago

There's nothing significant about anything, other than people liking it.

agons4y ago

This can definitely be significant as part of a trend

1 more reply

anothernewdude4y ago

We really do.

cryptica4y ago

Part of me wonders why nobody tries to make software like we make buildings... After some time, it's all done and nothing else needs to be added.

I did this with my open source project. I haven't made any code changes for over a year and it still works perfectly and still relevant.

I don't understand why there is such a fetish in this industry for never finishing any project. I find the whole attitude very frustrating.

DC-34y ago

You know buildings also require maintenance, right?

cryptica4y ago

Hardly any compared to software. We're talking about 1 change every 20 years versus 1 to 100s of changes per day. That's a huge magnitude of difference. But that's not even important to my argument.

To suggest that software maintenance and building maintenance are anything alike is ignoring the entire context of the two activities.

1 more reply

sophacles4y ago

And they regularly get overhauls and additions.

uncomputation4y ago

A couple things:

> so the drivers can change but the interface doesn't

> I haven't made any code changes for over a year

> I don't understand why there is such a fetish in this industry for never finishing any project

nikanj4y ago

2 more replies

fiddlerwoaroof4y ago

> This is similar to the argument that a company, once it has a good product, should just stop. Why do we need updates?, I like the features we have, Don’t change it, it’s perfect.

npteljes4y ago

j / k navigate · click thread line to collapse