If you took the 100 biggest code bases in the world. I would be surprised if more than 10% of them were open source.
[1] https://www.informationisbeautiful.net/visualizations/millio...
And again, that's for a middle size company you haven't even heard of. FAANG, Microsoft, Oracle, IBM for sure will have stuff dwarfing that.
https://devblogs.microsoft.com/oldnewthing/20180326-00/?p=98...
Bonus chatter: In a discussion of Windows source control, one person argued that git works great on large projects and gave an example of a large git repo: “For example, the linux kernel repository at roughly eight years old is 800–900MB in size, has about 45,000 files, and is considered to have heavy churn, with 400,000 commits.”
I found that adorable. You have 45,000 files. Yeah, call me when your repo starts to get big. The Windows repo has over three million files.
Four hundred thousand commits over eight years averages to around [130] commits per day. This is heavy churn? That’s so cute.
You know what we call a day with [130] commits? “Catastrophic network outage.”
Also, even if we normalized both cases for 'code files only' and/or 'kernel code only', there could still be architectural, code style, and development process differences that lead to different metrics for each project.
Most open-source projects have one to maybe five real contributors, outside the drive-by pull requests to fix some bug.