TDD from the Factorio Team (opens in new tab)

(factorio.com)

457 pointssorahn5y ago303 comments

303 comments

132 comments · 18 top-level

ramblerman5y ago· 36 in thread

I'd be curious to hear what kovarex thinks in 2-3 months.

TDD is often sold as a fix-all solution, which is incredibly appealing to mgmt and quite fun for most programmers as a new paradigm, allowing for quick adoption.

It also has its uses, especially in the enterprise space where requirements aren't often clear. But I don't know many good programmers that truly stick to the dogma after the honeymoon period. It becomes just another tool in your toolset.

Uncle bob is a salesman, not a "craftsman".

naikrovek5y ago

TDD and OOP as dogmas are very bad, for different reasons.

TDD encourages you to write many times the number of lines of code for your tests as the code you're testing, often 10X or more. So when you inevitably decide "there's a much better way to do this" (which happens to me 100% of the time) then you're not only changing your code, you're changing all of those tests. That's a lot of weight that you now have to deal with.

Most of the time, that's enough weight that the better design simply doesn't happen and becomes yet another chunk of tech debt that prevents certain things from happening in the future. I've seen that happen, and it's given me a very bad taste in my mouth for "TDD" because "TDD inertia" is the reason that better designs aren't implemented. If that piece of software lives for a while, and grows in scope, someone is going to have to deal with that, and eventually make the architectural change anyway.

By all means, test your code, of course. If you can find a way to easily generate test cases for an arbitrary code base, by all means, do that, because the test cases are no longer influencing your decision to change how your application works.

Otherwise, being "test-driven" is bad, IMHO. Software development is never as simple as the various dogmas would lead you to believe.

kllrnohj5y ago

I think kovarex is avoiding your TDD concerns by rejecting the notion that everything must be covered by minimal-scope unit tests. This means that there's much fewer (if any) mocks that need updating when the implementation changes, and fewer tests should end up needing to be touched as a result as well.

See specifically the "Fig. 5 - Test dependencies" section of the post.

Personally I think mocks are far over-used in tests, and I much prefer the solution kovarex outlines. I think dealing with mocks are the bigger source of test-update friction here than the tests themselves. I was already layering my tests instead of using mocks. As in having no clear line between "unit" and "integration" tests, everything just uses the "real" implementations of things and tests just get naturally more complex the higher up the stack it's testing. The idea of sorting by dependency depth is a cool idea I hadn't ever considered, though, I'll be borrowing that idea.

CharlesW5y ago

> So when you inevitably decide "there's a much better way to do this" (which happens to me 100% of the time) then you're not only changing your code, you're changing all of those tests.

Isn't the point of TDD that you can change implementation at will, safe in the knowledge that the tests will help guarantee that you're not inadvertently changing behavior?

4 more replies

pault5y ago

I always thought of TDD as a way to exercise your code while writing it, not a set in stone spec of your product. I consider unit tests disposable and basically noise; they're only useful when you're writing code. They are by nature a test of your implementation, not your specification. I usually throw away a lot of them after I'm done writing, and if I refactor I just delete the ones that are failing because I'm writing new ones as I go. There are places at boundaries where more stable unit tests can go, but even there a change to the interface is going to break all the tests anyway.

The better way to test your specification and behavior, IMO, is to use comprehensive E2E tests. The traditional test pyramid is upside down. If you have an E2E test for every user story in your spec, you can develop with confidence that your new code will not disrupt your users' activity. Unit tests are cattle, E2E tests are pets.

imiric5y ago

> TDD encourages you to write many times the number of lines of code for your tests as the code you're testing, often 10X or more.

If that happens I'd say you're doing TDD too early. TDD can be very useful, but early on in the process of transferring a design to code, all your interfaces are highly unstable and there's a lot of experimentation, so sticking strictly to TDD would naturally be frustrating. I would start a bit later when at least some of the API has stabilized and you don't have to do major changes.

TDD helps with creating user friendly APIs, since you experience it from the user's perspective. And it forces you to actually write testable code and not incur technical debt that's very costly to remove later (having to refactor code to make it testable).

> Most of the time, that's enough weight that the better design simply doesn't happen and becomes yet another chunk of tech debt that prevents certain things from happening in the future.

This reads like you're saying that tests themselves are technical debt...? Because you're going to run into this issue (or should run into it) regardless if you use TDD or not. Eventually you'll want to refactor parts of your codebase and, sure, you'll have to change some of your, hopefully mostly, unit tests. So in that sense you can say that doing TDD early creates a lot of technical debt that needs to be resolved quickly, but like I said above, that doesn't have to be the case.

TDD can be a dogma just like any practice (Agile is my favorite), but it doesn't mean that it's not useful if used correctly.

Kudos to the Factorio team for adopting it, which I think is rare in the gaming industry. The idea alone of testing the complexities of a video game is mind boggling to me as a web developer. Especially in an industry where it's popular to hype and sell broken products with the promise of patches and DLC.

1 more reply

wpietri5y ago

Nobody should stick to any dogma past the honeymoon period. The point of a dogma is to get you into the behavior space that you will learn how to do something well. It's like using a recipe in a cookbook. Once I had enough practice making scrambled eggs, I didn't need a recipe anymore.

I learned TDD ~20 years ago from Beck's "TDD by Example" book. For me it's far more than "just another tool". I'll certainly do exploratory work with throwaway code. And in the early stages of something, I might be a bit slack. But the lesson over and over for me is that if I don't get to significant test coverage soon, I end up wasting a lot of time on bugs. And TDD is the easiest way to get to test coverage. Test-first ends up feeling like a set of small successes with the red-great-refactor loop. With test-after, going back and adding tests feels more like drudgery, and I'm less likely to have written the code in a testable way.

So TDD is not dogma for me, just the inevitable place I end up if I want to maximize the amount of time getting things done while working on a non-trivial codebase.

wpietri5y ago

That said, I totally agree that Martin is a salesman. I knew him in the early days of the Agile movement and he did a lot of good then. But he's become consistently more strident, more dogmatic, more unpleasant. It's sad to see, really.

1 more reply

nightski5y ago

I had a similar trajectory but opposite experience. I felt like after doing heavy TDD and even FP after a while TDD felt more and more like a waste of time. I find that I naturally think and design in a test driven perspective without actually writing the tests due to many years of experience.

1.) I'd much rather lean on the type system to prove things than automated tests if possible. But of course depending on the language that often isn't possible.

2.) I find that only a small fraction of my code really benefits from automated testing. This is the logic/calculation parts of the code. The rest is slinging IO and SQL queries which all ends up being mocked out anyways and the tests just become secondary implementations of the original code.

1 more reply

alanfranz5y ago

My 2c: tdd is great in order to learn to create testable designs. You can’t tdd nontestable code.

Once you understand how to design testable code, tdd offers minimal benefits.

The real value comes from thorough, automated testing suites, whatever their origin is.

Chris_Newton5y ago

The real value comes from thorough, automated testing suites, whatever their origin is.

Automated testing is useful, for sure. However, TDD purists tend to focus on one very specific type of testing: automated unit testing, in the small, where you already know the expected output for given input and can easily specify that output using assertions in code. By its nature, TDD emphasises being testable in that specific sense above all else. I don’t think that is necessarily a good thing, partly because that type of universal unit testing may not be a good strategy for every software system, and partly because other useful properties of the code might be diminished because of the changes needed to make it “testable” in the TDD sense.

1 more reply

pydry5y ago

Sacrificing at the altar of unit testability doesnt necessarily make better code.

Unit tests' inability to handle state is way too often viewed as a problem with the code it can't properly test than the general crappiness of this form of test.

1 more reply

ashtonkem5y ago

Good observability standards and fast releases catch more bugs than meticulously maintained test suites, imho.

1 more reply

koonsolo5y ago

The thing is that all that extra test code also needs to be maintained, also contains bugs, etc.

It all comes down to return on investment. For example, I used to agree with TDD that for every bug, first write a test that fails, and then fix the bug. That way you prevent regression.

So I proposed this to my manager, and he responded: we tracked all bugs in our system for the last 10 years, and when you look at fixed bugs that get broken again, it only occured very rarely. So in the end, doing that was not the best investment of effort.

asddubs5y ago

One point uncle bob makes is that doing this for everything allows you to make far-reaching architectural changes with confidence that you haven't broken a bunch of things in places you don't even realize. so the tests are a tool to allow you to do refactorings you would otherwise be scared of

5 more replies

tikhonj5y ago

When I fix a bug, I have to reproduce it somehow to make sure my fix actually works. Adding a regression test is a way to save that work as code. Once I have a reasonable test infrastructure set up, the majority of the effort is in understanding and reproducing the bug; going from that to an automated test should not take significantly more effort.

In return, I get to have some extra confidence that a bug doesn't return (which would be embarrassing, even if it's infrequent!) and I get a more thorough test suite that lets me refactor more quickly and aggressively. And if an old bug does come up again, the advantage is not only that the test will catch it before a release, but also that the person fixing the bug won't have to go through the effort of figuring out how to reproduce it from scratch—it's reproduced right in the test suite!

So I am not sure that just looking at how often regressions actually happen in the existing codebase is sufficient to make any real conclusion by itself.

linspace5y ago

> The thing is that all that extra test code also needs to be maintained, also contains bugs, etc.

I have often observed an evolutionary behavior on tests: tests that pass easily survive

cbushko5y ago

> So I proposed this to my manager, and he responded: we tracked all bugs in our system for the last 10 years, and when you look at fixed bugs that get broken again, it only occurred very rarely. So in the end, doing that was not the best investment of effort.

That sounds very hand wavy.

It is making the assumption that:

  - your bug tracking system and people are so good that they have found all the duplicate tickets.  
  - they don't make mistakes and find all duplicates for tickets. 
  - your code is so good that it hasn't had any side effects that caused regressions. 
  - your boss is so good that he has a full grasp on 10 years worth of bugs.

edit: formatting

dgb235y ago

Regression tests are apparently an effective tool to maintain code stability.

peregren5y ago

Even if it's only rare, the test also shows clearly to a reviewer that the bug has been fixed.

In my opinion, as soon as a test suite finds a bug it has added a lot of value, even if it's rare.

drewcoo5y ago

Brushing _that_ tooth is a bad ROI because it rarely keeps a filled cavity from recurring.

cjfd5y ago

I learned TDD years ago and never looked back. It is true that there are some things where it is less suitable and therefore I would perhaps decide not to use it. E.g., user interfaces in case of changes that are mainly visual. Or code that tightly integrates with the system. E.g., file system manipulations. For all other code I think TDD is absolutely the best way to write anything more complicated than a throw-away script. Frankly, I am pretty much at the point where I consider anything less than TDD borderline unprofessional. I have come to expect code that was not TTD-ed to be buggy or more complicated than necessary or both.

sidlls5y ago

I take the opposite view. Code developed with TDD tends to be overly complicated and inefficient, and the test suites themselves are often bizarre labyrinthian hellscapes of mocks, facades, and workarounds. "They just did it wrong, then," you say? That statement ceases to be meaningful when it can applied routinely: it's the norm, and if it's the norm, it's not something "they just did it wrong" applies to.

1 more reply

serverholic5y ago

Have you considered that maybe TDD is just a really good fit for your brain?

I've worked at a TDD-focused company and it always felt like I was coding through molasses. Some of us weren't as strict about TDD and I didn't notice a difference in our code quality.

1 more reply

ramblerman5y ago

> Frankly, I am pretty much at the point where I consider anything less than TDD borderline unprofessional.

Would you feel that way about the linux kernel for example?

2 more replies

echelon5y ago

It also seems really ill suited for new systems and places where you're experimenting with new languages or frameworks. You don't know the shape well enough to be able to test the inputs and outputs before writing the thing, and learning via TDD seems highly suboptimal.

3 more replies

blacktriangle5y ago

You say that like its a bad thing. Sure dogmatic TDD can lead to issues, but going through a period of dogmatic TDD has, for me, resulted in becoming a far better programmer. And as time goes on I find myself drifting further back towards the dogmatic side of TDD as I know the difference how it feels working on tested portions of our code vs the untested portions of our code.

astrange5y ago

It is kind of a bad thing. Recently at work there was a quality program introduced by some training guy, but by quality they seemed to mean write more tests and do TDD, and our junior engineers read that as write a ton of unit tests (the only kind they'd heard of.)

I pointed out the 10 year old Norvig vs Ron Jeffries fight[1] which demonstrates that TDD is useless when you don't already know what you're writing, but they just looked confused.

The other problem here is that unit tests never break (since you've mocked everything that can break) and therefore aren't worth running; it might be more productive to write them for TDD and then just not commit them.

[1] https://news.ycombinator.com/item?id=3033446

4 more replies

JohnHaugeland5y ago

> You say that like its a bad thing.

That's because it is

tziki5y ago

TDD, when normalized for time spent writing tests, has not been found to be any better than the 'normal' way of writing tests afterwards. It's interesting how much of programming lore falls apart when you actually try to measure the impact.

fendy30025y ago

From my experience, tdd is useful if you already have solid specification, the code / part can be tested and you / your team understand how to develop unit / integrated tests. The problem arises because usually one or some of the points are unfulfilled.

And it's not without drawbacks. Increased development time and the needs to maintain the unit tests are costly, but rewarding.

Also I don't like too much interface and mocking only for the sake of testing. I find it usually breaks when integrated and makes code harder to maintain. Maybe I'm just inexperienced.

berkes5y ago

> The problem arises because usually one or some of the points are unfulfilled.

An important idea of TDD is that it allows you to discover those "unfulfilled points" in the tests. When writing code (the tests) that use an API, instead of when writing the actual API.

When writing code that uses objects, methods, interfaces and so on, you are in a mindset of writing "what a user of the code would wish there was". This is probably the best place, mindset and moment to define those specs in detail.

mbrodersen5y ago

I haven’t had a bug in production for 9+ years. Not a single one. And routinely do major refactorings, feature improvements, optimisations etc. I can do it because of 9000+ tests. 1800 hand written and the rest auto generated to detect changed behaviour. The test suite allows me to spend 95% of my time adding features instead of bug fixing. Most excellent.

skinnyarms5y ago

Am I missing something, it sounds like they are already doing this - not striking off on a new venture.

hatsuseno5y ago

> I have to admit, that I didn't know what TDD really was until recently.

They do already do this (hence the blogpost), but it's something kovarex hasn't explored before, so they're pretty new to TDD.

ashtonkem5y ago

Red/Green is a good technique for fixing bugs and extending existing functionality.

koonsolo5y ago

What is the chance of a fixed bug getting broken again? As it turned out in our analytics over 10 years: very very low.

So the effort you put into writing a test for a bug, has most likely a negative return on investment. That time could be better spend somewhere else.

3 more replies

dgb235y ago· 26 in thread

Two interesting takeaways:

> This is the beautiful thing about having a company that isn't on the stock market. Imagine you have a company that goes slower and slower every quarter, and then you confront the shareholders with the statement, that the way to solve it, is to do absolutely no new features for a quarter or two, refactor the code, learn new methodologies etc. I doubt that the shareholders would allow that. Luckily, we don't have any shareholders, and we understand the vital importance of this investment in the long run. Not only in the project, but also in our skill and knowledge, so we do better next time.

This is reassuring the notion of what I think actually matters, what the real essence is of developing a product, may that be a piece of art and entertainment (like here) or a productivity tool etc.

There are creators and there are consumers. We split them up by developers, designers, domain experts and so on, but what matters is that all the other participants, especially those who can exert power traditionally are not part of the essence and if not being careful and responsible, can easily add complexity and limitations that are entirely accidental and can even be harmful.

This reminds me of the agile manifesto, modern UX approaches and other processes that are driven by creators, but are often and very unfortunately being bent over backwards to fit into hierarchical power structures.

> TDD actually is the constant fast swithing between extending the tests and making them pass continously. So as you write tests, you write code to satisfy them basically at the same time. This allows you to instantly test what you write, and mainly use tests as specifiation of what the code should acctually do, which guides the thought process to make you think about where you are headed to, and to write code that is more structured and testable from the very beginning.

The important notion here is that TDD is not about tests and correctness, but about development. It continuously checks assumptions and explores the surrounding code, state and data until a sufficient solution is found.

If we squint a little we can see how closely related TDD with REPL Driven Development is. In essence it is the same thing and even has similar results, where the tests or REPL code can be left as an artifact for further, likely historical understanding.

We know now that neither is sufficient for a high degree of correctness, but they are certainly useful for understanding and development.

rpastuszak5y ago

> The important notion here is that TDD is not about tests and correctness, but about development.

Yup, writing tests helps me sleep at night. TDD helps me manage my mental resources and iterate.

(another reason is communication--we code for our colleagues first, then for the machine: https://sonnet.io/posts/code-sober-debug-drunk/)

IIRC smalltalk had a workflow where you'd debug and write your program at the same time. You'd just reach a path that has not been implemented yet, break, implement it and continue.

dgb235y ago

This „keeps working while it breaks“, does it have a name? It comes up in highly dynamic environments often like Smalltalk as you mentioned, but also Lisp, Erlang and others.

3 more replies

mulmboy5y ago

> You'd just reach a path that has not been implemented yet, break, implement it and continue.

This is exactly what I do with the PyCharm - hit a breakpoint, write the code as it should be, and execute it in the debug REPL to do basic initial testing [repeat]. Extremely productive.

wpietri5y ago

I appreciate this, but I still think it's a suboptimal approach:

> that the way to solve it, is to do absolutely no new features for a quarter or two, refactor the code, learn new methodologies

Even in a world without shareholders, we still are building things for users. 6 months without improvements has an effect on them, too. When possible, I think it's better to spread cleanup work out. Even if one spends 80% of the time on cleanup and 20% on features, that is much better for relationships than going dark for a quarter or two. And in my experience, continuing to do productive work during that period makes the behind-the-scenes improvements better.

mikewarot5y ago

I've both watched all of Uncle Bob's videos (parts more than once), and been a long time Reddit/Factorio reader/game player. (The factory must grow!)

They did exactly what you suggested! The relationship with the users never went dark. They kept up with bug fixes, and kept feeding new features at a more than acceptable pace. [Edit: Here's the pace of their status updates, as evidence at https://www.reddit.com/r/factorio/?f=flair_name%3A%22FFF%22 ]

I didn't quite believe that Uncle Bob's lessons worked in the real world, but if the Wube team is sold on them, that's about as good as an endorsement as I'll ever get.

1 more reply

scotty795y ago

In this specific case, Factorio is such beloved product with such great development history that 6 months without updates is nothing.

3 more replies

indigochill5y ago

Regarding the power structure observation, I've lately been weighing the merits of co-ops in the tech industry. Intuitively, it seems like a tech product owned by its users would lead to more product-centric decision-making. Leaving governance to shareholders who aren't necessarily directly involved in the product feels weird by comparison.

dgb235y ago

I think so too. A lot of that free energy is in the FOSS movement it seems, and there are plenty freelancers and single consultants in tech. This is a strong indicator that there is enough developers who primarily want to create, while having agency, responsibility and a direct communication channel. So why not start businesses that share these values and bring consumers and producers closer together?

wpietri5y ago

Yes, you're exactly right about REPL-driven development and TDD being the same spirit.

I took to TDD pretty easily because I was already used to doing short run-it-and-see-if-it-works cycles. The main difference was that instead of checking via eyeball, I talk the computer to check it. This is slightly slower early on, but so much faster once a program is big enough that manually checking everything would take a while.

bjornjajayaja5y ago

I’d love to see a language where you write the tests and then the compiler creates the application code.

kilburn5y ago

This is an active field of research. Search for "program synthesis".

We are advancing, but the current state is... not mind-blowing yet (albeit somewhat cool!). See [1] for an example interactive demo and [2] for the corresponding presentation.

[1] http://comcom.csail.mit.edu/comcom/#Synquid

[2] https://www.youtube.com/watch?v=HnOix9TFy1A

1 more reply

oats5y ago

Is this not almost how logic programming (prolog, etc.) works? You tell the language some things which are true, and then it'll be able to infer answers to "questions" you ask:

https://wiki.c2.com/?LogicProgramming

3 more replies

bregma5y ago

It's incredibly easy. First, you start with a customer that actually knows exactly what they want.

nyberg5y ago

This sounds a lot lke like minikanren (https://github.com/webyrd/Barliman) where you give test cases and Idris2 (https://github.com/idris-lang/Idris2) where you give type constraints as a tool for building programs.

dgb235y ago

We typically write _sample_ tests in boolean logic, which isn't quite expressive enough for this.

But if you look at logic programming with more expressive systems you can have something like what you propose. We describe what we expect to have and the system deduces a result. Not quite what you want but its closer.

Now there is also an ubiquitous logic system that many use: static typing. In a sense you are describing the general properties of something and the compiler infers optimizations based on your assertions. The concrete program is not a line by line translation from your code to machine code, but perhaps looked at in its entirety.

I agree that there is a lot of merit in pushing these things further and further. Right now we're kind of in a stage of patching things together. But I hope and assume that programming becomes more holistic in the future. Ironically we have to look at the past first, there was a lot of momentum in this direction up until the 80's roughly.

sidlls5y ago

How would this even work? At best you might get a set of class/function stubs with some minimal logic. At worst what you'd have is a compiler that is actually a complicated, truly AGI brain, which could produce some truly awful code. In which case you've simply reproduced TDD's normal result: truly awful code.

Aside from that, to produce a program that did what is actually required would require test cases and functions that cover the set of inputs and outputs. This is trivial for mathematical functions, but impractical (or impossible) for more general applications (e.g. anything dealing with human inputs).

mikewarot5y ago

You could just use a fuzzer to generate code, and run tests on the output. Each new test would approximately double the run time until a new "correct" output was found.

This doesn't become practical until you can do it on a quantum computer with millions of cubits.

iamwil5y ago

This is kinda what logic programming is, like in prolog. You tell the computer what you want, and it finds the answer for you.

TDD is where you both write what you want, and you do the implementation also.

phtrivier5y ago

I would like to see this language handle some lawmaker-specified code.

Show examples of French retirement pension computation, and watch if a computer can actually commit petit-suicide.

astrange5y ago

Well, that's how machine learning works.

1 more reply

CraigJPerry5y ago

For those curious about REPL driven development, this is a good example that i ran across recently:

https://gist.github.com/daveray/1441520

Even if you don’t follow along and try it, you can probably get the gist of how experimental / exploratory you can be.

adamkl5y ago

For those who want to see some REPL driven development in action:

https://vimeo.com/230220635

truetraveller5y ago

TDD is basically automated and "named" REPL-driven development. Which is very nice!

infogulch5y ago

Oh that's an interesting take, and feels right to me. So the thing that holds back TDD is ergonomics (compile time/cached results) and maybe marketing.

lugged5y ago

Its almost like Test Driven Development is the practice of using tests to drive development.

dgb235y ago

Much of TDD mantra is less about development but about (perceived) correctness, the article in question emphasizes the development part, where they describe their AHA moment.

3 more replies

tobyhinloopen5y ago· 17 in thread

I wonder how common TDD is in game development land, especially when you’re using things like Unity or Unreal.

I feel like testing your behaviors is pretty hard, and even if you unit-test your behaviors, there’s still integration tests.

I only write games as a hobby and never use TDD, even if I’d like to, since the tooling is just either poorly documented or too slow, or both.

Usually this ends with me being frustrated with the slow development cycle and pushes me towards more unconventional methods of developing games in Javascript using Mocha to run the tests directly in the browser.

jayd165y ago

So there's a few reasons tests aren't as ubiquitous in games as they are in non-game dev.

You need a large QA team (relative to your team size) to test for fun anyway. The game is constantly getting tested and bugs will get logged. The marginal benefit of automated tests is less than other places because of this.

Games have no specification. You have almost no idea of even the genre of game you'll end up with at the end of the dev cycle unless you're making a sequel that has to fit into a mold. Sure you can write tests as you go along and test that enemies with negative health die. The next day someone will suggest "what if they stay alive for a period of time and then explode!" The definition of correct is constantly changing.

Tests ossify functionality. It makes it harder to change things because at the very least you need to also change the test. If you're just changing tests whenever you want to suit your new desires then its hard to build trust in the tests.

Games don't need to be correct. They just need to be fun. This also decreases the marginal benefit of tests compared to other industries.

That said, it would be natural to unit test some data structure or some well defined system. Also, once your game is done, a la Factorio, you can go back and write tests for some refactor because you know the full design specs.

indeedmug5y ago

I don't know if "correct" and "fun" are at odds with each other. There are very famous examples of games failing at the start because of bugs like Cyberpunk. There is a point where the game is too broken to enjoy. You want games where the correct behavior is the fun behavior. (However, there are counter examples like Goat Simulator.)

To be fair, I don't pretend to know how CDProject developed their games. They might already have testing and the timetable was the problem.

mewse5y ago

In twenty years in game development I have never worked on a game which had real unit tests or even integration tests.

I’ve seen engines which used them, but not games. The rationale was that it was just too hard, which always felt like a cop-out to me.

I would dearly love to have more automated tests in the game I’m working on now, but I’ve never seen a model of it working well which I could copy, and part of me suspects that it’d be a huge investment of time to figure it out entirely on my own when I’m already vastly overworked as it is.

If anybody has references to indepth case studies of making game engines more friendly toward automated tests, I’d be super interested to see whether there were lessons I could apply toward my own situation!

teamonkey5y ago

It's not exactly what you mean, but automated gameplay testing is fairly common in the AAA space, although maybe not taken seriously enough.

A simple example might be simply starting and closing the engine after the automated build & package process to make sure the game actually runs, but I've seen things like using bots to emulate player behaviour to smoke test gameplay functionality. No Man's Sky used automated tools to evaluate the procedural-generation algorithms[1]. Here's a more comprehensive example of automated gameplay testing[2].

[1] https://youtu.be/sCRzxEEcO2Y?t=3100 [2] https://www.youtube.com/watch?v=VVq_hgaX8MQ

1 more reply

dsego5y ago

Not TDD but here is a talk about automated testing at Croteam

Continuous integration and testing pipelines in games - case studies of The Talos Principle and Serious Sam https://www.youtube.com/watch?v=YGIvWT-NBHk

tarcon5y ago

Interesting. I always assumed that to get the balancing feel right, a game would have to run huge parameterized test-suits to make sure win/lose results to user inputs are in balance.

1 more reply

CodeGlitch5y ago

I was in the industry from early 2000 to early 2010. It was only towards the end that Unit Testing was a thing. At the start we didn't even do code reviews or use a sensible source-control system.

Yeah it was a painful experience, but I survived.

Danieru5y ago

Factorio is perhaps the only game I am aware of using TDD. Lots of engine teams use extensive automated testing. Only Factorio is applying it to game logic of any major game I know.

mywittyname5y ago

Factorio seems especially well suited to TDD. The core gameplay loop involves automating away manual tasks. So I have to imagine that the test cases leverage blueprints a lot, i.e., create a blueprint for a feature, pipe resources into it via conveyor belt, test output rates on conveyor belts.

Glowbox5y ago

https://twitter.com/playartifact/status/1051964775658217473?...

Artifact did it too (which is no longer being worked on though).

exdsq5y ago

Anecdotal but I think it's pretty rare - my friend worked as a game developer for Epic and didn't know what TDD (or SQL for that matter) actually meant.

tobyhinloopen5y ago

Given how common it is for bugs to reappear in Fortnite, I’m pretty confident their testing suite is either incomplete or not present at all

Thaxll5y ago

Video game client don't have tests.

ashtonkem5y ago

I feel like game dev is probably an area where TDD has some value.

My team writes distributed systems, which drastically reduces the value of a TDD approach. There’s only so far you can take the technique with a database backed api before it just becomes absurd.

mschuster915y ago

Testing? In games? There is no such thing on a wide scale, not anymore since the cost of distributing patches essentially became a small budget line for a CDN.

Modern games are notorious for using the first, most loyal customers as beta testers (hello Fallout 76...).

The reason is two-fold... while you absolutely can test some parts of the engine (e.g. collision detection, networking) you can't really "test" stuff that needs a human eye to see if it's working as intended (anything that's rendered) or involves randomness (e.g. fire, fog, water, opponent spawning, loot). That means you have to hire lots of skilled (!) humans, provide them with expensive rigs, and give them time. Which is incredibly expensive.

gmueckl5y ago

Just to give you some perspective: my current employer's main product isn't called a game, but it has an engine at its core that is a game engine in all aspect except its name. And we test the sh*t out of it. We have thousands of automated and very sensitive tests on that stuff. Our test suite goes as far as testing for pixel perfect output. And this involves stuff that is "random". It took us some effort to be random in a perfectly reproduceable way, but we got there.

Game QA is more involved than that, of course. Content needs to go through a signoff and QA process that involves humans (we do that, too).

kempbellt5y ago

This is definitely true for some games, and Early Access is increasingly popular, but for others, QA and testing is definitely a part of the development process.

I interviewed at a game company a few years ago where one of my daily tasks would be to spend an hour just playing the game and seeing if I spot any bugs. I didn't end up taking the job so I didn't see how involved their actual code testing process was, but it was apparent that they actually cared a bit about quality control.

blindmute5y ago· 11 in thread

I'm not sure I understand why they're committing to such a long term refactor for a game which has already reached the tail end of its sales curve. As far as I know there are no internal monetization schemes in Factorio, and I really doubt further updates will boost sales anywhere near enough to justify the dev salaries.

trollied5y ago

They're working on an expansion. See: https://factorio.com/blog/post/fff-365

colonwqbang5y ago

I'm also surprised. Factorio feels like a finished game. It's more polished than most games I've played.

Maybe the devs haven't found a worthy new project yet?

naikrovek5y ago

They're working on an expansion for Factorio, and this refactor may have something to do with that.

Maybe they just want to leave their code in good shape so they (or someone else) can come back to it at a later time and pick it up relatively quickly.

1 more reply

EndXA5y ago

Worth pointing out something that they said in the post (emphasis in italics is mine, the full quote is given for context)

> Imagine you have a company that goes slower and slower every quarter, and then you confront the shareholders with the statement, that the way to solve it, is to do absolutely no new features for a quarter or two, refactor the code, learn new methodologies etc. I doubt that the shareholders would allow that. Luckily, we don't have any shareholders, and we understand the vital importance of this investment in the long run. Not only in the project, but also in our skill and knowledge, so we do better next time.

This isn't necessarily the full explanation, but it's certainly something to keep in mind.

fooey5y ago

I would assume they're working on either an expansion or another game using the same engine

If you're building a new game and your current GUI paradigm sucks, overhauling it first makes a lot of sense.

shepherdjerred5y ago

They're planning to release a paid expansion.

robryan5y ago

As he says, they don't have external shareholders that are demanding the everything they do maxmise profit.

mattmanser5y ago

Because the worst case scenario is that they get half-way through the refactor, the game's a buggy mess, and then they all move on.

And that's got a non-trivial chance of happening.

1 more reply

ashtonkem5y ago

There are some mentions of a DLC in the works, probably that.

js85y ago

Yes. I don't know what the DLC will be (personally I hope for water- and air-borne structures, vehicles and enemies), but I am sure I will pay for it.

1 more reply

ranger2075y ago

Maybe just for fun? This kind of thing is half the game itself after all.

achairapart5y ago· 8 in thread

Warning: This page almost crashed my browser (FireFox on MacOS) and put my CPU on fire.

DizzyDoo5y ago

My poor 2015 MacBook Air with FireFox went full 100% CPU on this page, I think it's the gifs.

I think Factorio itself actually runs better on this laptop than that Factorio blog post does.

kart235y ago

I have a dual-core 2015 Macbook Pro running FF too. htop showed my cpu pinned at 100%, VTDecoderXPCService was taking the lions share, there probably is something weird about the gifs.

1 more reply

Metacelsus5y ago

I'm also using Firefox on Mac and had no issues. I'm blocking their Javascript though.

kllrnohj5y ago

There doesn't seem to be any meaningful JS on the page. A single google analytics script and a tiny[0] little toy script for doing a silly animation when you click on the #rocket element.

Possibly the google analytics is doing something heavy (although it doesn't look to be when spot checking with a profiler), but there's otherwise nothing JS that runs continuously.

0: https://factorio.com/static/js/factorio.js

Diggsey5y ago

Strange. I'm using firefox on windows and didn't notice any problems.

Aachen5y ago

Firefox @ Linux, also no problems (and a crappy cpu at that). I've noticed some of their very-gif-heavy posts slowing down this laptop before, but not this post.

Smaug1235y ago

Likewise (Firefox 89.0.1 on macOS 10.14.6) I had to close the page once I'd scrolled down to the layout of the various building interfaces; ended up just reading the HTML.

diimdeep5y ago

Same, this page hangs entire Firefox on macOS

truncate5y ago· 4 in thread

> (1) no new features for a quarter or two, refactor the code, learn new methodologies etc

> (2) This allows you to instantly test what you write, and mainly use tests as specification

> (3) the problem comes when you break something and a lot of tests start to fail suddenly

My favorites. Don't expect to give away entire quarter, but at-least sometime would definitely be nice. All three so fundamental, and often ignored. In my experience, you get these right, it makes developer life so much easier. As someone earlier mentioned in thread, TDD is kind of like REPL driven development.

I think, one immediate benefit of companies focusing on good code is that engineers can aim for much more ambitious projects, and they can be more brave with the codebase. Instead we often end up with 100 over-engineered components with no well defined/enforced contracts, and a set of monolithic tests which runs the entire stack to test the most basic case.

sidlls5y ago

On the other hand, with TDD we often end up with code that has been butchered in the name of "testability," and which is both less efficient and more complex than necessary.

fendy30025y ago

Which IMO, a bad practice. Too much interface, abstraction, mocking do not reflect real process. I find it often break when integrated with services.

leprechaun10665y ago

This usually happens when the developers in this situation are focusing (or are being forced to focus) on the tests over focusing on the solution to the actual problem in the product. TDD is just a development methodology which is a means to an end, not the goal.

2 more replies

fendy30025y ago

This should be the way. I really like three interation approach: research, stable, enforce.

First you develop fast and breaking things with alpha / beta versions. Then you make it stable with bug fixes and minor enhancements. Finally enforce the code with review, unit tests and code coverage, etc.

Theoretically they already have a good game engine (long lasting product) and interested in developing it further. Without enforcement, any future changes have potential to break things. Unit tests (enforcement) reduce that risks and make any changes / refactoring closer with specification.

happyweasel5y ago· 3 in thread

You can TDD as much as you want to once the initial game mechanics are in place and a gameprotoype shows enough promise to be realized until completion. Because then most of the core stuff/ideas/principles won't wildly change and won't be thrown away.. The core mechanics are in place. But would TDD help you reach that stage? I guess it is simply too much overhead. So yeah, this is TDD after the fact ;-). I still love TDD :)

chii5y ago

> But would TDD help you reach that stage?

if your game has a lot of interactions, and you want to make sure that your changes are not causing unintended interactions, tests like these would help a lot during development.

marcosdumay5y ago

There is always a comment with that claim on a TDD thread. Just to clarify, writing tests is not TDD. There are plenty of ways you can have a codebase full of tests, TDD is only one.

But anyway, I doubt tests help at all in the prototype phase (by any procedure you want to get them). My guess is that they are incredibly harmful.

meheleventyone5y ago

> You can TDD as much as you want to once the initial game mechanics are in place and a gameprotoype shows enough promise to be realized until completion. Because then most of the core stuff/ideas/principles won't wildly change and won't be thrown away.

If only game development worked this way!

ineedasername5y ago· 2 in thread

Every time a Factorio thread makes it to HN I feel like a fully recovered meth addict who suddenly has their old dealer knocking at their door:

Dealer: "Hey? You there? I got some meth for you."

Me: "Go away! I don't want any!"

Dealer: "Oh now don't say that. You remember how good it is? I know you do."

Me: "I can't, I can't afford it, the price is too high."

Dealer: "What? Come on, it's free! You already paid for it!.

Me: "I'll lose my job, I can't, just go away!"

Dealer: "Your JOB? This IS your job. Open the damn door, THE FACTORY MUST GROW"

Me: ::cowers in the closet chanting please leave please leave please leave::

Also it was never good. It was more like a mind virus. Like the sort of problem or project at work you can't stop thinking about until it's done. Only with Factorio, it's never done. Never.

My best defense against it are other video games I can stop & start when needed. Or booting up my VPN connection and picking a work task from my back log until the cravings go away.

hatsuseno5y ago

Factorio is just personal project for people who are too tired after work to actually do one. Like me. It scratches the itch do "build" something, even if it's only an outlet and not intrinsically productive.

LegitGandalf5y ago

I always say, never start with a zero vacation balance!

IMTDb5y ago· 2 in thread

> now there are 9 programmers

Companies on the stock market don't have "9 programmers". They have a lot of teams of 9 programmers. So while it's true that it would probably completely be impossible for a stock market company to completely freeze for a quarter or two, individual teams can still do that.

If the factorio team grows to tens of programmers (it probably won't and probably shouldn't), I would be very surprised if they find the need - and if they manage to - freeze all teams together for a big refactoring round. I am also unsure that it would be the right approach. That observation holds wether they go public or stay private.

Aditya_Garg5y ago

Okay imagine you are a small startup backed with VC money. The same situation can still arise.

BowBun5y ago

Except that as the author of the article says, they don't owe investors any explanations. VC money will demand results which you cannot just ignore for a quarter.

harryf5y ago· 2 in thread

Came here hoping they’d turned Factorio into a tool of creating tests in other codebaes. Like literal gamification of work.

dgb235y ago

You might have looked at Flow Based Programming?

It has certain characteristics that align with a game like Factorio or Oxygen Not Included etc. such as visual programming, backpressure, common interfaces, local retention etc.

I can imagine this being applied to distributed/cloud computing as a way to reason about high level interactions and perhaps functional/integrated testing.

ashtonkem5y ago

I’ve done a bit in my home automation; it’s no replacement for a scripting language.

kevmo3145y ago· 1 in thread

> Which is a big improvement already, as adding and maintaining the new logic only requires you to look at one place instead of several, and it makes it generally more readable and less prone to errors.

It's interesting to think about the other HN thread discussion about comments vs one-time-call function abstractions in this light: https://news.ycombinator.com/item?id=27546135

I'm a big fan of "put code in one place" too. It was the biggest factor that convinced me that JSX was a great idea compared to separating the templating logic out.

adflux5y ago

Agreed, which is why I love vue components so much

tgtweak5y ago· 1 in thread

Damn I misread TTD and got excited that they were building it into factorio... Great article though, was not dissapointed.

dvgt5y ago

Exactly the same thing happened to me.

Aardwolf5y ago· 1 in thread

I used to follow friday facts until it stopped being weekly. I'm glad that every friday fact now gets posted on hacker news, that serves as my notification for new ones :)

I also misread TDD as TTD (related to the trains in factorio) first

depaya5y ago

The trains are my favorite part of Factorio. I would love "TTD but in Factorio"

ashtonkem5y ago

Honestly, I think the factorio team probably now knows more than Uncle Bob does, based on their blog posts.

nanis5y ago

This is a neat article. I do have comments about testing in general though.

IME most developer do not understand each test has four possible outcomes:

* Code is good and test passes

* Code is bad and test fails

These are the only two possible outcomes developers focus on: When I ask what they should do if a test that used to pass now fails, they always tell me stories about how to debug the code under test.

There are two additional possibilities in test:

* Code is bad yet test passes (false negative)

* Code is good yet test fails (false positive)

Again, IME, most people do not look at the test again once it passes for the first time.

As a result, tests which are themselves code, become the largest untested part of the code base. You get these thousands and thousands of lines of untested code yet you have 100% code coverage.

Some of my blog posts on testing:

* Deception in tests considered harmful https://www.nu42.com/2017/02/deception-in-tests-harmful.html

* Know what you are testing: The case of the test for median in Boost.Accumulators C++ Library <https://www.nu42.com/2016/12/cpp-boost-median-test.html>

* Who is testing the tests? https://www.nu42.com/2015/05/who-is-testing-the-tests.html

* Slashing one's feet with tests, or, how to fix 2,950 test failures in one fell swoop https://www.nu42.com/2015/08/fix-2950-test-failures.html

bluGill5y ago

It is called restructuring and big companies do it all the time. Investors allow it, though they are rightly suspicious - sometimes it is good, but often it is change for the sake of change and not change for better.

AwaAwa5y ago

While I'm ambivalent on TDD, there seems to be an attempt at cancellation brewing for his invocation of Uncle Bob.

swiley5y ago

Factorio convinced me that some people still write good closed commercial games. I wish the best for the authors and hope they don't stop any time soon.

j / k navigate · click thread line to collapse

303 comments

132 comments · 18 top-level

ramblerman5y ago· 36 in thread

I'd be curious to hear what kovarex thinks in 2-3 months.

TDD is often sold as a fix-all solution, which is incredibly appealing to mgmt and quite fun for most programmers as a new paradigm, allowing for quick adoption.

Uncle bob is a salesman, not a "craftsman".

naikrovek5y ago

TDD and OOP as dogmas are very bad, for different reasons.

Otherwise, being "test-driven" is bad, IMHO. Software development is never as simple as the various dogmas would lead you to believe.

kllrnohj5y ago

See specifically the "Fig. 5 - Test dependencies" section of the post.

CharlesW5y ago

> So when you inevitably decide "there's a much better way to do this" (which happens to me 100% of the time) then you're not only changing your code, you're changing all of those tests.

Isn't the point of TDD that you can change implementation at will, safe in the knowledge that the tests will help guarantee that you're not inadvertently changing behavior?

4 more replies

pault5y ago

imiric5y ago

> TDD encourages you to write many times the number of lines of code for your tests as the code you're testing, often 10X or more.

> Most of the time, that's enough weight that the better design simply doesn't happen and becomes yet another chunk of tech debt that prevents certain things from happening in the future.

TDD can be a dogma just like any practice (Agile is my favorite), but it doesn't mean that it's not useful if used correctly.

1 more reply

wpietri5y ago

So TDD is not dogma for me, just the inevitable place I end up if I want to maximize the amount of time getting things done while working on a non-trivial codebase.

wpietri5y ago

1 more reply

nightski5y ago

1.) I'd much rather lean on the type system to prove things than automated tests if possible. But of course depending on the language that often isn't possible.

1 more reply

alanfranz5y ago

My 2c: tdd is great in order to learn to create testable designs. You can’t tdd nontestable code.

Once you understand how to design testable code, tdd offers minimal benefits.

The real value comes from thorough, automated testing suites, whatever their origin is.

Chris_Newton5y ago

The real value comes from thorough, automated testing suites, whatever their origin is.

1 more reply

pydry5y ago

Sacrificing at the altar of unit testability doesnt necessarily make better code.

Unit tests' inability to handle state is way too often viewed as a problem with the code it can't properly test than the general crappiness of this form of test.

1 more reply

ashtonkem5y ago

Good observability standards and fast releases catch more bugs than meticulously maintained test suites, imho.

1 more reply

koonsolo5y ago

The thing is that all that extra test code also needs to be maintained, also contains bugs, etc.

It all comes down to return on investment. For example, I used to agree with TDD that for every bug, first write a test that fails, and then fix the bug. That way you prevent regression.

asddubs5y ago

5 more replies

tikhonj5y ago

So I am not sure that just looking at how often regressions actually happen in the existing codebase is sufficient to make any real conclusion by itself.

linspace5y ago

> The thing is that all that extra test code also needs to be maintained, also contains bugs, etc.

I have often observed an evolutionary behavior on tests: tests that pass easily survive

cbushko5y ago

That sounds very hand wavy.

It is making the assumption that:

  - your bug tracking system and people are so good that they have found all the duplicate tickets.  
  - they don't make mistakes and find all duplicates for tickets. 
  - your code is so good that it hasn't had any side effects that caused regressions. 
  - your boss is so good that he has a full grasp on 10 years worth of bugs.

edit: formatting

dgb235y ago

Regression tests are apparently an effective tool to maintain code stability.

peregren5y ago

Even if it's only rare, the test also shows clearly to a reviewer that the bug has been fixed.

In my opinion, as soon as a test suite finds a bug it has added a lot of value, even if it's rare.

drewcoo5y ago

Brushing _that_ tooth is a bad ROI because it rarely keeps a filled cavity from recurring.

cjfd5y ago

sidlls5y ago

1 more reply

serverholic5y ago

Have you considered that maybe TDD is just a really good fit for your brain?

I've worked at a TDD-focused company and it always felt like I was coding through molasses. Some of us weren't as strict about TDD and I didn't notice a difference in our code quality.

1 more reply

ramblerman5y ago

> Frankly, I am pretty much at the point where I consider anything less than TDD borderline unprofessional.

Would you feel that way about the linux kernel for example?

2 more replies

echelon5y ago

3 more replies

blacktriangle5y ago

astrange5y ago

I pointed out the 10 year old Norvig vs Ron Jeffries fight[1] which demonstrates that TDD is useless when you don't already know what you're writing, but they just looked confused.

[1] https://news.ycombinator.com/item?id=3033446

4 more replies

JohnHaugeland5y ago

> You say that like its a bad thing.

That's because it is

tziki5y ago

fendy30025y ago

And it's not without drawbacks. Increased development time and the needs to maintain the unit tests are costly, but rewarding.

Also I don't like too much interface and mocking only for the sake of testing. I find it usually breaks when integrated and makes code harder to maintain. Maybe I'm just inexperienced.

berkes5y ago

> The problem arises because usually one or some of the points are unfulfilled.

An important idea of TDD is that it allows you to discover those "unfulfilled points" in the tests. When writing code (the tests) that use an API, instead of when writing the actual API.

mbrodersen5y ago

skinnyarms5y ago

Am I missing something, it sounds like they are already doing this - not striking off on a new venture.

hatsuseno5y ago

> I have to admit, that I didn't know what TDD really was until recently.

They do already do this (hence the blogpost), but it's something kovarex hasn't explored before, so they're pretty new to TDD.

ashtonkem5y ago

Red/Green is a good technique for fixing bugs and extending existing functionality.

koonsolo5y ago

What is the chance of a fixed bug getting broken again? As it turned out in our analytics over 10 years: very very low.

So the effort you put into writing a test for a bug, has most likely a negative return on investment. That time could be better spend somewhere else.

3 more replies

dgb235y ago· 26 in thread

Two interesting takeaways:

This is reassuring the notion of what I think actually matters, what the real essence is of developing a product, may that be a piece of art and entertainment (like here) or a productivity tool etc.

We know now that neither is sufficient for a high degree of correctness, but they are certainly useful for understanding and development.

rpastuszak5y ago

> The important notion here is that TDD is not about tests and correctness, but about development.

Yup, writing tests helps me sleep at night. TDD helps me manage my mental resources and iterate.

(another reason is communication--we code for our colleagues first, then for the machine: https://sonnet.io/posts/code-sober-debug-drunk/)

IIRC smalltalk had a workflow where you'd debug and write your program at the same time. You'd just reach a path that has not been implemented yet, break, implement it and continue.

dgb235y ago

This „keeps working while it breaks“, does it have a name? It comes up in highly dynamic environments often like Smalltalk as you mentioned, but also Lisp, Erlang and others.

3 more replies

mulmboy5y ago

> You'd just reach a path that has not been implemented yet, break, implement it and continue.

This is exactly what I do with the PyCharm - hit a breakpoint, write the code as it should be, and execute it in the debug REPL to do basic initial testing [repeat]. Extremely productive.

wpietri5y ago

I appreciate this, but I still think it's a suboptimal approach:

> that the way to solve it, is to do absolutely no new features for a quarter or two, refactor the code, learn new methodologies

mikewarot5y ago

I've both watched all of Uncle Bob's videos (parts more than once), and been a long time Reddit/Factorio reader/game player. (The factory must grow!)

I didn't quite believe that Uncle Bob's lessons worked in the real world, but if the Wube team is sold on them, that's about as good as an endorsement as I'll ever get.

1 more reply

scotty795y ago

In this specific case, Factorio is such beloved product with such great development history that 6 months without updates is nothing.

3 more replies

indigochill5y ago

dgb235y ago

wpietri5y ago

Yes, you're exactly right about REPL-driven development and TDD being the same spirit.

bjornjajayaja5y ago

I’d love to see a language where you write the tests and then the compiler creates the application code.

kilburn5y ago

This is an active field of research. Search for "program synthesis".

We are advancing, but the current state is... not mind-blowing yet (albeit somewhat cool!). See [1] for an example interactive demo and [2] for the corresponding presentation.

[1] http://comcom.csail.mit.edu/comcom/#Synquid

[2] https://www.youtube.com/watch?v=HnOix9TFy1A

1 more reply

oats5y ago

Is this not almost how logic programming (prolog, etc.) works? You tell the language some things which are true, and then it'll be able to infer answers to "questions" you ask:

https://wiki.c2.com/?LogicProgramming

3 more replies

bregma5y ago

It's incredibly easy. First, you start with a customer that actually knows exactly what they want.

nyberg5y ago

dgb235y ago

We typically write _sample_ tests in boolean logic, which isn't quite expressive enough for this.

sidlls5y ago

mikewarot5y ago

You could just use a fuzzer to generate code, and run tests on the output. Each new test would approximately double the run time until a new "correct" output was found.

This doesn't become practical until you can do it on a quantum computer with millions of cubits.

iamwil5y ago

This is kinda what logic programming is, like in prolog. You tell the computer what you want, and it finds the answer for you.

TDD is where you both write what you want, and you do the implementation also.

phtrivier5y ago

I would like to see this language handle some lawmaker-specified code.

Show examples of French retirement pension computation, and watch if a computer can actually commit petit-suicide.

astrange5y ago

Well, that's how machine learning works.

1 more reply

CraigJPerry5y ago

For those curious about REPL driven development, this is a good example that i ran across recently:

https://gist.github.com/daveray/1441520

Even if you don’t follow along and try it, you can probably get the gist of how experimental / exploratory you can be.

adamkl5y ago

For those who want to see some REPL driven development in action:

https://vimeo.com/230220635

truetraveller5y ago

TDD is basically automated and "named" REPL-driven development. Which is very nice!

infogulch5y ago

Oh that's an interesting take, and feels right to me. So the thing that holds back TDD is ergonomics (compile time/cached results) and maybe marketing.

lugged5y ago

Its almost like Test Driven Development is the practice of using tests to drive development.

dgb235y ago

Much of TDD mantra is less about development but about (perceived) correctness, the article in question emphasizes the development part, where they describe their AHA moment.

3 more replies

tobyhinloopen5y ago· 17 in thread

I wonder how common TDD is in game development land, especially when you’re using things like Unity or Unreal.

I feel like testing your behaviors is pretty hard, and even if you unit-test your behaviors, there’s still integration tests.

I only write games as a hobby and never use TDD, even if I’d like to, since the tooling is just either poorly documented or too slow, or both.

jayd165y ago

So there's a few reasons tests aren't as ubiquitous in games as they are in non-game dev.

Games don't need to be correct. They just need to be fun. This also decreases the marginal benefit of tests compared to other industries.

indeedmug5y ago

To be fair, I don't pretend to know how CDProject developed their games. They might already have testing and the timetable was the problem.

mewse5y ago

In twenty years in game development I have never worked on a game which had real unit tests or even integration tests.

I’ve seen engines which used them, but not games. The rationale was that it was just too hard, which always felt like a cop-out to me.

teamonkey5y ago

It's not exactly what you mean, but automated gameplay testing is fairly common in the AAA space, although maybe not taken seriously enough.

[1] https://youtu.be/sCRzxEEcO2Y?t=3100 [2] https://www.youtube.com/watch?v=VVq_hgaX8MQ

1 more reply

dsego5y ago

Not TDD but here is a talk about automated testing at Croteam

Continuous integration and testing pipelines in games - case studies of The Talos Principle and Serious Sam https://www.youtube.com/watch?v=YGIvWT-NBHk

tarcon5y ago

Interesting. I always assumed that to get the balancing feel right, a game would have to run huge parameterized test-suits to make sure win/lose results to user inputs are in balance.

1 more reply

CodeGlitch5y ago

I was in the industry from early 2000 to early 2010. It was only towards the end that Unit Testing was a thing. At the start we didn't even do code reviews or use a sensible source-control system.

Yeah it was a painful experience, but I survived.

Danieru5y ago

Factorio is perhaps the only game I am aware of using TDD. Lots of engine teams use extensive automated testing. Only Factorio is applying it to game logic of any major game I know.

mywittyname5y ago

Glowbox5y ago

https://twitter.com/playartifact/status/1051964775658217473?...

Artifact did it too (which is no longer being worked on though).

exdsq5y ago

Anecdotal but I think it's pretty rare - my friend worked as a game developer for Epic and didn't know what TDD (or SQL for that matter) actually meant.

tobyhinloopen5y ago

Given how common it is for bugs to reappear in Fortnite, I’m pretty confident their testing suite is either incomplete or not present at all

Thaxll5y ago

Video game client don't have tests.

ashtonkem5y ago

I feel like game dev is probably an area where TDD has some value.

My team writes distributed systems, which drastically reduces the value of a TDD approach. There’s only so far you can take the technique with a database backed api before it just becomes absurd.

mschuster915y ago

Testing? In games? There is no such thing on a wide scale, not anymore since the cost of distributing patches essentially became a small budget line for a CDN.

Modern games are notorious for using the first, most loyal customers as beta testers (hello Fallout 76...).

gmueckl5y ago

Game QA is more involved than that, of course. Content needs to go through a signoff and QA process that involves humans (we do that, too).

kempbellt5y ago

This is definitely true for some games, and Early Access is increasingly popular, but for others, QA and testing is definitely a part of the development process.

blindmute5y ago· 11 in thread

trollied5y ago

They're working on an expansion. See: https://factorio.com/blog/post/fff-365

colonwqbang5y ago

I'm also surprised. Factorio feels like a finished game. It's more polished than most games I've played.

Maybe the devs haven't found a worthy new project yet?

naikrovek5y ago

They're working on an expansion for Factorio, and this refactor may have something to do with that.

Maybe they just want to leave their code in good shape so they (or someone else) can come back to it at a later time and pick it up relatively quickly.

1 more reply

EndXA5y ago

Worth pointing out something that they said in the post (emphasis in italics is mine, the full quote is given for context)

This isn't necessarily the full explanation, but it's certainly something to keep in mind.

fooey5y ago

I would assume they're working on either an expansion or another game using the same engine

If you're building a new game and your current GUI paradigm sucks, overhauling it first makes a lot of sense.

shepherdjerred5y ago

They're planning to release a paid expansion.

robryan5y ago

As he says, they don't have external shareholders that are demanding the everything they do maxmise profit.

mattmanser5y ago

Because the worst case scenario is that they get half-way through the refactor, the game's a buggy mess, and then they all move on.

And that's got a non-trivial chance of happening.

1 more reply

ashtonkem5y ago

There are some mentions of a DLC in the works, probably that.

js85y ago

Yes. I don't know what the DLC will be (personally I hope for water- and air-borne structures, vehicles and enemies), but I am sure I will pay for it.

1 more reply

ranger2075y ago

Maybe just for fun? This kind of thing is half the game itself after all.

achairapart5y ago· 8 in thread

Warning: This page almost crashed my browser (FireFox on MacOS) and put my CPU on fire.

DizzyDoo5y ago

My poor 2015 MacBook Air with FireFox went full 100% CPU on this page, I think it's the gifs.

I think Factorio itself actually runs better on this laptop than that Factorio blog post does.

kart235y ago

I have a dual-core 2015 Macbook Pro running FF too. htop showed my cpu pinned at 100%, VTDecoderXPCService was taking the lions share, there probably is something weird about the gifs.

1 more reply

Metacelsus5y ago

I'm also using Firefox on Mac and had no issues. I'm blocking their Javascript though.

kllrnohj5y ago

There doesn't seem to be any meaningful JS on the page. A single google analytics script and a tiny[0] little toy script for doing a silly animation when you click on the #rocket element.

Possibly the google analytics is doing something heavy (although it doesn't look to be when spot checking with a profiler), but there's otherwise nothing JS that runs continuously.

0: https://factorio.com/static/js/factorio.js

Diggsey5y ago

Strange. I'm using firefox on windows and didn't notice any problems.

Aachen5y ago

Firefox @ Linux, also no problems (and a crappy cpu at that). I've noticed some of their very-gif-heavy posts slowing down this laptop before, but not this post.

Smaug1235y ago

Likewise (Firefox 89.0.1 on macOS 10.14.6) I had to close the page once I'd scrolled down to the layout of the various building interfaces; ended up just reading the HTML.

diimdeep5y ago

Same, this page hangs entire Firefox on macOS

truncate5y ago· 4 in thread

> (1) no new features for a quarter or two, refactor the code, learn new methodologies etc

> (2) This allows you to instantly test what you write, and mainly use tests as specification

> (3) the problem comes when you break something and a lot of tests start to fail suddenly

sidlls5y ago

On the other hand, with TDD we often end up with code that has been butchered in the name of "testability," and which is both less efficient and more complex than necessary.

fendy30025y ago

Which IMO, a bad practice. Too much interface, abstraction, mocking do not reflect real process. I find it often break when integrated with services.

leprechaun10665y ago

2 more replies

fendy30025y ago

This should be the way. I really like three interation approach: research, stable, enforce.

happyweasel5y ago· 3 in thread

chii5y ago

> But would TDD help you reach that stage?

if your game has a lot of interactions, and you want to make sure that your changes are not causing unintended interactions, tests like these would help a lot during development.

marcosdumay5y ago

There is always a comment with that claim on a TDD thread. Just to clarify, writing tests is not TDD. There are plenty of ways you can have a codebase full of tests, TDD is only one.

But anyway, I doubt tests help at all in the prototype phase (by any procedure you want to get them). My guess is that they are incredibly harmful.

meheleventyone5y ago

If only game development worked this way!

ineedasername5y ago· 2 in thread

Every time a Factorio thread makes it to HN I feel like a fully recovered meth addict who suddenly has their old dealer knocking at their door:

Dealer: "Hey? You there? I got some meth for you."

Me: "Go away! I don't want any!"

Dealer: "Oh now don't say that. You remember how good it is? I know you do."

Me: "I can't, I can't afford it, the price is too high."

Dealer: "What? Come on, it's free! You already paid for it!.

Me: "I'll lose my job, I can't, just go away!"

Dealer: "Your JOB? This IS your job. Open the damn door, THE FACTORY MUST GROW"

Me: ::cowers in the closet chanting please leave please leave please leave::

Also it was never good. It was more like a mind virus. Like the sort of problem or project at work you can't stop thinking about until it's done. Only with Factorio, it's never done. Never.

My best defense against it are other video games I can stop & start when needed. Or booting up my VPN connection and picking a work task from my back log until the cravings go away.

hatsuseno5y ago

LegitGandalf5y ago

I always say, never start with a zero vacation balance!

IMTDb5y ago· 2 in thread

> now there are 9 programmers

Aditya_Garg5y ago

Okay imagine you are a small startup backed with VC money. The same situation can still arise.

BowBun5y ago

Except that as the author of the article says, they don't owe investors any explanations. VC money will demand results which you cannot just ignore for a quarter.

harryf5y ago· 2 in thread

Came here hoping they’d turned Factorio into a tool of creating tests in other codebaes. Like literal gamification of work.

dgb235y ago

You might have looked at Flow Based Programming?

It has certain characteristics that align with a game like Factorio or Oxygen Not Included etc. such as visual programming, backpressure, common interfaces, local retention etc.

I can imagine this being applied to distributed/cloud computing as a way to reason about high level interactions and perhaps functional/integrated testing.

ashtonkem5y ago

I’ve done a bit in my home automation; it’s no replacement for a scripting language.

kevmo3145y ago· 1 in thread

It's interesting to think about the other HN thread discussion about comments vs one-time-call function abstractions in this light: https://news.ycombinator.com/item?id=27546135

I'm a big fan of "put code in one place" too. It was the biggest factor that convinced me that JSX was a great idea compared to separating the templating logic out.

adflux5y ago

Agreed, which is why I love vue components so much

tgtweak5y ago· 1 in thread

Damn I misread TTD and got excited that they were building it into factorio... Great article though, was not dissapointed.

dvgt5y ago

Exactly the same thing happened to me.

Aardwolf5y ago· 1 in thread

I used to follow friday facts until it stopped being weekly. I'm glad that every friday fact now gets posted on hacker news, that serves as my notification for new ones :)

I also misread TDD as TTD (related to the trains in factorio) first

depaya5y ago

The trains are my favorite part of Factorio. I would love "TTD but in Factorio"

ashtonkem5y ago

Honestly, I think the factorio team probably now knows more than Uncle Bob does, based on their blog posts.

nanis5y ago

This is a neat article. I do have comments about testing in general though.

IME most developer do not understand each test has four possible outcomes:

* Code is good and test passes

* Code is bad and test fails

These are the only two possible outcomes developers focus on: When I ask what they should do if a test that used to pass now fails, they always tell me stories about how to debug the code under test.

There are two additional possibilities in test:

* Code is bad yet test passes (false negative)

* Code is good yet test fails (false positive)

Again, IME, most people do not look at the test again once it passes for the first time.

As a result, tests which are themselves code, become the largest untested part of the code base. You get these thousands and thousands of lines of untested code yet you have 100% code coverage.

Some of my blog posts on testing:

* Deception in tests considered harmful https://www.nu42.com/2017/02/deception-in-tests-harmful.html

* Know what you are testing: The case of the test for median in Boost.Accumulators C++ Library <https://www.nu42.com/2016/12/cpp-boost-median-test.html>

* Who is testing the tests? https://www.nu42.com/2015/05/who-is-testing-the-tests.html

* Slashing one's feet with tests, or, how to fix 2,950 test failures in one fell swoop https://www.nu42.com/2015/08/fix-2950-test-failures.html

bluGill5y ago

AwaAwa5y ago

While I'm ambivalent on TDD, there seems to be an attempt at cancellation brewing for his invocation of Uncle Bob.

swiley5y ago

Factorio convinced me that some people still write good closed commercial games. I wish the best for the authors and hope they don't stop any time soon.

j / k navigate · click thread line to collapse