Every time I've reviewed these sort of code tests I've never even run the code. I just looked if the approach looked alright, and to be honest I don't even care if it gives a slightly wrong answer because everyone can make an error; this is why we got code reviews and such. It's like physics and math classes at school; at least at my school an error in arithmetic was only a small minus if your general approach was okay (one reason I'm not in favour of computerized testing/ratings, because they can only look at the answer).
I don't know what your specific code test was about so I can't really judge, but generally I've considered overcomplicated solutions to be a minus. "I asked you to write a simple CLI and I get some container environment" is not something I would consider positive anyway.