I added this to my benchmark of models looking for Mythos-reported security bugs. Unsurprisingly, it found 0. There is, after all, a lower bound on how small a model can be and still find security bugs.
https://swelljoe.com/post/will-it-mythos/It can seemingly reliably write working Python code though, which is impressive for such a little guy.