yes, single gpu open models exist. Now show me the one that can keep up with a SOTA api model on more than short code block evals.