I know that, I'm in this game. I was comparing API throughput/ttft/ttbt of DeekSeek's own R1 API
before it went viral in the West, and o3-mini.
I remain unconvinced that DeepSeek themselves didn't optimize their own V3 inference good enough and left another 2x~3x improvement on the table.