To address this, I developed YPerf—a simple webpage designed to monitor the performance of inference APIs. I hope it helps you select better models and discover new trending ones as well.
The data is sourced from OpenRouter, an excellent provider that aggregates LLM API services.