If the metric is COINCIDENT with the goal, then it's not a problem. You can't get better at winning races without improving your race time (well... you can degrade everyone else's times I suppose).
But if the metric is a PROXY for the goal, then the metric becomes the objective (not the actual goal).
In this case, whatever this AI is measuring is going to be exactly what every dev drops into their Github Copilot prompt instructions.
Making a metric an objective is an effective if often very costly way of testing the hypothesis that it is coincident rather than a proxy (which in cases more complex than racing is frequently a matter of dispute.)