You don't need to understand anything about the math to run a random, or grid, or bayesian optimization, or whatever search of the hyperparameter space.
I feel like the lack of connection from all the math requires oneself to understand all the math, which is very difficult to do.
Is there any way to explain gradient boosting via category theory?