> Generative models can certainly create midi, but no one has done it yet.
Note sequence generation from statistical models has a long history, at least as long if not longer than text generation.
Have a look at section 2.1 of this survey paper [0] that cites a paper from 1957 as the first work that applies Markov models to music generation.
And, of course, plenty of follow-up work 6 decades later on GANs, LSTMs, and transformers.
[0]: https://www.researchgate.net/publication/345915209_A_Compreh...