You can't do that without commiting to
not supporting pathological text, otherwise you're stuck adding new special cases to the layout engine every update
anyway.
I do have some ideas for a better encoding (like, I assume, anyone competent with sufficient free time and interest in text encoding), but there's a lot of reluctance to put effort into something that's already completely eclipsed by a technically inferior but not completely unusable alternative, so I've had it mostly shelved.