If I were going to guess (not the parent), it's because human parsing is quite localised, and Markov models tend to optimise well for localised coherence.
Their failure mode is long sentences which don't go anywhere (or mean anything), but that's less grating than a blatant parse failure, especially if you're skim reading.