Large was not obviously better than medium when I tried it. My impression was that it tended to fit more to a language model than the sounds heard, which corrected some errors and introduced some others, but I didn't try a lot of songs because large won't run on my GPU.