It's the deep learning equivalent of a souped-up Honda Civic. Sure, with enough tweaking it'll eventually be competitive, but you could have just bought a racecar.
Variant calling doesn't look like it needs to be turned into a image in the first place. You'd probably be better off feeding a regular, non-convolutional network some tabular data.