I'm not an optics expert, but I expect that the physical system's spatial bandwidth is much higher than the sensor's bandwidth. That is to say, if there were more light sensing elements I think the spatial bandwidth would be higher.
I don't think the artifacts are directly from aliasing but rather an artifact of software interpolation.
It anyone knows better please correct me.