"Table 2 compares the timings on the four scenes in Figure 1 of our
unoptimized RenderFormer (pure PyTorch implementation without
DNN compilation, but with pre-caching of kernels) and Blender Cy-
cles with 4,096 samples per pixel (matching RenderFormer’s training
data) at 512 × 512 resolution on a single NVIDIA A100 GPU."