This seems easily forgotten by a large number of people. I try to remind myself to step back from the hype and explore the lesser travelled paths.
> I'll give some examples that are easier to read[0-2]
I need to reach ResNet strikes back, it was one the first networks I implemented and it is cool to see it still being worked on.
I'll check out [3]. I've wondered recently how you could get a GAN to generate things out of distribution but that still look like the training data, if that even makes sense.
> the StyleGAN code is not the easiest to read lol
Yup, even the official PGGAN code was quite hard to understand. I'll try out the PyTorch compile I've heard a lot about it recently. I had thought TensorRT was for LLMs I suppose it's applicable in other areas too?
> so recognize this as a hyper-parameter
Okay that makes sense. I'll reread this after exploring Diffusion models too in the future.
> carefully study Goodfellow's original paper
This is something I have not done, my current workflow is just to understand how best to implement what is written. I think deep exploration is the next step, no matter how many "I know nothings" I will experience. This side of GANs I had not considered (the theoretical, it looked interesting but very complex).
> I hope this can help provide direction
It certainly will, I imagine I'll come back to this comment many times. Thanks for taking the time to read my posts and provide so much material for further study.
> if unfortunately hard to gain
I agree it is rewarding and I hope I can purvey some of this knowledge in my blog for others too! That was why I started it, so much knowledge is locked away and hard to access or understand without some guidance.