undefined | Better HN

0 pointsl33tman2y ago0 comments

Research into the internals of the networks have shown that they figure out the correct 2.5D representation of the scene before the RGB textures (internally), so yes it seems they have an internal representation of the scene and therefore can do enough inference from that to make shadows and light seem natural.

I guess it's not that far-fetched as your brain has to do the same to figure out if a scene (or an AI-generated one for that matter) has some weird issue that should pop out. So in a sense your brain does this too.

0 comments

Chirono2y ago

Interesting! Do you have a link to that research?

l33tmanOP2y ago

Certainly: https://arxiv.org/abs/2306.05720

It's a very interesting paper.

"Even when trained purely on images without explicit depth information, they typically output coherent pictures of 3D scenes. In this work, we investigate a basic interpretability question: does an LDM create and use an internal representation of simple scene geometry? Using linear probes, we find evidence that the internal activations of the LDM encode linear representations of both 3D depth data and a salient-object / background distinction. These representations appear surprisingly early in the denoising process−well before a human can easily make sense of the noisy images."

nojvek2y ago

What does 2.5D mean?

l33tmanOP2y ago

You usually say 2.5D when it's a 3D but only from a single vantage point with no info of the back-facing side of objects. Like the representation you get from a depth-sensor on a mobile phone, or when trying to extract depth from a single photo.

shsbdncudx2y ago

It means you should be worried about the guy she told you not to worry about

j / k navigate · click thread line to collapse

0 comments

Chirono2y ago

Interesting! Do you have a link to that research?

l33tmanOP2y ago

Certainly: https://arxiv.org/abs/2306.05720

It's a very interesting paper.

nojvek2y ago

What does 2.5D mean?

l33tmanOP2y ago

shsbdncudx2y ago

It means you should be worried about the guy she told you not to worry about

j / k navigate · click thread line to collapse