Gsplat: CUDA accelerated rasterization of gaussians with Python bindings (opens in new tab)

(docs.gsplat.studio)

109 pointstylerneylon2y ago33 comments

33 comments

21 comments · 7 top-level

polskibus2y ago· 9 in thread

What is the use case for this library, in layman terms? I pass it a blurry pic and it „enhances it” ?

It's for deep learning, not that much "for graphics".

The problem is "how do you do 3D deep learning 3D scene reconstruction" aka "how to make 3d equivalent of stable diffusion".

Voxels are bad as they take too much memory and are uniform (each voxel is eg 5x5x5 cm).

Gausians are kinda like "variable sized voxels" that also have orientation, color based on angle viewing and can stretch.

Imagine 3D blobs basicaly (3D capsules or more like 3D density bubbles with transparency).

So the scene can be represented in 3D much more efficiently using gausians splats, which is why they are used for "3d stable diffusion".

So now that this has come into use we also need efficient way or rendering them.

encypruon2y ago

Sorry, some objections...

> It's for deep learning, not that much "for graphics".

No, while it is true that there is some overlap between the techniques and concepts used, gaussian splatting isn't necessarily "for deep learning". The library provides a differentiable rasterizer for gaussian splats. This basically means that you can ask it "if I want my output image to change in this and this direction, in what direction should I change the position / orientation / color / ... of my splats?". This enables users to plug it into other software (that is also commonly used for deep learning) and to optimize the parameters of the splats to represent a particular scene.

Since it's primarily a differentiable rasterizer for splats I think it's fair to say that it is "for graphics".

> The problem is "how do you do 3D deep learning 3D scene reconstruction" aka "how to make 3d equivalent of stable diffusion".

That it uses gradient descent doesn't mean that it is "deep learning". There are no neural networks or layers here.

It's not an "equivalent of stable diffusion". The way it's used now is to learn a representation of a single scene, not unlike photogrammetry. Sure, there may be other use cases for this library, but this is primarily what gaussian splatting is about.

IshKebab2y ago

This is not really anything to do with deep learning or AI, except that it uses the same global optimisation algorithm (SGD). But that is a generic optimisation algorithm that can be used for any problem where you can calculate the differential of the loss function.

scotty792y ago

Would 2d image generation based on gaussian splats be better than stable diffusion?

1 more reply

blovescoffee2y ago

the technique really isn’t “stable diffusion” at all. I’ve seen a couple papers build 3D generative models on top of GS though. Also depending on the measure of efficiency, 3DGS isn’t more efficient. Maybe more efficient than other NeRF methods but less than explicitly geometry representation.

And like another person explained, there’s no deep learning. There’s not even a neural net. In the other NeRF literature there are neural nets but they usually aren’t deep. RegNeRF uses a deep neural network along side a shallow net for regularization.

londons_explore2y ago

Really nice explanation here.

LoganDark2y ago

Gaussian splatting, at least in 3D, is rasterization technique that, AFAIK, doesn't use polygons, it's intended to allow photorealistic renders based on photogrammetry.

It seems like it took off after this SIGGRAPH paper: https://repo-sam.inria.fr/fungraph/3d-gaussian-splatting/

Anyway, this doesn't seem to allow any sort of dynamic lighting whatsoever, but for scans of existing places, such as house tours, street views and so on, this seems super promising.

slabity2y ago

> Anyway, this doesn't seem to allow any sort of dynamic lighting whatsoever, but for scans of existing places, such as house tours, street views and so on, this seems super promising.

So this method can only really be used for static scenes made from a point cloud?

I remember 10 years ago, a company called Euclideon revealed a demo[1] of a rendering technique (they called it "Unlimited Detail Real-Time Rendering") that put nearly every other rasterization technique of the time to shame. However, the major issue with the demo was that it only showed a static scene with zero dynamic lighting. I don't think they ever revealed the "secret sauce" of how it works, but it seems to have similar limitations to this new process, or at least that's what I expect considering it can't handle something as simple as a dynamic mesh.

It was extremely hyped up in certain areas, but over a decade later this technology isn't really used in any mainstream rendering engines because it only works for static scenes.

I hate to disparage new rendering techniques, but this really feels like a repeat of what happened back then. What exactly is different in 3D Guassian Splatting? Can it be used in any dynamic scenes in real-time at all? Does it provide any significant advantages from this older system?

[1] https://www.youtube.com/watch?v=DrBR_4FohSE

7 more replies

graffix2y ago

Right. Note that Gaussian splatting as a rendering primitive dates from the early 90s, but it never saw much use. Splats aren't very good for magnification (important for medical/scientific/engineering visualization), nor do they have easy support for domain repetition (important for video games).

The new thing is fast regression of a real light field into Gaussian splats, which can then be rendered at reasonable rates. Dynamic lighting requires full inverse rendering as a preprocess, which is way beyond the scope of the technique. Technically Gaussian splats could form part of an inverse rendering pipeline, and also be the final target representation, but I'm not sure there would be any benefit over alternative rendering primitives.

gravypod2y ago· 2 in thread

Is there any method for animating scenes captured using this method? It would be really interesting to build a hyper realistic world in a game with moving foliage or being able to "use" an item and watch it turn. Would be great for Myst like games.

lukehollis2y ago

Something close to what you mention is dynamic scene capture, such as this paper: https://dynamic3dgaussians.github.io/

You can manipulate the splats in viewer, kind of similar to a pointcloud, something to the effect of https://twitter.com/parazar/status/1713618668788109357 but there are a bunch around social media.

This has an entrypoint to the research and various projects so far: https://github.com/MrNeRF/awesome-3D-gaussian-splatting

sorenjan2y ago

You could maybe treat them as particle systems, or move groups of them by manipulating a volume containing them.

jimmySixDOF2y ago· 1 in thread

There is an interesting problem being worked on with Gaussian Splats for use in VR stereo rendering [1]

>Gaussian splats are 3d, but currently rendered as 2d cards ordered on centre, not per pixel like they should be.

>So as centres reorder, they "shuffle". This is a showstopper for general adoption in VR

>The solution? Ordering per pixel will solve this, as would stochastic methods which write depth correctly (though those have other problems for VR).

>For it to be viable you need to cull much more aggressively but also can't have any warp divergence when processing samples.

>Unforunately those things are in direct conflict. Paper's renderer is at extreme end of "weak cull, perfect warp convergence".

Apparently need some new Order-Independent Transparency algorithms that handle depth layers.

[1] https://x.com/charshenton/status/1710207169407447396?s=20

teucris2y ago

I hacked in a weighted stochastic method into a WebGPU solution[0] and it worked surprisingly well. Essentially, given each pixel of a splat, pick a pseudo-random number from zero to one and if it’s lower than the alpha value for the pixel, draw over whatever is in the frame buffer at full opacity.

The next step I wanted to try was to super-sample (eg render at double resolution) and linearly down sample. I imagine this would give a slightly noisy result that I think would be pretty visually pleasing.

0: https://github.com/antimatter15/splat

speps2y ago· 1 in thread

Aras has also done some very interesting blog posts regarding this technique:

* https://aras-p.info/blog/2023/09/05/Gaussian-Splatting-is-pr...

* https://aras-p.info/blog/2023/09/13/Making-Gaussian-Splats-s...

* https://aras-p.info/blog/2023/09/27/Making-Gaussian-Splats-m...

Gsplat gets a mention:

    Aside: the excellent gsplat.tech by Jakub Červený (@jakub_c5y) seems to also be using some sort of VQ/Clustering for the data. Seriously, check it out, it’s probably be nicest gaussian splatting thing right now w.r.t. usability – very intuitive camera controls, nicely presented file sizes, and works on WebGL2. Craftsmanship!

poetaster2y ago

I read those posts and enjoyed them, especially the last one. Even though I only understood half of the content :-)

sprash2y ago· 1 in thread

If you want to see some impressive gaussian splat demos take a look here: https://gsplat.tech

thenickdude2y ago

Wow thanks, I didn't realise there was a WebGL viewer available for these! Very cool!

qwertox2y ago

This week I saw a YouTube video which left me impressed regarding Gaussian Splatting.

https://www.youtube.com/watch?v=kShNYOuDnlI&t=224s

I got it recommended because I watched the following video:

https://www.youtube.com/watch?v=HVv_IQKlafQ&t=64s

Both are worth watching if one didn't know it exists.

Yenrabbit2y ago

One reason this implementation is exciting is that the code that accompanies the Gaussian Splatting paper requires companies wanting to use the tech commercially to pay for a commercial license. There are existing viewer implementations, but this seems like the first alternative that would let me do everything without depending on the Inria code. The code being nice and clean is a bonus too :)

j / k navigate · click thread line to collapse