3-Sweep: Extracting Editable Objects from a Single Photo [video] (opens in new tab)

(youtube.com)

354 pointsrellik12y ago72 comments

72 comments

The key here is really complementary use of ‘what humans are good at’ and ‘what machines are good at’.

In this case, it’s fair to say the machine, by analyzing pixels, can’t figure out perspective very well. The human can do that just fine, given an interface mechanism.

The machine is good at detecting edges and seeing similarity between pixels. Given hints from the human that ‘this point is within an object’ and here is the perspective, the machine can infer the limits of the object based on edges/colors and project it into 3 dimensions. Amazing.

gohrt12y ago

The perspective analysis is done pretty darn well by the machine in these examples.

olympus12y ago

I'm not a HN etiquette stickler, and I'm not accusing anyone of any foul play, but the actual YouTube video was submitted 17 hours prior to this post: https://news.ycombinator.com/item?id=6358080

This is just in case you want to throw a few upvotes their way for being first. This also illustrates that late night (PDT/UTC -8) posts don't get a whole lot of votes and proper timing is crucial to getting lots of votes.

turing12y ago

It was also submitted here even earlier: https://news.ycombinator.com/item?id=6351712

Personally, I'm just glad to see this video finally getting traction. It really is such a cool demo. It even stands out in the field of consistently high-quality SIGGRAPH demos. Can't wait to read the paper!

spindritf12y ago

I submitted it too https://news.ycombinator.com/item?id=6352371

It's weird that it has received quite a few votes each time and never made it to the front page. Was it a timing issue (late night, early morning, non-American hours) or is YouTube "weighted down" somehow?

krisoft12y ago

What I was thinking all along: "Oh come on! It can't be this perfect, show me where it fails." And they did!

This is indeed magic. I'm so happy to live in this age, and be part of the "Sorcerers' Guild".

rellikOP12y ago

Yeah, I was thinking the same thing. Funny how them pointing out the failures of the product make it seem cooler (since it seems more real).

DocSavage12y ago

The paper is not out yet, but you can read the abstract here:

http://www.faculty.idc.ac.il/arik/site/3Sweep.asp

breckinloggins12y ago

If you marked shadows and associated them with their source, could you then recover the light source(s) and be able to remove the baked shadows and recast them in real time?

Also, with the shiny objects, could you specify the material properties and have it "back out" the reflection such that the reflection was recomputed as you moved the shape around?

gohrt12y ago

Yes, there are other projects that do things like insert a synthetic object into a scene, with natural in-context lighting that is inferred from the light gradients on other objects in scene.

op12op1212y ago

Yes, here's a cool demo video from 2011 SIGGRAPH Asia...can't even imagine how much more things have progressed since then:

http://vimeo.com/28962540

swamp4012y ago

WOW.

Forget the Photoshop stuff, this needs to be integrated with 3D printing immediately.

Spit out a design file into Tinkercad[1] for some minor adjustments and BAM, you've made a printable 3D model.

[1] https://tinkercad.com/

nicholassmith12y ago

That's what I thought when I saw it. Break something, take a quick snap and import it, fix the damage, send to printer. Very little 3d modelling skill required, making it way more accessible to the average person.

moocowduckquack12y ago

I want this + i❤sketch now, but unfortunately I suspect that jumping up and down and shouting isn't likely to help.

http://www.dgp.toronto.edu/~shbae/ilovesketch.htm

MichailP12y ago

Thanks for the reference. Are you aware of any (commercially available) handdrawing to CAD software? I need to do a bunch of relatively simple figures for my thesis, and it would be much easier if I could just use handdrawings and than transform them to professionaly looking figures.

danboarder12y ago

Have you tried Google SketchUp? It might be the closest thing to this type of ease-of-use that I've seen so far, and it's free.

1 more reply

alxbrun12y ago

Wow, super impressive. And meanwhile, Silicon Valley is working on the gazillionth social photo sharing app.

baddox12y ago

The other side of the argument is that social networks improve far more lives than academic research projects like this.

endianswap12y ago

Without social networks like Reddit or HN I would have been unlikely to ever see most of the academic research that I do come across, like this!

dredmorbius12y ago

Citation needed.

1 more reply

Raphmedia12y ago

This is sorcery!

This technology is awesome. If it's as user friendly as they make it looks, I could see a lot of application for that!

breckinloggins12y ago

One application I can see is teaching people how to model objects in 3d. You could use this as the 3-dimensional analog to tracing and have tutorials where you first get good at tracing the model and then try to recreate it from scratch.

For example, I have only tried my hand at 3d modelling once or twice (and sucked at it enough to give up), but just watching this I feel like I could model vases and lamp posts with a bit of practice.

dharma112y ago

most impressive thing for me about this demo is how good the shape detection is (seems way better than magnetic lasso in Photoshop), and how they brought different pieces of separate technologies together to such a fluid experience. And how the presenter sounds about 12.

These guys/girls know what they're doing.

snogglethorpe12y ago

> seems way better than magnetic lasso in Photoshop

Indeed, and it's very impressive work.

It makes sense that this is the case, because this system is doing edge detection with fairly strict constraints: the edges must match the outline of a fairly simple shape which you roughly know the size and orientation of. That seems like it's inherently going to yield better results than completely-unconstrained edge-detection as in photoshop....

martindale12y ago

This is the single most impressive example of image processing I've seen to date.

acgourley12y ago

I think they patchmatch algorithm they use to fill the background is cooler, to be honest. Check out their video: https://vimeo.com/5024379#at=0

dharma112y ago

nothing new though, photoshop has had content aware fill for a while

1 more reply

bsenftner12y ago

you need to get out more.

lsh12y ago

This seems quite similar to this presented in 2011: https://www.youtube.com/watch?v=hmzPWK6FVLo

http://www.kevinkarsch.com/publications/sa11.html

tbatchelli12y ago

It looks so simple, yet my limited understanding of image processing tells me this requires a ton of research and technology. The pace of innovation is staggering!

atopuzov12y ago

I wish I had the time to sit down and understand all the math and algorithms behind this. It's awesome.

jostmey12y ago

I am skeptical, although I remain hopeful that my skepticism is misplaced. The "software" somehow seems to know what pattern of colors should exist on the other side of the object. Can someone explain to us how this aspect of the software works?

prezjordan12y ago

Looks like the flip whatever is on the visible side. If you look at the underside of the telescope, it's just a repeated pattern of what was originally visible.

jorgem12y ago

Looks like they just reverse the front.

baddox12y ago

Is there a reason many of these crazy image processing technologies never seem to have actual demos or releases? The only exception I can think of it the "smart erase" idea, which has been implemented in Photoshop as well as Gimp.

azakai12y ago

This is an academic project - the main goal is typically to publish a paper, not to release a demo or product.

leggo2m12y ago

Photoshop has added, and is regularly adding more, features based on similar technologies. An example of a content-aware featuree: http://www.youtube.com/watch?v=D58CG_-AWnY

wahnfrieden12y ago

What other image manipulation software do you follow closely?

baddox12y ago

I don't follow any closely, I just remember seeing several tech demos similar to this.

snogglethorpe12y ago

A lot of cool rendering/modeling research seems amazingly well-suited for the film industry and this is a perfect example ... besides the obvious applications in making CGI versions of real-world scenes, you can just imagine the director saying "oh no, that lamp is in the wrong location in all that footage... move it (without reshooting)!"

I wonder if it's just a coincidence, or whether the mega-bucketloads of money the film industry throws at CGI are a major factor in funding related research even in academia?

bsenftner12y ago

Not to imply that this technology is anything short of fantastic, if you look closely at the video again you will notice fairly obvious artifacts when an object is 'moved' from it's original location - the background replacement is only so good from a single image. Likewise, the 3D objects themselves created by this system show unrealistic artifacts. I'd like to see the results after they expand this system for multi-photo input, of the type used in film with multiple images from a moving camera. My point being, this is a fantastic combination of known technologies to create something truly new, and with refinement will be suitable for feature film work. However, as it is shown in the video, not high enough quality for VFX applications. (Disclaimer: VFX pipeline developer here.)

snogglethorpe12y ago

> However, as it is shown in the video, not high enough quality for VFX applications

Sure, understood.

The thing is, I imagine film VFX guys are already doing this kind of task—making 3D versions of real objects from the movie and doing CGI additions from them—and tools like this (with, as you say, refinements) could be a great help in speeding up that process...

zem12y ago

this is one of the most impressive things i've seen in a while.

zxcvvcxz12y ago

Question for the entrepreneurs: how would one monetize such a cool algorithm? I come across plenty of cool stuff like this, but without any idea how they can solve real problems.

bsenftner12y ago

This tech, as is, is suitable for one to make models of most of their household furniture as well as the rooms of their house. Possible applications: 1) Virtual home makeover, 2) child's "play/doll house" is their own home (virtual or 3D printed)... and on and on and on... Note that this system does not handle irregular, organic shapes (people, plants), so those need a different solution.

kenferry12y ago

Software for use with 3d printers. Modeling stuff for printing is really hard right now, definitely a big barrier.

jack-r-abbit12y ago

Also awesome is that it handles the background replacement so well. This could also be used to just remove an ugly lamp post, telephone pole, etc from an otherwise good photo. (assuming you can remove objects and resave the image)

Edit: I am aware that Photoshop has some of this available. I've not played with it so I don't know how they compare.

pwny12y ago

If you're just removing part of the image after cutting around it with a tool like this, having the object interpreted as 3D isn't really going to be of any benefit.

The impressive thing here, imho, is the seemingly effortless and seamless transition and replacement. The background is fixed and the surface texture is stretched in what seems like real time.

jack-r-abbit12y ago

Yes... I know the 3D part is the more impressive part. But I was also impressed with its ability to back fill the background.

1 more reply

205guy12y ago

The video says they used the PatchMatch algorithm for the background fill, and a quick search reveals that PatchMatch was developed in 2009 in association with Adobe and later incorporated into Photoshop CS5.

Leszek12y ago

AFAIK, Photoshop uses essentially the same method -- half the authors on the original Patchmatch paper were from Adobe.

hazz12y ago

This is amazing. My first thought is this could allow F1 teams to get a much better idea of what new packages their competitors are bringing to races early on just by looking at photos and video footage and modelling the new parts.

gohrt12y ago

This doesn't tell you any more about the contents of the photo than what is already visible. It can't actually see what's behind an object, it just synthesizes a plausible fictional fill.

TullamoreDude12y ago

This indeed is very impressing and I see the how much work passion is into this project.But I still have to say it almost only about round or cylindrical objects, there is still a long way to go

kunil12y ago

He did a couple rectangle ones too. And it's cylindrical tool handles a lot more than cylinders.

voltagex_12y ago

Is it too much to hope that this tech will be implemented in a program that's within an "average" user's budget? (i.e. non-enterprise).

deadfall12y ago

I think this is really impressive. Do you think it will be years before this actually gets used in public 3D modelling tools?

I vote for this to be used with 3D printer

EGreg12y ago

This is awesome - but how do they reconstruct the backgrounds that the objects previously obscured? There must be more photos?

dharma112y ago

i thought about that too - i think the background is simply a mirror image of the foreground, and that the object 3d shape is symmetrical

EGreg12y ago

apparently they are using some other algorithm to do this - even more impressive!

however, it seems strange in the first example how mountain ranges appear where none were before... how did the algos know to put it there?

1 more reply

Beltiras12y ago

This video is currently unavailable. Anyone else getting static@youtube?

pjgomez12y ago

Simply astonishing... imho this technology is revolutionary.

agumonkey12y ago

A worthy successor to SketchPad, beautiful user interface.

DavidPlumpton12y ago

I read this as "from a single photon"

scoofy12y ago

I'm going to need to buy more filament...

j / k navigate · click thread line to collapse

72 comments

mwsherman12y ago

The key here is really complementary use of ‘what humans are good at’ and ‘what machines are good at’.

In this case, it’s fair to say the machine, by analyzing pixels, can’t figure out perspective very well. The human can do that just fine, given an interface mechanism.

gohrt12y ago

The perspective analysis is done pretty darn well by the machine in these examples.

olympus12y ago

I'm not a HN etiquette stickler, and I'm not accusing anyone of any foul play, but the actual YouTube video was submitted 17 hours prior to this post: https://news.ycombinator.com/item?id=6358080

turing12y ago

It was also submitted here even earlier: https://news.ycombinator.com/item?id=6351712

spindritf12y ago

I submitted it too https://news.ycombinator.com/item?id=6352371

krisoft12y ago

What I was thinking all along: "Oh come on! It can't be this perfect, show me where it fails." And they did!

This is indeed magic. I'm so happy to live in this age, and be part of the "Sorcerers' Guild".

rellikOP12y ago

Yeah, I was thinking the same thing. Funny how them pointing out the failures of the product make it seem cooler (since it seems more real).

DocSavage12y ago

The paper is not out yet, but you can read the abstract here:

http://www.faculty.idc.ac.il/arik/site/3Sweep.asp

breckinloggins12y ago

If you marked shadows and associated them with their source, could you then recover the light source(s) and be able to remove the baked shadows and recast them in real time?

Also, with the shiny objects, could you specify the material properties and have it "back out" the reflection such that the reflection was recomputed as you moved the shape around?

gohrt12y ago

Yes, there are other projects that do things like insert a synthetic object into a scene, with natural in-context lighting that is inferred from the light gradients on other objects in scene.

op12op1212y ago

Yes, here's a cool demo video from 2011 SIGGRAPH Asia...can't even imagine how much more things have progressed since then:

http://vimeo.com/28962540

swamp4012y ago

WOW.

Forget the Photoshop stuff, this needs to be integrated with 3D printing immediately.

Spit out a design file into Tinkercad[1] for some minor adjustments and BAM, you've made a printable 3D model.

[1] https://tinkercad.com/

nicholassmith12y ago

moocowduckquack12y ago

I want this + i❤sketch now, but unfortunately I suspect that jumping up and down and shouting isn't likely to help.

http://www.dgp.toronto.edu/~shbae/ilovesketch.htm

MichailP12y ago

danboarder12y ago

Have you tried Google SketchUp? It might be the closest thing to this type of ease-of-use that I've seen so far, and it's free.

1 more reply

alxbrun12y ago

Wow, super impressive. And meanwhile, Silicon Valley is working on the gazillionth social photo sharing app.

baddox12y ago

The other side of the argument is that social networks improve far more lives than academic research projects like this.

endianswap12y ago

Without social networks like Reddit or HN I would have been unlikely to ever see most of the academic research that I do come across, like this!

dredmorbius12y ago

Citation needed.

1 more reply

Raphmedia12y ago

This is sorcery!

This technology is awesome. If it's as user friendly as they make it looks, I could see a lot of application for that!

breckinloggins12y ago

dharma112y ago

These guys/girls know what they're doing.

snogglethorpe12y ago

> seems way better than magnetic lasso in Photoshop

Indeed, and it's very impressive work.

martindale12y ago

This is the single most impressive example of image processing I've seen to date.

acgourley12y ago

I think they patchmatch algorithm they use to fill the background is cooler, to be honest. Check out their video: https://vimeo.com/5024379#at=0

dharma112y ago

nothing new though, photoshop has had content aware fill for a while

1 more reply

bsenftner12y ago

you need to get out more.

lsh12y ago

This seems quite similar to this presented in 2011: https://www.youtube.com/watch?v=hmzPWK6FVLo

http://www.kevinkarsch.com/publications/sa11.html

tbatchelli12y ago

It looks so simple, yet my limited understanding of image processing tells me this requires a ton of research and technology. The pace of innovation is staggering!

atopuzov12y ago

I wish I had the time to sit down and understand all the math and algorithms behind this. It's awesome.

jostmey12y ago

prezjordan12y ago

Looks like the flip whatever is on the visible side. If you look at the underside of the telescope, it's just a repeated pattern of what was originally visible.

jorgem12y ago

Looks like they just reverse the front.

baddox12y ago

azakai12y ago

This is an academic project - the main goal is typically to publish a paper, not to release a demo or product.

leggo2m12y ago

Photoshop has added, and is regularly adding more, features based on similar technologies. An example of a content-aware featuree: http://www.youtube.com/watch?v=D58CG_-AWnY

wahnfrieden12y ago

What other image manipulation software do you follow closely?

baddox12y ago

I don't follow any closely, I just remember seeing several tech demos similar to this.

snogglethorpe12y ago

I wonder if it's just a coincidence, or whether the mega-bucketloads of money the film industry throws at CGI are a major factor in funding related research even in academia?

bsenftner12y ago

snogglethorpe12y ago

> However, as it is shown in the video, not high enough quality for VFX applications

Sure, understood.

zem12y ago

this is one of the most impressive things i've seen in a while.

zxcvvcxz12y ago

Question for the entrepreneurs: how would one monetize such a cool algorithm? I come across plenty of cool stuff like this, but without any idea how they can solve real problems.

bsenftner12y ago

kenferry12y ago

Software for use with 3d printers. Modeling stuff for printing is really hard right now, definitely a big barrier.

jack-r-abbit12y ago

Edit: I am aware that Photoshop has some of this available. I've not played with it so I don't know how they compare.

pwny12y ago

If you're just removing part of the image after cutting around it with a tool like this, having the object interpreted as 3D isn't really going to be of any benefit.

The impressive thing here, imho, is the seemingly effortless and seamless transition and replacement. The background is fixed and the surface texture is stretched in what seems like real time.

jack-r-abbit12y ago

Yes... I know the 3D part is the more impressive part. But I was also impressed with its ability to back fill the background.

1 more reply

205guy12y ago

Leszek12y ago

AFAIK, Photoshop uses essentially the same method -- half the authors on the original Patchmatch paper were from Adobe.

hazz12y ago

gohrt12y ago

This doesn't tell you any more about the contents of the photo than what is already visible. It can't actually see what's behind an object, it just synthesizes a plausible fictional fill.

TullamoreDude12y ago

This indeed is very impressing and I see the how much work passion is into this project.But I still have to say it almost only about round or cylindrical objects, there is still a long way to go

kunil12y ago

He did a couple rectangle ones too. And it's cylindrical tool handles a lot more than cylinders.

voltagex_12y ago

Is it too much to hope that this tech will be implemented in a program that's within an "average" user's budget? (i.e. non-enterprise).

deadfall12y ago

I think this is really impressive. Do you think it will be years before this actually gets used in public 3D modelling tools?

I vote for this to be used with 3D printer

EGreg12y ago

This is awesome - but how do they reconstruct the backgrounds that the objects previously obscured? There must be more photos?

dharma112y ago

i thought about that too - i think the background is simply a mirror image of the foreground, and that the object 3d shape is symmetrical

EGreg12y ago

apparently they are using some other algorithm to do this - even more impressive!

however, it seems strange in the first example how mountain ranges appear where none were before... how did the algos know to put it there?

1 more reply

Beltiras12y ago

This video is currently unavailable. Anyone else getting static@youtube?

pjgomez12y ago

Simply astonishing... imho this technology is revolutionary.

agumonkey12y ago

A worthy successor to SketchPad, beautiful user interface.

DavidPlumpton12y ago

I read this as "from a single photon"

scoofy12y ago

I'm going to need to buy more filament...

j / k navigate · click thread line to collapse