TransparentHMD: Revealing the HMD User’s Face to Bystanders (2017) [pdf] (opens in new tab)

(medien.ifi.lmu.de)

128 pointsramboldio3y ago120 comments

120 comments

51 comments · 11 top-level

AndrewKemendo3y ago· 16 in thread

Having patented technology for see through AR display in 2016 that is cited by Apple [1] and knowing how crazy hard it is, it's a little bit refreshing to know that Apple recognized pass-through HMD AR as too hard and decided to invest in compensatory technology instead of trying to solve the hard see through AR problems.

[1]https://patents.google.com/patent/US10757400B2/en

CharlesW3y ago

(Caveat: I don't know a lot about pass-through HMD AR, and I assume you're incredibly smart and that the patent is innovative.)

> …it's a little bit refreshing to know that Apple recognized pass-through HMD AR as too hard and decided to invest in compensatory technology…

I understand the framing as "compensatory technology", but is it possible that what Apple's doing is the simpler and better way to solve the problem? Pass-through AR strikes me as an old-school analog approach, like optical printing for special effects. But a 100% digital vision pipeline seems like it could unlock interesting capabilities like "night vision", new ways of highlighting interesting objects, etc.

gjsman-10003y ago

I think the most prominent attempt at see-through AR was the Microsoft HoloLens. But if you've actually tried the HoloLens, the Field of View is atrociously, tragically small. The first HoloLens had a field of view of 30°*17.5°. The second HoloLens improved to 43°*29°, but it's still best described as "cramped." Couple that with almost all of the compute budget for the device going into vision processing and having very little compute left for actually running apps (the first HoloLens having a 1Ghz Intel Atom from 2015, the second a superior... Snapdragon 850).

The other problem, of course, is that nothing can be truly solidly-colored. Everything has some opacity - which, combined with the FOV issue, is why HoloLens was never marketed as having anything to do with VR.

2 more replies

jolmg3y ago

> is it possible that what Apple's doing is the simpler and better way to solve the problem? Pass-through AR strikes me as an old-school analog approach, like optical printing for special effects.

I think they're different and not one better than the other. If I wanted to drive with an HMD on, or otherwise be in a situation where it could be deadly to have my sight turned off for even a second or to even have some lag or stutter or other glitch in my eyesight, I'd much rather have a pass-through AR HMD. One's sense of sight seems much more reliable with it by its very nature. You simply don't have those modes of failure with transparent plastic, no matter what's going on in the hardware/software.

1 more reply

AndrewKemendo3y ago

In theory the wavefront emulator is the simplest display - all of the "pixels" are rendered in your brain so you don't have to build an actual "display"

HOWEVER if someone can get the input -> photon production pipeline to be less than ~10ms, then that does solve a lot of the rendering issues, however it doesn't solve all of the other long term problems that come with that amount of hardware - including weight and complexity.

That said, there's a lot of known-unknowns that need to be solved, for example I don't have a solution for micropiezo resonance issues that I'm sure will crop up.

alfalfasprout3y ago

Agreed. In fact, this approach is also likely better for military applications which I hope they explore.

Currently, NVGs that are fielded by soldiers are already displaying images in a way that's not pass-through (using classic image intensification tubes). Something like the Apple vision headset in a lighter and more durable form factor would allow for eg; fusion imagery (fusing visible, thermal, and night vision).

2 more replies

warning263y ago

I'm quite certain they're still trying to solve the hard see-through AR problems, and are hoping to release a future Vision headset with a true see-through display.

But otherwise I agree, it makes sense for them to focus on what they can do best with current tech as a stopgap. With current see-through HMD tech, AR ends up incredibly disappointing. (See also: Hololens & Magic Leap's limited FOV)

samwillis3y ago

My understanding is that occlusion in "see through" AR is an unsolved problem, everything looks ghostly and somewhat semitransparent. Until someone solves that I suspect the re-projection method is the only viable option.

2 more replies

AndrewKemendo3y ago

Right, I think it's going to come eventually but it's really really really hard

mensetmanusman3y ago

They are still asking suppliers to solve the technology, yes.

birdyrooster3y ago

Hopefully we are not forgetting about Bosch's Retinal Projection which solves many problems with lensing HMD AR. If I was a betting man I would say Bosch came up with this tech almost specifically for Apple to integrate it into interactive glasses.

nickelbob3y ago

Just curious - why is it so hard?

slg3y ago

I'm not aware of all the details around the technical complications, but from a physics perspective, you can't make something darker and more opaque by adding more light. Therefore, an AR headset needs to be able to at least partially block light in order to make convincing images. It seems a lot easier just to go with Apple's approach of blocking all light rather than try to develop tech that will only selectively block the light behind the AR objects while allowing other light through.

The blocking all light approach also allows you to hide other potential weaknesses of a device. For example, a lower field of view is much more distracting in a pass-through AR device as you still have your full peripheral vision. VR devices will generally black out the light outside of the FOV making it easier to ignore.

1 more reply

AndrewKemendo3y ago

Off the top of my head:

1. Field of view is limited to existing optics miniaturization

2. Subtractive shading (rendering black) might not be solvable

3. Variable focus objects in the same scene requires projecting n>2 significantly different wavefronts - not solved how to do this with a single vibrating element

1 more reply

jacobn3y ago

It's one of those "small, low energy, bright, pick 1-2, definitely not 3" type situations.

Moore's law works great for semiconductors, but Maxwell doesn't negotiate ;)

Animats3y ago

Nobody has a good way to draw dark while keeping it in focus.

mshockwave3y ago

another follow-up, slightly tangent question: how does F-35's helmet do that without blocking all the lights?

billconan3y ago· 12 in thread

I prefer no outward-facing display if that can make Vision Pro cheaper.

haswell3y ago

I think that more than anything, the inclusion of this feature is a hint about where Apple intends to take this product, and that they want to send a crystal clear message that this device is meant for interacting with other people.

And it seems this is such an important aspect of the product that they're willing to reduce the addressable market from a cost perspective.

This, to me, is what makes this product intriguing. And it makes me think that Apple's real goal is something closer to a pair of glasses, and they just know they can't get there without a long series of iterations.

JohnFen3y ago

> that they want to send a crystal clear message that this device is meant for interacting with other people.

But that display increases the "creepy factor" by orders of magnitude.

2 more replies

drcode3y ago

in apple's eyes, that would make you a VR zombie

apple has decided VR zombies hurt their brand & they won't allow it

billconan3y ago

the world is biased against us introverts by forcing us to socialize.

you see, we wear a noise cancelling headset to pretend to be working, in order to avoid unwanted socialization.

1 more reply

layer83y ago

This is funny, because to my eyes their main marketing image for the Vision Pro (dark-skinned woman) makes a face like a dazed zombie.

ramboldioOP3y ago

I have to try it out to see whether it's worth it. But since the display can be fairly low-resolution, I don't except that it adds a lot of cost. Weight would be a bigger concern to me..

mickdarling3y ago

My concern is primarily power draw. For a device with only a 2 hour battery, every erg matters.

3 more replies

dangus3y ago

How heavy is an OLED panel? Isn't it like a piece of flexible plastic?

Early reviewers seem to say that the metal construction of the Vision Pro seems to be contributing a lot to its weight. Most other headsets are all plastic.

dangus3y ago

I think it's one of the most important features of the design. Apple is trying to get VR/AR users out in public so that it can be a mainstream device.

In the long run, adding a second screen isn't that expensive, and the cameras that capture the video of your eyes already have to be inside the system to perform eye tracking. If smartphone manufacturers can make folding phones with second screens for under $1000 I think that the outward-facing display is not the lowest hanging fruit for cost reduction.

woah3y ago

Apple isn't generally in the habit of cutting corners

ec1096853y ago

The first iPhone only had 2G, the first iPad was super slow, and first AppleWatch had unusable apps.

They’re always evaluating if they can cut a corner.

jackmott423y ago

They cut headphone jacks to save space. No reason you can't cut a pointless outer display to save space too!

1 more reply

ladberg3y ago· 7 in thread

This is not the same as the Vision Pro's display. This paper describes tracking a single other person and displaying a perspective-correct rendering for them, but the Vision Pro displays a perspective-correct rendering for many viewpoints at once using a lenticular screen.

Apple's solution works for >1 people at the same time and doesn't require any external tracking (though it's already doing the external tracking regardless), at the cost of lower resolution and only being correct in one dimension vs two.

ramboldioOP3y ago

Adding multiple viewpoints is actually sth Meta first proposed, based on the paper from above:

https://research.facebook.com/blog/2021/08/display-systems-r...

jessriedel3y ago

More detail from that post:

> There are several established ways to display 3D images. For this research, we used a microlens-array light field display because it’s thin, simple to construct, and based on existing consumer LCD technology. These displays use a tiny grid of lenses that send light from different LCD pixels out in different directions, with the effect that an observer sees a different image when looking at the display from different directions. The perspective of the images shift naturally so that any number of people in the room can look at the light field display and see the correct perspective for their location.

> As with any early stage research prototype, this hardware still carries significant limitations: First, the viewing angle can’t be too severe, and second, the prototype can only show objects in sharp focus that are within a few centimeters of the physical screen surface. Conversations take place face-to-face, which naturally limits reverse passthrough viewing angles. And the wearer’s face is only a few centimeters from the physical screen surface, so the technology works well for this case — and will work even better if VR headsets continue to shrink in size, using methods such as holographic optics.

jessriedel3y ago

Do you know where one could read more about Apple's technique? I don't know much about lenticular displays or why the trick only works in one direction (presumably the horizontal one).

ladberg3y ago

Think of it like those movie posters or bookmarks that change as you move from side to side, but with a screen behind it.

The Wikipedia article might explain it better: https://en.wikipedia.org/wiki/Lenticular_lens

It could work in both dimensions but you're sacrificing even more resolution by doing it that way. For example imagine you have a 1000x1000 pixel display (I just made this resolution up) and you stick a 1D lenticular screen on top with a pitch of 10 pixels. You've effectively split the display into 10 separate 100x1000 displays that are each view from a different angle. You could instead use a 2D lenticular screen and split it up into 100 100x100 displays viewable from a different angle in a 10x10 grid at virtually no extra $ cost. However, you're displaying at 1/10th the resolution just to be able to support perspective-correct views from above or below, which are way less common than from the side.

ramboldioOP3y ago

Facebooks paper on their technology is quite amazing: https://dl.acm.org/doi/10.1145/3450550.3465338

I'm guessing (?) Apple's approach is similar.

cubefox3y ago

Please provide a source which says that Apple's solution works for more than one person. I'm pretty sure they didn't say anything about that.

ladberg3y ago

https://www.youtube.com/live/GYkq9Rgoj8E?t=6729

1 more reply

CharlesW3y ago· 3 in thread

Hey @ramboldio, as one of the authors of the paper, do you have insider knowledge that Apple got the idea from your paper vs. Facebook's "bizarre 'reverse passthrough'"¹ prototype from 2021? Is there a licensing arrangement? (Just curious, it's a really interesting idea in any case!)

¹ https://www.laptopmag.com/news/facebooks-bizarre-reverse-pas...

ramboldioOP3y ago

no insider knowledge, I just know that the work from facebook cited our work: https://research.facebook.com/blog/2021/08/display-systems-r...

They also add the display that would work with different angles. So it looks like, maybe Apple implemented Meta's research. The timeline could work.

For completeness, there is also another paper "FrontFace" proposing a similar idea that was published around the same time: https://dl.acm.org/doi/10.1145/3098279.3098548

ladberg3y ago

If you're suggesting that Apple implemented Meta's research starting as it was published on 8/2021, then that timeline absolutely does not work.

2 more replies

CharlesW3y ago

Thanks for the info! It must feel great to see this becoming a reality, and I hope it benefits you and your partners professionally.

1 more reply

Demmme3y ago· 1 in thread

Is this verified?

U do like the LMU after all I'm in Munich but this though is more obvious than magic.

ramboldioOP3y ago

What I can verify:

To the best of my knowledge, this is the first work that proposes putting a photorealistic, perspective corrected face on a VR headset.

"FrontFace" (https://dl.acm.org/doi/10.1145/3098279.3098548) is the first work that proposes putting eyes on a display on VR to "lower the communication barrier".

DonHopkins3y ago· 1 in thread

It should be touch sensitive so it can detect when somebody pokes you in the eye.

ramboldioOP3y ago

Like this? https://gugenheimer.com/?portfolio=facetouch-enabling-touch-...

TastyLamps3y ago

Google did something similar in 2017 (although it's overlaying your WHOLE FACE) on the headset and only when viewed through a camera: https://blog.google/products/google-ar-vr/google-research-an...

deanCommie3y ago

I have extremely mixed feelings about the fact that Apple seems to have "solved" AR by faking it through VR, and that means that with some iteration, a battery improvement, and a price drop, this could truly be the next revolutionary device that people can't help but want to use.

And I am scared of what it means for our society when "eye contact" no longer means a direct connection in person, but an indirect one through 2 cameras and 2 screens.

Obviously this is old hat for Facetime, and all remote collaboration. But in-person too?

gfodor3y ago

Anyone who thought reprojection was the solve for AR considered this solution since you'd have to find a way to simulate glass. The first consumer grade passthrough VR was the GearVR in 2015 or so, so I don't think the idea was originally conceived this late.

deanCommie3y ago

And I am scared of what it means for our society when "eye contact" no longer means a direct connection in person, but an indirect one through 2 cameras and 2 screens.

Obviously this is old hat for Facetime, and all remote collaboration. But in-person too?

marcell3y ago

How does this technology handle multiple bystanders looking at the display from different angles?

j / k navigate · click thread line to collapse

120 comments

51 comments · 11 top-level

AndrewKemendo3y ago· 16 in thread

[1]https://patents.google.com/patent/US10757400B2/en

CharlesW3y ago

(Caveat: I don't know a lot about pass-through HMD AR, and I assume you're incredibly smart and that the patent is innovative.)

> …it's a little bit refreshing to know that Apple recognized pass-through HMD AR as too hard and decided to invest in compensatory technology…

gjsman-10003y ago

2 more replies

jolmg3y ago

> is it possible that what Apple's doing is the simpler and better way to solve the problem? Pass-through AR strikes me as an old-school analog approach, like optical printing for special effects.

1 more reply

AndrewKemendo3y ago

In theory the wavefront emulator is the simplest display - all of the "pixels" are rendered in your brain so you don't have to build an actual "display"

That said, there's a lot of known-unknowns that need to be solved, for example I don't have a solution for micropiezo resonance issues that I'm sure will crop up.

alfalfasprout3y ago

Agreed. In fact, this approach is also likely better for military applications which I hope they explore.

2 more replies

warning263y ago

I'm quite certain they're still trying to solve the hard see-through AR problems, and are hoping to release a future Vision headset with a true see-through display.

samwillis3y ago

2 more replies

AndrewKemendo3y ago

Right, I think it's going to come eventually but it's really really really hard

mensetmanusman3y ago

They are still asking suppliers to solve the technology, yes.

birdyrooster3y ago

nickelbob3y ago

Just curious - why is it so hard?

slg3y ago

1 more reply

AndrewKemendo3y ago

Off the top of my head:

1. Field of view is limited to existing optics miniaturization

2. Subtractive shading (rendering black) might not be solvable

3. Variable focus objects in the same scene requires projecting n>2 significantly different wavefronts - not solved how to do this with a single vibrating element

1 more reply

jacobn3y ago

It's one of those "small, low energy, bright, pick 1-2, definitely not 3" type situations.

Moore's law works great for semiconductors, but Maxwell doesn't negotiate ;)

Animats3y ago

Nobody has a good way to draw dark while keeping it in focus.

mshockwave3y ago

another follow-up, slightly tangent question: how does F-35's helmet do that without blocking all the lights?

billconan3y ago· 12 in thread

I prefer no outward-facing display if that can make Vision Pro cheaper.

haswell3y ago

And it seems this is such an important aspect of the product that they're willing to reduce the addressable market from a cost perspective.

JohnFen3y ago

> that they want to send a crystal clear message that this device is meant for interacting with other people.

But that display increases the "creepy factor" by orders of magnitude.

2 more replies

drcode3y ago

in apple's eyes, that would make you a VR zombie

apple has decided VR zombies hurt their brand & they won't allow it

billconan3y ago

the world is biased against us introverts by forcing us to socialize.

you see, we wear a noise cancelling headset to pretend to be working, in order to avoid unwanted socialization.

1 more reply

layer83y ago

This is funny, because to my eyes their main marketing image for the Vision Pro (dark-skinned woman) makes a face like a dazed zombie.

ramboldioOP3y ago

I have to try it out to see whether it's worth it. But since the display can be fairly low-resolution, I don't except that it adds a lot of cost. Weight would be a bigger concern to me..

mickdarling3y ago

My concern is primarily power draw. For a device with only a 2 hour battery, every erg matters.

3 more replies

dangus3y ago

How heavy is an OLED panel? Isn't it like a piece of flexible plastic?

Early reviewers seem to say that the metal construction of the Vision Pro seems to be contributing a lot to its weight. Most other headsets are all plastic.

dangus3y ago

I think it's one of the most important features of the design. Apple is trying to get VR/AR users out in public so that it can be a mainstream device.

woah3y ago

Apple isn't generally in the habit of cutting corners

ec1096853y ago

The first iPhone only had 2G, the first iPad was super slow, and first AppleWatch had unusable apps.

They’re always evaluating if they can cut a corner.

jackmott423y ago

They cut headphone jacks to save space. No reason you can't cut a pointless outer display to save space too!

1 more reply

ladberg3y ago· 7 in thread

ramboldioOP3y ago

Adding multiple viewpoints is actually sth Meta first proposed, based on the paper from above:

https://research.facebook.com/blog/2021/08/display-systems-r...

jessriedel3y ago