Imagine the possibilities if Google Photos integrated voice assisted editing like AAIELA! Alongside Magic Eraser and other AI tools, editing with audio instruction could revolutionize how we interact with our photos.
The biggest reason we should be adding conversational UI to everything is the harm done by RSI and sedentary keyboard and mouse interfaces. We're crippling entire generations of people by sticking to outdated hardware. The good news is we can break free of this now that we have huge improvements in LLMs and AR hardware. We'll be back to healthy levels of activity in 5 to 10 years. Sorry Keeb builders, it's time to join the stamp collectors and typewriter enthusiasts. We'll be working in the park today.
Before that, I'm certain we'll all be spending a lot more time reviewing work, trying out prototypes and tweaking prompts or specifications than we do typing or talking.
If you've got the technology to enable you to seamlessly transition from working in your home to working while sitting outside at a cafe to working while sitting on a blanket under a tree in the park to working wherever you feel like it then there will be enough brave people that say "fuck what other people think" and just do it so they can enjoy being active and getting fresh air and eventually more and more people will join them. Eventually we'll reach the point where sitting inside at a desk for 8-12 hours will be the weird thing.
----
[0] : https://scifiinterfaces.com/2020/04/29/deckards-photo-inspec...
[1] 'mirror reality' image / TERI[2] : https://www.hackster.io/news/blade-runner-s-image-enhancemen...
[2] : TERI, almost IRL blade runner move image enhancement tool : https://news.ycombinator.com/edit?id=40844595 / https://github.com/iscilab2020/TERI-3DNLOS/tree/TERI
[3] : Gest : https://news.ycombinator.com/edit?id=40844704
[0] : Intel CPU with OCI Chiplet Demoed with 4Tbps of Bandwidth and 100M Reach : https://news.ycombinator.com/item?id=40844616
This is why usually when you're doing this sort of traditional inpainting in automatic1111 you generate several iterations with various mask blurs, whole picture vs only masked section, padding and of course the optimal inpainting checkpoint model to use depends on whether or not the original images is photorealistic versus illustrated, etc.
Check out the Research section for more complex instructions.