undefined | Better HN

0 pointsRandomInteger49y ago0 comments

By non-verbal do you mean like ambient sound? Dogs barking, child yelling, garbage truck garbage trucking? I don't know. If they can do voice, then it might be possible to do ambient sounds of there is a separate nets trained with a library of ambient sounds where it's tuned not to be the same every time the sound plays like how when you have tiled graphics, there are algorithms that remove the unnatural sameness from one tile to the next.

This could have interesting implications for Foley-artists of the 21st century.

How likely would such a tech help lower budget companies who want to implement voice communication within their software, say for video games or similar?

Hmm, now this has me wondering what implications this has for voice acting as well.

EDIT: We can call the ambient sound symbols sent over the wire "Soundmojis" or "amojis" or "audiomojis"

0 comments

2 comments · 1 top-level

dest9y ago· 1 in thread

I was thinking about the voice intonation. For example the sentences "this is really great" or "how do you do? -> I'm fine, thank you" can have opposite meanings depending on the intonation. This explains a lot of the misunderstandings on written forums.

It should be possible to train a neural network to catch those special intonations, but it is IMHO substantially harder than the initial project, with uncertain results.

RandomInteger4OP9y ago

Oh, right. I can't believe I forgot about intonation ... I should really get out and talk to people via voice more ...

j / k navigate · click thread line to collapse