These all sounded to me like people auditioning for a role in film and television or something.
So if I'm asking it a question on a neutral tone it will answer in the same way.
It really sounded like people from central casting. The fact that is was exactly that is hilarious to me.
Your point is well advised!
There were already AI products with exactly this behavior (perhaps sans multi-modality) prior to OpenAI's demo. Some of them were even open source. And these products will cater to the things we care about: local, niche, privacy-respecting, with bespoke domain expertise.
I hope eventually they can pick up language, dialect and accent from youtube content or something like that, because voice casting will never scale to the thousands of variations they need to have to support a global audience. Instead of picking a voice from a handful of choices it should do a q&a where it learns your locale and intonation preferences and generates a tailored voice on the fly.
Still, I am impressed with the quality of the voices they have and with how well they manage dutch. It is just the case that there still is a lot of room for growth.
There is nothing intrinsically neutral about the British accent, it's more a matter of diction from the speaker. I would also not consider the OpenAI accents as neutral American, feels more like "Young Californian adult woman" accent to me.
The UK has more distinct accents than the US. However RP English, which was popular in an American form as “mid Atlantic” last century, is almost impossible for any English speaker to misconstrue thanks to its emphasis on clear diction. British actors have been proving that in the US for over a century. No one needs subtitles to understand Hugh Grant. Standard US accents frequently fail the merry, marry, Mary test and are worse overall.
Even us Americans know this. The classic, original, beloved voice of the NY subway was a British guy faking a “flat” American accent.
If you're interested in her voice
The person in question is transsexual. You're saying that regular men and women aren't neutral but transsexuals are?
> Computers better not have genders imo
This doesn't make any sense. Gender is an intrinsic part of many languages which cannot be ripped out. It's impossible to talk in some languages without picking a way of speaking which is either masculine or feminine.
It's from 4 years ago so I assume the quality could be much better, but it still feels a lot less natural than to talk with a "gendered" voice.
I also don't really understand the problem of having only gendered voices. It's not YOUR voice, GPT-4 isn't speaking in your place, it's another entity that has its own voice.
Do you ask people to speak with a gender neutral voice? If not how is that different?
The why question is easily answered if you see how many negative reactions their choice of voice caused. A gender-neutral voice would have just avoided annoying a certain percentage of the population, including me.
I'm happy if advertising stops hitting the sexy/cliched stereotypes.
Sometimes the Guardian goes a bit far but OpenAI could have avoided this kind of article: https://www.theguardian.com/commentisfree/article/2024/may/1...
(Edit: I guess I was being slightly inflammatory with my first sentence. I think the default voice should be gender-neutral and then let the user choose what makes them happy. I don't think it was clever of OpenAI to use a sexy female as the default voice in their demos as evidenced by us having this discussion)
The goal of a tool is to be used by someone, and if the interface is the voice that the user can interact with it makes sense that it should ultimately be up to the user's preferences how the voice will sound like.
I see the fact that they're aiming for gender-neutral voice as yet another ludicrous attempt to advertise their advocation for inclusiveness which, while I'm in favor, I think has manifestations that go well past benefiting the original intention. Examples: Main over Master branch on git repositories, Latinx, removing "blind playthrough" on Twitch.tv because it indicates ableism, and so on.
I don't mind having some voice selections out of the box, but if they're gonna restrict my options and ability to change them to fit my preferences then I do mind. Our primal brain (lizard/monkey, or whatever tag you feel like assigning) will always perceive voice interaction as "talking to someone else", so why not just let the user choose who they talk to? It's a tool, for the user's needs. There's no need with appropriate ascribing of a gender to a tool, because it's not a human or anything living.
ChatGPT, a system designed to deal with human language and answer in a way similar to a human, will surely be anthropomorphized, and many people will want it to have male or female voices.
Gender neutral should definitely exist as an option, it makes a lot of sense, but I don't see why it should be the only option.
They would have annoyed less people if the accents weren't so American and quirky. Google Maps has had a female voice since like forever and I don't remember any outrage.
To be honest, this HN thread was the first time I saw someone complaining about male and female voices in ChatGPT.
>A gender-neutral voice would have just avoided annoying a certain percentage of the population, including me.
The voice actors and actresses hired by OpenAI use their natural voices for training, I don't understand how that could be annoying to anyone, is the problem that they didn't hire a trans person (I imagine they have a more neutral voice)?
For whatever historical reason, the majority of Indo-European languages assign genders to nouns, so apparently it does make some sense.
Also, I'm offended on behalf of all Cylons.
I don't know if you speak a language with genders, I assume you don't. A computer personality needs a gender in some languages in order to talk naturally about itself.
For example, if it needs to answer a question with "I'm not sure." In Romanian it would say "Nu-s sigur." for a male personality or "Nu-s sigură." for female. There is no other option.
> it’s 2024
You know a position is bad when it's justified by "it's current year".
How do you manage to be annoyed when they offer both male and female voices?
I think you're the problem here buddy.
I'm not opposed to computers using emotion at all, mind you. But I don't like that arguably the AI company furthest ahead is choosing this gaudy hollywood approach to marketing.
I personally hated the female voice, it wasn’t pleasant to listen to. I found it quite grating, making the demo hard to watch.
I doubt the net result is support. Expanding creative community - yes. But support - not quite; established creators will likely suffer because of openai, most of them already having a harder time.
I love how they have to go out of their way to explain that, rather than train a model on Scarlett Johansson's voice, they came about her voice honestly, by just finding someone who's natural voice sounded the most like Scarlett Johansson... as, clearly, no one is questioning that this voice was chosen to mimic Scarlett Johansson.
(Note that if you select that voice today, you might not actually be using it anymore. I noticed earlier today I was shunted off to Juniper, which I didn't like as much.)
> 400 people did some unpaid work for us, only 5 of which eventually got paid, after 5 months of not knowing will they or won't they.
Fair enough, that's how things work for other voice acting jobs as well, I just doubt it usually takes almost half a year. I wonder how many of the shortlisted 14 have just given up somewhere along the way.