story

Mycroft – open source voice assistant (opens in new tab)

mycroft.ai

286 pointskitebive3y ago140 comments

140 comments

I backed on IndieGoGo (in late 2018 I think), then helped crowdfund on StartEngine. I want so badly to speak well of these guys, and since I invested I want them to succeed, but it's been getting harder to do so.

They deserve credit for being reasonably transparent about their processes though. Their blog has been very interesting to follow. They had a ton of issues trying to work out their original design, then the pandemic hit. So in a way, timing has been bad. In the meantime they spent a lot of time on the software side, then on the fundraising side, then deck shuffling at the top, and now they're finally shipping something that looks nothing like what the Kickstarter showed. Instead of being close to the $200 price point they wanted to be at, it's currently $349, then will go to $500. They're at a point where if they don't get sales at those prices, they're not going to be able to deliver to backers; at least that's how they framed it when they discussed rollout.

They are actually shipping, though! Which didn't look like a guarantee for awhile. However, the reviews have been a bit lukewarm; [this][1] being an example. They had a way to make skills and they apparently changed it in order to better accommodate their final product, so it seems like there's been a bit of fracturing of the ecosystem as a result.

In conclusion: I'd love for you to support them! I still believe this segment needs a player like Mycroft, Mycroft can be that player, and I really want to not have my investments go to waste. But I 100% would not blame you if you looked at the company's arc and said "no thanks".

[1]: https://old.reddit.com/r/Mycroftai/comments/yitzzk/mycroft_m...

GekkePrutser3y ago

Those prices are really insane tbh. No way it will take off like that, it's just a non-starter. 200 was already the upper limit of what's doable.

I think they're stretching too much to satisfy the original backers and it's commendable not giving up on them but if it's going to be similar to Alexa, Siri or Google it's just not good enough.

If it were an actual assistant I could talk to, then yes. It would be worth it.

Imagine this.. I'm doing my laundry and mycroft pipes up.

"hey Alice is looking for you on telegram"

"Tell her I'll get back to her after I finish the laundry"

"Ok!" ... "She says it's urgent, are you sure"?

"Ok call her please on speaker"

Or another scenario.

"hey mycroft I'm going out to the zoo"

"Ok make sure you bring an umbrella because it's going to rain in 2 hours "

Stuff like this. Right now assistants have zero short term memory, don't remember any of my preferences and can only understand one thing at a time. They're also not proactive at all. They don't know my life and habits and don't warn me when things are happening that I should know about. Yet most of those things are easily identified from notifications on my phone! It's not a stretch to expect this IMO. It's all thing I'd expect from a real assistant. All this low-hanging fruit turn on the bedroom light stuff is not worth money.

It's just that there's not much sales to link to it other than the service price (which I'd definitely pay for!!).

I don't think these scenarios are too far-fetched with the current state of AI tbh.

Ps blaming the pandemic is a bit rich. Their project was already down the drain for years before that. First they canned the original plan and then they had this DIY raspberry addon board and that was in huge trouble well before the pandemic. I'm sure it made matters worse but if they'd managed it properly it would all have been fulfilled years before corona meant anything other than beer.

ElijahLynn3y ago

I'm not sure that price is going to be a blocker. The audience may not think of price as the main thing here, they may think of openness and non-vendor lockin and privacy and security as pretty amazing things for that price.

I myself am getting tired AF of Google Assistant and their devices. So so tired of saying "Hey Google". And I have a Lenovo device with Google on it that was decent, the Google pushed an update to it and all video calls have an echo on them now making it entirely unusable. There are multiple threads on the internet about it too and Lenovo support says "contact Google" and Google says nothing on those threads, zilch. I don't trust Google devices anymore, they get abandoned routinely. I wish they didn't but I feel more empowered with an open device than I do with closed source, no public response Google stuff.

I'm definitely curious about trying a Mycroft now, and I think there may be others too. It may not be the masses but it might be enough to keep the project thriving.

lannisterstark3y ago

>I'm not sure that price is going to be a blocker.

Eh, idk if I want to spend $350 on what is arguably a half baked, subpar Google-Assistnat/Alexa.

aartav3y ago

Kudos to them for shipping a product.

I looked into them a few weeks ago because I am tired of the "Did you know.." and "Can I add that to your cart?" from Alexa. When I saw the device I was really disappointed. I can just never see getting one - it doesn't need a screen (cause it should be hidden) and why did they make it look like a 1960s sci-fi movie thing? IMHO it looks terrible.

Brendinooo3y ago

The original design was great, but it just didn't work. They had too many issues trying to source hardware and decided to pivot to more off-the-shelf components.

That's the short version; here are some highlighted blog posts that document the trials and tribulations:

https://mycroft.ai/blog/mark-ii-update-delivery-timeline-and...

https://mycroft.ai/blog/mark-ii-update-january-2019-current-...

https://mycroft.ai/blog/mark-ii-architecture-change/

https://mycroft.ai/blog/mark-ii-update-revised-architecture/

https://mycroft.ai/blog/real-companies-ship-product/ (here's the pivot to Raspberry Pi)

https://mycroft.ai/blog/mark-ii-update-january/

https://mycroft.ai/blog/mycroft-mark-ii-july-2020/ (here's where they decide that the thing has to be a box instead of a cylinder)

https://mycroft.ai/blog/mark-ii-update-october-2020/

https://mycroft.ai/blog/redesigning-the-mark-ii-part-1/ (here's the final design)

For what it's worth I'm intrigued by a screen. Theoretically it shouldn't be needed but if they want to have a reference device that can be used in multiple industries, it's better to have it than to not have it.

But that series of posts is why I'm still cheering for Mycroft despite everything: Clearly this stuff is hard, they've been out there trying hard, taking lumps, fighting off patent trolls, putting in the work. If they don't succeed, I'm not sure who else will pick up the reins and do any better.

nshm3y ago

There is also a strange story of speech developer leaving them a week ago https://community.rhasspy.org/t/rhasspy-is-joining-nabu-casa...

SEJeff3y ago

That's great news for Home Assistant however!

rexreed3y ago

The voice assistant space is dying: https://arstechnica.com/gadgets/2022/11/amazon-alexa-is-a-co...

Some of the comments below are part of the explanation why. It doesn't work as well as people were hoping, and it's a solution in search of a problem with limited application and it seems little monetization. The above article sums it up better from the big tech company's perspective.

rolenthedeep3y ago

Voice assistants which are trying to force engagement to squeeze money out of you are dying.

Most people only use the voice assistants for a few simple tasks, which is perfect for an open source project like mycroft. It is, however, very, very bad for Amazon and Google, because those tasks don't make them money. That's why they're all going so aggressive on "you asked for the time, but by the way here's a 5 minute speech on all the easily monetizable tasks I can do instead"

People like the idea of voice assistants, but by and large they don't like all the problems associated with a voice assistant run by Amazon, Google, and Microsoft.

horsawlarway3y ago

Yup - I think this is the truth. I'm willing to spend several hundred dollars right this second for a simple voice assistant for things like weather, time, timers, unit conversion, alarms, and home assistant control (mainly lights).

I've actually pre-ordered the Mycroft Mark 2, although no chance to evaluate it yet.

I'm very interested in devices that can do this locally.

I'm not interested in Alexa/Google home anymore AT ALL - I've gone that route, they both work, but they want my dollars all the time, and it's become increasingly clear that if they can't get me making purchases through those devices - they will kill them off, or become ever more scummy in the attempt (Alexa is now including ads in the "did you know" section - "did you know" was already a fucking terrible decision to include, since it's going to marginally increase interaction at the expense of huge user dissatisfaction. But putting ads there has made me leave.)

So basically - I think if anything, we're seeing a speed run of 90s/2000s tech company boom/bust. A huge amount of money poured into the space with no real idea of how to sustainably profit, but the space itself doesn't feel like it's going anywhere.

It's really, really compelling to allow voice control in all sorts of interactions - but it needs to be very clearly working with me, and not trying to subvert my intent for profit. That might even mean it needs to fall back to something like "if this command, then that action" style usage. No more changing commands, no more bullshit ads, no more subversion of what I'm asking it to do.

It needs to obey me, not google or amazon. Otherwise it's a sales rep and not a digital assistant.

cupofpython3y ago

>they both work,

I got an echo (alexa) for free and use it for home assistant. It only works when I have an internet connection. So when my internet is out, I cannot turn my lights on/off with it. I understand why, but i too would REALLY like to just have all functionality dependencies for home automation to be local.

1 more reply

kuberlog3y ago

I worked on a project in 2016 that told me all I needed to know about the space. It was an online voice assistant and I couldn't find myself wanting to interact with it. Even though I spent a lot of time on the project, I scrapped it, because it was just lame. It looked kind of cool, but was lame.

I personally don't think there is enough cybernetica to control with voice. At some point there may be, but right now, the internet is just one giant consumption stream with a few searches and purchases now and then.

That digital daemon experience taught me that I care more about physical intelligence than verbal intelligence when it comes to my technology. I'm verbally intelligent myself, I don't need an AI who can't even speak correctly, let alone understand me, be my verbal interface to the world.

1 more reply

shagie3y ago

> Most people only use the voice assistants for a few simple tasks, which is perfect for an open source project like mycroft.

I'll certainly grant that... but the price point where Mycroft is, is certainly not near what I'd pay for doing those few simple tasks.

Apple is at the upper end of what I'd spend for such a device (the HomePod mini is $99) - and that's because I'm fairly invested into the Apple ecosystem and thus it can make use of the iTunes library, home automation, calendar items, etc...

If I wasn't invested in Apple, then none of the home assistants other than Amazon (because of the price point for the echo) would be particularly interesting.

I've got a echo show - because its a very nice simple clock/weather interface (that's got Alexa behind it) too (I really liked the Ambient 7 day weather clock when it was available). I've got an echo wall clock that is paired with the echo in the kitchen - it makes timers nicely visible (a sibling of mine has an echo wall clock because its an analog dial that doesn't have any sound with it).

The problems with Alexa of suggesting by the way ("Alexa, stop by the way" - give it a try and yes, it is routineable) are tolerable for how much I'm paying for them and the functionality that I use it for.

dublin3y ago

The Ars article on Alexa's financial crash-and-burn inside Amazon missed a lot of the reason people aren't willing to engage with Alexa as much as they could or would, if things were different. First, the privacy aspects are significant. Secondly, the value proposition is just not there - worse, Amazon has deliberately broken one of the most useful things you could do with Echo products: using them for distributed networked audio, a la Sonos: The new generation Echo Show products ELIMINATED the audio output jack, so you can't even plug the output into a stereo or speaker now!

On top of that, the Echo products are just not well built, not well thought out, and have NOT been upgraded to make them better: They update, but with NO visible benefit to the owner. One example: The Echo Show 8 Cannot and will not keep its display off all night, even if you explicitly command "display off" before going to bed (yes, it does understand and temporarily obey this command!) But sometime during the night, something will wake it up, and the damn thing turns into a lighthouse in your bedroom, waking one of us up.

I'd really like to find Alexa more useful, but like most folks I know with one, it's mostly just useful as a glorified voice-controlled radio - I'd use it more to control lights and such, if I could get the damn thing to actually realize waht lights are in what rooms, and that dimmer switches and smart lights can indeed share a location that should be controlled together. (Yes, this is supposed to work, but it doesn't...)

I would pay $500 to outfit the house with a central voice recognition processor that would be capable of supporting a dozen or so very secure listeners on the local LAN. Mycroft isn't that solution.

vineyardmike3y ago

> The problems with Alexa of suggesting by the way ("Alexa, stop by the way" - give it a try and yes, it is routineable) are tolerable for how much I'm paying for them and the functionality that I use it for.

So cold comfort since it’s annoying as hell, but it slowly learns you don’t like it and will back off its frequency. Amazon unsurprisingly tracks “dissatisfaction” responses and adapts rate of things (globally and individually) so you do actually have to cuss out Alexa to change it. It’s slow because obviously it’s profitable but it does happen.

smoldesu3y ago

> That's why they're all going so aggressive on "you asked for the time, but by the way here's a 5 minute speech on all the easily monetizable tasks I can do instead"

This is a word-for-word description of how Siri originally functioned. "You asked for the top 5 romantic resturaunts nearby; here are the top results from Google Search:"

zamadatix3y ago

GP isn't talking about bad fallback answers where it punts you to a search page more like when you ask "Hey Alexa, what is the time" and it says "The time is 5:45 PM. By the way did you know you can buy ribbons for the holidays on Amazon by saying..." i.e. things that are blatantly unrelated to answering your question and often trying to sell you something.

1 more reply

jm43y ago

I don't think the entire space is dying. Amazon is having problems because Alexa has little benefit to them outside of direct monetization. They wanted people to use Alexa to buy things and no one wants to shop like that. So they have these devices and all this infrastructure to run people's kitchen timers, lights and play music. People will buy Alexa devices on Prime Day, use them dozens of times a day for years, and never make a dime for Amazon.

Apple isn't necessarily in the same boat. Siri isn't particularly good, but it does all those things well. Most importantly, it keeps people on iPhones and in the Apple ecosystem, which does make money.

You already have the phone and probably have a device that works with HomeKit so why not try it out. Next, you buy some new lights. Before you know it, you're controlling most of the lights in your house, streaming Apple Music and setting kitchen timers from your Apple Watch. Next time you need a new phone, you're not even going to think about anything else because if you change you won't be able to turn on your lights anymore.

Apple has a plan that works and Amazon doesn't.

smoldesu3y ago

This makes no sense to me. Apple's plan works because... their lock-in is better? I can "control most of the lights in my house, stream Apple music and set kitchen timers" with Alexa, Google Assistant, Cortana and even Bixby. What is Apple's actual advantage here? How is Apple making money from this when Amazon does not?

Tagbert3y ago

Apple makes money from the devices it sells. Siri makes those devices more convenient to use.

Amazon is selling the devices at or below cost and hoping that Alexa will make the money (which it doesn’t)

1 more reply

shagie3y ago

Apples devices are smarter. This makes them cost more for the "same" hardware, but costs less for the computation.

Apple isn't trying to make money with Siri. It's using Siri to make its ecosystem of Apple Music and similar more valuable to its customers.

The limits that Apple puts on what it can do makes that cloud side computation less expensive.

---

Consider that bit - less expensive. Apple doesn't run its own cloud in the way that Google, Amazon, or Microsoft do. So what does Alexa cost? It costs for AWS cloud time. That's the expense that it's running. Those skills that people use run on AWS compute time rather than a phone's local cpu and battery.

A google search "costs" about 1 KJ of energy. Alexa has similar costs somewhere just for energy and other costs for the maintenance of the additional software and content. It costs something to maintain that joke database.

1 more reply

twobitshifter3y ago

I think the point is that apple produces software that doesn't directly make them money as part of their business model. Pages, Numbers, iMovie, Maps are all given away for “free” with devices. Siri is just another example of that. As you say, being able to talk to your phone is table stakes, not an advantage. But having that offering keeps sales of Apple hardware moving.

theptip3y ago

Seems odd right? Just charge more than break-even for Alexa and you have a business?

jm43y ago

Not necessarily because the cost increases through increased use by the customer. The idea was to sell hardware at close to cost and then monetize the customer. The likely assumption was that Alexa users would buy more similar to how Prime customers buy more. Ideally, Alexa customers were supposed to use services like "subscribe and save" and then randomly tell Alexa to order more toilet paper or laundry detergent. Amazon wanted all the household stuff on recurring subscriptions. It would have been great. Increased revenue, better ability to bundle shipments together to cut costs, customers who don't even look at prices anymore. Instead, they sell a device for which they incur an operating cost while producing little to zero increased revenue and subscriptions.

1 more reply

ncallaway3y ago

Charge more than break even for Alexa and they won’t sell enough.

vineyardmike3y ago

> Apple isn't necessarily in the same boat.

Also importantly, Siri mostly runs off your iPhones processor, and apple doesn’t have to pay a big cloud bill for it, unlike Alexa.

cupofpython3y ago

Apple and the garden of eden. Dont take a bite unless you're going all in

Brendinooo3y ago

The "voice assistant space" also includes Siri, Google Assistant, and Cortana, so it's not going anywhere.

I'd contend that it's absolutely not a solution in search of a problem; it's much more of an unsolved problem, and a big part of the "why" is

- voice recognition/assistance tech still maturing

- major players are insisting that the tech supports their walled gardens

- price points are still a problem

The last two creates a conundrum: a lot of times tech prices come down by selling expensive stuff to rich people until the hardware becomes commoditized. But for a good voice assistant, you need a lot of up-front investment at scale. Unfortunately, the companies that are able to do this are also controlling the hardware that can use it, which limits its ability to spread and be useful.

This is why I think Mycroft is important to support:

1. If you can make voice assistant software open-source and plug-and-play, then it frees people up to tinker with form factors

2. Part of Mycroft's pitch to businesses is that they can make custom solutions. There are probably a thousand big businesses that might want to get into this space but don't want to rely on Amazon because they want to control the experience and not give up their data. Maybe Target wants to stick virtual assistants around their stores, or maybe a hospital wants to give tools for surgeons.

I also think there's an opportunity for voice control in home stereo, where someone decouples the speakers from everything else. It's still annoying to work with Bluetooth in 2022, and Sonos is still pricey, and another walled garden. I'd love to have a simple controller that connects a dumb speaker to Wi-Fi and lets me voice-control it to play music from a library of my choosing. That's not a thing yet, right?

liotier3y ago

> custom solutions

Especially interesting as voice recognition is much easier, cheaper and more efficient within the limited space of a specific usage.

tjohns3y ago

> 1. If you can make voice assistant software open-source and plug-and-play, then it frees people up to tinker with form factors

For what it's worth, Google Assistant does have an open API to create new devices. It's not open source, but you can certainly experiment with your own custom form factors. There's even a tutorial:

https://medium.com/google-cloud/how-to-build-your-own-smart-...

ghaff3y ago

>- price points are still a problem

Really? An Echo Dot is $25 on Amazon right now. Which, if you use it at all, is pretty reasonable. (To be sure, if I were using it for music to any degree, I'd probably get a model with better speakers.)

For music, I have an old phone connected to a stereo receiver. So it has voice control although I mostly pick a playlist or album manually.

Brendinooo3y ago

>An Echo Dot is $25

Yup, price point is solved for the "just timers and some music" people, but now you're stuck in Amazon's orbit.

1 more reply

horsawlarway3y ago

It's not dying at all - it's an incredibly useful interaction style.

Those companies are failing to profit because they don't understand that a digital assistant needs to be working with me, locally, and not subverting my intent.

It just needs to be my device, and not a sales rep for google/amazon. I use voice controls all the time at home - it's astoundingly useful in all sorts of situations, and I'm not even disabled (where it's literally life changing in some cases).

voakbasda3y ago

A truly open platform stands a chance in the voice assistant space, as it could be adapted into forms that are useful beyond their current limited designs. Such useful forms probably are not as monetizeable as the current incarnations that invasively collect information about you and your family, so I very much doubt the big tech players will ever attempt to build these useful systems directly.

Unfortunately, Mycroft is not very open itself. Sure, most of the code is open and available, but I tried to contribute and found my PRs ignored for weeks. When they were finally ready to merge them, their poor response cause me to lose interest in the project. At that time, they did not seem interested in cultivating a strong developer community around their core technology components; they were doing their thing, and they wanted the community to implement “skills”. I got the impression that community could either get on board or stand aside and watch them work. For that reason alone, I feel fairly certain that this project will fail eventually as well, and their hardware will become yet another high-tech relic of a paperweight.

As a formerly enthusiastic kickstarter backer, I cannot recommend the Mycroft project as the basis for a product; you don’t own and can’t control the platform on any meaningful way (short of forking it). It might be a better choice than a closed platform, but not enough to make me want to put any money in it.

gigatree3y ago

Why do you need to control it yourself in order for it to be valuable as an open-source project? Maybe the team has a specific vision for the product, and reading through PRs from random people online takes away from their limited resources.

thomastjeffery3y ago

...but you can fork it. Is the build process really too painful for that to be enough?

cloudking3y ago

We use Google home assistant devices throughout the house, and find them quite useful. Use cases:

- controlling smart devices (thermostats, TVs, speakers)

- broadcasting messages

- reminders / tasks

- asking questions

However, none of these use cases generate any revenue for Google afaik.

VikingCoder3y ago

Ours is a voice-driven music player for our kids. They love it. We have a YouTube Premium subscription just because of the Nest Minis we have in every room. Sometimes we ask it "What's the animal of the day?" or "Tell a story" or "What year was Abraham Lincoln born?" but mostly the Nest Hub Maxes we have are just photo slideshows, which we love, and sometimes we ask "What's the weather?"

That set of functionality alone makes them well worth the money for us.

veidr3y ago

With all due respect, I find that thesis absurd.

Maybe you meant it like, "the voice assistant space isn't going to generate huge profits, and thus giant corporations will lose interest".

But even that is absurd. They will still have to do it as a loss leader. Maybe not Amazon — because they just ship us our toilet paper and protein bars and shit. They don't have an "ecosystem" (although they gave it a halfhearted try a few times).

But the chance that in 2032 people just like... don't have voice assistants? It's literally zero, barring an actual WWIII cataclysm reversion-to-barbarism event.

> doesn't work as well as people were hoping

Nothing does, until it does...

> little monetization

Yep, that might be right. But it doesn't necessarily mean the space is "dying". Just that it might not be amenable to oligopolization.

voakbasda3y ago

I will never have a voice assistant unless a completely open and self-hosted solution appears on the market. And with current patent landscape, that seems incredibly unlikely to happen before 2032.

veidr3y ago

OK, I can't keep reading this website any more tonight, but for fuck's sake you do realize that the submission you are commenting on is a completely open and self-hosted solution that is on the market, right?

C2H4O23y ago

Maybe one of the reasons behind this is that people use voice assistants, and search engines too in general, to look for information. Today, all that these products do is suggest instead of catering results. I believe this is one of the reasons people, or at least I, do not wish to use assistants. It feels that a computer is controlling my likes, dislikes and wishes while it should actually be me who controls computers.

aartav3y ago

Its certainly not a solution looking for a problem. Its a great way to deal with a number of minor daily tasks. Checking the time, setting timers, checking the weather/AQI, playing music, checking news headlines, etc.

Theres a lot of things its really not good for and people have tried them all I'm sure. But where it fails the hardest is being able to increase sales volume for Amazon or increase ad revenue for Google - the only path to monetization seems to be to force it in - and THAT is what is dying.

And who the heck wants it in a ring or glasses??

Zuiii3y ago

I always wanted a voice assistant but there's no way I'm having big tech listen in on me and my family 24/7 just to have one. Most non-tech family members I talked with about this share my opinion. THAT is why these assistants are failing.

On the other hand, Mycrodt sounds like something people would actually want to use provided that it can operate locally and doesn't send any data outside the home.

melling3y ago

It works a lot better than nothing. I use Siri and Alexa every day. If Alexa goes away, I’ll use Siri more, or find another. Siri was a little slow to catch up.

I think the story that you read simply says that it’s hard to monetize. You are inferring more than what the story says.

Voice assistants are here to stay.

I eagerly await the day when I can simply say respond to this post then begin writing with my voice.

genewitch3y ago

"Okay, navigating to the nearest post office"

giancarlostoro3y ago

I want a voice assistant that passes my AI turing test if you will. I want it open sourced like Mycroft too though. I don't care for having 17 speakers that start talking when they think I was talking to them. I wasn't.

bluGill3y ago

My needs for a smart speaker are not really passing a turing test. I want them to be automations. They need to get some things that an AI would do right, but there is a large step between an assistant that can do specific things and an AI that can talk about anything.

veidr3y ago

Would love to read about experiences actually using this (I mean Mycroft in general) — good, bad, or otherwise.

Also, though: why don't we have "text assistants"? Seems to me the process of deciphering spoken text is (or should be) entirely orthogonal to performing the actual task — changing the lighting, cranking up the AC/heat, arming the security perimeter, or whatever.

I think the reason is that voice recognition is hard and so far only the "BIGASS TECH!!!" corporations have been able to make it "mom or granny ready" — and they have no incentive to do that for free and let us make our own mash ups. They want to wall us into their ecosystems.

So from that standpoint, this looks pretty cool to me — even if the voice recognition isn't as good as the big three.

OTOH, to rebut my own point: I got the new Apple Watch Ultra and I noticed that I can map the side button to a "shortcut" (the Apple term for a script you create yourself to automate something) that just transcribes whatever I say, and sends it as text over SSH to any host I want. On my local LAN, the delivery time is well under 1000ms.

So that's getting pretty close to being able to use Siri as a generic voice recognizer, and then piping the input into whatever arbitrary/homebrew system I want.

To do it purely with voice though you have to be like "Hey Siri, do the funky chicken" (after naming the shortcut "do the funky chicken"). And then say the actual command phrase you want your home automation to do.

criddell3y ago

> why don't we have "text assistants"

I used to use a text-based assistant service called I want Sandy and it was great. Then Twitter bought the company and they went away.

http://boingboing.net/2007/11/14/i-want-sandy-perfect.html

toqy3y ago

> why don't we have "text assistants"

Siri has this as an accessibility feature since iOS 11, but might not be exactly what you're looking for

veidr3y ago

That is right, and I did try it, but they made it so that if you enable that then voice input no longer works. (T_T)

rickoooooo3y ago

I played with Mycroft about two years ago. I had been using a couple Google home minis for a while for the usual things (play spotify, set timers, ask the weather, control lights around the hose). They worked perfectly for that. At the time I decided to de-Google my life and take back my privacy so I went looking for something open source that would provide me more control of my data. I found Mycroft and played with it for a few months.

I was pretty excited about it. I bought a ReSpeaker 2.0, which is an embedded device that can run Linux and has a six microphone array. I designed a custom 3d-printed case to hold the ReSpeaker and a small speaker to make my own little "Jarvis" box (Iron-man reference).

My favorite part about the whole thing was the customization. I wrote a couple of skills to do some other things for me. For example, I could say "Where can I watch X?" and it would use an API to search for a TV show or movie to see where it was available on Netflix, Amazon Prime, Disney+, etc and let me know. It's always been annoying to go Google and try to figure out where I can watch something streaming online, but limited to only the services I currently subscribe to. I wrote another skill that tied into my couchpotato instance so I could say "Download the movie X" and it would go find it and download it. If it found multiple matches, it would read off the top few matches and let me choose the correct one. I even tied those skills together so if the first skill couldn't find a movie at one of my streaming services it would ask if I wanted to download it and I could simply say "yes". I also modified the code to use a custom text to speech API so I could configure Mycroft to use a custom voice.

It was all really cool and I had a lot of fun playing with it. The biggest problem I ran into was the wake word recognition. It worked mostly OK for me on the ReSpeaker from close range but I found as I moved away it went downhill. It was especially bad if I had my device playing music, which is possibly the most common thing I was using my Google Home mini for. I had hoped that the ReSpeaker would help with this, because it had the six microphone array and some built-in loopback hardware to try and cancel out any noise that that was being generated by the ReSpeaker. So any sound output to the speakers would be looped back into the ReSpeaker and could be subtracted from the microphone's input. I found that I just couldn't get it to work well, though. I think the music was causing vibrations that were overloading the microphone array and causing it to be unable to hear me through the music. It's possible it could be improved with a better hardware design to help reduce vibration caused by the device's own speaker. Maybe it works better now, two years later. I think I had configured Mycroft to use Snowboy for wake-word recognition so I could name my Mycroft something else (Jarvis).

One day the Mycroft installation just stopped working on my device after I hadn't touched it in a week or more and I never went back to figure out what was wrong. It's still sitting on the corner of my desk unplugged. If I could have got the wake-word recognition working reliably with music playing I think I would have used it a lot, but I wasn't able to at the time.

I just recently bought a smart watch with a built in "Alexa" app that allows you to send voice commands to your phone which get processed through the watch's official app. I'm instead using Gadgetbridge on Android to interface to the watch. Some kind hacker updated Gadgetbridge to add very basic support for my watch's microphone, allowing you to send the raw voice data to an external application. I'm hoping I'll be able to use this to revive my Mycroft instance and I'll just send voice commands to Mycroft from my watch/phone via a custom Android app/service. In theory, I'll be wearing the watch all the time anyway and having the microphone on my person and right next to my face should hopefully help with the speech-to-text and I won't have to worry about a wake word at all. I've only just barely started working on this, though.

adam10283y ago

I gave up on mycroft after a long wait and built my own with respeaker and picovoice. i have 2 of them with different wake words. imo it's way better and easier than snowboy. i dont understand why people give their data to amazon to set a timer :)

check their free stuff: https://picovoice.ai/pricing/

rickoooooo3y ago

You are using picovoice as the assistant? Is it en entire solution for that? Or are you running a DIY Mycroft device with picovoice as the wake word detector? I'll have to check this out but I've been trying to stick with open source technologies where I can. I don't trust that a free tier will remain free forever, but it may be worth testing out.

1 more reply

moffkalast3y ago

> why don't we have "text assistants"

We do, it's called typing things into Google.

HankB993y ago

Will that initiate actions like the voice thing does? I thought it just returned search results.

rolenthedeep3y ago

Google assistant on your phone can accept text input. If you're on a relatively recent version of Android you should be able to long-press the home button, then tap the keyboard icon in the popup. Works the same as a voice prompt

moffkalast3y ago

A lot of assistant functionality is just getting data from the internet, which search engines already know how to present and format in a useful way.

If you need to go to a specific spot in the house to write some text that turns on a light it seems easier to just walk to an actual light switch? For general automation then I think there are some visual block-based configurators to set up triggers for smart appliances otherwise.

1 more reply

EntropyIsAHoax3y ago

This is actually how Mycroft handles it, more or less.

The wakeword ("hey Mycroft") is done on-device, but everything you say after that is sent to a speech-to-text API. That text is then routed to the appropriate skill to handle. So when you're writing the skill you only worry about the content of that text

https://mycroft-ai.gitbook.io/docs/mycroft-technologies/over...

voakbasda3y ago

The recent comments on the Mycroft kickstarter [0], which was funded four years ago, indicate that the company is shipping preorders. However, only 10% of the units are going to their backers. Instead they are selling the units to new customers. If you are backer 2000, they might not fulfill your order for years to come, based on the production rate quoted there by their new CEO.

This is not a viable way to treat your original, most ebthusiatic customers. They will go on forums like HN and bitterly complain, warning other potential customers not to invest in a company that clearly does not respect its users.

[0] https://www.kickstarter.com/projects/aiforeveryone/mycroft-m...

wolczek3y ago

They use Google cloud speech-to-text API, one of the key technology. https://mycroft-ai.gitbook.io/docs/mycroft-technologies/over...

¯\_(ツ)_/¯

arbol3y ago

The pi isn't really fast enough to process the speech in real time. deepspeech by mozilla was cited as an offline alternative to the Google speech API but it's difficult to set up with Mycroft and doesn't work very well (lack of data and lag - https://mycroft.ai/voice-mycroft-ai/). Because of this, Mozilla set up Common Voice (https://commonvoice.mozilla.org/en) to help build open datasets of voice recordings.

shagie3y ago

> The pi isn't really fast enough to process the speech in real time.

If you've got an iPhone... put it in to airplane mode so that it is local only. You'll note that Siri no longer works when you do this. However... open up the notes app and tap the microphone. Do some interesting text...

> Mister Smith said that he wanted a two by four and half of a pie.

(if you don't have an iDevice, it transcribes this as:

> Mr. Smith said he wanted a 2 x 4 and 1/2 of a pie

That is without a network and done in real time. We can compare the relative processing capabilities of an iPhone and the RPi, but offline speech to text is feasible on a device of limited capabilities.

Additionally, you can do a limited vocabulary speech to text on chip ( https://www.imagesco.com/articles/hm2007/SpeechRecognitionTu... - https://www.amazon.com/HM2007-Speech-Recognition-Integrated-... ). This can handle the specific incantation common tasks (think closer to how a car voice control works - say exactly these words in this order), but that can help with performance for things that are often done.

arbol3y ago

Yeah, but this is the closed source Apple implementation of speech to text versus Mozilla's abandoned deepspeech. I'm sure its possible to get it working well on a pi but I don't have the time to create and maintain a personalised speech training set and then optimise the resultant models.

one-another-dev3y ago

> If you've got an iPhone... put it in to airplane mode so that it is local only. You'll note that Siri no longer works when you do this

This is not true anymore. Latest iPhone models have offline Siri working to some extent

1 more reply

nshm3y ago

DeepSpeeech is very old software. Vosk works just fine https://github.com/alphacep/vosk-api. People even run tiny Whisper on Pi, though they have to wait ages.

arbol3y ago

Thanks, I might try to get this working with picroft

xrd3y ago

I'm comfortable with their approach to anonymizing the interaction, and assume they will find a way to remove that dependency.

walterbell3y ago

Is there a good, affordable, open-ish dev platform with an array of far-field microphones? If not, would it be possible to teardown an Amazon Echo Dot and attach their microphone array to an ODROID Arm SBC? Might depend on whether the Alexa microphone array is using a dedicated audio processor chip for echo cancellation.

A web search finds this 4-mike array for $63, https://www.robotshop.com/en/seeedstudio-respeaker-mic-array...

Google had an audio dev kits for schools based on their TPU and RPi, https://aiyprojects.withgoogle.com/voice/

mtlmtlmtlmtl3y ago

Why is this so damn expensive? I have plenty of good hardware available to run the compute for this thing. Just make a daemon/app architecture where I can use my phone as a microphone and run daemons on whatever hardware I need to control.

I just don't see this being worth the money. Hundreds of $ to make switching music slightly more convenient just seems like a colossal waste of money to me.

Firmwarrior3y ago

I think we're spoiled by Google and Amazon losing so much money for so long

For example, I noticed a couple years ago at the store that a regular featureless analog wall clock was more expensive than an Echo Dot

These guys need to pay for their software dev out of hardware sales and can't hope for a runaway success yet, so of course it'll cost a painfully lot more

systemicdanna3y ago

Agreed. I don’t need a screen or a loud speaker on a voice assistant box. Just make it small and cheap, as promised.

theCrowing3y ago

I use Mycroft for around 2-3 years on a raspberry pi with an microphone array and the quality is still not nowhere near the level were I would give it to my mom or granny.

nshm3y ago

Try Sepia https://sepia-framework.github.io, it is an open source assistant, works very good.

EntropyIsAHoax3y ago

Depends on what it's used for imo. I've got the same setup and--apart from some sexism in the voice recognition--the basics work flawlessly. Once it's up and running it's trivial to set timers, check the weather, do basic unit conversions, etc...

Maybe once a month it freezes and it just needs to be restarted, anyone can do that who can get used to talking to a robot in the first place.

The more complicated stuff, I agree is not fit for people who aren't comfortable with a terminal. I also use it to control my lights and play spotify, but I'm only able to do that because I'm comfortable messing around with it and have the skills (and desire) to debug it when it breaks every other week.

It's nowhere near as polished as Alexa or similar, but it's good for the basics or for hobbyists who don't want to be spied on.

veidr3y ago

> sexism in the voice recognition

What does this mean? Mycroft prefers taking orders from dudes?

EntropyIsAHoax3y ago

Yes. It reliably responds to the wakeword ("hey Mycroft") from men, and only responds about 50% of the time to women.

When my sister visits, it almost never responded to her for example.

And in my personal experience, I used to have a very deep voice and it always responded to me. I decided I would rather have a more androgynous voice and did some voice training to accomplish that and use a pretty neutral pitch. Now it only responds to my normal voice about 50% of the time, so I intentionally drop my voice an octave whenever I speak to it.

But somehow it's not just about pitch either. I have friends who are trans men, and speak in a deep voice but they still have trouble getting Mycroft to respond.

2 more replies

vagrantJin3y ago

Been looking for an open source tool like this for a while now - but to automate some home security stuff. All I need is good basic functionality anyways. So its good to know it at least works.

The sexism part is horse manure though. A strong accusation on a likely small team on tight deadlines and budgets who cannot cater to everyone all at once, like Big Tech and their massive resources and teams.

EntropyIsAHoax3y ago

I don't mean to say the devs have any ill intent. Just pointing out the reality it has trouble with feminine voices. Like veidr points out in response to my comments, implicit bias is a big issue. Probably they originally trained the wakeword model entirely on American cis men, so naturally it has trouble recognizing any voice outside of that norm. It's not an accusation, this is a very well documented pitfall of machine learning.

I'm very grateful to the Mycroft team because I love smart speakers but am not willing to sacrifice my privacy to such a degree as I would have to to use a google home or Alexa or anything like that. That does not mean I won't point out its flaws.

1 more reply

acidburnNSA3y ago

I installed their self-hosted minic3 tts the other day to add a voice to my home assistant on prem smart home. It sounds unbelievably good compared to the picotts crap I was using before. Pretty stoked. Now I want to try getting the voice assistant hooked up too.

https://mycroft.ai/mimic-3/

dividedbyzero3y ago

Sadly their German voices sound broken, as if they stress and lengthen/shorten random syllables. And that's with their demo text.

synesthesiam3y ago

Sorry about that. I'm going to be working more closely with native German speakers to get it right!

karencarits3y ago

I really liked the idea from an earlier post with continuous recording + Whisper for transcription + keyword based actions. The drawback is asynchronous execution of your actions, but that setup seems very flexible!

https://news.ycombinator.com/item?id=33608437

incomingpain3y ago

$350 for a 4.5" CRT looking thing?

$90 for echo show 8" which does many more things. *Including government surveillance.

I wonder how many people have preordered.

srmarm3y ago

There was another post[0] on here today about Amazon losing $10B on Alexa this year. The only other big player is Google who I assume must also run the division at a big loss - at least on the hardware side of things as I've got loads of their devices dotted around the house most were given to me free or at a stupidly low cost (£20 each). Even the ones like this with a full colour screen I've only paid £49 for.

It's an interesting market that I don't think either has figured out too well. Anecdotally, I've not really seen them used for shopping or even really shopping lists and the search model doesn't seem quite as lucrative as if someone used their phone or computer to search.

The one thing they do seem to do well in is as an introduction to - and hub for - the smarthome but I struggle to see how that will make these big subsides viable.

Which brings us back to this. A bit ahead of it's time perhaps but if Amazon/Google pull back their subsidies then this kind of thing might be where we have to go. I'd be happier not using Google Home and have mostly moved to zigbee switches with home assistant to control now anyway. Maybe voice control was a bit of a flash in the pan?

[0] https://news.ycombinator.com/item?id=33700792

joseda-hg3y ago

Google to some degree can justify it better considering that Assistant is ALSO present in Android

They need to have some sort of version existing to compete with Siri, and can also shoehorn integrations that wouldn't quite work for Amazon

Were not for this[0] ever expanding page, Assistant seems like the more reliable option

[0] https://killedbygoogle.com/

UltraViolence3y ago

Which is essentially a Raspberry Pi 4 with an LCD display.

However, not wanting to spend time and money integrating the hardware and the software and building an enclosure around it I'd say it's still a fair deal.

Amazon is merely selling devices at cost, which is why they're losing $10 billion a year on Alexa. All this is probably going to stop soon so people end up with useless devices. Hopefully hackers will be able to run MyCroft on them.

glenstein3y ago

Under the specs, all I saw for display was:

>4.3″ IPS wide-viewing angle, full color, touchsceen

Were you able to find more information somewhere else about it being crt?

moffkalast3y ago

He didn't say it's a CRT, he said it's a 'CRT looking thing', which it definitely is with the thick extension behind the monitor.

Personally I do like the design but I'm quite fond of Fallout/Alien style retro displays so YMMV. As a personal assistant type thing the original vertical prototype with eyes seemed better though.

dspillett3y ago

He said CRT looking, I assume to mean “old-fashioned and bulky/boxy”, not that it actually was a CRT based display.

Brendinooo3y ago

They actually have some data for you here to give you a sense of scale:

https://mycroft.ai/mark-ii-status/

JosephRedfern3y ago

This isn't a CRT.

incomingpain3y ago

I realize it's not an actual CRT, it looks like a CRT.

kitebiveOP3y ago

Sorry if I ask, but what does CRT mean?

Brendinooo3y ago

Cathode Ray Tube, the kind of monitor/television that existed before LCD screens became popular.

splitrocket3y ago

The key feature I haven't seen any of these opensource projects implement is microphone response coordination: If you have multiple microphones and speakers, which one responds?

My google home's are terrible at this: often one in another room responds, but at least it's only one. When I tried to run Genie (https://genie.stanford.edu/) I had multiple devices responding simultaneously. It was a disaster.

For me, this is the core feature that will enable me to swap out my corporate listening devices for an opensource, cloud-free alternative.

1 more reply

borissk3y ago

I hope this project makes into people's homes, but I really doubt it. Google invests so much resources into their Assistant (a recruiter recently told me they have over 1000 vacancies). Given that Google has it's own very advanced and very efficient cloud infrastructure, their own ML processors and an army of devs and AI scientists, their assistant will always be cheaper and "smarter" than a device build by a small company on top of an open source project.

xrd3y ago

I have one of these devices. I'm still a bit mystified about what I want to use it for. But, after reading these threads, I've now got a few ideas that make me very excited.

I have come to despise the Google/Alexa/Siri devices. I'll explain why.

I hate that Google Home devices always give you a direct answer when you ask a question. The reason I hate this is because my kids use it to get an answer, without any work, without any thinking, without any consideration that there might be context to the answer. If they were to research and read about the question, they would learn so much more. But, they, like all people, want a simple and compact answer. And, I'm sure Google engineers have their RSUs tied to some KPI that says "make answers as simple and compact" so it will never come out any way other than this from Google.

I hate that Google permits my kids to play the same damn song over and over again. (Cue sentimental music...). In my day, we listened to the radio and it might have been bad for my dad for five minutes and he scowled the whole way as enjoyed some utterly awful pop song, but then that song ended and he didn't have to listen to it for a few hours. Modern radio is worse, but at least you can take a break for an hour before they play (and are paid to play) the same song over and over.

I hate the surveillance aspect of Google. I don't want to have profiles generated of my kids such that when Google revenues dip in a few years they are enticed by an offer from that shady insurance conglomerate that really wants to know whether any of them discussed depression or racism.

So, if I can use a Mycroft device to:

  * Permit them to ask questions, but give them answers in a way they have to dig and think and explore, that would be really cool. I'm sure this isn't easy, but it will never happen with Google/Alexa/Siri because they only care about MONETIZING those interactions.
  * Give me more control over how media is consumed. The people working at YouTube will never have a KPI for "make sure you can only play one song per hour" and Google Home will never have that KPI, so it will never happen. That will never be something they can MONETIZE. It seems like it will be a lot more challenging to get music onto my Mycroft, but I prefer to play Jazz radio and because there still are live streams, I think you could get off the YouTube/Spotify/Amazon music train anyway. I got rid of so much of my music, but you can play shared files: https://mycroft-ai.gitbook.io/mark-ii/basic-commands#jukebox
  * Forget the worries about surveillance. Mycroft right now uses Google for text to speech, but it can anonymize it enough for me not to worry as much.

nottorp3y ago

> I hate that Google Home devices always give you a direct answer when you ask a question.

So does StackOverflow. Teaching people how to do things instead of giving them code ready to copy paste is frowned upon there as well.

cjtrowbridge3y ago

Mycroft was the first thing I thought of when I read the first press releases from OpenAI about Whisper! Mycroft has historically used Google for voice recognition. Exciting to see self-hosted alternatives like Whisper coming out.

laputan_machine3y ago

I love the idea, but I think this isn't going to be "the one".

It's VC funded, the VCs are going to want to get a RoI.

And we all know where that leads...

cjtrowbridge3y ago

So fork it; it's FOSS.

NikkiA3y ago

Huh, didn't expect to see a Mycroft Holmes reference today. Picroft looks interesting.

UltraViolence3y ago

I like it. Now they simply have to make it fully functional and add some intelligent AI.

ccn0p3y ago

Mark II sounds like he'd rather be sleeping than helping the person in the demo video.

j / k navigate · click thread line to collapse

140 comments

Brendinooo3y ago

[1]: https://old.reddit.com/r/Mycroftai/comments/yitzzk/mycroft_m...

GekkePrutser3y ago

Those prices are really insane tbh. No way it will take off like that, it's just a non-starter. 200 was already the upper limit of what's doable.

I think they're stretching too much to satisfy the original backers and it's commendable not giving up on them but if it's going to be similar to Alexa, Siri or Google it's just not good enough.

If it were an actual assistant I could talk to, then yes. It would be worth it.

Imagine this.. I'm doing my laundry and mycroft pipes up.

"hey Alice is looking for you on telegram"

"Tell her I'll get back to her after I finish the laundry"

"Ok!" ... "She says it's urgent, are you sure"?

"Ok call her please on speaker"

Or another scenario.

"hey mycroft I'm going out to the zoo"

"Ok make sure you bring an umbrella because it's going to rain in 2 hours "

It's just that there's not much sales to link to it other than the service price (which I'd definitely pay for!!).

I don't think these scenarios are too far-fetched with the current state of AI tbh.

ElijahLynn3y ago

I'm definitely curious about trying a Mycroft now, and I think there may be others too. It may not be the masses but it might be enough to keep the project thriving.

lannisterstark3y ago

>I'm not sure that price is going to be a blocker.

Eh, idk if I want to spend $350 on what is arguably a half baked, subpar Google-Assistnat/Alexa.

aartav3y ago

Kudos to them for shipping a product.

Brendinooo3y ago

The original design was great, but it just didn't work. They had too many issues trying to source hardware and decided to pivot to more off-the-shelf components.

That's the short version; here are some highlighted blog posts that document the trials and tribulations:

https://mycroft.ai/blog/mark-ii-update-delivery-timeline-and...

https://mycroft.ai/blog/mark-ii-update-january-2019-current-...

https://mycroft.ai/blog/mark-ii-architecture-change/

https://mycroft.ai/blog/mark-ii-update-revised-architecture/

https://mycroft.ai/blog/real-companies-ship-product/ (here's the pivot to Raspberry Pi)

https://mycroft.ai/blog/mark-ii-update-january/

https://mycroft.ai/blog/mycroft-mark-ii-july-2020/ (here's where they decide that the thing has to be a box instead of a cylinder)

https://mycroft.ai/blog/mark-ii-update-october-2020/

https://mycroft.ai/blog/redesigning-the-mark-ii-part-1/ (here's the final design)

nshm3y ago

There is also a strange story of speech developer leaving them a week ago https://community.rhasspy.org/t/rhasspy-is-joining-nabu-casa...

SEJeff3y ago

That's great news for Home Assistant however!

rexreed3y ago

The voice assistant space is dying: https://arstechnica.com/gadgets/2022/11/amazon-alexa-is-a-co...

rolenthedeep3y ago

Voice assistants which are trying to force engagement to squeeze money out of you are dying.

People like the idea of voice assistants, but by and large they don't like all the problems associated with a voice assistant run by Amazon, Google, and Microsoft.

horsawlarway3y ago

I've actually pre-ordered the Mycroft Mark 2, although no chance to evaluate it yet.

I'm very interested in devices that can do this locally.

It needs to obey me, not google or amazon. Otherwise it's a sales rep and not a digital assistant.

cupofpython3y ago

>they both work,

1 more reply

kuberlog3y ago

1 more reply

shagie3y ago

> Most people only use the voice assistants for a few simple tasks, which is perfect for an open source project like mycroft.

I'll certainly grant that... but the price point where Mycroft is, is certainly not near what I'd pay for doing those few simple tasks.

If I wasn't invested in Apple, then none of the home assistants other than Amazon (because of the price point for the echo) would be particularly interesting.

dublin3y ago

I would pay $500 to outfit the house with a central voice recognition processor that would be capable of supporting a dozen or so very secure listeners on the local LAN. Mycroft isn't that solution.

vineyardmike3y ago

smoldesu3y ago

> That's why they're all going so aggressive on "you asked for the time, but by the way here's a 5 minute speech on all the easily monetizable tasks I can do instead"

This is a word-for-word description of how Siri originally functioned. "You asked for the top 5 romantic resturaunts nearby; here are the top results from Google Search:"

zamadatix3y ago

1 more reply

jm43y ago

Apple has a plan that works and Amazon doesn't.

smoldesu3y ago

Tagbert3y ago

Apple makes money from the devices it sells. Siri makes those devices more convenient to use.

Amazon is selling the devices at or below cost and hoping that Alexa will make the money (which it doesn’t)

1 more reply

shagie3y ago

Apples devices are smarter. This makes them cost more for the "same" hardware, but costs less for the computation.

Apple isn't trying to make money with Siri. It's using Siri to make its ecosystem of Apple Music and similar more valuable to its customers.

The limits that Apple puts on what it can do makes that cloud side computation less expensive.

---

1 more reply

twobitshifter3y ago

theptip3y ago

Seems odd right? Just charge more than break-even for Alexa and you have a business?

jm43y ago

1 more reply

ncallaway3y ago

Charge more than break even for Alexa and they won’t sell enough.

vineyardmike3y ago

> Apple isn't necessarily in the same boat.

Also importantly, Siri mostly runs off your iPhones processor, and apple doesn’t have to pay a big cloud bill for it, unlike Alexa.

cupofpython3y ago

Apple and the garden of eden. Dont take a bite unless you're going all in

Brendinooo3y ago

The "voice assistant space" also includes Siri, Google Assistant, and Cortana, so it's not going anywhere.

I'd contend that it's absolutely not a solution in search of a problem; it's much more of an unsolved problem, and a big part of the "why" is

- voice recognition/assistance tech still maturing

- major players are insisting that the tech supports their walled gardens

- price points are still a problem

This is why I think Mycroft is important to support:

1. If you can make voice assistant software open-source and plug-and-play, then it frees people up to tinker with form factors

liotier3y ago

> custom solutions

Especially interesting as voice recognition is much easier, cheaper and more efficient within the limited space of a specific usage.

tjohns3y ago

> 1. If you can make voice assistant software open-source and plug-and-play, then it frees people up to tinker with form factors

For what it's worth, Google Assistant does have an open API to create new devices. It's not open source, but you can certainly experiment with your own custom form factors. There's even a tutorial:

https://medium.com/google-cloud/how-to-build-your-own-smart-...

ghaff3y ago

>- price points are still a problem

For music, I have an old phone connected to a stereo receiver. So it has voice control although I mostly pick a playlist or album manually.

Brendinooo3y ago

>An Echo Dot is $25

Yup, price point is solved for the "just timers and some music" people, but now you're stuck in Amazon's orbit.

1 more reply

horsawlarway3y ago

It's not dying at all - it's an incredibly useful interaction style.

Those companies are failing to profit because they don't understand that a digital assistant needs to be working with me, locally, and not subverting my intent.

voakbasda3y ago

gigatree3y ago

thomastjeffery3y ago

...but you can fork it. Is the build process really too painful for that to be enough?

cloudking3y ago

We use Google home assistant devices throughout the house, and find them quite useful. Use cases:

- controlling smart devices (thermostats, TVs, speakers)

- broadcasting messages

- reminders / tasks

- asking questions

However, none of these use cases generate any revenue for Google afaik.

VikingCoder3y ago

That set of functionality alone makes them well worth the money for us.

veidr3y ago

With all due respect, I find that thesis absurd.

Maybe you meant it like, "the voice assistant space isn't going to generate huge profits, and thus giant corporations will lose interest".

But the chance that in 2032 people just like... don't have voice assistants? It's literally zero, barring an actual WWIII cataclysm reversion-to-barbarism event.

> doesn't work as well as people were hoping

Nothing does, until it does...

> little monetization

Yep, that might be right. But it doesn't necessarily mean the space is "dying". Just that it might not be amenable to oligopolization.

voakbasda3y ago

I will never have a voice assistant unless a completely open and self-hosted solution appears on the market. And with current patent landscape, that seems incredibly unlikely to happen before 2032.

veidr3y ago

C2H4O23y ago

aartav3y ago

And who the heck wants it in a ring or glasses??

Zuiii3y ago

On the other hand, Mycrodt sounds like something people would actually want to use provided that it can operate locally and doesn't send any data outside the home.

melling3y ago

It works a lot better than nothing. I use Siri and Alexa every day. If Alexa goes away, I’ll use Siri more, or find another. Siri was a little slow to catch up.

I think the story that you read simply says that it’s hard to monetize. You are inferring more than what the story says.

Voice assistants are here to stay.

I eagerly await the day when I can simply say respond to this post then begin writing with my voice.

genewitch3y ago

"Okay, navigating to the nearest post office"

giancarlostoro3y ago

bluGill3y ago

veidr3y ago

Would love to read about experiences actually using this (I mean Mycroft in general) — good, bad, or otherwise.

So from that standpoint, this looks pretty cool to me — even if the voice recognition isn't as good as the big three.

So that's getting pretty close to being able to use Siri as a generic voice recognizer, and then piping the input into whatever arbitrary/homebrew system I want.

criddell3y ago

> why don't we have "text assistants"

I used to use a text-based assistant service called I want Sandy and it was great. Then Twitter bought the company and they went away.

http://boingboing.net/2007/11/14/i-want-sandy-perfect.html

toqy3y ago

> why don't we have "text assistants"

Siri has this as an accessibility feature since iOS 11, but might not be exactly what you're looking for

veidr3y ago

That is right, and I did try it, but they made it so that if you enable that then voice input no longer works. (T_T)

rickoooooo3y ago

adam10283y ago

check their free stuff: https://picovoice.ai/pricing/

rickoooooo3y ago

1 more reply

moffkalast3y ago

> why don't we have "text assistants"

We do, it's called typing things into Google.

HankB993y ago

Will that initiate actions like the voice thing does? I thought it just returned search results.

rolenthedeep3y ago

moffkalast3y ago

A lot of assistant functionality is just getting data from the internet, which search engines already know how to present and format in a useful way.

1 more reply

EntropyIsAHoax3y ago

This is actually how Mycroft handles it, more or less.

https://mycroft-ai.gitbook.io/docs/mycroft-technologies/over...

voakbasda3y ago

[0] https://www.kickstarter.com/projects/aiforeveryone/mycroft-m...

wolczek3y ago

They use Google cloud speech-to-text API, one of the key technology. https://mycroft-ai.gitbook.io/docs/mycroft-technologies/over...

¯\_(ツ)_/¯

arbol3y ago

shagie3y ago

> The pi isn't really fast enough to process the speech in real time.

> Mister Smith said that he wanted a two by four and half of a pie.

(if you don't have an iDevice, it transcribes this as:

> Mr. Smith said he wanted a 2 x 4 and 1/2 of a pie

arbol3y ago

one-another-dev3y ago

> If you've got an iPhone... put it in to airplane mode so that it is local only. You'll note that Siri no longer works when you do this

This is not true anymore. Latest iPhone models have offline Siri working to some extent

1 more reply

nshm3y ago

DeepSpeeech is very old software. Vosk works just fine https://github.com/alphacep/vosk-api. People even run tiny Whisper on Pi, though they have to wait ages.

arbol3y ago

Thanks, I might try to get this working with picroft

xrd3y ago

I'm comfortable with their approach to anonymizing the interaction, and assume they will find a way to remove that dependency.

walterbell3y ago

A web search finds this 4-mike array for $63, https://www.robotshop.com/en/seeedstudio-respeaker-mic-array...

Google had an audio dev kits for schools based on their TPU and RPi, https://aiyprojects.withgoogle.com/voice/

mtlmtlmtlmtl3y ago

I just don't see this being worth the money. Hundreds of $ to make switching music slightly more convenient just seems like a colossal waste of money to me.

Firmwarrior3y ago

I think we're spoiled by Google and Amazon losing so much money for so long

For example, I noticed a couple years ago at the store that a regular featureless analog wall clock was more expensive than an Echo Dot

These guys need to pay for their software dev out of hardware sales and can't hope for a runaway success yet, so of course it'll cost a painfully lot more

systemicdanna3y ago

Agreed. I don’t need a screen or a loud speaker on a voice assistant box. Just make it small and cheap, as promised.

theCrowing3y ago

I use Mycroft for around 2-3 years on a raspberry pi with an microphone array and the quality is still not nowhere near the level were I would give it to my mom or granny.

nshm3y ago

Try Sepia https://sepia-framework.github.io, it is an open source assistant, works very good.

EntropyIsAHoax3y ago

Maybe once a month it freezes and it just needs to be restarted, anyone can do that who can get used to talking to a robot in the first place.

It's nowhere near as polished as Alexa or similar, but it's good for the basics or for hobbyists who don't want to be spied on.

veidr3y ago

> sexism in the voice recognition

What does this mean? Mycroft prefers taking orders from dudes?

EntropyIsAHoax3y ago

Yes. It reliably responds to the wakeword ("hey Mycroft") from men, and only responds about 50% of the time to women.

When my sister visits, it almost never responded to her for example.

But somehow it's not just about pitch either. I have friends who are trans men, and speak in a deep voice but they still have trouble getting Mycroft to respond.

2 more replies

vagrantJin3y ago

Been looking for an open source tool like this for a while now - but to automate some home security stuff. All I need is good basic functionality anyways. So its good to know it at least works.

EntropyIsAHoax3y ago

1 more reply

acidburnNSA3y ago

https://mycroft.ai/mimic-3/

dividedbyzero3y ago

Sadly their German voices sound broken, as if they stress and lengthen/shorten random syllables. And that's with their demo text.

synesthesiam3y ago

Sorry about that. I'm going to be working more closely with native German speakers to get it right!

karencarits3y ago

https://news.ycombinator.com/item?id=33608437

incomingpain3y ago

$350 for a 4.5" CRT looking thing?

$90 for echo show 8" which does many more things. *Including government surveillance.

I wonder how many people have preordered.

srmarm3y ago

The one thing they do seem to do well in is as an introduction to - and hub for - the smarthome but I struggle to see how that will make these big subsides viable.

[0] https://news.ycombinator.com/item?id=33700792

joseda-hg3y ago

Google to some degree can justify it better considering that Assistant is ALSO present in Android

They need to have some sort of version existing to compete with Siri, and can also shoehorn integrations that wouldn't quite work for Amazon

Were not for this[0] ever expanding page, Assistant seems like the more reliable option

[0] https://killedbygoogle.com/

UltraViolence3y ago

Which is essentially a Raspberry Pi 4 with an LCD display.

However, not wanting to spend time and money integrating the hardware and the software and building an enclosure around it I'd say it's still a fair deal.

glenstein3y ago

Under the specs, all I saw for display was:

>4.3″ IPS wide-viewing angle, full color, touchsceen

Were you able to find more information somewhere else about it being crt?

moffkalast3y ago

He didn't say it's a CRT, he said it's a 'CRT looking thing', which it definitely is with the thick extension behind the monitor.

Personally I do like the design but I'm quite fond of Fallout/Alien style retro displays so YMMV. As a personal assistant type thing the original vertical prototype with eyes seemed better though.

dspillett3y ago

He said CRT looking, I assume to mean “old-fashioned and bulky/boxy”, not that it actually was a CRT based display.

Brendinooo3y ago

They actually have some data for you here to give you a sense of scale:

https://mycroft.ai/mark-ii-status/

JosephRedfern3y ago

This isn't a CRT.

incomingpain3y ago

I realize it's not an actual CRT, it looks like a CRT.

kitebiveOP3y ago

Sorry if I ask, but what does CRT mean?

Brendinooo3y ago

Cathode Ray Tube, the kind of monitor/television that existed before LCD screens became popular.

splitrocket3y ago

The key feature I haven't seen any of these opensource projects implement is microphone response coordination: If you have multiple microphones and speakers, which one responds?

For me, this is the core feature that will enable me to swap out my corporate listening devices for an opensource, cloud-free alternative.

1 more reply

borissk3y ago

xrd3y ago

I have one of these devices. I'm still a bit mystified about what I want to use it for. But, after reading these threads, I've now got a few ideas that make me very excited.

I have come to despise the Google/Alexa/Siri devices. I'll explain why.

So, if I can use a Mycroft device to:

  * Permit them to ask questions, but give them answers in a way they have to dig and think and explore, that would be really cool. I'm sure this isn't easy, but it will never happen with Google/Alexa/Siri because they only care about MONETIZING those interactions.
  * Give me more control over how media is consumed. The people working at YouTube will never have a KPI for "make sure you can only play one song per hour" and Google Home will never have that KPI, so it will never happen. That will never be something they can MONETIZE. It seems like it will be a lot more challenging to get music onto my Mycroft, but I prefer to play Jazz radio and because there still are live streams, I think you could get off the YouTube/Spotify/Amazon music train anyway. I got rid of so much of my music, but you can play shared files: https://mycroft-ai.gitbook.io/mark-ii/basic-commands#jukebox
  * Forget the worries about surveillance. Mycroft right now uses Google for text to speech, but it can anonymize it enough for me not to worry as much.

nottorp3y ago

> I hate that Google Home devices always give you a direct answer when you ask a question.

So does StackOverflow. Teaching people how to do things instead of giving them code ready to copy paste is frowned upon there as well.

cjtrowbridge3y ago

laputan_machine3y ago

I love the idea, but I think this isn't going to be "the one".

It's VC funded, the VCs are going to want to get a RoI.

And we all know where that leads...

cjtrowbridge3y ago

So fork it; it's FOSS.

NikkiA3y ago

Huh, didn't expect to see a Mycroft Holmes reference today. Picroft looks interesting.

UltraViolence3y ago

I like it. Now they simply have to make it fully functional and add some intelligent AI.

ccn0p3y ago

Mark II sounds like he'd rather be sleeping than helping the person in the demo video.

j / k navigate · click thread line to collapse