Show HN: Ermine.ai – Record and transcribe speech, 100% client-side (WASM) (opens in new tab)

(ermine.ai)

236 pointsvishnumenon3y ago68 comments

68 comments

63 comments · 21 top-level

viraptor3y ago· 17 in thread

I'm after something that can transcribe medical notes and unfortunately it does not work well for that case. (almost nothing does though) There's quite a few people interested in something that doesn't turn "laparoscopic" into "leper as cop it".

Maybe the current progress will help though. Models adjusted by your own dictionary or from postprocessing fixes would be amazing.

Void_3y ago

OpenAI Whisper is really good.

Here’s an iOS app to play with it: https://whispermemos.com

It even formats recording as paragraphs by running through GPT.

dgorges3y ago

This site is using Whisper:

> Built using transformers.js and the whisper-tiny.en model.

kinkopop3y ago

I've been using Whisper Memos for some time now. Simple but useful app to quickly save an idea or memory when you don't have time to type. Speech recognition is much better than native one, especially with languages which are not widely supported.

philiplitassy3y ago

This app has been my go-to solution for efficiently recording thoughts or memories without the need to spend time typing. Its proficiency in speech recognition is notably outstanding.

kerinin3y ago

I really like the accuracy of Whisper, but I feel like it operates at roughly real-time on my machine.

1 more reply

moneywoes3y ago

How is the latency? This is whisper running on the iPhone?

kkielhofner3y ago

I had an application for radiology and whisper large-v2 with beam size five was essentially 100% across multiple different types of dictated radiology reports.

ChrisK913y ago

We are using Nuance Dragon Medical at work which is intuitive to use and surprisingly accurate even with very fast dictation. I have yet to come across a solution, that is as accurate, although I'm not sure if they offer solutions for end users directly.

anonymouse0083y ago

Have you tried Siri Dictation? The transcription process doesn’t leave iOS 14 and on - and I’ve been pleasantly surprised.

Laparoscopic - that worked fine ;)

Edit: if you have one available, mind sending over a sample deidentified note to my email in profile? I’m working on something.

viraptor3y ago

> Have you tried Siri Dictation?

No, there's no Apple hardware available in my scenario.

Also in medical context, if I can't tell where the data goes, the solution is not usable.

johtso3y ago

Have you tried https://speechmatics.com/ ? I think they have a specially tuned medical version, and quite a generous free allowance.

viraptor3y ago

I have not, not will do tomorrow. Thanks for the link.

leetharris3y ago

I work at Rev.AI, we have a model tuned for medical and we are HIPAA compliant across the board. We do human and AI transcription and our ASR is #1 accuracy in the world right now

cpeth3y ago

I took at a look at the Rev.AI website and didn't see any mention of a medical-specific model nor HIPAA compliance. It would be nice if this information was presented in your marketing!

dilek3y ago

picovoice processes on the device and you can fine-tune the models https://picovoice.ai/platform/cat/

machiaweliczny3y ago

Have you tried Whisper and simply saying to GPT the context of conversation and to fix it. I think it should work

giovannibonetti3y ago

You'll probably need a custom model tailored to medical content.

wdb3y ago· 6 in thread

Doesn't seem to work in Safari