undefined | Better HN

0 pointsbiomcgary3y ago0 comments

Since you work on podcasts, do any open source transcription tools currently identity the speaker in the output? This would be particularly helpful for interviews.

0 comments

1 comments · 1 top-level

nico3y ago

Not sure about open source, but in general, automated transcription systems need a separate track for each different speaker. So for example, for a phone call with one person on each end, you need two separate channels (recording systems usually split them left/right on one stereo file).

j / k navigate · click thread line to collapse