We've decided to support Hacktoberfest by creating an open-source catalog of datasets in the audio domain. The idea is to have a bunch of audio datasets, which will be completely open-source, with the ability to view, visualize (waveform, spectrograms, etc), and download to use in your projects. Check out this dataset that I created as an example: https://dagshub.com/DagsHub/Librispeech-ASR-corpus/src/master/dev-clean/84/121123/84-121123-0000.flac.
You can read the full guidelines here: https://dagshub.com/blog/hacktoberfest-x-dagshub-2/ Would be happy to answer questions, but I think if you're passionate about open-source ML, this is a great opportunity to contribute.
No comments yet.