Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
pabs3
11mo ago
0 comments
Share
It is definitely doable to get openly licensed data, you just have to do it via voluntary participation of crowdsourced data acquisition programs. For example the RNNoise model was retrained from such crowdsourced data.
0 comments
default
newest
oldest
tedivm
11mo ago
IBM did it with their Granite models.
pabs3
OP
11mo ago
The data used for training Granite doesn't sound like it would be under FOSS licenses.
https://en.wikipedia.org/wiki/IBM_Granite
j
/
k
navigate · click thread line to collapse