Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
pabs3
1y ago
0 comments
Save
Share
It is definitely doable to get openly licensed data, you just have to do it via voluntary participation of crowdsourced data acquisition programs. For example the RNNoise model was retrained from such crowdsourced data.
0 comments
2 comments · 1 top-level
top
newest
oldest
tedivm
1y ago
· 1 in thread
IBM did it with their Granite models.
pabs3
OP
1y ago
The data used for training Granite doesn't sound like it would be under FOSS licenses.
https://en.wikipedia.org/wiki/IBM_Granite
j
/
k
navigate · click thread line to collapse