Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
story
0 points
sheepdestroyer
1y ago
0 comments
Share
They could easily list the data used though. These datasets are mostly known and floating around. When they are constructed, instructions for replication could be provided too
0 comments
default
newest
oldest
coliveira
1y ago
They could, but even if they give this list the detractors will still say it is not open source.
rvnx
1y ago
yes and as a bonus they may get sued, which in the long-term, makes free / offline models to not be viable
It would be so much better if all models were trained with LibGen.
j
/
k
navigate · click thread line to collapse