Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
intrasight
1mo ago
0 comments
Save
Share
Is that true? If the distillation is not lossy and the model runs much faster due to less resource consumption, then it may outperform.
0 comments
2 comments · 1 top-level
top
newest
oldest
mwigdahl
1mo ago
· 1 in thread
One of those conditionals is a pretty huge assumption.
intrasight
OP
1mo ago
It's an assumption and it can be tested
j
/
k
navigate · click thread line to collapse