Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
smlacy
2mo ago
0 comments
Save
Share
Post actual results, make a blog post. Don't just say "this sucks" without tangible evidence.
Otherwise you're doomed to "sample size of one" level of relevance.
0 comments
3 comments · 3 top-level
top
newest
oldest
thorum
2mo ago
I have the opposite experience: random HN/Reddit comments saying “this sucks” or “whoa this is a huge improvement” are the only benchmark that means anything. Standard benchmarks are all gamed and don’t capture the complexity of the real world.
titanomachy
2mo ago
Then your internal benchmarks will be in the post-training set and you’ll have to make new ones.
_2d30
2mo ago
I may already have but I'm pseudonymous on this website.
1 more reply
j
/
k
navigate · click thread line to collapse