Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
rfoo
1y ago
0 comments
Share
IMO the relevant benchmark for now is a mixed stream of requests with 50 (20%), 500 (50%), 2000 (10%) and 50k (20%) input tokens, ignore EOS and decode until you get around 300 output tokens.
0 comments
default
newest
oldest
SuchAnonMuchWow
1y ago
I'm really interested, do you have a source for those percentages ?
I tried to look for some service provider to publish this kind of metrics, but haven't found any.
rfoo
OP
1y ago
Sorry, I can't. My employer doesn't publish this kind of metrics, either. What I posted was definitely just some very rough number off my brain.
j
/
k
navigate · click thread line to collapse