Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
dietr1ch
1y ago
0 comments
Save
Share
I guess that if the bulk of the computation goes into the multiplications, you can work in the log-space and simply sum, and when the time comes to actually do a sum on the original space you can go back and sum.
0 comments
1 comments · 1 top-level
top
newest
oldest
a-loup-e
1y ago
Not sure how well that would work if you're often adding bias after every layer
j
/
k
navigate · click thread line to collapse