Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Annotated Implementation of DeepNet: Scaling Transformers to 1k Layers | Better HN
Annotated Implementation of DeepNet: Scaling Transformers to 1k Layers
(opens in new tab)
(nn.labml.ai)
3 points
vpj
4y ago
0 comments
Share
0 comments
No comments yet.