Skip to content

Top New Best Ask Show Jobs

Annotated Implementation of DeepNet: Scaling Transformers to 1k Layers | Better HN

Annotated Implementation of DeepNet: Scaling Transformers to 1k Layers (opens in new tab)

(nn.labml.ai)

3 pointsvpj4y ago0 comments

0 comments

No comments yet.