Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
bitL
6y ago
0 comments
Save
Share
e.g. XLNet:
https://arxiv.org/abs/1906.08237
0 comments
2 comments · 1 top-level
top
newest
oldest
phreeza
6y ago
· 1 in thread
XLnet is Bert with a bunch of additional training tricks.
bitL
OP
6y ago
BERT is a Transformer with a bunch of additional training tricks. Transformer is self-attention with a bunch of additional training tricks...
j
/
k
navigate · click thread line to collapse