Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
karmasimida
4y ago
0 comments
Save
Share
If they use BPE dropout, then the split can be different and not unique.
And for the record, they use BPE dropout for DALLE-1, see
https://arxiv.org/pdf/2102.12092.pdf
0 comments
2 comments · 1 top-level
top
newest
oldest
DalasNoin
4y ago
· 1 in thread
I believe they only apply it during training.
karmasimida
OP
4y ago
right, that is my point. It is hard to know which combination triggers the current tokenization to be interpreted as bird.
j
/
k
navigate · click thread line to collapse