Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
derac
2y ago
0 comments
Save
Share
It's one model with text/audio/image input and output.
0 comments
1 comments · 1 top-level
top
newest
oldest
jacobsimon
2y ago
Very exciting, would love to read more about how the architecture of the image generation works. Is it still a diffusion model that has been integrated with a transformer somehow, or an entirely new architecture that is not diffusion based?
j
/
k
navigate · click thread line to collapse