Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
0 points
axiom92
2y ago
0 comments
Share
Right, but no separate image encoder + half the size could be very helpful for many applications.
undefined | Better HN
0 comments
default
newest
oldest
GaggiX
2y ago
The 7B LLaVa model is smaller, even considering the image encoder (CLIP-L).
j
/
k
navigate · click thread line to collapse