undefined | Better HN

0 pointsnealabq3y ago0 comments

> Image inputs are still a research preview and not publicly available.

Will input-images also be tokenized? Multi-modal input is an area of research, but an image could be converted into a text description (?) before being inserted into the input stream.

0 comments

teruakohatu3y ago

My understanding is thta the image embedding is included, rather than converting to text.

1 more reply

j / k navigate · click thread line to collapse

0 comments

teruakohatu3y ago

My understanding is thta the image embedding is included, rather than converting to text.

1 more reply

j / k navigate · click thread line to collapse