1
Ask HN: Can anybody get ChatGPT to make an image of a cow jumping over the moon?
We’ve been trying for a while now and it seems to always place the cow in front of the moon, but not above.
There are 100k+ images which are categorized with generic tags (such as conference room, reception area) as well as specific tags identifying the particular manufacturer / product visible in the image (Herman Miller Aeron Chair). The product-specific tags also contain X&Y coordinates for where the products are in the photo (there are roughly 50k of these).
If you were going to create your own detection model, what would you use?