What I mean by structured is: invoices, documents containing tables, etc.
Extracting useful data from fully unstructured content is very hard IMO and potentially above the capacity of LLMs (depending on your definition of "useful" and "unstructured")