Sure, you could build an index of attribution <> latent space coords, but it would not be clear whether a generated document near several index entries would require compliance.
I guess this is where the threshold comes from. Choose a generous margin and over-attribute rather than under-attribute.