ML based linkage has evolved a lot over the past years, and a few people are trying out LLM based ones - https://arxiv.org/pdf/2403.06434v1
https://sbert.net/docs/cross_encoder/usage/usage.html
to "predict" if two entities are the same?