RAG still needs model training, if the models were to go stale and the context drifts sufficiently, the RAG mechanism collapses.
Sure, those models are cheaper, but we also don’t really know how an ecosystem with a stale LLM and up to date RAG would behave once context drifts sufficiently, because no one is solving that problem at the moment.