"RSS lies. Your process might not be using that memory. The allocator might be hoarding it."
Interesting writeup, but:
No, anonymous AI author. Your process is using that memory, for its allocator. Features like lower-latency allocations don't come for free, even when they turn out severely suboptimal for your particular case. Your code isn't using that memory, but a support library is. It is very much in your process.