Direct addressing is the preferred configuration. Only if your existing code's working set does not fit in MCDRAM does the cache configuration make sense.
It might sound pedantic on my part, but 'it can act as cache' is very different in practice from 'It is a cache'.