I'm also doing embedded work, but I don't really see - or expect - performance from the RISC-V cores above similar CPU designs that consume about the same area/gate-equivalents; did I miss some recent results?
Of course, freely-available and well-supported CPU IP can be very cool!