5FP32 TFLOPs, if not doing sparse low precision inference it seems to be about in line with mid-high end 2014 Nvidia consumer card performance (gtx 980), one decade old.
For running sparsified/quantized llama2 it might be good, not sure about for fine tuning. I didn't see any FP16 numbers.