Its very fast on Vulkan, and from what I understand fast on metal, but its not as feature packed as llama.cpp yet.