If anyone has faster than 12tkps on Air's let me know.
I'm using the LM Studio GUI over llama.cpp with the "Apple Metal GPU" option. Increasing CPU threads seemingly does nothing either without metal.
Ram usage hovers at 5.5GB with a q5_k_m of Mistral.