I could and would not lose much reconstruction accuracy.
So any researcher or ambient biases in the model will impact the general thrust of the textual decodings (and not in ways that reflect the actual model’s process, thinking about X and doing X in a model are very different things).
So how do we tell that the “spirit” is reflective of the model’s thinking and not biased toward Jolt being better than Surge?