For a body, with bones, since they can't stretch. All you need is rotation at each joint. You can get away with 10/11 bits per axis.
So for a full body you should be able to compress to 200 bytes per frame for 50 joints. That would mean 300k for 1 minute of animation at 30fps. Interpolate to get 60fps. That doesn't include faces.
If you do faces like Apple does, which IIUC is just N morph targets where N is like 15? Those are 1 weight each and you could easily make those 1 byte per weight or less so that's 27k for 1 minute of animation
Both of those could probably easily be compressed by storing deltas like draco or fit to curves for lots more compression.