400B parameters would need 18 chips. Then you need a bit more ram for other stuff
CSE systems also come with off-chip memory, comparable to a GPU's memory, but usually in the TB range.
Here [1] they imply they can reach 1.2Tbps (allegedly, I know), and that's the previous generation ...
1: https://f.hubspotusercontent30.net/hubfs/8968533/Virtual%20B...
Of course they're using the on-chip SRAM, why wouldn't they?
This is a press release from Cerebras about a Cerebras chip, ... of course they are using a Cerebras chip!
Is that not obvious?