However, the claim made by Arm: "the Arm AGI CPU, for agentic AI infrastructure, delivering more than 2x performance per rack compared with x86 platforms" is obviously false.
The new Intel Clearwater Forest Xeon processors use Darkmont cores, which have approximately the same performance per core, the same die area per core and the same power consumption per core as the Neoverse V3, but Intel offers 288 cores per socket and 576 cores per board, in comparison with only 136 cores per socket for Arm.
Therefore there is no chance that these new Arm processors can provide more performance per rack than Intel Clearwater Forest.
For applications that benefit from array operations, the AMD Zen 5 compact cores have much more performance per core than Neoverse V3 and AMD has provided 192 cores per socket for a long time. There is no chance for the new processors to exceed the performance per rack of Zen 5, but for those applications that do not benefit from array operations, these new Arm CPUs should have better performance per watt than Zen 5. But by the end of the year AMD should have Zen 6 Epyc CPUs, with more cores per socket, enhanced performance per core and improved performance per watt, so then there would be even less opportunities for these Arm CPUs to be better at something.
The only way how the claim of Arm can be true is if they have compared their new CPUs with antiquated CPUs like the Intel Granite Rapids Xeon CPUs, instead of comparing with state-of-the-art Intel Clearwater Forest and AMD Zen 5.
IIRC Granite Rapids is also not _that_ old, and either current or a single generation behind. (Has its successor landed yet? IIRC GNR is the same generation as Sierra Forest).
V4 cores should be out this year using X925 and C1 Ultra-based V5 will probably be 2027-2028.
I suspect that X4 is already fast enough to beat EPYC in per-core performance when using the whole chip. ARM caught up/passed x86 in IPC all the way back around A77/78 in 2019-2020. They are now much faster per clock and hitting about the same all-core clockspeeds as standard EPYC (let alone zen5c EPYC).
The big issue is that Graviton5 is already starting to hit the market and uses the same v3 cores. A lot of marketshare for this chip will probably come from taking Ampere customers.
However, Neoverse V3 has a lower die area, so you could implement more cores per socket than with Zen 5, but this has not been done yet, as these new CPUs have only 136 cores per socket versus 192 cores per socket for Zen 5.
For programs that do not use array operations, i.e. which do not use AVX/AVX-512 instructions, Neoverse V3 has better performance per watt than Zen 5. But that changes for programs that benefit from AVX/AVX-512, where Zen 5 has better performance per watt.
Moreover, Zen 5 is already old. By the end of the year there will be Zen 6, which will be the real competitor for these new Arm CPUs, and Zen 6 will have better performance per watt, even more cores per socket and even more performance per core.
SB likes 10x buys.
I'm a chip designer and a chip this complicated takes about 3 years from start to actual silicon so it would have started well before Softbank started their acquisition process of Ampere.
The press release says it was co-developed with Meta who has a growing custom chip team. Normally these chips like Amazon's Graviton or Google's Axion are designed for their own data center use only and rented to customers. This ARM chip sounds like Meta and other companies will all be able to buy chips for their own data centers.
I'm guessing Softbank will get ARM and Ampere to align on future chips or just merge Ampere completely into ARM.
I can only imagine that their boardroom minutes included heated exchanges between their legal and marketing teams.