AWS Graviton 3 Instances (opens in new tab)

(aws.amazon.com)

174 pointsTrisell4y ago110 comments

110 comments

72 comments · 14 top-level

ghshephard4y ago· 12 in thread

So - for those deeper into security - is this useful?

"Graviton3 processors also include a new pointer authentication feature that is designed to improve security. Before return addresses are pushed on to the stack, they are first signed with a secret key and additional context information, including the current value of the stack pointer. When the signed addresses are popped off the stack, they are validated before being used. An exception is raised if the address is not valid, thereby blocking attacks that work by overwriting the stack contents with the address of harmful code. We are working with operating system and compiler developers to add additional support for this feature, so please get in touch if this is of interest to you"

spijdar4y ago

Very useful, depending on the implementation and potential trade-offs. If the performance is good, this is a nice extra layer that makes return-oriented programming more difficult. Combined with NX bits, it really raises the difficulty in developing/using many types of exploits.

(it's not impossible to bypass, I'm vaguely aware it's been done on Apple's new chips that implement a similar (the same?) ARM extension, but there's no perfect security)

pram4y ago

Yup arm64e in general has pointer authentication, iOS and MacOS already implement it.

1 more reply

Accujack4y ago

Performance is what I wonder about. The idea sounds good, but what crypto scheme can perform encryption of a signature both securely and fast enough to keep up with every pointer pushed on the stack?

What's the trade-off?

1 more reply

ComputerGuru4y ago

These slides should help: https://llvm.org/devmtg/2019-10/slides/McCall-Bougacha-arm64...

saagarjha4y ago

Note that those slides are slightly LLVM/Apple biased.

ljhsiung4y ago

Pointer authentication has been around for several years already. As with many things in hardware, though, it takes time for the software ecosystem around it to mature. Still, I've found it to be quite influential.

Here are a couple "real world" examples--

Project Zero had a blogpost about some of the weaknesses on the original Pointer Auth spec [0], and even had a follow up [1].

Here is an example of what some mitigation might look like, showing how gets(), which is a classically trivially vulnerable primitive, becomes not-so-trivial (but still feasible enough to do in a blogpost, obviously) [2].

Cost-wise, in terms of both hardware and software, it's rather cheap. The hardware to support this isn't too expensive, about on par with a multiplier. On the software end, like I said, it's taken some time to mature and gotten to a pretty good state IMO, with basically all compilers providing simple usage since 2019-- just turn on a flag!

ARM also did a performance vs. ROP gadget reduction analysis [3]. The takeaway is, as others have mentioned, while it doesn't completely mitigate, it does heavily increase the complexity for rather cheap.

In fact, I'm rather annoyed Amazon didn't include this feature on Graviton2, and to claim it as new or innovative on their end feels just like marketing speak. Any CPU that claims to be ARMv8.5-a compliant *must* have this feature, and that's been around for quite a few years now.

[0]: https://googleprojectzero.blogspot.com/2019/02/examining-poi...

[1]: https://bazad.github.io/presentations/BlackHat-USA-2020-iOS_...

[2]: https://blog.ret2.io/2021/06/16/intro-to-pac-arm64/

[3]: https://developer.arm.com/documentation/102433/0100/Applying...

my1234y ago

> In fact, I'm rather annoyed Amazon didn't include this feature on Graviton2

Arm-designed licensable cores didn't have it back then, and that's what AWS uses.

Graviton2 used the Neoverse N1 core.

pjmlp4y ago

So far, I think Solaris SPARC ADI is the more mature version of it on use, however given it is Solaris SPARC, not many are aware of it.

MaxBarraclough4y ago

This isn't something I know a lot about but it sounds like the idea of a shadow stack [0] but implemented with crypto. See also Intel CET (for Control-flow Enforcement Technology). [1]

[0] https://en.wikipedia.org/wiki/Shadow_stack

[1] https://www.intel.com/content/www/us/en/developer/articles/t...

tyingq4y ago

It would take the heat off for mitigating buffer overflow CVEs in a rushed way. There are many of those that give remote code execution, so typically a frenzied patching exercise. A little more time to do the patching in a more deliberate way would be nice.

staticassertion4y ago

I think in some cases this would effectively mitigate a vulnerability entirely. If you require control over the return address you're basically shit out of luck. A buffer overflow at that point is going to have to target some other function pointer or data, which may not be feasible in a given function.

staticassertion4y ago

I've heard very promising things about pointer authentication.

talawahtech4y ago· 11 in thread

I was wondering how they were going to manage the fact that AMDs Zen3 based instances would likely be faster than Graviton2. Color me impressed. AWS' pace of innovation is blistering.

dragontamer4y ago

I don't know. I prefer it when companies actually give the technical details.

> While we are still optimizing these instances, it is clear that the Graviton3 is going to deliver amazing performance. In comparison to the Graviton2, the Graviton3 will deliver up to 25% more compute performance and up to twice as much floating point & cryptographic performance. On the machine learning side, Graviton3 includes support for bfloat16 data and will be able to deliver up to 3x better performance.

This means nothing to me. Why is there more floating point and cryptographic performance? Did Amazon change the Neoverse core? Is this N1 cores still? Did they tweak the L1 caches?

I don't think Amazon has the ability to change the core design unfortunately. This suggests to me that maybe Amazon is using N2 cores now?

But it'd be better if Amazon actually said what the core design changes are. Even just saying "updated to Neoverse N2" would go a long way to our collective understanding.

talawahtech4y ago

AWS re:Invent is this week. This was announced as part of the CEO's keynote. I am sure we will get more details throughout the week in some of the more technical sessions.

ksec4y ago

>Why is there more floating point and cryptographic performance?

You can infer this to N2 which ARM gave their own results [1], N2 uses SVE2 256bit.

[1] https://community.arm.com/arm-community-blogs/b/architecture...

1 more reply

Rafuino4y ago

Aren't the Zen3 instances still faster than Graviton 3? DDR5 is interesting, and while lower power is nice, the customers don't benefit from that much, mostly AWS itself with its power bill. I haven't seen pricing yet, but assume AWS will price their own stuff lower to win customers and create further lock-in opportunities (and even take a loss like with Alexa).

tyingq4y ago

>Aren't the Zen3 instances still faster

I would guess price/performance matters more than peak performance for a lot of use cases. With prior Graviton releases, AWS has made it so they are better price/performance. Keep in mind that a vCPU on Graviton is a full core rather than SMT/Hyperthread (half a core).

inkyoto4y ago

> Aren't the Zen3 instances still faster than Graviton 3?

Irrelevant.

The vast majority of applications running in the cloud are business applications that struggle to saturate the CPU and waste most of the CPU cycles idlying by in epoll/select loops. Unless you need HPC, you do not need the fastest CPU, either.

> create further lock-in opportunities

Don't like AWS/Graviton? Take your workload to the Oracle cloud and run it on Oracle ARM.

Don't like ARM? If your app is interpreted/JIT'd (e.g. Python/NodeJs) or byte code compiled (JVM), lift and shift it to the IBM cloud and run it on a POWER cloud instance – as long as IBM offers a price-performance ratio comparable to that of AWS/Graviton or you are willing to pay for it.

2 more replies

staticassertion4y ago

How does Graviton create lock-in? It's ARM.

3 more replies

ksec4y ago

Where is this narrative that Zen 3 instances are faster than Graviton 3 instances coming from?

I am pretty sure Zen 3 doesn't bring 25% ST performance improvement compared to Zen 2.

2 more replies

b9a2cab54y ago

On a per thread basis - yes. On a per dollar basis on AWS - no. Per instance basis - depends a lot.

thinkafterbef4y ago

Nice wording with 'per core performance'. We had difficulties properly conveying this point in our product when comparing our CI runners[1] to GitHub Actions CI Runner. I will be using it in our next website update. Tack

[1]https://buildjet.com/for-github-actions

wayoutthere4y ago

Honestly, they’re not innovating so much as forcing a product market fit that everyone knew existed, but didn’t have the business case to develop without a hyperscalar anchor customer (like AWS!)

If anything, this is just another data point that shows how truly commoditized tech is. I just worry what happens when Amazon decides to “differentiate” after they lock you in.

amelius4y ago· 11 in thread

I really hate it that big companies are rolling their own CPU now. Soon, you're not a serious developer if you don't have your own CPU. And everybody is stuck in some walled garden.

I mean, it's great that the threshold to produce ICs is now lower, but is this really the way forward? Shouldn't we have separate CPU companies, so that everybody can benefit from progress, not only the mega corporations?

ksec4y ago

>I really hate it that big companies are rolling their own CPU now. Soon, you're not a serious developer if you don't have your own CPU. And everybody is stuck in some walled garden.

It is still just ARM. You can buy ARM chip everywhere. There is no walled garden.

> Shouldn't we have separate CPU companies, so that everybody can benefit from progress,

You are benefiting the same CPU design from ARM, and same Fab improvement from TSMC. Amortised across the whole industry. Doesn't get any better than that.

amelius4y ago

> It is still just ARM. You can buy ARM chip everywhere.

Only large companies can build CPUs based on ARM. Also, now companies might rely on everything in vanilla ARM, but soon they will be adding parts of their own ISA, improvements to the memory hierarchy, a GPU, or perhaps even their own management engine to keep an eye on things or to keep things locked down.

> There is no walled garden.

There is huge potential for walled gardens, just look at Apple.

3 more replies

glogla4y ago

ARM in general is available. But good ARM is only in proprietary walled gardens.

Compare RaspberryPi or Snapdragon with Apple M1.

2 more replies

minedwiz4y ago

I mean, it's just ARM - pretty standard architecture these days. If the big companies want to compete on chip design, I don't see it as all that different from AMD, Intel (and Via if you count them) competing on x86-compatibles.

zamadatix4y ago

AMD/Intel/Via/IBM/ARM are in horizontal competition on chip design, Amazon/Google/Microsoft/Apple are in vertical competition on chip design. Vertical competition typically results in far less ability for the market to optimize.

Compare for instance the Zen 3 upset vs the M1 upset. Zen 3 allowed the market to pick what they thought was the best CPU, the M1 allowed the market to pick if they wanted to buy an entire computer, OS, and software set because the CPU was good. Similarly with Graviton and Amazon, you can't just say Amazon is competing the same as Via, their interest is in selling the AWS ecosystem not in providing the best individual components. Same with Google and their custom chips and Microsoft with theirs now. Yes many are "just ARM" but due to custom extensions/chips and (in some cases) lack of standard ARM features that doesn't mean they are the same ARM.

Of course that's not to argue it's wrong because it's vertical integration, many will think that's the better way to make complicated products, but that's not the point - the way big companies are competing on chip design is very different than if one acted like an AMD/Intel/Via competitor to actually compete in the chip space instead of a larger space.

1 more reply

lizthegrey4y ago

You can get Ampere chips which are not proprietary to any specific cloud provider. Both Packet/Equinix Metal and Oracle use them.

zokier4y ago

I do point out that vendor-specific architectures were the norm for long time in the history and the sky didn't fall down. The x86 dominance was relatively short anomaly more than anything.

amelius4y ago

The sky didn't fall down because business folks didn't yet figure out how to build the most effective walled gardens.

1 more reply

Terry_Roll4y ago

Microcode is your friend. https://hackaday.com/2017/12/28/34c3-hacking-into-a-cpus-mic...

Blobs can also be reverse engineered.

amelius4y ago

Until they start using encryption, which is relatively easy.

pjmlp4y ago

Lets not pretend using a Z80, 6502 or 68000 made the remaining hardware differences go away.

qbasic_forever4y ago· 8 in thread

Where are Google and Azure with ARM instances? It's been nothing but crickets for years now... this is starting to get silly that their customers can't at least start getting workloads on a different architecture, nevermind get better performance per dollar, etc. too. The silence is deafening.

ksec4y ago

>The silence is deafening

Remember Amazon brought Annapurna Labs in 2015. And only released their first Graviton instances in 2018. The lead time for a Server CPU product is at least a year even when you have blueprints. That is ignoring fab capacity booking and many other things like testing. And without scale ( AWS is bigger than GCP and Azure combined ) it is hard to gain competitive advantage ( which often delays management decision making ).

I think you should see Azure and GCP ARM offering in late 2022. Marvel's exit statement on ARM server SoC pretty much all but confirmed Google and Microsoft are working on their own ARM offering.

pdelgallego4y ago

And if I recall correctly, AWS needed to develop Nitro before offering Graviton.

baybal24y ago

> And without scale ( AWS is bigger than GCP and Azure combined )

Azure cloud is bigger than AWS

https://cloudwars.co/microsoft/microsoft-q2-cloud-revenue-st...

In terms of physical servers. I believe Chinese cloud providers, and their monster CNDs (China has terribly slow Internet, and superlocal CDNs are the only way out there) overtook AWS quite a few years ago.

2 more replies

baybal24y ago

> Where are Google and Azure with ARM instances?

I think they are bound by long term supply agreements with Intel. They will just bargain for better prices with Intel.

Not an easy task it will be, given that Intel is capacity jammed.

MangoCoffee4y ago

https://interestingengineering.com/microsoft-will-build-adva...

"The second phase will see these entities develop custom integrated chips and System on a Chip (SoC) with "lower power consumption, improved performance, reduced physical size, and improved reliability for application in DoD systems."

maybe its coming?

lostmsu4y ago

Funnily, IBM offers ARM instances. Even on free tier.

Uehreka4y ago

Doesn’t IBM also offer a bunch of weird architectures that can’t be found anywhere else? One day I was looking up some old PowerPC and s390 architectures that are supported by a lot of docker images, trying to figure out why anyone would want them, and it appears the answer is that they’re used in IBM mainframes.

1 more reply

tyingq4y ago

Same for Oracle.

1 more reply

aclelland4y ago· 4 in thread

So that's the c6g and c7g but x86 instances are still on c5. Will AWS ever release an x86 computing instance again or is this just a sign that x86 has reached peak performance on AWS?

pella4y ago

(29 NOV 2021) "New – Amazon EC2 M6a Instances Powered By 3rd Gen AMD EPYC Processors"

https://aws.amazon.com/blogs/aws/new-amazon-ec2-m6a-instance...

"Up to 35 percent higher price performance per vCPU versus comparable M5a instances, up to 50 Gbps of networking speed, and up to 40 Gbps bandwidth of Amazon EBS, more than twice that of M5a instances."

"Larger instance size with 48xlarge with up to 192 vCPUs and 768 GiB of memory, enabling you to consolidate more workloads on a single instance. M6a also offers Elastic Fabric Adapter (EFA) support for workloads that benefit from lower network latency and highly scalable inter-node communication, such as HPC and video processing."

"Always-on memory encryption and support for new AVX2 instructions for accelerating encryption and decryption algorithms"

aclelland4y ago

Missed this one too. Awesome, thanks.

messe4y ago

They did. A month ago.

https://aws.amazon.com/about-aws/whats-new/2021/10/amazon-ec...

aclelland4y ago

I missed this. Thanks!

ksec4y ago· 3 in thread

>Graviton3 will deliver up to 25% more compute performance and up to twice as much floating point & cryptographic performance. On the machine learning side, Graviton3 includes support for bfloat16 data and will be able to deliver up to 3x better performance.

>First in the cloud industry to be equipped with DDR5 memory.

Quite hard to tell whether this is Neoverse V1 or N2. Since the description fits both . But this SVE extensions will move a lot of workload that previously wont suitable for Graviton 2

Edit: Judging from Double floating point performance it should be N2 with SVE2. Which also means Graviton 3 will be ARMv9 and on 5nm. No wonder why TSMC doubled their 5nm expansion spending. It will be interesting to see how they price G3 and G2. And much lowered priced G2 instances will be very attractive.

Uehreka4y ago

I’m going to look up what SVE extensions are, but before I do, how much work (as a proportion of all work done on EC2) couldn’t be done on G2? I generally go off the assumption that most EC2 instances are hosting web servers and database servers, along with a handful, relatively, of CI servers and perhaps a sprinkling of video transcoders, 3D renderers and ML trainers. How much of that work can’t be done with the operations supported by G2? Is it just the long tail?

ksec4y ago

>I’m going to look up what SVE extensions are,

SIMD instructions, basically in the Intel x86 world that is like SSE4.

> Can’t be done on G2

Probably close to zero? Assuming your code compiles and run on ARM. It is just a matter of whether that operation is fast or slow, or in AWS terms whether it is cost effective since those EC2 instances are priced differently. And that cost includes porting and testing your software on ARM. For a lot of Web Server workload, G2 nearly offer 50% reduction in cost at the same or better performance. At the scale of twitter it absolutely makes sense to move those operation over. There are some workloads that dont like well things like 3D Renderers, or software that has too many x86 specific optimisation and takes too. much man power to port. So yes in that sense it will be a long tail of x86 instances. ( Assuming that is what you are referring to long tail )

ksec4y ago

Cant edit it now, It should be V1, not N2.

Alex39174y ago· 3 in thread

How long until we get T5g with Graviton3?

croddin4y ago

Graviton2 was announced at re:invent 2019 and t4g came out in September 2020 so my guess is we will see t5g instances by September 2022.

shaicoleman4y ago

They don't always update the T series instances for every generation, so I wouldn't hold my breath.

mwcampbell4y ago

If I'm not mistaken, they have updated the T series for every generation since the introduction of their Nitro virtualization.

haukem4y ago· 2 in thread

Does Graviton3 use the Neoverse V1? The Graviton2 used the Neoverse N1.

The features listed here match the core: https://developer.arm.com/ip-products/processors/neoverse/ne...

The N2 misses the bfloat, but it could be that the ARM marketing named it differently: https://developer.arm.com/ip-products/processors/neoverse/ne...

dragontamer4y ago

N2 has BFloat16 instructions. EDIT: I probably should have a citation: https://developer.arm.com/documentation/PJDOC-466751330-1825...

Page 50 of 92 shows off BFCVTN, BFDOT, BFMMLA (matrix multiply and accumulate), BFCVT, and other BF16 instructions on the N2.

I'd assume this Graviton 3 is a N2 core. But that's just me assuming.

haukem4y ago

Thanks for the document, so the ARM marketing just confused me. ;-)

Yes N2 is more likely than V1. N2 has the better PPA ratio. Own CPU core is very unlikely as I am not aware of any rumors which we would have notice before.

N2 also supports ARMv9 which is nice.

2-718-281-8284y ago· 2 in thread

are ARM (Advanced RISC Machine) and Arm the same?

count4y ago

Yes.

2-718-281-8284y ago

Why does Amz write it as "Arm" when it is an abbreviation?

1 more reply

lostmsu4y ago· 1 in thread

Any benchmarks? I'd like to see Geekbench 5 results from a full-sized one socket instance.

lizthegrey4y ago

No benchmarks yet, but I can get 30% faster latency, on 2/3 the number of instances compared to c6g for the same uncompress/compress/write to Kafka workload.

sabujp4y ago· 1 in thread

yeah signed stack pointers is a thing since at least 2017 : https://lwn.net/Articles/718888/

sydthrowaway4y ago

aka arm64e

kloch4y ago

DDR5? Can't wait to see how the memory bandwidth performs. I have a project that is memory bandwidth limited on c6gd instances.

givemeethekeys4y ago

Side-note: Polly's intonation makes for a fairly confusing listening session.

jfbaro4y ago

This seems to be a good fit Lambda as well. Looking forward to seeing more information about the CHIP design and capabilities.

j / k navigate · click thread line to collapse

110 comments

72 comments · 14 top-level

ghshephard4y ago· 12 in thread

So - for those deeper into security - is this useful?

spijdar4y ago

(it's not impossible to bypass, I'm vaguely aware it's been done on Apple's new chips that implement a similar (the same?) ARM extension, but there's no perfect security)

pram4y ago

Yup arm64e in general has pointer authentication, iOS and MacOS already implement it.

1 more reply

Accujack4y ago

Performance is what I wonder about. The idea sounds good, but what crypto scheme can perform encryption of a signature both securely and fast enough to keep up with every pointer pushed on the stack?

What's the trade-off?

1 more reply

ComputerGuru4y ago

These slides should help: https://llvm.org/devmtg/2019-10/slides/McCall-Bougacha-arm64...

saagarjha4y ago

Note that those slides are slightly LLVM/Apple biased.

ljhsiung4y ago

Here are a couple "real world" examples--

Project Zero had a blogpost about some of the weaknesses on the original Pointer Auth spec [0], and even had a follow up [1].

[0]: https://googleprojectzero.blogspot.com/2019/02/examining-poi...

[1]: https://bazad.github.io/presentations/BlackHat-USA-2020-iOS_...

[2]: https://blog.ret2.io/2021/06/16/intro-to-pac-arm64/

[3]: https://developer.arm.com/documentation/102433/0100/Applying...

my1234y ago

> In fact, I'm rather annoyed Amazon didn't include this feature on Graviton2

Arm-designed licensable cores didn't have it back then, and that's what AWS uses.

Graviton2 used the Neoverse N1 core.

pjmlp4y ago

So far, I think Solaris SPARC ADI is the more mature version of it on use, however given it is Solaris SPARC, not many are aware of it.

MaxBarraclough4y ago

This isn't something I know a lot about but it sounds like the idea of a shadow stack [0] but implemented with crypto. See also Intel CET (for Control-flow Enforcement Technology). [1]

[0] https://en.wikipedia.org/wiki/Shadow_stack

[1] https://www.intel.com/content/www/us/en/developer/articles/t...

tyingq4y ago

staticassertion4y ago

I've heard very promising things about pointer authentication.

talawahtech4y ago· 11 in thread

I was wondering how they were going to manage the fact that AMDs Zen3 based instances would likely be faster than Graviton2. Color me impressed. AWS' pace of innovation is blistering.

dragontamer4y ago

I don't know. I prefer it when companies actually give the technical details.

This means nothing to me. Why is there more floating point and cryptographic performance? Did Amazon change the Neoverse core? Is this N1 cores still? Did they tweak the L1 caches?

I don't think Amazon has the ability to change the core design unfortunately. This suggests to me that maybe Amazon is using N2 cores now?

But it'd be better if Amazon actually said what the core design changes are. Even just saying "updated to Neoverse N2" would go a long way to our collective understanding.

talawahtech4y ago

AWS re:Invent is this week. This was announced as part of the CEO's keynote. I am sure we will get more details throughout the week in some of the more technical sessions.

ksec4y ago

>Why is there more floating point and cryptographic performance?

You can infer this to N2 which ARM gave their own results [1], N2 uses SVE2 256bit.

[1] https://community.arm.com/arm-community-blogs/b/architecture...

1 more reply

Rafuino4y ago

tyingq4y ago

>Aren't the Zen3 instances still faster

inkyoto4y ago

> Aren't the Zen3 instances still faster than Graviton 3?

Irrelevant.

> create further lock-in opportunities

Don't like AWS/Graviton? Take your workload to the Oracle cloud and run it on Oracle ARM.

2 more replies

staticassertion4y ago

How does Graviton create lock-in? It's ARM.

3 more replies

ksec4y ago

Where is this narrative that Zen 3 instances are faster than Graviton 3 instances coming from?

I am pretty sure Zen 3 doesn't bring 25% ST performance improvement compared to Zen 2.

2 more replies

b9a2cab54y ago

On a per thread basis - yes. On a per dollar basis on AWS - no. Per instance basis - depends a lot.

thinkafterbef4y ago

[1]https://buildjet.com/for-github-actions

wayoutthere4y ago

Honestly, they’re not innovating so much as forcing a product market fit that everyone knew existed, but didn’t have the business case to develop without a hyperscalar anchor customer (like AWS!)

If anything, this is just another data point that shows how truly commoditized tech is. I just worry what happens when Amazon decides to “differentiate” after they lock you in.

amelius4y ago· 11 in thread

I really hate it that big companies are rolling their own CPU now. Soon, you're not a serious developer if you don't have your own CPU. And everybody is stuck in some walled garden.

ksec4y ago

>I really hate it that big companies are rolling their own CPU now. Soon, you're not a serious developer if you don't have your own CPU. And everybody is stuck in some walled garden.

It is still just ARM. You can buy ARM chip everywhere. There is no walled garden.

> Shouldn't we have separate CPU companies, so that everybody can benefit from progress,

You are benefiting the same CPU design from ARM, and same Fab improvement from TSMC. Amortised across the whole industry. Doesn't get any better than that.

amelius4y ago

> It is still just ARM. You can buy ARM chip everywhere.

> There is no walled garden.

There is huge potential for walled gardens, just look at Apple.

3 more replies

glogla4y ago

ARM in general is available. But good ARM is only in proprietary walled gardens.

Compare RaspberryPi or Snapdragon with Apple M1.

2 more replies

minedwiz4y ago

zamadatix4y ago

1 more reply

lizthegrey4y ago

You can get Ampere chips which are not proprietary to any specific cloud provider. Both Packet/Equinix Metal and Oracle use them.

zokier4y ago

I do point out that vendor-specific architectures were the norm for long time in the history and the sky didn't fall down. The x86 dominance was relatively short anomaly more than anything.

amelius4y ago

The sky didn't fall down because business folks didn't yet figure out how to build the most effective walled gardens.

1 more reply

Terry_Roll4y ago

Microcode is your friend. https://hackaday.com/2017/12/28/34c3-hacking-into-a-cpus-mic...

Blobs can also be reverse engineered.

amelius4y ago

Until they start using encryption, which is relatively easy.

pjmlp4y ago

Lets not pretend using a Z80, 6502 or 68000 made the remaining hardware differences go away.

qbasic_forever4y ago· 8 in thread

ksec4y ago

>The silence is deafening

I think you should see Azure and GCP ARM offering in late 2022. Marvel's exit statement on ARM server SoC pretty much all but confirmed Google and Microsoft are working on their own ARM offering.

pdelgallego4y ago

And if I recall correctly, AWS needed to develop Nitro before offering Graviton.

baybal24y ago

> And without scale ( AWS is bigger than GCP and Azure combined )

Azure cloud is bigger than AWS

https://cloudwars.co/microsoft/microsoft-q2-cloud-revenue-st...

2 more replies

baybal24y ago

> Where are Google and Azure with ARM instances?

I think they are bound by long term supply agreements with Intel. They will just bargain for better prices with Intel.

Not an easy task it will be, given that Intel is capacity jammed.

MangoCoffee4y ago

https://interestingengineering.com/microsoft-will-build-adva...

maybe its coming?

lostmsu4y ago

Funnily, IBM offers ARM instances. Even on free tier.

Uehreka4y ago

1 more reply

tyingq4y ago

Same for Oracle.

1 more reply

aclelland4y ago· 4 in thread

So that's the c6g and c7g but x86 instances are still on c5. Will AWS ever release an x86 computing instance again or is this just a sign that x86 has reached peak performance on AWS?

pella4y ago

(29 NOV 2021) "New – Amazon EC2 M6a Instances Powered By 3rd Gen AMD EPYC Processors"

https://aws.amazon.com/blogs/aws/new-amazon-ec2-m6a-instance...

"Always-on memory encryption and support for new AVX2 instructions for accelerating encryption and decryption algorithms"

aclelland4y ago

Missed this one too. Awesome, thanks.

messe4y ago

They did. A month ago.

https://aws.amazon.com/about-aws/whats-new/2021/10/amazon-ec...

aclelland4y ago

I missed this. Thanks!

ksec4y ago· 3 in thread

>First in the cloud industry to be equipped with DDR5 memory.

Quite hard to tell whether this is Neoverse V1 or N2. Since the description fits both . But this SVE extensions will move a lot of workload that previously wont suitable for Graviton 2

Uehreka4y ago

ksec4y ago

>I’m going to look up what SVE extensions are,

SIMD instructions, basically in the Intel x86 world that is like SSE4.

> Can’t be done on G2

ksec4y ago

Cant edit it now, It should be V1, not N2.

Alex39174y ago· 3 in thread

How long until we get T5g with Graviton3?

croddin4y ago

Graviton2 was announced at re:invent 2019 and t4g came out in September 2020 so my guess is we will see t5g instances by September 2022.

shaicoleman4y ago

They don't always update the T series instances for every generation, so I wouldn't hold my breath.

mwcampbell4y ago

If I'm not mistaken, they have updated the T series for every generation since the introduction of their Nitro virtualization.

haukem4y ago· 2 in thread

Does Graviton3 use the Neoverse V1? The Graviton2 used the Neoverse N1.

The features listed here match the core: https://developer.arm.com/ip-products/processors/neoverse/ne...

The N2 misses the bfloat, but it could be that the ARM marketing named it differently: https://developer.arm.com/ip-products/processors/neoverse/ne...

dragontamer4y ago

N2 has BFloat16 instructions. EDIT: I probably should have a citation: https://developer.arm.com/documentation/PJDOC-466751330-1825...

Page 50 of 92 shows off BFCVTN, BFDOT, BFMMLA (matrix multiply and accumulate), BFCVT, and other BF16 instructions on the N2.

I'd assume this Graviton 3 is a N2 core. But that's just me assuming.

haukem4y ago

Thanks for the document, so the ARM marketing just confused me. ;-)

Yes N2 is more likely than V1. N2 has the better PPA ratio. Own CPU core is very unlikely as I am not aware of any rumors which we would have notice before.

N2 also supports ARMv9 which is nice.

2-718-281-8284y ago· 2 in thread

are ARM (Advanced RISC Machine) and Arm the same?

count4y ago

Yes.

2-718-281-8284y ago

Why does Amz write it as "Arm" when it is an abbreviation?

1 more reply

lostmsu4y ago· 1 in thread

Any benchmarks? I'd like to see Geekbench 5 results from a full-sized one socket instance.

lizthegrey4y ago

No benchmarks yet, but I can get 30% faster latency, on 2/3 the number of instances compared to c6g for the same uncompress/compress/write to Kafka workload.

sabujp4y ago· 1 in thread

yeah signed stack pointers is a thing since at least 2017 : https://lwn.net/Articles/718888/

sydthrowaway4y ago

aka arm64e

kloch4y ago

DDR5? Can't wait to see how the memory bandwidth performs. I have a project that is memory bandwidth limited on c6gd instances.

givemeethekeys4y ago

Side-note: Polly's intonation makes for a fairly confusing listening session.

jfbaro4y ago

This seems to be a good fit Lambda as well. Looking forward to seeing more information about the CHIP design and capabilities.

j / k navigate · click thread line to collapse