AWS Graviton Weekly # 144: re:Invent 2025 Edition


Issue # 144: November 28, 2025, to December 12, 2025

Brought to you by CAST AI

Hey Reader.

Welcome to Issue # 144 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from November 28, 2025, to December 12, 2025.

After some crazy weeks, I let the dust settle, and now I will share the most interesting announcements, talks, and resources from the past AWS re:Invent 2025.

Every year, thousands of people from a lot of corners of the world gather together in Las Vegas to get the most exciting news from the AWS crew and from AWS partners in one single place.

I waited for the dust to settle because there are a lot of things to review and discuss, but my job here is to give you a condensed version of all that.

So, enjoy, and if you went to the event, share your thoughts in a reply. Love to hear about your experiences.


Recommendation of the week: CAST AI: 2025 GPU Price Report

Discover the availability and pricing of A100 and H100 GPU-based compute instances across AWS, Azure, and Google Cloud Platform.

AWS announces Graviton5

For me, this was the most exciting announcement here: Graviton5. Why? The features:

  • 192 cores
  • 5x larger L3 cache
  • Deliver up to 25% higher performance than the previous generation
  • Graviton5 introduces the Nitro Isolation Engine as an enhancement to the Nitro System, harnessing formal verification to provide mathematical certainty that your workloads are isolated from each other and AWS operators

I know for sure that this new generation will bring more Enterprise customers to AWS, especially for high-intensive operations. We will continue monitoring the use cases for its adoption, so stay tuned.

Graviton5 is available in the M9g instances.

Read more here:

https://www.aboutamazon.com/news/aws/aws-graviton-5-cpu-amazon-ec2

https://aws.amazon.com/ec2/instance-types/m9g/

Trainium3 UltraServers are out

This was another announcement that could be very beneficial for companies doing highly intensive AI work. Why? Just read what they shared in the announcement posts:

Trn3 UltraServers scale up to 144 Trainium3 chips, delivering up to 362 FP8 PFLOPs with 4x lower latency to train larger models faster and serve inference at scale.

Crazy scale, right? And another aspect that surprised for good was the focus on decreasing costs as well:

Customers are already seeing significant value from Trainium, with companies like Anthropic, Karakuri, Metagenomi, NetoAI, Ricoh, and Splash Music reducing their training costs by up to 50% compared to alternatives. Amazon Bedrock, AWS's managed service for foundation models, is already serving production workloads on Trainium3, demonstrating the chip's readiness for enterprise-scale deployment.

If you have work in this kind of system, you know for sure that this stuff could get very expensive, and seeing AWS deeply thinking about being more efficient with AI ops is very refreshing.

Amazon EC2 X8aedz instances powered by 5th Gen AMD EPYC processors

AWS and AMD kept collaborating on many fronts, but this is one of my favorite ones. The AMD EPYC series has been a huge success for AMD, and AWS knows the power behind these chips, so this new generation had to be part of the AWS ecosystem.

The specs are very, very interesting for the instances

Read more:

https://aws.amazon.com/blogs/aws/introducing-amazon-ec2-x8aedz-instances-powered-by-5th-gen-amd-epyc-processors-for-memory-intensive-workloads/


VIDEOS

For the videos and talks in the event, these were the most interesting ones I found (at least for me)

That's it for today.

See ya next week for the last issue of the year.

Have a great weekend


Brought to you by CAST AI

See ya next week.

Marcos Ortiz

I'm a Data Engineer by day at Riot Games (via X-Team ) and by night, I curate the last news/product announcements/resources about AWS Silicon (AWS Graviton, Nitro, Inferentia, and Trainium).

Read more from Marcos Ortiz
AWS Graviton Weekly # 143

Issue # 143: November 14, 2025, to November 28, 2025 Brought to you by CAST AI Hey Reader. Welcome to Issue # 143 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from November 14, 2025, to November 28, 2025. It's re:Invent time, folks. I know some of you will be traveling to Las Vegas for the upcoming reInvent, so safe travel, my friends. Looking for some recommendations, I saw this cool video from Swami...

Issue # 142: October 31, 2025, to November 14, 2025 Brought to you by CAST AI Hey Reader. Welcome to Issue # 142 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from October 31, 2025, to November 14, 2025. Enjoy Recommendation of the week: CAST AI: 2025 GPU Price Report Discover the availability and pricing of A100 and H100 GPU-based compute instances across AWS, Azure, and Google Cloud Platform. Download the report...

Issue # 141: October 17, 2025 to October 31, 2025 Brought to you by CAST AI Hey Reader. Welcome to Issue # 141 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from October 15, 2025, to October 31, 2025. Before sharing the usual stuff we put every week here, we wanted to say two things. The first one is related to the AWS outage. I know for sure this wasn't an easy period for the AWS crew, so HugOps for the whole team...