AWS Graviton Weekly # 82


Issue # 82: April 5, 2024 to April 12, 2024

Hey Reader.

Welcome to Issue # 82 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from April 5, 2024, to April 12, 2024.

This week has been very quiet related to AWS Silicon because most headlines were occupied by Google Next, especially the new Arm-based CPU called Axion, which looks very interesting and promising.

But one thing I noticed was that more and more times we will see AWS Silicon present in Amazon's shareholders' letters by Andy Jassy

"Shifting to AWS, we started 2023 seeing substantial cost optimization, with most companies trying to save money in an uncertain economy. Much of this optimization was catalyzed by AWS helping customers use the cloud more efficiently and leverage more powerful, price-performant AWS capabilities like Graviton chips (our generalized CPU chips that provide up to ~40% better price-performance than other leading x86 processors)".

"The past year was also a significant delivery year for AWS. We announced our next generation of generalized CPU chips (Graviton4), which provides up to 30% better compute performance and 75% more memory bandwidth than its already-leading predecessor (Graviton3)."

Arm is becoming ubiquitous to cloud-focused CPUs. The future looks bright for them and the whole ecosystem.

This is why I'm investing a lot of time creating an amazing Notion-based content library focused on ARM custom chips on the cloud (Graviton, AmpereOne, Trainium, Inferentia, Cobalt, and now Axion).

It hasn't been easy, dedicating more than 780 hours to curate, review, organize, and tag more than 2,500 resources related to ARM custom chips.

But unfortunately, it's not ready yet.

A sneak peek?

Stay tuned for the next week and you will be the first to know.

Back to biz


Tool of the week: CAST AI

If you are using Kubernetes on the cloud (AWS, GCP, or Azure), according to a new report from CAST AI, perhaps you are overspending on Kubernetes; and right now, every penny overspent; it's a penny that you can't use to what truly matters: growing your company and services.

CAST AI is the "magic wand" your company needs to save up to 60 percent of your Kubernetes-related cloud bills. Believe me: you will thank me later.


NEWS


ARTICLES AND TUTORIALS


Newsletter recommendation of the week

Image for Eugene Yan

Eugene Yan

Applied Scientist @ Amazon, Writer & Speaker, Operator-Angel

I build ML, RecSys, and LLM systems that serve customers at scale, and write about what I learn along the way. Join 7,500+ subscribers!


SLIDES, AUDIO AND VIDEO


EVENTS

Cost Optimization for Compute Workshop
(Monday 15th April 08:00 - 11:00 GMT-5)

At AWS, saving customers money is part of our DNA. It was a founding principle of AWS back in 2006 and continues to shape everything we do, from how we develop products to how we build greater efficiency into our operations. AWS is relentless in inventing new ways to increase performance while optimizing costs and today offers the broadest and deepest functionality for computing.

Speakers:


Berlin AWS User Group Meetup - April 2024

Are you ready for an exciting evening of learning and networking? We're delighted to invite you to our upcoming event on April 16th, for another great AWS User Group session in Berlin. The session is sponsored by Rackspace Technology and will feature a great talk, followed by an AMA.

19:00 - 19:45 - Cristian Măgherușan-Stanciu // RDS and ElastiCache Optimization Strategies

In this session, Cristian will share how you can cut costs and reduce the carbon footprint of your Amazon RDS and ElastiCache clusters without giving up on performance.

We'll cover everything from choosing the right compute options — like selecting the best instance types, including Graviton where supported, good metrics for rightsizing, etc. — to picking the best EBS volume types for database storage.

We'll also cover scenarios where it makes sense to use the available Serverless options for RDS and Elasticache.


From the ARM Ecosystem

Marcos Ortiz

I'm a Data Engineer by day at Riot Games (via X-Team ) and by night, I curate the last news/product announcements/resources about AWS Silicon (Graviton, AWS Nitro, Inferentia, and Trainium).

Read more from Marcos Ortiz
AWS Graviton Weekly # 97: How Amazon’s New CPU Fights Cybersecurity Threats?

Issue # 97: July 19, 2024 to July 26, 2024 Hey Reader. Welcome to Issue # 97 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from July 19, 2024 to July 26, 2024. Enjoy. Recommendation of the week: CAST.AI NEWS AWS Step Functions now supports Customer Managed Keys Llama 3.1 models from Meta are now available on AWS, offering more options for building generative AI applications AWS Lambda now supports Amazon MQ for...

Issue # 96: July 11, 2024 to July 19, 2024 Hey Reader. Welcome to Issue # 96 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from July 11, 2024 to July 19, 2024. Before continuing with the regular content: #hugops for CrowdStrike for the Microsoft Windows BSOD issue today. BTW, Cristian Măgherușan-Stanciu worked on some Terraform automation tool to fix this issue on AWS. He is looking for testers. Back to business now...

Issue # 95: July 5, 2024 to July 11, 2024 Hey Reader. Welcome to Issue # 95 of AWS Graviton Weekly, which will be focused on sharing everything that happened in the past week related to AWS Silicon: from July 5, 2024 to July 11, 2024. Enjoy. Recommendation of the week: CAST.AI NEWS Amazon EC2 R8g instances powered by AWS Graviton4 now generally available AWS Neuron introduces Flash Attention kernel enabling high performance and large sequence lengths Announcing availability of AWS Outposts in...