AWS EC2 Instance Types
Overview
AWS has a huge catalog of EC2 Instance Types for you to choose from depending on your needs. These instance types vary by the number of CPU cores, memory, network bandwidth, and so many other factors. Although it may seem daunting to choose the correct instance type, there are a few points you use to make this process easier and more cost effective.
Amazon EC2 Instance
Amazon EC2 is a virtualization service provided by AWS. Amazon EC2 allows you to create virtual machines, known as "EC2 Instances". AWS gives you full control over choosing which operating system to run, and provides a list of pre-configured hardware specifications which are called EC2 Instance Types. We will be taking a look at these EC2 Instance Types in the next section.
EC2 Instance Types
When you are creating an Amazon EC2 Instance, AWS will provide you with a list of hardware configurations (number of CPU cores, memory, disk space, etc.), which are called AWS EC2 Instance Types. As of writing this article, there are over 500 instance types to choose from!
Choosing an EC2 Instance Type is similar to shopping for a laptop. Imagine you are trying to buy a laptop for your college or workplace. You need to consider which Operating System you want to use - Windows or Linux or Mac OS. Then you also need to choose the number of CPU Cores (2, 4, 6 etc.) and amount of Memory/RAM (4 GB, 8 GB etc.). And maybe you want some extra configurations like a Graphics Card/GPU to play games. Once you know what your requirements are you will shop around and pick a laptop that fits your Budget. Let's look at the different AWS EC2 Instance Types and understand how to pick the right instance type as per your needs.
General Purpose EC2 Instance
General Purpose EC2 Instances provide a balanced distribution of CPU cores, memory, and networking resources. These instances can be used for workloads that do not require high utilization of any one particular resource.
M5 Instance
- Covers most "balanced" use cases and can be used for a wide range of applications
- Can be used for small to midsize databases, backend servers
- Examples: m5.large (2 CPU cores, 8 GB of memory), m5.xlarge (4 CPU cores, 16 GB of memory)
T3 / T3a Instance
- Entry-level instances for "simple" use cases
- Can be used for websites, testing/staging environments
- Part of AWS's Free Tier (t3.micro)
- Examples: t3.micro (2 CPU cores, 1 GB of memory), t3.medium (2 CPU cores, 4 GB of memory)
- T3a Instances use AMD EPYC Processors whereas T3 Instances use the standard Intel Xeon Processors
A1 Instance
- Uses AWS's custom ARM Processor Graviton
- Can be used instead of T3/T3a instances when using ARM is a requirement
- Examples: a1.medium (1 CPU core, 2 GB of memory), a1.large (2 CPU cores, 4 GB of memory)
Compute Optimized
Computer OptimizedEC2Instances provide a higher number of CPU cores compared to other resources. These instances can be used for workloads that require high compute or CPU utilization.
C5 / C5n / C5a Instance
- Cost-effective, high performance for running compute-intense workloads
- Can be used for batch processing, media transcoding, and high performance web servers
- Examples: c5.xlarge (4 cores, 8 GB of memory), c5.4xlarge (16 cores, 32 GB of memory)
- C5n provide increased memory and network bandwidth (up to 100 Gbps) compared to their C5 counterparts
- C5a Instances use AMD EPYC Processors whereas C5/C5n Instances use the standard Intel Xeon Processors C6g Instance
- Uses AWS's custom ARM Processor Graviton2
- Can be used instead of C5 instances when using ARM is a requirement - also provide 40% better price to performance compared to C5 instances
- Examples: c6g.xlarge (4 cores, 8 GB of memory), c6g.4xlarge (16 cores, 32 GB of memory)
Memory Optimized
Memory Optimized EC2 Instances provide a higher allocation of memory compared to other resources. These instances can be used for workloads that require high memory or RAM utilization.
R5 / R5n / R5a Instance
- Designed to deliver fast performance for workloads that process large data sets in memory
- Can be used for different types of databases - relational (MySQL), NoSQL (MongoDB, Cassandra), or in-memory (Redis)
- Also can be used for real time processing of big unstructured data along with Apache Hadoop/Spark
- Examples: r5.xlarge (4 cores, 32 GB of memory), r5.4xlarge (16 cores, 128 GB of memory)
- R5n boosts the available network bandwidth to up to 100 Gbps and can be used where both memory and network performance is critical
- R5a Instances use AMD EPYC Processors whereas R5/R5n Instances use the standard Intel Xeon Processors
R6g Instance
- Uses AWS's custom ARM Processor Graviton2
- Can be used instead of R5 instances when using ARM is a requirement - also provide 40% better price to performance compared to R5 instances
- Examples: r6g.xlarge (4 cores, 32 GB of memory), r6g.4xlarge (16 cores, 128 GB of memory)
X1 / X1e Instance
- Instances offering up to a whopping 3904 GB of memory
- Can be used for large-scale enterprise applications and in-memory databases like SAP HANA
- Examples: x1.32xlarge (128 cores, 1952 GB of memory), x1e.32xlarge (128 cores, 3904 GB of memory)
Accelerated Computing
Accelerated Computing EC2 Instances are a unique set of instance types that use specialized hardware accelerators or co-processors. These instances are used for a wide range of operations like graphics processing, machine learning, or high performance computing.
P3 / P2 Instance
- Designed for general purpose GPU computing using the CUDA or OpenCL programming models or through a machine learning framework.
- Use NVIDIA Tesla V100 GPUs (P3) and NVIDIA K80 GPUs (P2) which provide powerful half, single, and double-precision floating-point capabilities
- Can be used for high performance computing, computational fluid dynamics, and autonomous vehicles
- Examples: p3.16xlarge (64 CPU cores, 488 GB of memory, 8 GPU cores, 128 GB of GPU memory)
G4 / G3 Instance
- Offer high performance for graphics-intensive applications such as remote workstations, video rendering, and deep learning models.
- Feature up to 8 NVIDIA T4 Tensor Core GPUs
- Can be used for high fidelity cloud gaming, GPU intensive video rendering, and training for deep learning models
- Examples: g3.16xlarge (64 CPU cores, 488 GB of memory, 4 GPU cores, 32 GB of GPU memory)
Inf1 Instance
- These instances are designed to accelerate machine learning using AWS Inferentia, a custom AI/ML chip from Amazon that provides high performance and low latency machine learning inference.
- Can be used for deploying deep learning models, image/video analysis, and recommendation engines
- Deliver up to 2.3x higher throughput and up to 70% lower cost per inference than comparable current generation GPU-based Amazon EC2 instances
- Examples: inf1.6xlarge (24 CPU cores, 48 GB of memory, 4 Inferentia Chips)
F1 Instance
- Provide customizable hardware acceleration with Field Programmable Gate Arrays (FPGAs), which are required for highly parallel operations not suited for general purpose CPUs
- F1 instances use Xilinx UltraScale+ VU9P FPGA, which each has approximately 2.5 million logic elements
- Can be used for genomics and financial analysis
- Examples: f1.16xlarge (64 CPU cores, 976 GB of memory, 8 FPGAs)
Storage Optimized
Storage Optimized EC2 Instances provide a higher sequential read/write access to local storage compared to other resources. These instances can be used for workloads that depend on highly performant read/write operations with large storage devices.
I3 / I3en Instance
- Designed for workloads requiring high read/write transactions and low latency, up to 60 of NVMe SSD storage
- These instances typically include a Non-Volatile Memory Express (NVMe) SSD-based storage, attached as a secondary storage volume
- Can be used to host in-memory databases like Aerospike or Redis
- Also can be used as a cache or a distributed file system
- Examples: i3.xlarge (2 cores, ~16 GB of memory, 475 GB NVMe SSD), i3.2xlarge (4 cores, ~32 GB of memory, 1900 GB NVMe SSD)
- I3en instances come with up to 100 Gbps networking bandwidth
D2 Instance
- Specialized for workloads that require massive scale while at the same time being distributed, up to 48 TB of HDD storage
- Can be used for large data lakes or distributed file systems (HDFS)
- Examples: d2.8xlarge (36 cores, 244 GB of memory, 48 TB of HDD)
H1 instance
- For workloads that require low cost, high disk throughput to very large data sets - offers the best price/performance for instances with HDDs
- Can be used instead of I3/D3 instances where pricing is a higher priority than performance
- Examples: h1.2xlarge (8 cores, 32 GB of memory, 2000 GB HDD), h1.8xlarge (32 cores, 128 GB of memory, 8000 GB HDD).
Choosing EC2 Instance Types
Choosing an EC2 Instance Type comes down to two steps, which have been outlined below.
1) Make a Selection As Per Your Requirements
The first step in choosing the appropriate EC2 Instance Type is listing down your hardware requirements:
- Number of CPU cores
- Amount of Memory (RAM)
- Network Bandwidth
- Is any extra Storage required? SSD or HDD?
- Are any co-processors required? GPU or FPGAs?
Then pick the EC2 Instance Type from the family of instance types provided by AWS. You can first pick the Type based on your overall needs, and then choose a Sub-Type that is closest to the requirements you have outlined. AWS provides a comprehensive list of all the available instance types here.
Don't worry if your choice is the most cost effective or not, we will take care of that in the next section.
AWS provides over 500 different instance types! Here's a quick reference to the different AWS EC2 Instance Types we discussed earlier:
Type | Hardware Characteristics | Sub-Types |
---|---|---|
General Purpose | Balanced distribution of hardware | T3, M5 |
Compute Optimized | Higher distribution of CPU cores | C5, C6g |
Memory Optimized | Higher distribution of Memory | R5, X1 |
Storage Optimized | Includes high throughput storage | I3, D3 |
Accelerated Computing | Includes co-processors like GPUs | G3, Inf1 |
2) Apply Cost Optimizations With The Help of AWS
Now that you have chosen your instance type, let's see how you can optimize your choice to be cost effective and reduce your monthly bill by the most amount. All of these optimizations can be done within the AWS Console, so you don't need to depend on external services to save costs.
AWS Cost Optimizer AWS Cost Optimizer is a service provided by AWS that provides recommendations based on resource utilization. This service observes the usage patterns of an EC2 instance and provides recommendations on alternate instance types that perform similarly for a lower cost.
If you have over-provisioned your EC2 instance(s), you can change the instance type using the AWS Console. Be careful of the caveats of changing instance types though!
AWS Pricing Models AWS provides multiple purchasing options to enable you to optimize your costs.
By default, you pay for every second the moment you launch an EC2 instance, which is known as On-Demand Pricing. But if you know your application's usage patterns you can use the below alternate pricing models to save on costs.
Savings Plans: Reduce your Amazon EC2 costs by making a commitment to a consistent amount of usage, in USD per hour, for a term of 1 or 3 years.
Reserved Instances: Reduce your Amazon EC2 costs by making a commitment to a consistent instance configuration, including instance type and Region, for a term of 1 or 3 years.
Spot Instances: Request unused EC2 instances, which can reduce your Amazon EC2 costs significantly. Spot instances offer the most lucrative discounts, and you can choose to run Spot Instances even for your Production workloads, depending on how your application handles interruptions.
Optimize Storage and Elastic IPs Usage An optimization you can apply is picking only the right storage specification for your EC2 instance's root volume and secondary volume(s).
AWS Elastic Block Storage (EBS) is the storage solution used for EC2 instances. EBS is charged separately and even gets charged when your EC2 instance is shut down! Make sure you have chosen only the required size for your volume(s), and also choose more cost-effective options like "gp3" instead of "gp2".
Another optimization you can apply is de-attaching any Elastic IP Addresses if your EC2 instances do not need to be part of an internet-facing network/subnet. Elastic IP addresses also are charged separately by the hour.
Conclusion
- In this article, we learned about the different AWS EC2 Instance Types, and how to choose the right instance type for your use case.
- Similar to choosing a laptop, you need to make a list of your hardware requirements - like the number of CPU cores, amount of memory, etc.
- Then pick a Type that broadly matches your usage goals from General Purpose Instances, Compute Optimized, Memory Optimized, Storage Optimized, or Accelerated Computing.
- Once you have selected a Type, pick a Sub-Type that is closest to your requirements. AWS provides over 500 of these Sub-Types!
- Finally, we learned how to make cost optimizations using AWS Cost Optimizer, picking a better Pricing Model, or making other optimizations on Storage/Elastic IPs.