AWS Compute Services

AWS Compute Services Overview

Applications require Compute, Storage, and Network resources. AWS offers a comprehensive portfolio of compute services that can be broadly classified into categories based on their operational models.

Service Category	Management Level	Use Case	Learning Curve	Cost Model
IaaS (EC2)	Full server control	Custom applications, legacy systems	Moderate to High	Pay for instance hours
PaaS (Elastic Beanstalk)	Platform managed	Web applications, rapid deployment	Low to Moderate	Pay for underlying resources
Containers (ECS/EKS)	Container orchestration	Microservices, modern applications	Moderate to High	Pay for compute + orchestration
Serverless (Lambda)	Function managed	Event-driven, short-running tasks	Low	Pay per execution

🖥️ IaaS Use Cases

Enterprise Applications: ERP, CRM systems
Legacy Modernization: Lift-and-shift migrations
Development Environments: Custom configurations
High-Performance Computing: Scientific computing
Game Servers: Multiplayer gaming infrastructure

🔧 PaaS Use Cases

Web Applications: Focus on code, not infrastructure
API Development: RESTful services
Rapid Prototyping: Quick deployment cycles
Small to Medium Applications: Standard architectures
Developer Productivity: Reduce operational overhead

📦 Container Use Cases

Microservices Architecture: Service decomposition
DevOps Pipelines: CI/CD automation
Multi-environment Consistency: Dev/test/prod parity
Application Modernization: Containerized legacy apps
Hybrid Cloud: Portable workloads

⚡ Serverless Use Cases

Event Processing: S3 triggers, API Gateway
Data Processing: ETL pipelines
IoT Backends: Device data ingestion
Scheduled Tasks: Cron job replacements
Image/Video Processing: Media transformations

Compute Service Decision Tree

1

Do you need full OS control?

Yes: Consider EC2 (IaaS)
No: Continue to step 2

2

Is your workload event-driven or short-running?

Yes: Consider Lambda (Serverless)
No: Continue to step 3

3

Do you use containers or need microservices?

Yes: Consider ECS/EKS (Containers)
No: Consider Elastic Beanstalk (PaaS)

Amazon EC2 Deep Dive

Amazon Elastic Compute Cloud (EC2) is AWS's foundational Infrastructure as a Service (IaaS) offering, providing resizable virtual compute capacity in the cloud. EC2 uses Type 1 virtualization (bare-metal hypervisor) for optimal performance.

⚠️ Important: EC2-Classic Retirement

As announced in Werner Vogels' blog, EC2-Classic was officially retired on August 15, 2022. All instances now launch in VPC (Virtual Private Cloud) by default, providing enhanced security and networking capabilities.

EC2 Tenancy Models

🏢 Shared Tenancy

Default Option - Most cost-effective

Characteristics

Multiple customers share physical hardware
AWS manages security isolation
Optimal performance for most workloads
Standard pricing model

Best for: General workloads, cost optimization

🔒 Dedicated Instance

Hardware Isolation - Single customer hardware

Characteristics

Runs on dedicated physical hardware
AWS controls instance placement
Higher cost than shared tenancy
Compliance and isolation benefits

Best for: Compliance requirements, data isolation

🖥️ Dedicated Host

Full Server Control - Entire physical server

Characteristics

Full control over instance placement
Socket and core visibility
Bring Your Own License (BYOL) support
Most expensive option

Best for: BYOL, regulatory compliance

⚙️ Bare Metal

Direct Hardware Access - No hypervisor

Characteristics

Direct access to physical hardware
No virtualization overhead
Custom hypervisor support
Specialized use cases

Best for: VMware on AWS, custom solutions

EC2 Instance Components

💾 Amazon Machine Image (AMI)

A template containing the operating system, pre-installed software, and configurations. Think of it as a "golden image" for launching instances.

Storage: AMIs are stored in Amazon S3
Sources: AWS provided, Marketplace, Community, Custom
Content: OS, applications, configurations, user data

🔧 Instance Type/Family

Defines the hardware specifications including CPU, RAM, network performance, and storage options.

T-shirt Sizes: Fixed hardware packages (t2.micro, m5.large, etc.)
Instance Families: Optimized for different workloads
Generations: Newer generations offer better price/performance

EC2 Instance Types Complete Guide

Understanding EC2 instance families is crucial for selecting the right compute resources for your applications. Each family is optimized for specific use cases and workload patterns. This section includes memory techniques to help you remember each family.

🎓 Instance Type Naming Convention

Instance types follow the format: [Family][Generation][Additional Capabilities].[Size]

Example: m5a.large = M family, 5th generation, AMD processors, large size

General Purpose Instances

Balanced compute, memory, and networking for diverse workloads. These are a good place to start, particularly if you're not sure what type to use.

T

T3/T4g - Burstable Performance

Baseline CPU with ability to burst when needed

💭 Mnemonic: T is for Tiny or Turbo

The t3 family is a burstable instance type. If you have an application that needs to run with some basic CPU and memory usage, you can choose t3. It also works well if you have an application that gets used sometimes but not others. When the resource is idle, you'll generate CPU credit, which you'll utilize when the resource is used.

Use Cases:

Websites and small applications
Development and testing environments
Microservices
Small to medium databases

💳 Important: T3 vs T2 Differences

Make sure you understand the difference between t3 and the older t2 — t3 are in "unlimited mode" by default, so instead of throttling down to baseline CPU when your instance runs out of credits, you pay for overages.

M

M5/M6i - Balanced

Well-balanced compute, memory, and network resources

💭 Mnemonic: M is for Main choice or happy Medium

The m5 instance type is similar to t3, but for more consistent workloads. It has a nice balance of CPU, memory, and disk. It's not hard to see why almost half of EC2 workloads are on "m" instances. If you aren't sure what to choose, m5 is the most versatile of all the Amazon instance types.

Use Cases:

Web applications and API backends
Enterprise applications
Microservices
Small to mid-size databases

Available variants: m6g (AWS Graviton2), m5a (AMD), m5n (enhanced networking), m4 (previous generation)

A

A1 - ARM-based

ARM-based processors for cost-effective scale-out workloads

💭 Mnemonic: A is for ARM processor — or as light as A1 steak sauce

The a1 instance type was announced in late 2018 and can be a less expensive option than other EC2. They are suited for scale-out workloads such as web servers, containerized microservices, caching fleets, distributed data stores, and development environments.

Use Cases:

Web servers and containerized applications
Distributed data stores
Development environments
Microservices architectures

Compute Optimized Instances

High-performance processors for CPU-intensive applications.

C

C5/C6i - High Performance CPU

High ratio of compute/CPU versus memory

💭 Mnemonic: C is for Compute (at least that one's easy!)

The c5 instance type has a high ratio of compute/CPU versus memory. If you have a compute-intensive application — maybe scientific modelling, intensive machine learning, or multiplayer gaming — these instances are a good choice.

Use Cases:

High-performance computing (HPC)
Scientific modeling and simulation
CPU-intensive machine learning
Multiplayer gaming servers
Video encoding

Available variants: C5d (local NVMe SSD), C5n (enhanced networking up to 100 Gbps), C4 (previous generation)

Memory Optimized Instances

Designed for workloads that process large data sets in memory.

R

R5/R6i - Memory Optimized

High memory-to-vCPU ratio for memory-intensive applications

💭 Mnemonic: R is for RAM

The r instance family is memory-optimized, which you might use for in-memory databases, real-time processing of unstructured big data, or Hadoop/Spark clusters. You can think of it as a kind of midpoint between the m5 and the x1e.

Use Cases:

In-memory databases (Redis, Memcached)
Real-time processing of unstructured data
Hadoop/Spark clusters
High-performance relational databases

Available variants: R5a (lower cost per GiB), R5n (higher bandwidth)

X

X1e/X2iezn - Extreme Memory

Highest memory-to-vCPU ratio, up to 4TB RAM

💭 Mnemonic: X is for eXtreme memory

The x1 family has a much higher ratio of memory, so this is a good choice if you have a full in-memory application or a big data processing engine like Apache Spark or Presto. X1e are optimized for high-performance databases, in-memory databases, and other memory intensive enterprise applications.

Use Cases:

Large in-memory applications (SAP HANA)
Big data processing engines (Apache Spark)
High-performance databases
Memory-intensive enterprise applications

💾

High Memory - Extreme Scale

Up to 24TB of memory for largest workloads

💭 Mnemonic: High memory is for... high memory

We're not sure why these didn't get an alphabet soup name like the rest of the AWS instances, but at least it's easy to remember and understand. As you might guess, high memory instances run large in-memory databases, including production deployments of SAP HANA.

Use Cases:

SAP HANA production deployments
Extremely large in-memory databases
High-performance analytics

Z

z1d - High Frequency + Memory

High compute performance with high memory and local NVMe storage

💭 Mnemonic: Z is for Zippy (high frequency)

The z1d instances combine high compute capacity with a high memory footprint. They have a sustained core frequency of up to 4.0 GHz, the fastest of AWS's offerings. These are best for electronic design automation (EDA) and some relational database workloads with high per-core licensing costs.

Use Cases:

Electronic Design Automation (EDA)
Relational databases with high per-core licensing costs
CPU-intensive distributed computing

Accelerated Computing Instances

Hardware accelerators or co-processors for specialized workloads.

P

P3/P4 - GPU for ML Training

NVIDIA Tesla GPUs optimized for machine learning training

💭 Mnemonic: P is for Pictures (graphics)

If you need GPUs on your instances, p3 instances are a good choice. They are useful for video editing, and AWS also lists use cases of "computational fluid dynamics, computational finance, seismic analysis, speech recognition, autonomous vehicles" — so it's fairly specialized.

Use Cases:

Machine learning model training
High-performance computing
Computational fluid dynamics
Seismic analysis
Video processing

G

G4/G5 - GPU for Graphics Workloads

GPUs optimized for graphics workstations and streaming

💭 Mnemonic: G is for Graphics or GPU

The g instance type uses Graphics Processing Units (GPUs) to accelerate graphics-intensive workloads, and also designed to accelerate machine learning inference. This could include adding metadata to an image, automated speech recognition, and language translation, as well as graphics workstations, video transcoding, and game streaming in the cloud.

Use Cases:

Graphics workstations
Video transcoding
Game streaming
ML inference
Remote desktop solutions

F

F1 - Field Programmable Gate Arrays

Customizable hardware acceleration with FPGAs

💭 Mnemonic: F is for FPGA or feel as in hardware

f1 instances offer customizable hardware acceleration with field programmable gate arrays (FPGAs) — hence the "f". Applications could include genomics research, financial analysis, and real-time video processing.

Use Cases:

Genomics research
Financial risk analysis
Real-time video processing
Custom hardware acceleration

Inf

Inf1 - ML Inference

AWS Inferentia chips optimized for ML inference

💭 Mnemonic: Inf is for Inference

The Inf1 instances are a specialized EC2 type for machine learning inference applications, such as recommendation engines, forecasting, image and video analysis, advanced text analytics, document analysis, voice, conversational agents, translation, transcription, and fraud detection.

Use Cases:

Natural language processing
Image and video analysis
Speech recognition
Recommendation engines
Document analysis

Storage Optimized Instances

Designed for workloads requiring high sequential read/write access to large datasets on local storage.

I

I3/I4i - High Random I/O

NVMe SSD-backed instance storage for high IOPS

💭 Mnemonic: I is for IOPS

The i3 instance type is similar to h1, but it is SSD backed, so if you need an NVMe drive, choose this type. Use it for NoSQL databases, in-memory databases, Elasticsearch, and more. The i3en option has higher network bandwidth with Elastic Network Adapter (ENA)-based enhanced networking.

Use Cases:

NoSQL databases (Cassandra, MongoDB)
In-memory databases
Distributed file systems
Data warehousing
Search engines (Elasticsearch)

D

D2/D3en - Dense HDD Storage

High disk throughput with HDD-backed storage

💭 Mnemonic: D is for Dense storage

d2 instances have an even higher ratio of disk to CPU and memory, which makes them a good fit for Massively Parallel Processing (MPP), MapReduce and Hadoop distributed computing, and similar applications.

Use Cases:

MapReduce workloads
Distributed computing frameworks
Data warehousing
Massively Parallel Processing (MPP)

H

H1 - Balanced Compute and Storage

Balance of compute and memory with HDD storage

💭 Mnemonic: H is for HDD

The h1 type is HDD backed, with a balance of compute and memory. You might use it for distributed file systems, network file systems, or data processing applications.

Use Cases:

Distributed file systems
Network file systems
Data processing applications
Big data workload

🎯 What EC2 instance types should you use?

As AWS has continued to add options to EC2, there are now EC2 instance types for almost any application. If you have comparison questions around pricing, run them through the AWS pricing calculator. And if you don't know, then generally starting with t3 or m5 is the way to go.

Instance Comparison Tools: instances.vantage.sh and ec2instances.github.io
Previous Generation: AWS Previous Generation Instances are still available but newer generations offer better price/performance

EC2 Pricing Evolution

Understanding how EC2 pricing has evolved helps you optimize costs and take advantage of the latest billing improvements. This timeline shows the major changes in EC2 pricing models.

2021

Pricing is per instance-hour consumed for each instance, from the time an instance is launched until it is terminated or stopped. Each partial instance-hour consumed will be billed per-second for Linux, Windows, Windows with SQL Enterprise, Windows with SQL Standard, and Windows with SQL Web Instances, and as a full hour for all other instance types.

Improvement: Extended per-second billing to Windows and SQL Server instances

2018

Pricing is per instance-hour consumed for each instance, from the time an instance is launched until it is terminated or stopped. Each partial instance-hour consumed will be billed per-second for Linux Instances and as a full hour for all other instance types.

Major Change: Introduction of per-second billing for Linux instances (originally announced in 2017)

2016

Pricing is per instance-hour consumed for each instance, from the time an instance is launched until it is terminated or stopped. Each partial instance-hour consumed will be billed as a full hour.

Original Model: Hourly billing only - even 1 minute of usage charged as full hour

💰 Cost Optimization Impact

The evolution to per-second billing provides significant cost savings for:

Development/Testing: Frequently started/stopped instances
Batch Processing: Short-running workloads
Auto Scaling: Dynamic capacity adjustments
CI/CD Pipelines: Build and deployment tasks

Hands-on Lab: EC2 Instance Deployment

Now let's put theory into practice. We'll launch an EC2 instance, configure it with a web server using User Data, create a custom AMI, and deploy a new instance from that AMI.

💡 AWS Free Tier Information

The AWS Free Tier provides eligible users with 750 hours of t2.micro or t3.micro instances per month for the first 12 months after account creation. Always check the current Free Tier terms on the AWS website, as offerings may change over time. For accounts that have exceeded the Free Tier period, standard on-demand pricing applies.

Exercise 1: Launch EC2 Instance with User Data

Deploy an EC2 instance with automated web server installation using User Data scripts.

Launch Configuration Steps

1

Navigate to EC2 Console

In the AWS Management Console, search for "EC2" and click on the EC2 service.

2

Launch Instance

Click the "Launch instances" button to start the instance creation wizard.

3

Name and Tags

Set the instance name to web-server-demo for easy identification.

4

Choose AMI

Select Amazon Linux 2023 from Quick Start AMIs (current generation) or Amazon Linux 2 (extended support available).

5

Instance Type

Select t2.micro (eligible for Free Tier during first 12 months) or t3.micro (newer generation, also Free Tier eligible).

6

Key Pair Configuration

Choose an existing key pair or create a new one for SSH access. Download and securely store the private key.

7

Network Settings

Click Edit to modify network settings
VPC: Select the default VPC
Subnet: Choose a default public subnet
Auto-assign Public IP: Enable
Security Group: Create new with name web-server-sg

8

Security Group Rules

Configure the following inbound rules:

SSH (Port 22): Source = My IP (recommended) or specific IP ranges
HTTP (Port 80): Source = Anywhere (0.0.0.0/0) for public web access

9

User Data Script

In Advanced Details, add this User Data script:

#!/bin/bash
# Update system packages
sudo yum update -y

# Install Apache HTTP server
sudo yum install -y httpd

# Start Apache service
sudo systemctl start httpd

# Enable Apache to start on boot
sudo systemctl enable httpd

# Create a simple HTML page
echo "<h1>Hello World from $(hostname -f)</h1>" > /var/www/html/index.html
echo "<p>Instance ID: $(curl -s http://169.254.169.254/latest/meta-data/instance-id)</p>" >> /var/www/html/index.html
echo "<p>Availability Zone: $(curl -s http://169.254.169.254/latest/meta-data/placement/availability-zone)</p>" >> /var/www/html/index.html

10

Launch Instance

Review all settings and click "Launch instance" to create your EC2 instance.

Verification

Once your instance reaches "Running" state with "2/2 checks passed":

Copy the Public IPv4 address
Open a browser and navigate to http://<public-ip>
You should see "Hello World" with instance details

Important: Use http:// not https:// as we only configured HTTP (port 80).

Exercise 2: Create Custom AMI

Create a custom AMI from your configured instance for rapid future deployments.

1

Select Instance

In the EC2 console, select your web-server-demo instance.

2

Create Image

Click Actions → Image and templates → Create image

3

Configure AMI

Image name: custom-web-server-ami
Description: "Pre-configured web server with Apache"
Leave other settings as default
Click Create image

4

Monitor AMI Creation

Navigate to AMIs in the left sidebar under Images. Wait for status to change from "pending" to "available" (typically takes 2-5 minutes).

Exercise 3: Deploy from Custom AMI

Launch a new instance using your custom AMI to demonstrate rapid deployment.

1

Launch from Custom AMI

In AMIs section, select custom-web-server-ami → Actions → Launch instance from AMI

2

Quick Configuration

Name: web-from-custom-ami
Instance type: t2.micro or t3.micro
Key pair: Use existing key pair
Security group: Select existing web-server-sg
Enable auto-assign public IP

3

Launch and Verify

Launch the instance and verify it's immediately accessible via HTTP without waiting for User Data execution - this demonstrates the power of custom AMIs for rapid deployment.

🎉 Success!

You've successfully:

Deployed an EC2 instance with automated configuration
Created a custom AMI with pre-installed software
Rapidly deployed a new instance from your custom AMI

🧹 Cleanup Resources

Important: Clean up resources to avoid unnecessary costs.

1

Terminate Instances

Select both instances → Instance state → Terminate instance

2

Deregister AMI

In AMIs section: Select custom AMI → Actions → Deregister AMI (also deletes associated snapshots)

3

Delete Security Group

In Security Groups: Select web-server-sg → Actions → Delete security group (only after instances are terminated)

EC2 Pricing Models & Cost Optimization

Understanding EC2 pricing models is crucial for cost optimization. AWS offers several pricing options to match different usage patterns and business requirements.

💳 On-Demand Instances

Pay-as-you-go with no commitments

Characteristics

No upfront payment required
No long-term commitment
Per-second billing (Linux/Windows)
Full control over start/stop
Highest per-hour cost

Best For

Unpredictable workloads
Development and testing
Short-term projects
Applications with irregular usage patterns
First-time AWS users

📋 Reserved Instances

Up to 75% savings with 1 or 3-year commitment

Payment Options

No Upfront: Monthly payments, lowest discount
Partial Upfront: Some upfront + monthly
All Upfront: Highest discount

Types

Standard: Highest discount, fixed instance family
Convertible: Change instance family, lower discount

Best For

Steady-state applications
Predictable workloads
Production environments
Applications with consistent usage patterns

⚡ Spot Instances

Up to 90% savings using spare AWS capacity

How It Works

Bid on unused EC2 capacity
2-minute termination notice
Price fluctuates with supply/demand
Can be interrupted anytime
Request persistent or one-time

Best For

Fault-tolerant applications
Big data analysis
Batch processing jobs
CI/CD pipelines
Stateless workloads
Image/video processing

⚠️ Not Suitable For

Critical applications
Databases requiring high availability
Long-running processes
Applications requiring guaranteed uptime

🏢 Dedicated Hosts

Compliance & Licensing - Physical server isolation

Features

Dedicated physical server
Socket and core visibility
Instance placement control
Bring Your Own License (BYOL) support
Regulatory compliance capabilities

Use Cases

Regulatory compliance requirements
Server-bound software licenses
Corporate security policies
Multi-tenant concerns

Licensing Benefits

Windows Server licenses
SQL Server licenses
Oracle database licenses
VMware vSphere
Red Hat Enterprise Linux

Cost Optimization Strategies

Instance Right-Sizing

🎯 Right-Sizing Principles

Choose the optimal instance type and size for your workload to avoid over-provisioning or under-provisioning.

Monitoring Metrics

CPU Utilization: Target 70-80% average
Memory Usage: Monitor with CloudWatch agent
Network Performance: Match workload requirements
Storage I/O: Monitor IOPS and throughput
Cost per Performance: Use AWS Cost Explorer

Right-Sizing Tools

AWS Compute Optimizer: ML-powered recommendations
CloudWatch Insights: Historical analysis
AWS Trusted Advisor: Cost optimization checks
Third-party tools: instancetyp.es, ec2instances.github.io
Load testing: Validate performance under load

1

Baseline Performance

Monitor your application for at least 2 weeks to establish performance patterns and identify peak usage periods.

2

Analyze Utilization

Use CloudWatch metrics to identify over-provisioned or under-provisioned resources. Look for consistent low utilization (under 20%) or high utilization (over 90%).

3

Test and Validate

Before making changes to production, test new instance sizes in staging environments to validate performance and functionality.

Mixed Pricing Strategy

Combine different pricing models to optimize costs while maintaining performance and availability.

📊 Example Savings

Web Application:

Base: 10 Reserved instances (70%)
Variable: 3 On-Demand (20%)
Burst: 2 Spot instances (10%)
Potential savings: ~60-65% vs. all On-Demand

🔄 Auto Scaling Integration

Smart Scaling:

Reserved instances always running
On-Demand for predictable growth
Spot instances for elastic capacity
Automatic failover to On-Demand

⚖️ Balance Considerations

Risk vs. Reward:

Monitor Spot interruption rates
Maintain minimum On-Demand buffer
Implement graceful Spot handling
Regular cost-benefit analysis

Advanced Cost Optimization Techniques

🕐 Scheduling & Automation

Instance Scheduling: Stop non-prod during off-hours
Lambda Automation: Automated start/stop scripts
Tags for Automation: Environment-based scheduling
Hibernation: Preserve state while saving costs

# Example: Lambda function for instance scheduling
import boto3
import json

def lambda_handler(event, context):
    ec2 = boto3.client('ec2')
    
    # Stop instances tagged for nighttime shutdown
    response = ec2.describe_instances(
        Filters=[
            {'Name': 'tag:Schedule', 'Values': ['NightlyStop']},
            {'Name': 'instance-state-name', 'Values': ['running']}
        ]
    )
    
    instance_ids = []
    for reservation in response['Reservations']:
        for instance in reservation['Instances']:
            instance_ids.append(instance['InstanceId'])
    
    if instance_ids:
        ec2.stop_instances(InstanceIds=instance_ids)
        return f'Stopped {len(instance_ids)} instances'

📊 Monitoring & Analytics

Cost Allocation Tags: Track spending by project/team
AWS Cost Explorer: Analyze usage patterns
Budgets & Alerts: Proactive cost management
Reserved Instance Utilization: Monitor RI efficiency

Key Metrics to Track:

Cost per transaction/user
Reserved Instance utilization %
Spot instance interruption rate
Right-sizing opportunities

💰 Hidden Cost Factors

Don't overlook these often-missed cost factors:

Data Transfer: Cross-AZ and internet egress charges
EBS Volumes: Storage costs continue even when instances are stopped
Elastic IPs: Charges when not associated with running instances
Load Balancers: Hourly charges plus data processing
NAT Gateways: Hourly + data processing charges
CloudWatch Logs: Storage and ingestion costs

Advanced EC2 Features

Explore advanced EC2 capabilities that enable sophisticated architectures and optimize performance.

EBS Optimization and Multi-Attach

📦 EBS Multi-Attach

Attach a single EBS volume to multiple EC2 instances simultaneously for shared storage scenarios.

Supported volume types: io1, io2, gp3
Maximum instances: 16 per volume
Use cases: Shared file systems, clustered databases
Requirements: Cluster-aware file system (ext4/xfs not supported)

More info: EBS Multi-Attach Documentation

⚙️ CPU Optimization

Customize CPU options for specific workloads requiring particular core/thread configurations.

Core count: Reduce for software licensing optimization
Threads per core: Disable hyperthreading if needed
Use cases: High-performance computing, licensed software

More info: CPU Optimization Documentation

Mountpoint for Amazon S3

🗄️ S3 File System Interface

Mount S3 buckets as file systems for high-throughput access to S3 objects. This feature became generally available in 2023.

# Install Mountpoint for Amazon S3
curl -L https://s3.amazonaws.com/mountpoint-s3-release/latest/x86_64/mount-s3.tar.gz | tar -xz
sudo mv mount-s3 /usr/local/bin/

# Mount S3 bucket
mkdir /mnt/s3bucket
mount-s3 my-bucket /mnt/s3bucket

# Use like a regular file system
ls /mnt/s3bucket/
cp large-dataset.csv /mnt/s3bucket/data/

Benefits: Up to 100 GB/s throughput, optimized for read-heavy workloads, seamless S3 integration

Learn more: Mountpoint Announcement | AWS Blog

Performance Optimization

Compute Performance Tuning

CPU Optimization Techniques

Instance family selection: Match workload characteristics
CPU scaling: Vertical vs. horizontal scaling decisions
Processor selection: Intel vs. AMD vs. Graviton performance
Hyperthreading: Enable/disable based on workload

Memory Optimization

Memory-to-vCPU ratio: Match application requirements
NUMA awareness: Optimize for multi-socket instances
Memory bandwidth: Consider DDR4 vs. DDR5
Swap configuration: Minimize swap usage

# Performance monitoring commands
# CPU information
lscpu
cat /proc/cpuinfo

# Memory information  
free -h
cat /proc/meminfo

# Performance monitoring
htop
iostat -x 1
sar -u 1 5

Storage Performance Optimization

EBS Volume Type	Max IOPS	Max Throughput	Best For
gp3	16,000	1,000 MB/s	General purpose, cost-effective
io2	64,000	1,000 MB/s	Critical applications, databases
io2 Block Express	256,000	4,000 MB/s	Largest, most I/O intensive workloads
Instance Store	3.3M+	14,000+ MB/s	Temporary, high-performance storage

🔧 Storage Performance Best Practices

EBS-optimized instances: Enable for consistent performance
Volume size affects performance: Larger volumes = better performance
RAID configuration: RAID 0 for performance, RAID 1 for redundancy
File system tuning: ext4 vs. xfs performance characteristics
I/O scheduler: noop for SSD, deadline for HDD

Network Performance Optimization

🌐 Network Performance Features

Enhanced Networking: SR-IOV for higher bandwidth
Placement Groups: Reduce network latency
Instance types with high network: C5n, M5n, R5n families
Elastic Network Adapter: Up to 100 Gbps

⚡ Performance Tuning

Buffer sizes: Tune TCP send/receive buffers
Interrupt handling: Distribute across CPU cores
Jumbo frames: 9000 MTU within VPC
Multiple queues: Utilize multi-queue networking

# Network performance testing
# Install network testing tools
sudo yum install -y iperf3 netperf

# Test bandwidth between instances
# On server instance:
iperf3 -s

# On client instance:
iperf3 -c SERVER_IP -t 30 -P 4

# Monitor network utilization
watch -n 1 'cat /proc/net/dev'
nload eth0

Key Takeaways and Best Practices

🎯 What You've Learned

Compute Categories: IaaS, PaaS, Containers, and Serverless options
EC2 Fundamentals: Instance types, tenancy models, and components
Instance Types Mastery: All families with memory techniques
Pricing Evolution: From hourly to per-second billing improvements
Hands-on Skills: Launching instances, creating AMIs, and automation
Cost Optimization: Pricing models and optimization strategies
Advanced Features: EBS optimization, Mountpoint for S3, and performance tuning

🚀 Next Steps

Practice: Complete more hands-on labs with different instance types
Explore: Try other compute services like Lambda and Elastic Beanstalk
Optimize: Implement cost optimization strategies in practice
Monitor: Set up CloudWatch monitoring and cost alerts
Scale: Learn about Auto Scaling and Load Balancing

💡 Best Practices Summary

Security

Use IAM roles instead of access keys
Implement least privilege access
Regular security audits and updates
Proper security group configuration

Cost Management

Right-size instances based on actual usage
Use mixed pricing strategies
Implement automated scheduling
Monitor and optimize continuously

Copyright and Legal Information

📝 Educational Use

This educational content is designed for learning purposes and provides hands-on experience with AWS services. The guide combines theoretical knowledge with practical exercises to enhance understanding.

⚖️ Copyright Notice

Redistribution: Requires written permission
Educational Use: Permitted for learning purposes
Commercial Use: Contact for licensing terms

🏷️ Trademark Information

Product names and trademarks mentioned are property of their respective owners:

Amazon Web Services, AWS, EC2: Trademarks of Amazon.com, Inc.
Linux: Trademark of Linus Torvalds
Windows: Trademark of Microsoft Corporation
Other trademarks: Property of their respective owners

Use of these trademarks does not imply endorsement or affiliation.

AWS Compute Services

Learning Objectives