Lambda Cost Optimization: From Memory Tuning to ARM Migration

Key Takeaway

Lambda costs are determined by "memory × execution time." The key is finding the optimal memory-performance balance, not just minimizing memory. Use Power Tuning to find optimal memory, and switch to ARM (Graviton2) for an additional 20% savings.

Exam Tip

Exam Essential: "Lambda memory increase → CPU proportionally increases → Execution time decreases → Total cost stays same or decreases"

Understanding Lambda Pricing

Lambda costs consist of 3 components:

Component	Pricing	Free Tier
Requests	$0.20/million	1 million/month
Compute Time	$0.0000166667/GB-second	400,000 GB-seconds/month
Provisioned Concurrency	$0.0000041667/GB-second	-

Cost Calculation Formula

Lambda Cost = Request charge + Compute charge

Compute charge = Memory(GB) × Execution time(seconds) × Unit price

Example: 256MB memory, 200ms execution, 1 million requests/month
- Requests: 1M × $0.0000002 = $0.20
- Compute: 0.25GB × 0.2sec × 1M × $0.0000166667 = $0.83
- Total: $1.03/month

Exam Tip

1ms billing granularity: Changed from 100ms to 1ms units, significantly reducing costs for short-running functions.

Memory and CPU Relationship

In Lambda, increasing memory proportionally increases CPU performance.

Memory ↑ = CPU ↑ = Network bandwidth ↑

128MB  → Minimum CPU
256MB  → 2× CPU
512MB  → 4× CPU
1024MB → 8× CPU (approx. 0.5 vCPU)
1769MB → 1 vCPU
10240MB → 6 vCPU

Lower Memory Isn't Always Cheaper

Example: Data processing Lambda function

128MB (minimum memory):
- Execution time: 2000ms
- Cost: 0.128 × 2.0 × $0.0000166667 = $0.00000427

512MB:
- Execution time: 500ms (4× faster)
- Cost: 0.512 × 0.5 × $0.0000166667 = $0.00000427

→ Same cost, but 512MB is 4× faster response!

Exam Tip

Core Principle: CPU-bound tasks may cost the same or less with higher memory. I/O-bound tasks benefit from lower memory.

Strategy 1: Power Tuning

AWS Lambda Power Tuning is an open-source tool that automatically tests your function across various memory settings to find the optimal cost-performance balance.

How It Works

Runs via Step Functions:
┌──────────────────────────────────────────────┐
│  128MB → Measure                             │
│  256MB → Measure                             │
│  512MB → Measure       → Optimal value report│
│  1024MB → Measure         (cost vs speed)    │
│  2048MB → Measure                            │
│  3008MB → Measure                            │
└──────────────────────────────────────────────┘

Interpreting Results

Memory    Duration    Cost
128MB     2000ms    $0.00000427  ← Slowest
256MB     1000ms    $0.00000427
512MB     500ms     $0.00000427  ← Optimal (same cost, 4× faster)
1024MB    480ms     $0.00000819  ← Cost increase, minimal perf gain

→ In this case, 512MB is optimal

Strategy 2: ARM (Graviton2) Migration

ARM architecture Lambda is 20% cheaper and up to 34% better performance than x86.

Item	x86	ARM (Graviton2)
Architecture	x86_64	arm64
Price	Baseline	20% cheaper
Performance	Baseline	Up to 34% better

ARM Migration Considerations

Native binaries: C/C++, Rust need to be rebuilt for ARM
Interpreted languages: Python, Node.js, Java are mostly compatible
Lambda Layers: May need ARM-specific layers

Exam Tip

Exam Point: Lambda Graviton2 (ARM) = 20% cost reduction + performance improvement vs x86

Strategy 3: Reduce Execution Time

Minimize Cold Starts

Strategy	Method	Cost Impact
Provisioned Concurrency	Pre-warm instances	Additional cost
SnapStart	JVM snapshot (Java)	Free
Minimal Package	Remove unnecessary SDKs	Free
Run Outside VPC	Remove VPC if not needed	Free

Code Optimization

# Bad: Initialize inside handler every time
def handler(event, context):
    import boto3  # Imports every time
    client = boto3.client('dynamodb')  # Creates every time
    return client.get_item(...)

# Good: Initialize outside handler (reuse)
import boto3
client = boto3.client('dynamodb')  # Created once on cold start

def handler(event, context):
    return client.get_item(...)  # Reused on warm starts

Strategy 4: Cost Management

Tiered Pricing

Discounts automatically apply at high volume:

Tier	Price (GB-second)
0 - 6 billion GB-sec	$0.0000166667
6B - 15B GB-sec	$0.0000150000 (10% discount)
Over 15B GB-sec	$0.0000133334 (20% discount)

Savings Plans

Compute Savings Plans can be applied to Lambda:

1-year or 3-year commitment
Up to 17% additional savings

Eliminate Unnecessary Invocations

Cost Reduction Checklist:
☑ Optimize CloudWatch log levels (DEBUG → INFO)
☑ Remove unnecessary API calls
☑ Use DynamoDB batch operations (PutItem → BatchWriteItem)
☑ SQS batch processing (receive 10 messages at a time)
☑ Utilize /tmp directory caching

Cost Monitoring

CloudWatch Metrics

Metric	Purpose
Duration	Track execution time
ConcurrentExecutions	Concurrent execution count
Throttles	Throttling occurrence count
MemorySize vs MaxMemoryUsed	Detect memory over-provisioning

Using Cost Explorer

Lambda Cost Analysis:
1. Cost Explorer → Filter by service → Lambda
2. Analyze costs by tag (environment, project)
3. Check daily/weekly trends

SAA-C03 Exam Focus Points

✅ Memory-CPU relationship: "Memory increase → CPU proportionally increases"
✅ Cost optimization: "Power Tuning to find optimal memory"
✅ ARM migration: "Graviton2 for 20% cost reduction"
✅ Pricing structure: "Requests + (Memory × Execution time)"
✅ Cold start: "Provisioned Concurrency = extra cost, SnapStart = free"

Exam Tip

Sample Exam Question: "How to reduce Lambda function costs while maintaining performance?" → Answer: Switch to ARM (Graviton2) architecture (20% cheaper + better performance)

Lambda Cost Optimization: From Memory Tuning to ARM Migration

Key Takeaway

Understanding Lambda Pricing

Cost Calculation Formula

Memory and CPU Relationship

Lower Memory Isn't Always Cheaper

Strategy 1: Power Tuning

How It Works

Interpreting Results

Strategy 2: ARM (Graviton2) Migration

ARM Migration Considerations

Strategy 3: Reduce Execution Time

Minimize Cold Starts

Code Optimization

Strategy 4: Cost Management

Tiered Pricing

Savings Plans

Eliminate Unnecessary Invocations

Cost Monitoring

CloudWatch Metrics

Using Cost Explorer

SAA-C03 Exam Focus Points

Frequently Asked Questions (FAQ)

Q: Is setting Lambda memory to minimum (128MB) always cheapest?

Q: When should I use Provisioned Concurrency?

Q: Which is cheaper, Lambda or EC2?

Q: How much is Lambda Free Tier?

Q: Does ARM migration require code changes?

References