Task Statement 4.3: Design cost-optimized database solutions.
📘AWS Certified Solutions Architect – (SAA-C03)
1. What is Backup and Retention?
Backup
A backup is a copy of your data stored separately so that it can be restored if data is lost, corrupted, or deleted.
Retention Policy
A retention policy defines:
- How long backups are kept
- When backups are deleted automatically
2. Why Backup and Retention Matter
In AWS architecture, backups are needed for:
- Data recovery (accidental deletion, corruption)
- Disaster recovery
- Compliance requirements
- Business continuity
However:
- Too many backups = high cost
- Too few backups = risk of data loss
👉 The goal is to balance cost and recovery requirements
3. Types of Backups in AWS
3.1 Automated Backups
- Managed by AWS
- Enabled for services like:
- Amazon RDS
- Amazon DynamoDB
- Includes:
- Daily snapshots
- Transaction logs (for point-in-time recovery)
Key Features:
- Continuous backup
- Point-in-time recovery (PITR)
- Retention period configurable (e.g., 1–35 days for RDS)
3.2 Manual Backups (Snapshots)
- Created manually by users
- Do not expire automatically unless deleted
- Used for long-term storage
Supported Services:
- Amazon RDS
- Amazon EBS
- Amazon Redshift
- Amazon Aurora
3.3 AWS Backup Service
A centralized backup solution for multiple services:
- RDS
- EBS
- DynamoDB
- EFS
- FSx
Features:
- Centralized backup policies
- Cross-region backups
- Lifecycle management
- Automation
4. Snapshot Frequency (VERY IMPORTANT FOR EXAM)
Snapshot frequency determines how often backups are taken.
Common Strategies:
1. Frequent Snapshots
- Taken every few minutes or hours
- Used for:
- Critical databases
- High transaction systems
✔ Pros:
- Minimal data loss
✘ Cons:
- Higher storage cost
2. Daily Snapshots
- Taken once per day
✔ Pros:
- Balanced cost and recovery
✘ Cons:
- Possible data loss between backups
3. Weekly/Monthly Snapshots
- Used for long-term retention
✔ Pros:
- Low cost
✘ Cons:
- Not suitable for frequent recovery
Exam Tip:
- High critical workload → frequent snapshots + PITR
- Low critical workload → less frequent snapshots to save cost
5. Retention Policies (VERY IMPORTANT)
Retention defines how long backups are stored before deletion.
Key Concepts:
5.1 Short-Term Retention
- Example: 7–35 days
- Used for:
- Operational recovery
- Recent data restoration
5.2 Long-Term Retention
- Example: months to years
- Used for:
- Compliance
- Auditing
5.3 Lifecycle Policies
Automatically:
- Move backups to cheaper storage (e.g., cold storage)
- Delete old backups
👉 This is critical for cost optimization
6. Point-in-Time Recovery (PITR)
Allows restoring data to any specific time within retention period.
Supported by:
- Amazon RDS
- Amazon DynamoDB
How it works:
- Uses:
- Continuous backups
- Transaction logs
✔ Very low data loss
✔ Higher storage cost
7. Backup Storage Optimization (Cost-Focused)
7.1 Incremental Backups
- Only changes since last backup are stored
- Used in:
- Amazon EBS snapshots
✔ Reduces storage cost significantly
7.2 Deduplication
- Avoids storing duplicate data blocks
7.3 Compression
- Reduces storage size
7.4 Cold Storage Tier
- Move old backups to cheaper storage
- Example:
- AWS Backup cold storage
✔ Much lower cost
✘ Higher retrieval time
8. Cross-Region and Cross-Account Backups
Cross-Region Backup
- Copies backup to another AWS region
- Used for:
- Disaster recovery
Cross-Account Backup
- Protects against account-level failure or compromise
9. Backup Policies Using AWS Backup
You can define:
- Backup frequency (e.g., daily at midnight)
- Retention period (e.g., 30 days)
- Lifecycle rules (move to cold storage after 7 days)
Benefits:
- Automation
- Consistency
- Cost control
10. Key AWS Services and Their Backup Features
Amazon RDS
- Automated backups (1–35 days)
- Manual snapshots (indefinite)
- PITR supported
Amazon DynamoDB
- PITR (up to 35 days)
- On-demand backups
Amazon EBS
- Snapshot-based backups
- Incremental storage
Amazon S3
- Versioning for backup
- Lifecycle policies
- Cross-region replication
11. Designing a Cost-Optimized Backup Strategy
To design properly, consider:
1. Data Criticality
- High → frequent backups + long retention
- Low → fewer backups + short retention
2. Recovery Objectives
- RPO (Recovery Point Objective)
→ How much data loss is acceptable - RTO (Recovery Time Objective)
→ How fast data must be restored
3. Storage Cost Optimization
- Use incremental backups
- Use lifecycle policies
- Move old backups to cold storage
- Delete unnecessary snapshots
12. Common Exam Scenarios
Scenario 1:
Need minimal data loss
👉 Use:
- Frequent snapshots
- PITR
Scenario 2:
Need to reduce cost
👉 Use:
- Less frequent backups
- Lifecycle policies
- Cold storage
Scenario 3:
Need long-term compliance
👉 Use:
- Manual snapshots
- Long retention
Scenario 4:
Need disaster recovery
👉 Use:
- Cross-region backups
13. Important Exam Tips (Must Remember)
- Automated backups = limited retention
- Manual snapshots = kept until deleted
- EBS snapshots = incremental (cost-efficient)
- Use AWS Backup for centralized management
- Always combine:
- Snapshot frequency + retention + lifecycle policies
14. Quick Summary
- Backup = copy of data
- Retention = how long backup is stored
- Snapshot frequency = how often backup is taken
- PITR = restore to any time
- Lifecycle policies = optimize cost
- Incremental backups = reduce storage cost
