Selecting the correct data lifecycle for storage

Task Statement 4.1: Design cost-optimized storage solutions.

📘AWS Certified Solutions Architect – (SAA-C03)


1. What is a Data Lifecycle in AWS Storage?

A data lifecycle is the journey of data from creation to deletion. In AWS, this is critical because different stages of data have different storage needs:

  1. Active / Frequently Accessed Data – Needs fast access (e.g., user uploads in S3 that apps read/write often).
  2. Infrequently Accessed Data – Still needed but rarely used (e.g., old reports).
  3. Archival / Long-Term Data – Rarely accessed, kept for compliance (e.g., audit logs).
  4. Expired / Deleted Data – No longer needed and can be removed to save cost.

Selecting the correct lifecycle stage ensures performance, availability, and cost optimization.


2. AWS Storage Services & Lifecycle Management

AWS has multiple storage options. The data lifecycle selection is about choosing the right service and moving data between them as it ages.

Amazon S3 Lifecycle

Amazon S3 is the main service where lifecycle policies are applied.

  • Lifecycle Policies: Rules you define to automatically transition or expire objects.
  • Key operations:
    1. Transition – Move data to a cheaper storage class.
    2. Expiration – Delete objects automatically after a defined time.

S3 Storage Classes

Understanding these is key to selecting the lifecycle:

Storage ClassUse CaseCostAccess Frequency
S3 StandardFrequently accessed dataHighFrequent
S3 Intelligent-TieringAutomatic cost optimization based on access patternsMediumVariable
S3 Standard-IA (Infrequent Access)Infrequently accessed data but needs rapid retrievalLowerRare
S3 One Zone-IAInfrequent data, stored in a single AZLower than Standard-IARare
S3 Glacier Instant RetrievalArchived data, immediate accessVery lowRare
S3 Glacier Flexible RetrievalArchived data, retrieval within minutes to hoursVery lowRare
S3 Glacier Deep ArchiveLong-term archival, rarely accessedCheapestHours to retrieve

Exam Tip: Know that S3 lifecycle policies automatically move objects to cheaper storage classes based on age or access pattern.


3. Lifecycle Rules Examples in IT Context

Here are exam-style IT examples:

  1. Transitioning logs
    • Web server logs are frequently accessed for 7 days.
    • Lifecycle rule: Move logs older than 7 days from S3 Standard → S3 Standard-IA.
    • After 365 days, move to S3 Glacier Deep Archive for long-term retention.
  2. Database backups
    • Daily backups are needed for the last 30 days.
    • Move backups older than 30 days to S3 Glacier Flexible Retrieval.
    • Delete backups older than 7 years (compliance) using expiration.
  3. Temporary files
    • Files uploaded by users but only needed for 1 day.
    • Lifecycle rule: Expire objects after 1 day → automatic deletion saves cost.

4. Best Practices for Lifecycle Selection

  1. Know Access Patterns: Identify which data is frequently accessed vs. rarely accessed.
  2. Automate Transitions: Use S3 lifecycle policies instead of manual migration.
  3. Balance Cost & Retrieval Time: Don’t put data you might need quickly into Glacier Deep Archive.
  4. Combine with Backup/Replication: Lifecycle policies complement AWS Backup or cross-region replication.
  5. Monitor Costs: Use S3 Storage Class Analysis to understand access patterns and optimize transitions.

5. Lifecycle Policy Components (Exam Focus)

A lifecycle policy in S3 has these components:

  • Rule ID – Name of the rule.
  • Scope – Which objects/folders the rule applies to (prefix or tags).
  • Transition Actions – Move objects to another storage class after X days.
  • Expiration Actions – Delete objects after X days.
  • Abort Incomplete Multipart Uploads – Optional to save storage from incomplete uploads.

Exam Tip: Lifecycle policies can have multiple transitions, e.g., Standard → Standard-IA → Glacier → Glacier Deep Archive → Expired.


6. Decision Checklist for Exam

When asked “which lifecycle to select?”, consider:

  1. How often is data accessed?
    • Frequent → Standard
    • Infrequent → Standard-IA / One Zone-IA
    • Rare → Glacier / Deep Archive
  2. How fast do you need to retrieve it?
    • Immediate → Standard / Intelligent-Tiering
    • Minutes → Glacier Flexible Retrieval
    • Hours → Deep Archive
  3. How long do you need to keep it?
    • Short-term → Expire after a few days/months
    • Long-term → Archive for years
  4. Cost optimization
    • Transition to cheaper storage over time.
    • Delete expired data automatically.

7. Key Exam Points to Remember

  • Lifecycle policies automate storage transitions and deletions.
  • S3 Intelligent-Tiering adjusts costs automatically based on usage patterns.
  • Expired objects save money by automatically deleting old data.
  • Glacier and Deep Archive are extremely low cost, but retrieval takes longer.
  • Lifecycle rules can have multiple stages — don’t just move once; think about a 3–4 stage lifecycle.

By understanding how data moves through lifecycle stages, students can answer SAA-C03 questions like:

  • “Where should you move logs that are accessed monthly but must be stored for compliance?” → Glacier Flexible Retrieval or Deep Archive.
  • “How can you automatically delete temporary files after 1 day?” → Lifecycle Expiration Rule.
  • “Which storage class minimizes cost but can adapt to changing access patterns?” → Intelligent-Tiering.
Buy Me a Coffee