Task Statement 4.1: Design cost-optimized storage solutions.
📘AWS Certified Solutions Architect – (SAA-C03)
When working in AWS, often you need to move data from one place to another, whether it’s from on-premises servers, other cloud providers, or within AWS itself. AWS provides different services to migrate data, and selecting the right one depends on size, speed, cost, and use case. Let’s go step by step.
1. AWS Storage Services Overview
Before migrating data, you need to know where you’re moving it. AWS has several storage services:
- Amazon S3 (Simple Storage Service)
- Object storage for files and backups.
- Examples: Logs, images, application data.
- Features: Scalable, durable, inexpensive.
- Amazon EBS (Elastic Block Store)
- Block storage for EC2 instances.
- Use for running operating systems, databases, and applications.
- Amazon EFS (Elastic File System)
- File storage for shared access across multiple EC2 instances.
- Ideal for applications needing real-time access to files.
- Amazon FSx
- Managed file systems: Windows File Server or Lustre (for high-performance workloads).
- Amazon Glacier / S3 Glacier / S3 Glacier Deep Archive
- For long-term backups and archives.
- Very low cost, slower retrieval times.
2. AWS Data Migration Services
AWS provides different tools depending on data location, size, and speed requirements.
A. For Large On-Premises to AWS Data Migration
- AWS Snow Family
- Physical devices that AWS ships to your location to load large amounts of data.
- Types:
- Snowcone – Small, portable (up to 8 TB).
- Snowball – Medium, rugged (up to 80 TB).
- Snowmobile – Extremely large (up to 100 PB in a shipping container).
- Use case: Huge datasets that would take too long to transfer over the internet.
- AWS Direct Connect
- Dedicated network link between on-premises and AWS.
- Use for continuous, high-speed data migration, like syncing databases or storage over time.
- AWS Storage Gateway
- Connects your on-premises storage with AWS cloud storage.
- Supports:
- File Gateway – For files to S3.
- Volume Gateway – For block storage to EBS.
- Tape Gateway – For virtual tape backups to Glacier.
B. For Online / Smaller Data Transfers
- AWS DataSync
- Managed service to transfer data over the network from on-premises storage to AWS.
- Works for S3, EFS, FSx.
- Efficient for daily or repeated migrations.
- S3 Transfer Acceleration
- Uses AWS Edge locations to speed up uploads to S3 globally.
- Good for smaller datasets spread across regions.
- AWS CLI / SDK
- Can directly copy files to S3 or EFS using command-line tools or scripts.
- Ideal for automated or scripted migrations.
C. Database-Specific Migration
- AWS Database Migration Service (DMS)
- Migrates databases from on-premises or other clouds to AWS RDS or Aurora.
- Supports continuous replication with minimal downtime.
- Example: Moving a MySQL database to Amazon Aurora.
3. How to Choose the Right Migration Method
Here’s a simple decision guide for the exam:
| Scenario | Recommended Service | Why |
|---|---|---|
| Small files (< 1 TB), fast transfer | AWS CLI / S3 Transfer Acceleration | Easy, cost-effective, network-based |
| Medium files (1–100 TB), offline option | AWS Snowball | Avoids long internet transfers |
| Extremely large files (>100 PB) | AWS Snowmobile | Physical transport of huge data |
| Continuous sync from on-prem | Storage Gateway | Hybrid storage integration |
| Large on-prem to AWS over network | DataSync | Fast, managed, automated |
| Database migration | Database Migration Service (DMS) | Supports replication & minimal downtime |
| Need fast uploads globally | S3 Transfer Acceleration | Uses AWS Edge locations |
Key exam tip: AWS often tests your ability to match data size, speed requirements, and cost considerations to the correct migration service.
4. Other Considerations for Migration
- Security
- Enable encryption in transit (TLS) and at rest (S3 SSE, EBS encryption).
- For Snow devices, AWS provides hardware encryption.
- Cost Optimization
- For large datasets, offline methods like Snowball can be cheaper than internet transfers.
- For frequent incremental transfers, DataSync reduces operational overhead.
- Performance
- High-speed or low-latency workloads: use Direct Connect.
- Distributed global users: use S3 Transfer Acceleration.
- Ease of Management
- Fully managed services like DataSync or DMS require minimal manual work.
- Physical devices like Snowball need shipment but are simpler than setting up high-speed networking.
5. Key Points to Remember for the Exam
- Snow Family: physical devices for large offline transfers.
- DataSync: online, automated data migration.
- Storage Gateway: hybrid on-prem + cloud integration.
- DMS: database migrations with minimal downtime.
- S3 Transfer Acceleration: faster global uploads to S3.
- Always consider data size, speed, cost, and security when selecting a service.
✅ Quick Memory Tip:
- “Snow is physical, DataSync is digital, Gateway is hybrid, DMS is for databases, Acceleration is for speed.”
