Task Statement 3.1: Determine high-performing and/or scalable storage solutions.
📘AWS Certified Solutions Architect – (SAA-C03)
1. What is Hybrid Storage?
Hybrid storage means combining:
- On-premises storage (your local data center, servers, NAS, SAN)
- AWS cloud storage
👉 The goal is to connect both environments so data can move between them seamlessly.
2. Why Use Hybrid Storage?
In real IT environments, organizations often:
- Already have existing storage systems
- Cannot move all data to cloud immediately
- Need low latency access to local data
- Want backup, disaster recovery, or scaling in AWS
Key Benefits:
- Gradual cloud migration
- Cost optimization
- Backup & disaster recovery
- Scalability
- Compliance (keep some data on-prem)
3. Key AWS Hybrid Storage Services (VERY IMPORTANT FOR EXAM)
You must understand these core services:
3.1 AWS Storage Gateway
What it is:
A service that connects on-premises storage to AWS cloud storage.
It runs as:
- Virtual machine (VMware/Hyper-V)
- Hardware appliance
Types of Storage Gateway (EXAM CRITICAL)
1. File Gateway
- Provides file-based access (NFS/SMB)
- Stores files in:
- Amazon S3
Key Features:
- Appears like a normal file server
- Files stored as objects in S3
- Supports caching locally
Use Case:
- File sharing across environments
- Application storing files locally but backed by S3
2. Volume Gateway
Provides block storage (iSCSI)
Two Modes:
a. Cached Volumes
- Frequently accessed data → stored on-prem
- Entire dataset → stored in S3
👉 Good for:
- Low local storage, large cloud storage
b. Stored Volumes
- Entire data stored on-prem
- AWS stores backups (snapshots) in:
- Amazon EBS snapshots
👉 Good for:
- Need full local access
- Backup in AWS
3. Tape Gateway
- Replaces physical tape backups
- Uses:
- Amazon S3 Glacier
Key Features:
- Virtual tape library (VTL)
- Long-term backup storage
Storage Gateway Summary Table
| Type | Storage Type | Stored In | Best Use |
|---|---|---|---|
| File Gateway | File | S3 | Shared files |
| Cached Volume | Block | Mostly AWS | Low local storage |
| Stored Volume | Block | Mostly local | Backup to AWS |
| Tape Gateway | Backup | Glacier | Archive |
3.2 AWS DataSync
What it is:
A service to move large amounts of data between:
- On-prem → AWS
- AWS → AWS
Key Features:
- Fast and automated transfer
- Data validation
- Encryption
- Incremental transfer (only changed data)
Supported Storage:
- NFS, SMB (on-prem)
- Amazon S3
- Amazon EFS
- Amazon FSx
Use Cases:
- Data migration
- Data replication
- Backup
3.3 AWS Snow Family
What it is:
Physical devices used to transfer data when:
- Internet is slow
- Data size is very large (TBs–PBs)
Types:
- Snowcone (small)
- Snowball (medium)
- Snowmobile (very large scale)
Key Features:
- Secure data transfer
- Offline migration
Use Case:
- Move large datasets to AWS
- Data center migration
3.4 Amazon EFS + On-Prem Access
Hybrid Setup:
- On-prem servers connect to EFS via:
- VPN or Direct Connect
Key Features:
- Shared file system
- Scales automatically
- Linux-based workloads
3.5 Amazon FSx
Supports hybrid workloads with:
- FSx for Windows File Server
- FSx for NetApp ONTAP
Hybrid Advantage:
- Can integrate with on-prem Active Directory
- Supports enterprise file systems
4. Network Connectivity for Hybrid Storage
Hybrid storage requires connection between on-prem and AWS:
1. AWS Site-to-Site VPN
- Encrypted connection over internet
- Quick to set up
- Lower cost
2. AWS Direct Connect
- Private dedicated connection
- High performance
- Low latency
Exam Tip:
- VPN = cheaper, internet-based
- Direct Connect = faster, dedicated, expensive
5. Common Hybrid Storage Architectures (EXAM IMPORTANT)
1. Backup to Cloud
- On-prem data → Storage Gateway → S3/Glacier
2. Cloud Burst
- Local system handles normal load
- AWS handles extra storage demand
3. Migration
- DataSync / Snow → move data to AWS
4. Disaster Recovery (DR)
- Local data backed up to AWS
- Restore during failure
6. Choosing the Right Hybrid Solution (EXAM LOGIC)
Scenario-Based Decision Guide:
Need file-based access (NFS/SMB)?
→ Use Storage Gateway (File Gateway)
Need block storage (iSCSI)?
→ Use Volume Gateway
Need backup/archive?
→ Use Tape Gateway
Need fast data transfer?
→ Use DataSync
No/slow internet + huge data?
→ Use Snow Family
Need shared file system across environments?
→ Use EFS or FSx
7. Security in Hybrid Storage
Key concepts:
- Encryption in transit (TLS)
- Encryption at rest (S3, EBS, etc.)
- IAM access control
- Secure network (VPN / Direct Connect)
8. Important Exam Tips
✅ Storage Gateway is the MOST IMPORTANT hybrid service
✅ Know differences between:
- File vs Volume vs Tape Gateway
✅ DataSync = online transfer
✅ Snow = offline transfer
✅ Direct Connect = high performance
✅ S3 = central storage layer in hybrid setups
9. Quick Revision Summary
- Hybrid storage = On-prem + AWS
- Core services:
- Storage Gateway
- DataSync
- Snow Family
- EFS / FSx
- Connectivity:
- VPN or Direct Connect
- Use cases:
- Backup
- Migration
- DR
- Scaling
