Hybrid storage options (for example, DataSync, Transfer Family, Storage Gateway)

Task Statement 4.1: Design cost-optimized storage solutions.

📘AWS Certified Solutions Architect – (SAA-C03)


Hybrid Storage in AWS

Hybrid storage refers to solutions that let you connect on-premises storage systems with AWS cloud storage. This allows organizations to use cloud storage without completely moving all their data to the cloud. It’s important for the exam because AWS often tests your understanding of connecting existing IT infrastructure to the cloud efficiently and cost-effectively.

The three main AWS services for hybrid storage are:

  1. AWS DataSync
  2. AWS Transfer Family
  3. AWS Storage Gateway

1. AWS DataSync

What it is:

AWS DataSync is a managed data transfer service. It moves large amounts of data between on-premises storage and AWS storage (like Amazon S3, Amazon EFS, or Amazon FSx) quickly, securely, and efficiently.

How it works:

  • You install a DataSync agent on your on-premises server or storage appliance.
  • DataSync connects to your local storage system and AWS storage.
  • It automatically copies files, preserves metadata, and can sync only changed files to save bandwidth.

Key Features for Exam:

  • Automated transfers: No manual scripts required.
  • High speed: Optimized for large data sets.
  • Secure: Data is encrypted in transit using TLS.
  • Incremental transfer: Only changes are synced after the first full copy.
  • Use cases:
    • Moving file server backups to Amazon S3
    • Migrating on-premises data to Amazon EFS for cloud applications
    • Continuous replication for disaster recovery

2. AWS Transfer Family

What it is:

AWS Transfer Family allows organizations to use standard file transfer protocols to move files directly into AWS storage (usually Amazon S3 or Amazon EFS).

Supported protocols include:

  • SFTP (Secure File Transfer Protocol)
  • FTPS (FTP Secure)
  • FTP (File Transfer Protocol)

How it works:

  • You create a Transfer Family endpoint in AWS.
  • External clients (users or partner systems) can connect using SFTP/FTPS/FTP just like they connect to a regular server.
  • Files are stored directly in Amazon S3 or EFS—no separate servers needed.

Key Features for Exam:

  • Managed service: No need to run your own SFTP server.
  • Supports existing workflows: Works with standard protocols your partners or legacy systems already use.
  • Secure: Supports encryption in transit and at rest.
  • Integrates with IAM: Controls user access easily.
  • Use cases:
    • Partner data uploads to your S3 buckets
    • Receiving log files or backups from remote systems
    • Replacing legacy SFTP servers with AWS-managed service

3. AWS Storage Gateway

What it is:

AWS Storage Gateway is a hybrid storage service that connects on-premises applications to AWS cloud storage in a seamless way. It makes cloud storage appear like local storage to your applications.

Gateway Types:

  1. File Gateway:
    • Provides file-based access to Amazon S3.
    • Stores files as objects in S3, but your apps see them as standard file shares (NFS or SMB).
  2. Volume Gateway:
    • Provides block storage to on-premises applications.
    • Can operate in cached mode (most data in S3, frequently accessed data cached locally) or stored mode (all data stored locally and backed up to S3).
  3. Tape Gateway:
    • Replaces physical backup tapes with virtual tapes stored in Amazon S3 and Glacier.
    • Works with existing backup software that expects tape drives.

Key Features for Exam:

  • Transparent cloud integration: Your apps don’t need to change.
  • Supports hybrid IT environments: Works alongside on-prem storage.
  • Secure: Encryption in transit and at rest.
  • Use cases:
    • Archiving old files to S3/Glacier
    • Using cloud for disaster recovery
    • Integrating legacy backup workflows with the cloud

Exam Tips for Hybrid Storage

  1. DataSync vs Storage Gateway:
    • DataSync = high-speed data migration or replication.
    • Storage Gateway = ongoing hybrid access (your apps continue using cloud as if it’s local).
  2. Transfer Family is for external file transfers using SFTP/FTP protocols, mainly when partners or legacy systems need to connect.
  3. Cost-Optimization Angle:
    • Use DataSync for bulk migration to reduce manual overhead.
    • Use Storage Gateway in cached mode to save on local storage hardware costs.
    • Use Transfer Family to avoid running dedicated SFTP servers, which reduces operational costs.
  4. Remember Protocols and Types:
    • Transfer Family supports SFTP, FTPS, FTP
    • Storage Gateway has File, Volume, and Tape gateways

Summary Table

ServicePurposeAccess TypeKey Use Case
DataSyncMigrate & sync large datasetsN/A (agent-based)Migrate file servers to S3/EFS
Transfer FamilyManaged file transferSFTP, FTPS, FTPPartner uploads directly to S3/EFS
Storage GatewayHybrid storage accessFile, Block, TapeConnect on-prem apps to cloud seamlessly

By understanding these three services and their use cases, protocols, and hybrid capabilities, you’ll be well-prepared for any exam question on hybrid storage.

Buy Me a Coffee