DEA-C01 Choosing Data Stores for Access Patterns Guide

Study DEA-C01 Choosing Data Stores for Access Patterns: key concepts, common traps, and exam decision cues.

Store choice is one of the highest-yield DEA-C01 decision types. The right store depends on access pattern, latency, schema flexibility, cost, and the difference between landing, serving, and warehouse-style analytics.

Access pattern: The way applications or analysts read, write, filter, update, or aggregate data in practice.

Lake storage: Durable object storage that keeps raw or curated files for broad analytical use.

Serving store: Store optimized for the way an application or workload consumes data, such as low-latency key lookups or relational transactions.

What AWS is really testing here

AWS wants you to separate:

  • lake storage from serving storage
  • warehouse analytics from transactional workloads
  • key-value access from relational joins
  • “where data lands first” from “where it should be queried for the actual use case”

High-yield chooser

Need Strongest first fit Why
object-based lake storage Amazon S3 The requirement is durable object storage for raw or curated data
warehouse analytics Amazon Redshift DEA-C01 expects a warehouse answer for large analytical SQL workloads
managed relational pattern Amazon RDS or Aurora The need is relational structure, transactions, or familiar SQL application behavior
key-value / low-latency access Amazon DynamoDB The access pattern is high-scale lookup, not relational join logic
search-oriented text retrieval Amazon OpenSearch Service The need is search and indexing rather than warehouse analytics

Read the access pattern before the product name

If the stem emphasizes… Think first Why this fits
raw or curated files, broad durability, low-cost object storage S3 This is lake storage, not necessarily the serving layer
complex SQL over modeled analytical data Redshift The workload is warehouse analytics
transactional rows, joins, and application consistency RDS or Aurora This is a relational serving workload
high-scale key lookups or access by primary key DynamoDB The workload is low-latency key-value access
text search, ranked results, or indexing OpenSearch This is search, not warehouse or OLTP access
    flowchart LR
	  A["Data workload"] --> B{"How is data consumed?"}
	  B -->|Files and lake-style storage| C["S3"]
	  B -->|Warehouse SQL analytics| D["Redshift"]
	  B -->|Relational transactions and joins| E["RDS / Aurora"]
	  B -->|Low-latency key access| F["DynamoDB"]
	  B -->|Search and indexing| G["OpenSearch"]

Store-selection reading

Question Better reading
“Will this mostly be queried as files in a lake?” Think S3-based storage plus the right query layer
“Do analysts need warehouse-style SQL performance on modeled data?” Think Redshift
“Does the application need relational consistency and joins?” Think RDS or Aurora
“Is the dominant need millisecond key-value access at scale?” Think DynamoDB
“Do users need full-text or indexed search behavior?” Think OpenSearch instead of forcing a warehouse to do search work

Landing, serving, and analytics can be different stores

Role Typical first thought
raw landing zone S3
application serving store DynamoDB or RDS/Aurora depending on the access pattern
analytical serving store Redshift or a lake query pattern

DEA-C01 often rewards answers that keep these roles separate instead of forcing a single store to handle every access pattern badly.

Common traps

Trap Better reading
“S3 is enough for every workload.” S3 is foundational lake storage, not always the final serving store.
“Redshift is just another database.” DEA-C01 expects you to read it as a warehouse analytics engine.
“DynamoDB is a relational replacement by default.” DynamoDB fits key-value and high-scale access patterns, not every relational workload.
“RDS and Aurora are the same as analytical warehouses.” They solve transactional relational needs, not the same warehouse-style problem as Redshift.

How strong DEA-C01 answers usually reason

  1. Decide whether the workload is lake, warehouse analytics, transactional relational serving, key-value serving, or search.
  2. Separate where data lands from where the application or analyst consumes it.
  3. Do not force Redshift into OLTP or DynamoDB into relational-join workloads.
  4. Use multiple stores when the access patterns are genuinely different.

Decision order that usually wins

When several stores could technically work, choose in this order:

  1. Identify the access pattern before the product name.
  2. If the workload is raw or curated file storage, start with S3.
  3. If the workload is repeated analytical SQL on modeled data, start with Redshift.
  4. If the workload is transactional relational serving, start with RDS/Aurora.
  5. If the workload is key-value or search-driven, move toward DynamoDB or OpenSearch instead of forcing a warehouse answer.

Harder scenario question

A company lands raw clickstream files in a lake, serves user-profile lookups to an application with low latency, and also needs modeled analytical SQL for business reporting. What is the strongest reading first?

  • A. Use only Amazon Route 53 for everything
  • B. Use Amazon S3 for the lake, DynamoDB for low-latency profile access, and Amazon Redshift for warehouse-style reporting
  • C. Use only Amazon EBS snapshots
  • D. Put dashboards directly on CloudTrail

Correct answer: B. DEA-C01 expects you to match different stores to different access patterns instead of forcing one store to solve unrelated workloads.

Quiz

Loading quiz…
Revised on Sunday, May 10, 2026