Study DEA-C01 Data Lifecycle, Loads and Retention: key concepts, common traps, and exam decision cues.
DEA-C01 expects you to think about what happens after data lands: how it is loaded, retained, versioned, archived, deleted, or replayed. Lifecycle choices affect cost, compliance, and operational recoverability.
Lifecycle policy: Rule that transitions or expires data based on age or other storage conditions.
Unload: Exporting data from a warehouse such as Amazon Redshift out to storage, commonly Amazon S3.
Retention boundary: Point at which data must be kept, archived, versioned, or deleted to satisfy business and legal requirements.
AWS wants you to separate:
The exam usually rewards answers that treat lifecycle as a design discipline, not as “keep everything until someone cleans it up later.”
| Requirement | Strongest first fit | Why |
|---|---|---|
| objects should move to a cheaper tier as they age | S3 Lifecycle transition rule | The need is age-based storage-tier management |
| old data should expire automatically after the retention window | S3 Lifecycle expiration rule | The requirement is rule-driven expiration |
| warehouse results should be exported to durable object storage | Amazon Redshift UNLOAD to Amazon S3 |
DEA-C01 expects Redshift load and unload patterns |
| S3 objects must remain recoverable after accidental overwrite | S3 versioning | The problem is object-history protection |
| DynamoDB items should disappear after a validity window | DynamoDB TTL | The requirement is age-based item expiration at the table layer |
| If the stem emphasizes… | Think first | Why this fits |
|---|---|---|
| moving older objects to cheaper storage | S3 Lifecycle transition | This is tier management, not deletion |
| deleting objects after a retention window | S3 Lifecycle expiration | This is rule-based removal |
| restoring previous object states | S3 versioning | The center of gravity is recoverability |
| aging out table rows automatically | DynamoDB TTL | This is table-native expiry behavior |
| exporting warehouse data for downstream storage or sharing | Redshift UNLOAD |
This is outbound movement from the warehouse |
| Situation | Stronger first pattern |
|---|---|
| files in S3 should populate warehouse tables | load from Amazon S3 into Amazon Redshift |
| query results or warehouse subsets should be exported for downstream use | unload from Amazon Redshift to Amazon S3 |
| data must stay replayable after transformation mistakes | keep a durable raw or historical copy before destructive cleanup |
| retention rules differ by zone or dataset class | manage lifecycle per bucket, prefix, or table design rather than one blanket rule |
flowchart LR
A["Data ages or changes role"] --> B{"What is needed?"}
B -->|Cheaper storage over time| C["Lifecycle transition"]
B -->|Delete after retention boundary| D["Lifecycle expiration"]
B -->|Recover previous object state| E["Versioning"]
B -->|Expire table items automatically| F["DynamoDB TTL"]
B -->|Export warehouse data| G["Redshift UNLOAD"]
| Question | Better reading |
|---|---|
| “Can we restore an older object version?” | Use S3 versioning when overwrite or delete recovery matters. |
| “Can the data age into lower-cost storage automatically?” | S3 Lifecycle transition is the strongest first answer. |
| “Should stale DynamoDB rows disappear eventually?” | TTL is the table-native expiration lane. |
| “Must data be deleted on a legal timeline?” | Retention and deletion requirements should be enforced deliberately, not left to ad hoc cleanup. |
When lifecycle controls look similar, use this order:
| Trap | Better reading |
|---|---|
| “Archive and delete mean the same thing.” | DEA-C01 distinguishes lower-cost retention from actual expiration or deletion. |
| “Versioning replaces lifecycle rules.” | Versioning protects object history, while lifecycle rules manage tiering and expiration. |
| “UNLOAD is just another load job.” | Load and unload solve opposite movement directions. |
| “Recoverability only matters for production databases.” | Raw zones, historical exports, and object history matter for data pipelines too. |
A company keeps daily Redshift aggregates for analysts, but legal policy says raw source files must remain restorable for 1 year while older raw data should shift to cheaper storage automatically. What is the strongest reading first?
UNLOAD when exports are needed, enable S3 versioning for recovery, and apply S3 Lifecycle transitions and expiration rules to the raw zoneCorrect answer: B. DEA-C01 expects you to combine movement, recoverability, and retention controls instead of forcing one feature to do every lifecycle job.