DVA-C02 Logs, Metrics, Alerts, and Health Checks Guide

April 1, 2026

Study DVA-C02 Logs, Metrics, Alerts, and Health Checks: key concepts, common traps, and exam decision cues.

On this page

This lesson is about building observability into the application instead of hoping infrastructure will explain everything later. DVA-C02 wants developers to understand logging strategy, custom metrics, alerts, tracing annotations, and the difference between liveness and readiness signals.

Structured logging: Logging approach where fields such as user ID, request ID, action, and status are emitted in a machine-readable format.

Health check: Probe or endpoint used to determine whether an application instance is alive, ready, or safe to receive traffic.

What AWS is really testing here

AWS wants you to know:

why logging, monitoring, and observability are related but not identical
when custom metrics are better than only raw log review
how alerts should point at actionable conditions instead of raw noise
why readiness and health checks influence traffic flow and diagnosis
why application-level instrumentation matters even when AWS already gives infrastructure metrics

Choose the right observability output

Need	Strongest first control	Why
search and correlate failures quickly	structured logging	Queryable fields outperform free-form text under pressure.
measure business or app-specific behavior	custom metrics	Infra defaults do not capture app semantics.
alert only when action is required	thresholds tied to meaningful symptoms	Actionable alerts reduce noise fatigue.
know whether the process is alive	liveness health check	This is not the same as traffic readiness.
know whether the instance can safely receive requests	readiness health check	Prevents sending traffic to a warming or broken dependency path.
connect application logs to metrics	EMF or deliberate structured emission	Bridges application events into CloudWatch metric workflows.

Logging, metrics, and health checks work together

    flowchart LR
	    A["Application code"] --> B["Structured logs"]
	    A --> C["Custom metrics / EMF"]
	    A --> D["Tracing annotations"]
	    A --> E["Health endpoints"]
	    C --> F["Dashboards and alerts"]
	    E --> G["Traffic decisions and deployment safety"]

Strong answers usually prefer:

structured logs instead of giant free-form debug blobs
business or app metrics for things infra metrics cannot express
alerts that map to a response, not just to any activity
readiness probes when traffic should wait for dependencies to be ready

Instrument for the question you will ask later

Developers often log too much text and too little structure. DVA-C02 prefers instrumentation that helps future diagnosis:

request IDs
tenant IDs where appropriate
operation names
status or result fields
latency or size values that can become metrics

Small Python example:

1import json
2import time
3
4def emit_order_metric(order_id: str, started_at: float) -> None:
5    print(json.dumps({
6        "orderId": order_id,
7        "latencyMs": int((time.time() - started_at) * 1000),
8        "status": "success",
9    }))

The point is not a logging framework choice. The point is that application instrumentation should preserve searchable context and measurable signals.

Common traps

writing verbose unstructured logs that are hard to search under pressure
emitting no business or application metrics beyond infrastructure defaults
alerting on every event instead of actionable thresholds
assuming a running process is automatically ready to serve traffic correctly

High-confusion pairs

Pair	How to separate them
logging vs monitoring	event detail vs aggregate signal and thresholds
custom metrics vs raw log search	measurable trends and alarms vs ad hoc investigation detail
liveness vs readiness	process alive vs safe to receive traffic
alerting vs dashboarding	proactive notification vs operator inspection surface

Decision order that usually wins

Separate business-specific signals, log structure, traffic-readiness gating, and alert quality.
If infrastructure metrics miss the real application behavior, emit custom application metrics.
If troubleshooting under pressure is hard, improve structured logging before adding more prose logs.
If a service should not receive traffic until dependencies are ready, think readiness checks.
If pages are noisy, redesign for actionable thresholds instead of adding more alerts.

Quiz

Loading quiz…

Revised on Monday, June 15, 2026

4.1 Root Cause

4.3 Performance & Tuning

Browse AWS Certification Guides