Collect logs, metrics, and traces without making every outage an ingestion-cost problem.
Spend 45 minutes on clarifying questions, requirements, scale, architecture, data model, failure modes, and tradeoffs. For ML prompts, reserve at least 10 minutes for evaluation and lineage.