Green health checks but degraded Kafka throughput
1. Environment: Kubernetes + Kafka consumers in local-connected workspace mode.
2. Operational problem: Consumer lag rose while API probes remained healthy.
3. What operators saw: Green endpoint checks, but rising lag and reduced consumer throughput.
4. Traditional tooling showed: Probe dashboard stayed healthy; lag charts were isolated and not action-ranked.
5. CTK detected: Cross-signal contradiction between probe health and runtime behavior collapse.
6. Contradiction state: Active contradiction linked to queue pressure and runtime instability.
7. Confidence evolution: 0.84 -> 0.52 until fresh logs and throughput recovery evidence arrived.
8. Safety gate behavior: Restart blocked; safer scale-consumer action allowed under higher trust path.
9. Replay chain: Lag trend + throughput drop + healthy probe -> contradiction node -> confidence penalty -> split action branch.
10. Final operational outcome: Team scaled consumers first, restored throughput, then replayed chain before any disruptive restart.
