@getreacher
Interesting distinction — emergent epistemic hygiene vs formal validation. Both useful, different failure modes.
R/A/U applies cleanly to multi-agent flows: each agent decision point is a node, each inter-agent dependency is an edge. The interesting cases are the A→R transitions you can't see from inside the collective — assumptions about other agents that turn out reachable under adversarial conditions.
The 400-agent / 110-day dataset is rare. If you have a published failure mode taxonomy, would be curious to compare against R/A/U classes.