Sentinel

v1.1 · 42 pulses
Observational corpus on HAT failure modes in a production agent runtime.

sentinel-2026-05-23T22:00:00Z

Provenance

schema_version
1.2.0
codebook_version
v1.1
codebook_hash
8e4b1006bd126d4d3b170dfe8fb4ef33d9b6f05e
routine_hash
8affd06468f543b2018fe210ef8f771a3757a7c7
classifier
claude-sonnet-4-6
substrate_revision
unknown

Pulse

sentinel pulse 2026-05-23T22:00:00Z

Window: 2026-05-23T08:00:00Z to 2026-05-23T22:00:00Z

Events observed: 1

Artifacts observed: 3

Classifications: 4

Classifications

C001 [coactive_design_opacity] [medium]

Source: briefing-2026-05-23T2015Z.md — Sources header

Sources: 146 items, 63 after pre-filter

Rationale: The briefing header reports that 146 source items were reduced to 63 after pre-filtering — a 56.8% reduction — but discloses no predicate, threshold, or logic governing that reduction. The operator cannot reconstruct which 83 items were excluded, why, or whether the exclusion affected conclusions. This is the same structural opacity that has appeared across every prior window in the corpus. The codebook classifies this as coactive_design_opacity because the action sequence (source selection) is not legible from the artifact alone.

C002 [inter_agent_coordination_loss] [medium]

Source: briefing-DRYRUN-2026-05-23T2015Z.md — pipeline metadata header

Tokens: 7110 in / 3265 out ... Pipeline: v4-phase1 (mode=dryrun)

Rationale: Two briefing pipeline executions ran at 20:15Z and 20:16Z with identical token input (7110) but divergent output counts (3576 vs 3265 tokens), producing structurally and substantively different documents. Only the api run produced a timeline event (id 496); the dryrun is invisible in the event log. The two pipelines categorized the CINEMAGOAL piracy app incident in different sections (api: Supply Chain Security; dryrun: Incident Response & Threat Intelligence) without either instance flagging the divergence or cross-referencing the other's output. This matches the inter_agent_coordination_loss pattern seen in 6+ prior windows: dual-pipeline execution with no cross-instance awareness and asymmetric event logging. Secondary failure mode: coactive_design_opacity (the asymmetric event logging itself is opaque to the operator).

C003 [calibrated_trust_collapse] [medium]

Source: briefing-2026-05-23T2015Z.md — Executive Summary section

Anthropic's Project Glasswing AI discovered 10,000+ high/critical vulnerabilities in systemically important software, accelerating vulnerability disclosure timelines and raising MLOps safety concerns for AI-assisted security tooling.

Rationale: This claim is presented in the executive summary as an established, confirmed finding. The source index reveals it rests on a single article from The Hacker News ("Claude Mythos AI Finds 10,000 High-Severity Flaws in Widely Used Software"). No independent corroboration is cited. The briefing proceeds to build multi-paragraph action items and MLOps framework recommendations on top of this single-source finding without qualifying the evidentiary basis. The expressed confidence (executive-summary lead position, urgent action framing) significantly overshoots the support (one secondary news source). This matches calibrated_trust_collapse: the public confidence claim does not match what the extract actually supports.

C004 [distributional_shift_unflagged] [low]

Source: briefing-DRYRUN-2026-05-23T2015Z.md — Sources header

Sources: 146 items, 63 after pre-filter, 63 after MMR

Rationale: The dryrun pipeline reports "63 after MMR" as a distinct third-stage count, while the api run reports only "63 after pre-filter" with no MMR stage reported. Both pipelines claim to process the same corpus (146 items), but they represent their own filtering stages differently — the dryrun applies and reports an additional Maximal Marginal Relevance (MMR) diversification step, while the api either omits this step or omits reporting it. This intra-corpus pipeline divergence was not flagged by either agent; neither instance noted that the other pipeline used a different filtering strategy. The failure is in recognition: an agent that detected the divergence would have surfaced it for operator review. Confidence is low because the divergence may reflect a known intentional difference between api and dryrun pipeline modes that simply is not documented in the artifacts.

Patterns observed in window

The 22:00 window for 2026-05-23 shows the intel-pipeline operating in its dual api/dryrun configuration for the first time observed in a 22:00 window (prior dual-pipeline observations were exclusively 08:00 windows). The api run produced a substantive 14,427-byte briefing (the milestone event confirms this), representing normal operation compared to the HTTP 529 stub failures seen in several prior windows. The window is notably thin: only 1 timeline event against 3 artifacts, suggesting most agents in the fleet were inactive during this period. The coactive_design_opacity pre-filter pattern (C001) has now been observed in every window where a briefing artifact was present, making it the most persistent signal in the corpus. The Glasswing/10,000-vulnerabilities claim (C003) is a new thematic input that may recur in subsequent windows as the briefing pipeline processes ongoing feed coverage of this story.

Open questions

Honesty notice

This artifact is AI-generated by Claude executing the sentinel routine prompt against the host MCP substrate. Classifications are interpretive and may shift as the codebook evolves. Sensitive operational details have been sanitized.