thejefflarson · thejefflarson · Jun 30, 2026 · Jun 30, 2026
diff --git a/docs/adr/0020-signature-continuity.md b/docs/adr/0020-signature-continuity.md
@@ -0,0 +1,135 @@
+# 0020. Supply-chain trust is signature continuity, not prefix-gated single-identity
+
+- Status: Accepted
+- Date: 2026-06-29
+
+## Context
+
+The admission webhook's [`SignaturePolicy`](../../engine/src/policies/signature.rs)
+answers one narrow question: *are **my** images signed by **me**?* It is configured
+with `PROTECTOR_GATED_PREFIXES` (the image-ref prefixes to check) and a single
+`PROTECTOR_IDENTITY_REGEXP` (one trusted cosign/Fulcio signer). An image whose ref
+matches a gated prefix must carry a signature from that one identity or it fails;
+**every other image is `NotApplicable`** — not checked, no opinion.
+
+That shape has three problems, and they compound:
+
+1. **No visibility.** Because anything outside the prefix is `NotApplicable`, the
+   operator cannot see the signing posture of the cluster at all. The Admission view
+   shows `n/a` for almost every workload (third-party images, and everything when
+   gating is unconfigured), and `n/a` + "would admit" reads like a green stamp when
+   in fact nothing was inspected. There is no inventory of *which images are signed,
+   and by whom*.
+
+2. **Single-identity is the wrong trust topology.** Real clusters run images signed
+   by many legitimate publishers — distroless by Google, linkerd by Buoyant,
+   Chainguard images by Chainguard, our images by our own workflows. A single
+   org-wide identity regexp can only ever vouch for *our* images; it has nothing to
+   say about the upstream dependencies that make up most of the attack surface, so
+   they all fall to `NotApplicable`.
+
+3. **It cannot catch the actual attack.** The supply-chain threat that matters is
+   **a repository that was serving signed images suddenly serving an unsigned one**
+   (or one signed by a *different* identity) — the signature of an attacker who
+   obtained push access to that registry/repo. A prefix-gated single-identity gate
+   is structurally blind to this: an upstream repo it never gated stays `n/a` before
+   and after the compromise; the signal — *loss of an established signature* — is
+   exactly what the model never records and therefore can never miss-or-catch.
+
+The current gate is a fine *enforcement primitive* but the wrong *model*. It treats
+signing as a static per-image allow-rule, when the security-bearing fact is a
+**change** in signing posture over time.
+
+This is the same shape as the rest of protector. ADR-0016 established the engine's
+thesis: don't alarm on static posture, **observe a baseline and treat the deviation
+as the signal** — reachability is potential, the runtime/enrichment *change* is what
+the model judges. Signing posture is no different: a signed→unsigned regression on an
+established repo is supply-chain *drift*, the direct analogue of the behavioral drift
+the runtime side already corroborates on.
+
+A technical enabler makes the broader model feasible without a hand-built trust map
+for the entire ecosystem: the `sigstore` crate already vendored for `CosignChecker`
+can fetch an image's signature layers and read the **Fulcio certificate subject**
+(signer identity + OIDC issuer) *without* committing to a trusted identity up front.
+We can therefore record "repo `R` signed by `<identity>` via `<issuer>`" for **any**
+image by observation, and let trust be *learned* (Trust On First Use) rather than
+pre-declared.
+
+## Decision
+
+**Supply-chain trust is modeled as signature continuity, observed for every image and
+learned per source — not as a prefix-gated check against one identity.** Three layers,
+each a strict superset of what the old gate did, rolled out audit-first to honor the
+shadow invariant (ADR-0016: the engine proposes, it never acts by surprise).
+
+1. **Observe every image (no trust config required).** For each image admitted *and*
+   already running (the engine watches Pods; existing workloads are swept through the
+   same observer, not just new admissions), record its signing posture: signed or not,
+   and — when signed — the signer identity + OIDC issuer read from the cert subject.
+   This is pure observation; it requires no `gated_prefixes` and no trusted identity,
+   and it is what the Admission view renders as a real per-image *"signed by `<id>` /
+   unsigned"* column for **all** images.
+
+2. **Learn a per-repo baseline (TOFU).** Persist, durably (alongside the decision
+   journal, so it survives restarts), a signing history keyed by **repository**
+   (`registry/repo`): the identity/issuer set observed signing images from that source,
+   and whether that source has an established signed history. A new *digest* under a
+   repo is normal (that is every deploy); the baseline is about the *source*, not the
+   tag or digest.
+
+3. **Detect drift and decide (the breach-relevant signal).** A **signing regression**
+   — an image from a repo with an established signed history that arrives *unsigned*,
+   or signed by an *identity not previously seen for that repo* — is surfaced as a
+   finding and, in enforced scope, blocked. Known-benign exceptions (a publisher that
+   legitimately stopped signing, a deliberate signer rotation) are managed by an
+   explicit pin/acknowledgement, not by disabling the signal.
+
+The old prefix-gated single-identity gate becomes **one pinned special case** of layer
+3: "repo prefix `ghcr.io/<org>` must always be signed by identity `X`" is a manually
+asserted baseline, equivalent to what TOFU would learn but declared up front. We keep
+that as an available pin; we no longer make it the whole model.
+
+Enforcement stays opt-in per scope (`PROTECTOR_ENFORCE_*`), audit-everywhere by
+default, exactly as today. Signature verification continues to use the already-
+sanctioned outbound path (the registry the cluster already pulls from + sigstore
+TUF/Rekor) — this is the existing ADR-0015 exception for advisory/verification data,
+now exercised for every distinct image rather than only gated ones, bounded by the
+existing verification cache and `PROTECTOR_MAX_IMAGES` cap. The security graph and
+evidence still never leave the cluster.
+
+## Consequences
+
+What becomes easier:
+
+- **Real supply-chain visibility.** The operator can finally see, in audit mode, which
+  images across the cluster are signed and by whom — the question the old model could
+  not answer for anything outside its prefix.
+- **The repo-compromise attack becomes catchable.** A signed→unsigned (or signer-change)
+  regression on an established source is precisely the push-access-compromise signal,
+  and it is now a first-class finding instead of structurally invisible.
+- **Trust scales to the real dependency set** without a hand-maintained identity map:
+  upstream publishers' identities are learned by observation; only exceptions need a pin.
+- **One coherent thesis.** Signing drift joins reachability and behavioral drift under
+  ADR-0016 — the model decides on a *deviation*, not on static posture — so the Admission
+  surface stops being a special-case allow-list and becomes another evidence channel.
+
+What becomes harder / the downsides we accept:
+
+- **More verification work.** Inspecting every distinct image's signature is more
+  outbound verification than gating a prefix. Bounded by the verification cache + the
+  `MAX_IMAGES` cap, but it scales with the number of distinct images, not with our org.
+- **TOFU cold-start is trust-on-*first*-use.** An image that was malicious *before*
+  protector first observed it looks clean — the baseline is "what we first saw," not
+  ground truth. The model catches *changes from* the baseline, not a poisoned origin;
+  a pin is the only way to assert ground truth up front. This is an honest limitation,
+  surfaced as such (a freshly-learned baseline is weaker evidence than an aged one).
+- **False positives on legitimate change.** A publisher that stops signing, or rotates
+  signing identity, trips the regression signal. We accept this and manage it with an
+  explicit pin/ack, deliberately *not* a global off-switch — silencing the channel must
+  be a scoped, recorded decision, never the default.
+- **New durable state.** The per-repo signing baseline must persist and be reasoned
+  about (keying granularity, eviction, replay on boot) — net-new state on the same
+  footing as the decision journal.
+- **Keying is a judgement call.** Repo-level baselines can miss a per-tag distinction
+  and can over-trust a repo that legitimately serves a mix; the staged rollout starts
+  at repo granularity and revisits if observation shows it is too coarse.
diff --git a/docs/adr/README.md b/docs/adr/README.md
@@ -27,5 +27,6 @@ Copy [`0000-template.md`](0000-template.md) to start one.
 | [0017](0017-isolation-persists-on-the-breach-condition.md) | Isolation persists on the breach condition: chain ∧ enrichment fingerprint (revert keys on `entry_fingerprint`) | Accepted |
 | [0018](0018-operator-configured-redacted-breach-notifier.md) | The breach notifier is the one sanctioned outbound path: operator-configured, off by default, redacted by default | Accepted |
 | [0019](0019-dashboard-v3-presentation-architecture.md) | Dashboard v3: server-rendered (maud), zero-egress, light-theme presentation — the view_model/component/page split + the honesty invariants | Accepted |
+| [0020](0020-signature-continuity.md) | Supply-chain trust is signature continuity: observe every image, learn a per-repo TOFU baseline, treat the signed→unsigned / identity-change regression as the signal — not prefix-gated single-identity | Accepted |
 
 See also [`../VISION.md`](../VISION.md) for the longer-form narrative this ADR realizes.