STRC h09 Phase 4n — Monte Carlo + sensitivity refinement of Phase 4j avidity Kd model
Goal
Phase 4j answered “for an arbitrary 4×4×5 grid, 47/80 cells PASS”. That is a property of the grid, not a probability statement. Phase 4n converts the same Mammen polyvalent formalism into the right question under literature-bracketed priors:
Given what literature constrains us to know about (Kd_mono, C_eff, N), what is P(Kd_eff < 50 µM)?
Plus a Sobol-style first-order sensitivity ranking — which input dominates outcome variance, and therefore which next experiment buys the most uncertainty reduction.
Method
scripts/phase4n_avidity_montecarlo.py — pure analytic, numpy + Accelerate, runs in seconds on M5 Max CPU (no GPU needed).
Priors (lit-bracketed, mirror Phase 4j grid spans):
Kd_mono ~ LogUniform(1 mM, 20 mM)— Husson 2010 Tβ4 × F-actin floor 5–10 mM ± 2× speculative band on either sideC_eff ~ LogUniform(13.4 mM, 500 mM)— Karpen 1998 PEG-3400 flexible-linker floor → Mammen 1998 rigid-scaffold ceilingN ~ DiscreteUniform({3, 4, 5, 6})— geometric minimum 3 contacts, maximum 6 per fibril × filament patch
Mammen entropic-penalty stress test: Mammen 1998 §1.16 cautions that real avidity falls 10–100× short of the ideal-cooperativity formula due to non-ideal entropic linker penalties. Stress-test multiplier = geometric mean = sqrt(10·100) ≈ 31.6× haircut applied to all Kd_eff before re-evaluating gate.
Samples: 100,000. Seed: 20260426.
Sensitivity: for each continuous variable, condition on quartiles of its sample distribution and report P(PASS | x_i ∈ q_k). Range across quartiles = first-order variance contribution proxy. For discrete N, condition directly on each value.
Results
Headline
| Reading | P(PASS) | 95% CI |
|---|---|---|
| Ideal cooperativity | 0.803 | [0.801, 0.806] |
| 30× entropic haircut (worst-case Mammen) | 0.590 | [0.587, 0.593] |
The literature-plausible parameter space passes the 50 µM gate with 80% probability under ideal cooperativity. Even after the worst-case Mammen-style entropic penalty (factor 31.6× weaker than ideal), 59% still passes — i.e., even the conservative-conservative reading clears the gate more often than not.
Kd_eff distribution (ideal cooperativity)
| Percentile | Kd_eff (µM) |
|---|---|
| 5% | ~0 (sub-nM) |
| 25% | 0.003 (3 nM) |
| 50% (median) | 0.378 (≈400 nM) |
| 75% | 19.8 |
| 95% | 1786 (≈1.8 mM) |
Median Kd_eff ≈ 400 nM = 125× under the gate; 75th percentile = 19.8 µM = 2.5× under gate; the ~20% tail above gate is dominated by the worst-case parameter combinations.
Sensitivity (first-order, by quartile spread of conditional P(PASS))
| Rank | Variable | ΔP(PASS) across quartiles |
|---|---|---|
| 1 | Kd_mono | 0.51 |
| 2 | C_eff | 0.45 |
| 3 | N | 0.26 |
Conditional P(PASS) by N (N is discrete):
| N | P(PASS) |
|---|---|
| 3 | 0.644 |
| 4 | 0.798 |
| 5 | 0.868 |
| 6 | 0.903 |
Interpretation
-
The gate is favored, not marginal. Even with the most pessimistic Mammen non-ideality correction, more than half the literature-plausible space passes. This formalizes why the Phase 4j conservative-FAIL was specifically a 1.6× margin within input noise: it sits at the edge of the failing tail, not in its bulk.
-
Kd_mono dominates uncertainty (ΔP ≈ 0.51), not N. This is a meaningful re-ordering of next-experiment priority. Before Phase 4n, “Phase 4k AF3 fibril × actin geometry” was implicitly highest-leverage because it was cheapest. Phase 4n shows Phase 4m (Kd_mono refinement via Chereau ipTM-vs-Kd calibration) buys more variance reduction per AFS quota unit.
-
Phase 4l (MD measurement of C_eff) is second-highest leverage, not third. The OpenMM 100 ns campaign on M5 Max is justified — it tightens the second-largest contributor.
-
Phase 4k is genuinely third in leverage despite being cheapest. The N axis still matters (the 0.26 spread is non-trivial — N=3 vs N=6 swings P(PASS) from 64% → 90%), but it gives the smallest information gain.
-
Re-ordered next-experiment priority (lit-leverage maximizing):
- Phase 4m — AFS, 6 jobs (Kd_mono soft-floor)
- Phase 4l — local OpenMM, 100 ns (C_eff direct measurement)
- Phase 4k — AFS, 5 jobs (N geometric verification)
Tomorrow’s combined 13-job AFS submission stays as built — these are independent jobs that share the same upload — but the analysis order on results changes.
Caveats
-
First-order conditional sensitivity ≠ formal Sobol indices. A proper variance-decomposition (Saltelli’s method, Sobol or PAWN) requires an additional ~2N samples per main effect. The quartile-spread proxy is well-known to under-report interaction terms. Conclusion: Kd_mono > C_eff > N is the first-order ranking; interactions could shift it modestly.
-
Priors are uniform on log scale, not lit-frequency-weighted. Husson 2010 Tβ4 is a single point measurement; LogU(1, 20) mM gives equal weight to “WH2 might be 5× tighter than Tβ4” and “WH2 might be 2× weaker”. A Bayesian prior with a peak at Husson’s central value would tighten P(PASS) further but at the cost of overcommitting to one literature analog.
-
No avidity-haircut model interaction. The 31.6× haircut is applied uniformly post-hoc. In reality, the haircut depends on linker rigidity (RADA16 is rigid → smaller haircut, maybe ~10×). Conservative reading: applied 31.6× is upper-bound penalty. Truer reading likely closer to the ideal P(PASS) = 80% than the stressed 59%.
-
Same load-bearing literature gap as Phase 4j. WH2 × F-actin Kd was never measured. The probability statement is “given Tβ4 is the right structural-class proxy, with bracketed uncertainty”. Phase 2c wet-lab (off-limits per project rule) is the only way to remove the proxy assumption.
-
No cooperativity-anti-cooperativity boundary check. Mammen formula assumes positive cooperativity (C_eff > Kd_mono). When it fails, formula returns Kd_eff > Kd_mono (anti-cooperative). The Phase 4n implementation computes regardless — those cases automatically fail the gate, which is correct directionally even if the absolute number is meaningless.
Ranking delta
- Tier: tentative-S held. Phase 4n strengthens but does not move the tier — promotion to full S still requires at least one of {4k geometry passes, 4m soft-floor < 5 mM, 4l C_eff > 100 mM} from tomorrow’s batch + Phase 4l outputs. What 4n does change: confidence in the tentative-S floor. P(PASS, stressed) = 59% means the worst-case literature reading is ALREADY at Phase 4j’s grid-PASS rate. Risk of demotion to A on tomorrow’s results is meaningfully lower than pre-4n.
- Mech: 5 → 5 held. Probabilistic refinement of Phase 4j; same formalism, same lit inputs.
- Deliv: 4 → 4 held.
- Misha-fit: 3 → 3 held.
- next_step: REORDER. Tomorrow’s 13-job AFS submission unchanged (one-shot upload). Analysis priority on results: Phase 4m first (highest variance contributor — Kd_mono refinement). Then Phase 4l (C_eff local MD on M5 Max OpenCL). Then Phase 4k (N geometry, lowest conditional ΔP per outcome).
- Risk that reverses tentative-S: unchanged — if Phase 4k tomorrow shows N < 3 achievable in ALL covalent-tandem-WH2 proxies, conservative reading degrades to Kd_eff > 1 mM = catastrophic FAIL. Phase 4n shows the conditional P(PASS|N=3) is 64%, so even N=3 alone wouldn’t kill the hypothesis under the prior, but it would lock conservative scenario into the failing tail.
Connections
[part-of]STRC Synthetic Peptide Hydrogel HTC (h09 hub)[refines]STRC h09 Phase 4j Avidity Kd Model 2026-04-26[informs]h09 hubnext_step(frontmatter)[uses]1998-mammen-polyvalent-interactions-angew (formalism + 10–100× haircut basis)[uses]2026-04-23-husson-wh2-multifunctionality (Kd_mono prior bracket)[uses]1998-kramer-karpen-polymer-ligand-dimers-nature (C_eff prior floor)[see-also]STRC h09 Phase 4i Kd Sensitivity Sweep 2026-04-23 (deterministic 5×5 sweep predecessor)[see-also]STRC Computational Scripts Inventory