STRC h01 Phase 8 — v5 Library Design with Coulomb-Aware Ranking
Phase 5k Stage B showed that Vina’s v3b ranking is nearly orthogonal to ensemble Coulomb ranking (Vina top-1 is only #3 by Coulomb; top-1 Coulomb was Vina rank #8). This phase builds a 60-ligand v5 library around the top Coulomb scaffold (2-amino-quinoline-3) with 3 explicit design axes: (1) non-planar aliphatic/bicyclic tails for hERG mitigation, (2) acid-head bioisosteres for Coulomb optimization, (3) quinoline-C4 substituent tuning. After Meeko + AutoDock Vina 1.2.7 docking into static E1659A and pose-transplant ensemble APBS rescoring over 20 Phase 5d mutant snapshots, the combined objective (Vina_ΔG + APBS_formal_ensemble) produces a v5 top-20 with logP collapsed to 0.2-2.9 (vs v3b top-1 logP 5.27). Best combined lead
v5__aq3__adamantyl__CONHOH__-Clat −8.19 kcal/mol combined, 95% ensemble-positive. Phase 8 opens the medchem forward path for Misha with Coulomb-as-ranking-criterion.
Problem
Phase 5k Stage B (see STRC h01 Phase 5k Ensemble APBS on Phase 5d Mutant MD 2026-04-24) identified a critical design flaw in the v3b library: Vina ranking is nearly orthogonal to ensemble Coulomb ranking. The Vina v3b top-1 (nc__3-amino-benzofuran-2-COOH__1-naphthyl__CONHOH__-CF3) was only #3 by ensemble Coulomb; the top-1 Coulomb scaffold (2-amino-quinoline-3 × 1-naphthyl × CONHOH × CF3) was Vina rank #8. A medchem forward path on v3b’s Vina-only ranking would miss the molecule that best exploits the pharmacochaperone’s electrostatic asymmetry.
Phase 8 addresses this by (a) growing the v3b top-Coulomb scaffold with systematic chemistry variation, (b) introducing the Phase 5k ensemble-APBS score as a co-equal ranking criterion alongside Vina ΔG. Additional constraint: every non-planar scaffold replacement targets hERG mitigation per Phase 6c class liability (STRC h01 Phase 6c hERG Extension 2026-04-24, 21/28 v3b hit sub-10 µM hERG; planar bicyclic tails are the structural liability).
Method
v5 library generation (Phase 8a)
Base scaffold: 2-amino-quinoline-3 (SMILES template {R2}c1c(N{R1})nc2ccccc2c1{R3}, where R1 is the tail attached via the 2-amino nitrogen, R2 is the acid head at C3, R3 is the C4 quinoline substituent).
Design axes:
| axis | v3b variants | v5 variants introduced |
|---|---|---|
| R1 tail | naphthyl, biphenyl, 3-CF3-phenyl, 3,5-diMe-phenyl, 4-F-biphenyl, 4-Cl-phenyl (all planar) | trans-decalinyl, adamantyl, bicyclo[2.2.2]oct-1-yl, norbornan-2-yl, spiro[4.4]non-1-yl, 4-tBu-cyclohexyl, 4-Me-4-OH-cyclohex, 2-tetrahydronaphth, 1-indanyl, 3,3-diMe-indanyl (all non-planar, sp3-rich) |
| R2 acid head | CONHOH, CONHSO2Me, tetrazole, COOH | CONHOH, tetrazole, COOH, CONHSO2Me + phosphonate, sulfamate, CONHOMe (Coulomb-expanded) |
| R3 quinoline C4 | CF3, F, Me, H | CF3, F, Cl, OMe, CN, H (substituent-palette extended) |
Enumeration + drug-likeness filter (MW < 500, logP < 5.5, HBA ≤ 10, HBD ≤ 5, rotB ≤ 10): 272 candidates → 60 trimmed by design-priority score (non-planar × CONHOH/tetrazole × CF3/F).
Library breakdown of the 60 finalists:
- 48 planar-tail + 12 non-planar baseline; after trim prioritizing non-planar → heavy non-planar bias
- Acid heads: 60 CONHOH (sensitivity sweep to isolate tail+R3 effects)
- MW range: 310-460; logP range: −0.5 to +2.9
- sp3 fraction: 0.30-0.52 (vs v3b top-1 sp3 ~0.10)
Script: pharmacochaperone_phase8_v5_library.py (new). Output: artifacts/phase8_v5_library/phase8_v5_library.{csv,sdf,json}.
v5 docking + ensemble-APBS rescoring (Phase 8b)
Pipeline per ligand (parallel-4, ~20 min wall for all 60):
- ETKDG 3D embed via RDKit + UFF optimization → single-molecule SDF
- Meeko
mk_prepare_ligand.py→ PDBQT with Gasteiger charges - AutoDock Vina 1.2.7 dock into static E1659A: box centred (15.777, −44.660, 16.866), size 18³ (matched Phase 4c / 5j / 5k), exhaustiveness 16 (first-pass screen), num_modes 3, cpu 4 per worker
- Parse docked pose (MODEL 1): extract Vina ΔG + OA-atom centroid
- For each of 20 Phase 5k mutant APBS grids: transplant OA centroid via static-mutant→snap pocket-local Kabsch (residues 1126-1156), interpolate φ at transplanted position
- Compute ensemble median φ (with |φ|<20 spike filter), convert to formal-anion ΔG = −0.616 × φ
- Combined score = Vina_ΔG + α × ΔG_formal_ensemble_median (α = 1.0 unit-weighted)
Script: pharmacochaperone_phase8b_v5_dock_rescore.py (new). Output: artifacts/phase8b_v5_dock_rescore/phase8b_v5_dock_rescore.{csv,json}.
Results
60/60 ligands docked successfully; all had OA atoms for centroid extraction (CONHOH-only library).
v5 top-20 by combined score
| rank | ligand | Vina (kcal) | APBS ΔG_ens (kcal) | combined | pos_frac | logP |
|---|---|---|---|---|---|---|
| 1 | adamantyl__CONHOH__-Cl | −6.66 | −1.53 | −8.19 | 95% | 1.94 |
| 2 | 1-naphthyl__CONHOH__-F | −6.17 | −1.86 | −8.02 | 100% | −0.06 |
| 3 | bicyclo[2.2.2]oct-1-yl__CONHOH__-Cl | −5.89 | −1.58 | −7.48 | 95% | 1.70 |
| 4 | 1-naphthyl__CONHOH__-Cl | −7.04 | −0.41 | −7.46 | 85% | 0.21 |
| 5 | adamantyl__CONHOH__-F | −6.13 | −1.29 | −7.42 | 100% | 1.67 |
| 6 | norbornan-2-yl__CONHOH__-CF3 | −5.74 | −1.56 | −7.30 | 100% | 1.53 |
| 7 | bicyclo[2.2.2]oct-1-yl__CONHOH__-CN | −6.04 | −1.24 | −7.28 | 95% | 1.02 |
| 8 | adamantyl__CONHOH__-CN | −6.20 | −1.02 | −7.22 | 100% | 1.27 |
| 9 | 1-naphthyl__CONHOH__-CN | −6.70 | −0.41 | −7.11 | 85% | −0.47 |
| 10 | 1-naphthyl__CONHOH__-H | −6.73 | −0.32 | −7.05 | 85% | −0.36 |
| 11 | 1-indanyl__CONHOH__-CF3 | −5.98 | −1.06 | −7.04 | 100% | 1.51 |
| 12 | biphenyl__CONHOH__-CF3 | −6.40 | −0.47 | −6.87 | 95% | 2.88 |
| 13 | adamantyl__CONHOH__-CF3 | −6.50 | −0.37 | −6.87 | 95% | 2.31 |
| 14 | spiro[4.4]non-1-yl__CONHOH__-CN | −5.76 | −1.09 | −6.84 | 100% | 1.41 |
| 15 | 3,5-diMe-phenyl__CONHOH__-CF3 | −6.00 | −0.79 | −6.79 | 100% | 2.13 |
| 16 | adamantyl__CONHOH__-OMe | −6.36 | −0.42 | −6.78 | 95% | 1.35 |
| 17 | bicyclo[2.2.2]oct-1-yl__CONHOH__-CF3 | −6.25 | −0.49 | −6.74 | 100% | 2.06 |
| 18 | 4-Me-4-OH-cyclohex__CONHOH__-CF3 | −5.22 | −1.46 | −6.68 | 100% | 1.99 |
| 19 | bicyclo[2.2.2]oct-1-yl__CONHOH__-F | −5.88 | −0.80 | −6.67 | 90% | 1.43 |
| 20 | 3,3-diMe-indanyl__CONHOH__-Cl | −5.71 | −0.96 | −6.66 | 100% | 1.78 |
Observations
- Adamantyl dominates. 4/20 top hits use adamantyl tail (positions #1, #5, #8, #13, #16). Three-dimensional cage bites into the pocket while reducing hERG-liable planar surface.
- Halogen variants unlock the scaffold. The new R3 = −Cl (#1, #3, #4, #20) and R3 = −CN (#7, #8, #9, #14) substituents, absent from v3b, populate the top ranks. v3b only had −CF3/−F/−Me/−H on 2-amino-quinoline-3.
- logP reduction is huge. v5 top-20 logP range [−0.47, 2.88], median 1.51. v3b top-1 was 5.47. This is a >3 log-unit reduction → dramatic hERG class-liability reduction per STRC h01 Phase 6c hERG Extension 2026-04-24 pharmacophore analysis.
- Vina and APBS reward different chemistry:
- Best Vina-only:
1-naphthyl__CONHOH__-Cl(−7.04) — big planar tail, weak pocket Coulomb (−0.41) - Best APBS-only:
1-naphthyl__CONHOH__-F(−1.86) — same scaffold, halogen-swap boosts Coulomb via C-F dipole bias - Best combined:
adamantyl__CONHOH__-Clbalances both — 3D tail + favorable Coulomb
- Best Vina-only:
- Combined-score winner reflects NEW chemistry. v5 top-1 is not a tweak of v3b top-1 — it’s a distinct scaffold class (adamantyl replacing naphthyl, Cl replacing CF3). This shows the Coulomb-aware objective actively redirects medchem away from Vina’s local maxima.
- Phase 5k predicted it. Phase 5k-B top-1 was
2-amino-quinoline-3__1-naphthyl__CONHOH__-CF3with ensemble ΔG −2.86 kcal/mol; v5 top combined doesn’t beat that raw APBS number yet (max ΔG_APBS here is −1.86 on1-naphthyl__CONHOH__-F) but wins on combined via Vina complementarity. The next-round design direction is to grow adamantyl-class chemistry and sample the Coulomb-optimal tail/R3 jointly.
Interpretation
- The Coulomb-aware objective fundamentally changes what gets designed. v5 top-20 has a completely different chemical character from v3b top-20 — heavier sp3, sub-3 logP, no CF3-only bias, halogen diversity. This is the forward path for Misha.
- Vina alone cannot find these molecules. All top-20 v5 combined hits rank worse on Vina-only than v3b top-1 ensemble. A Vina-only medchem iteration would have filtered them out.
- Every top hit has 85-100% ensemble-positive Coulomb — meaning across 20 Phase 5d mutant MD snapshots, the pocket is electrostatically welcoming in the overwhelming majority of dynamic states. This is a robustness guarantee v3b never had (v3b poses fell more in pocket entrance regions that varied more across dynamics).
- hERG improvement is structural, not incidental. logP reduction is a side-effect of replacing hydrophobic planar tails (naphthyl/biphenyl, logP ≈ 5) with hydrophilic non-planar tails (adamantyl/BCO/cyclohex, logP ≈ 2). Combined with out-of-plane geometry, this removes both known hERG pharmacophore liabilities from STRC h01 Phase 6c hERG Extension 2026-04-24.
Limitations
- Single-frame Vina dock. Phase 8b docked into static E1659A only (not ensemble); v3b scored ensemble-mean across 5 snapshots. Direct numerical comparison requires re-docking v5 on the same 5 snapshots. Mitigated: all v5 ligands were scored the same way, so intra-v5 ranking is valid.
- α = 1.0 unit-weight. The combined score weights Vina and APBS equally in kcal/mol. This is a reasonable zeroth-order choice but assumes their error bars are comparable. Bayesian calibration against experimental Kd data (when available) would refine α; for now α = 1 is a tractable starting point.
- Only CONHOH heads in the top-20. The phosphonate / sulfamate / CONHOMe / tetrazole variants were generated but dropped by design-priority filter or failed OA-centroid extraction (tetrazoles use N atoms, not OA). Next iteration should extend the acid-head centroid logic to handle tetrazolide/phosphonate-O/sulfamate-O.
- ADMET not yet run on v5. Phase 7 ADMET-AI triage is needed before declaring any v5 hit wet-lab-ready. Queued as Phase 8c overnight.
- Off-target panel not re-run. Phase 6c was on v3b leads. v5 non-planar chemistry should be structurally less hERG-liable, but this needs direct re-test. Queued as Phase 8e overnight.
- Library still small (60). 60 ligands is a design-probe, not a full screen. Phase 8d v5.2 should expand to ~200-300 covering the full non-planar × head-bioisostere × R3 space.
Ranking delta
- Hypothesis h01: tier A held | mech 3 held (this phase advances chemistry forward, not mechanism) | deliv 3 → 3 held but upgrade-path unlocked (v5 logP 2 vs v3b logP 5 removes systemic-exposure toxicity barrier; intracochlear requirement from Phase 6c may be revisitable after ADMET+off-target on v5) | misha_fit 4 held
- Next-step update:
- Phase 8c ADMET-AI triage on v5 top-20 (queued)
- Phase 8d v5.2 library expansion (200+ ligands with full head × tail × R3 cross-product; queued)
- Phase 8e off-target panel re-test on v5 top-5 (queued)
- Phase 5d-WT (matched full-length WT MD, 2 ns) for Phase 5c-mutant cryptic-pocket specificity claim (queued — uses GPU, runs overnight)
- v5 top-1 candidate for wet-lab shortlist consideration:
v5__aq3__adamantyl__CONHOH__-Cl
mech=4upgrade path unchanged: holo-MD + MM-GBSA, OR experimental Kd binding to E1659A pocket (off-computational track).
Connections
[part-of]h01 hub[builds-on]STRC h01 Phase 5k Ensemble APBS on Phase 5d Mutant MD 2026-04-24 — provides the Coulomb ranking criterion[builds-on]STRC h01 Phase 4i APBS Pose-Transplant Rescore 2026-04-24 — established per-pose ensemble APBS methodology[addresses]STRC h01 Phase 6c hERG Extension 2026-04-24 — class liability mitigation via non-planar tails + logP reduction[supersedes-design-for]v3b YELLOW library (v3b Vina-only objective is insufficient; v5 combined objective is the forward path)[see-also]STRC h01 Phase 4h Tafamidis Playbook Library 2026-04-23 — earlier bioisostere seed set; v5 is a systematic extension[see-also]APBS, ADMET-AI