STRC h01 Phase 9a — FEP / TI Alchemical ΔΔG (SKELETON)
Placeholder. Documents the question, method, smoke-test, production protocol, pass criteria, and known artifacts so the run can fire on a single day’s notice. No production calculation has been performed yet.
Question
For the top-N shortlisted leads from Phase 5q, what is the relative binding free energy difference between E1659A mutant STRC and WT STRC, computed by alchemical λ-mutation rather than by docking score or kinetic proxy?
ΔΔG_bind = ΔG_bind(mutant) − ΔG_bind(WT)
Sign expectation (from existing pipeline): ΔΔG_bind < 0 (mutant binds better).
Method
Two complementary approaches; pick one for production after smoke:
- (A) Relative binding FEP — same ligand, mutate E1659 → A on protein side via λ-windows in apo and holo states. Closes thermodynamic cycle.
- Tooling candidates: GROMACS-FEP + pmx (Gapsys), AMBER pmemd.cuda + pmx, OpenMM/Yank, BFEE2.
- (B) Absolute binding FEP — decouple ligand from solvent and from each protein variant separately; subtract.
- Tooling candidates: BAT.py (Heinzelmann/Gilson 2021), Loeffler ABFE protocol, Yank.
Recommended default: (A) relative, because protein-side mutation is exactly the variable we care about and the cycle closure cancels most systematic error.
Why orthogonal
- Different physics from docking: alchemical free energy uses the actual partition function via λ-mutation, no scoring function.
- Different physics from tauRAMD: equilibrium thermodynamics, no forced pulling.
- Different from MM-PBSA: explicit solvent, full configurational entropy, no continuum approximation.
- Same force field as MD pipeline → catches docking/scoring artifacts but does not catch force-field artifacts (see Phase 9d for that).
Inputs needed
- Phase 5q shortlist top-3 to top-5 ligands (SMILES + parameters in OpenFF or GAFF2).
- Phase 5d equilibrated holo MD frame for each ligand × {WT, mutant}.
- Force field: AMBER ff19SB protein + OpenFF-2.x or GAFF2 ligand + OPC or TIP3P water.
- Hardware: GPU. M5 Max MPS path needs verification (see Apple Silicon first rule); fall back to remote A100 if MPS path crashes for FEP kernels.
Smoke test (1-day, theoretical)
Single ligand, single protein variant, no production statistics:
- Build alchemical topology with pmx for one E→A mutation in apo-protein-only (no ligand) — confirms the topology generator works for STRC.
- Run 11 λ-windows × 250 ps each, NPT, T=300K — total ~3 ns × 11 = 33 ns wall.
- Check
dV/dλtime-series finite and bounded; check soft-core potentials don’t explode at λ=0/1. - Estimate apo ΔG_E→A via TI (gmx bar) — value is not the answer, just a pipeline-health signal.
Smoke pass: pipeline runs end-to-end, dV/dλ has no NaN/explosion, BAR analysis returns a finite number with reasonable error bar (<1 kcal/mol on this trivial apo case).
Smoke fail: any of: topology generation crashes, MD simulation explodes, soft-core blows up, BAR gives infinite variance. → debug before committing to production.
Production protocol (theoretical)
- 21 λ-windows × 5 ns each per leg × 2 legs (apo and holo) × 2 ligand-variant pairs = ~840 ns total per ligand.
- 3× independent replicas with different random seeds for error bars.
- Cycle closure check: |ΔG_cycle| < 1 kcal/mol.
- Per-ligand wall time on single A100: ~3-5 days. For 3 ligands: ~2 weeks GPU or parallelize across multiple GPUs.
Pass criteria
- PRIMARY: ΔΔG_bind sign matches existing pipeline (negative = mutant binds better) for ≥2 of 3 production ligands.
- SECONDARY: |ΔΔG_bind| > 1 kcal/mol (above thermal noise) for the lead ligand.
- TERTIARY: cycle-closure |ΔG_cycle| < 1 kcal/mol — sanity check on numerical convergence.
- FAIL state: ΔΔG_bind insignificant (|value| < 0.5 kcal/mol) or wrong sign on the lead → drops h01 deliv tier.
Known artifacts and risks
- Sampling: 5 ns/window may be insufficient for charged-residue mutations (E→A removes a carboxylate); replica-exchange λ-dynamics or Hamiltonian replica exchange might be required.
- Charge-changing mutation: E→A changes net charge by +1 — must use co-alchemical counterion or background-charge correction (Rocklin, Ross, Pavlova 2013 type artifact).
- Force-field bias: same FF as MD pipeline → if FF systematically over-favors charged interactions, FEP carries the same bias. Phase 9d (QM/MM) is the disambiguator.
- Pose dependence: starting pose for holo leg comes from docking; if docked pose is wrong, ΔG is wrong. Mitigate by 3 independent starting poses (Phase 5q replicas).
References (canonical)
- Mey et al. 2020. LiveCoMS 2:18378 — “Best practices for alchemical free energy calculations”.
- Wang et al. 2015. JACS 137:2695 — FEP+ benchmark.
- Cournia, Allen, Sherman 2017. J Chem Inf Model 57:2911 — relative binding FEP.
- Gapsys et al. 2020. Chem Sci 11:1140 — pmx large-scale validation.
- Rocklin, Mobley, Dill, Hünenberger 2013. J Chem Phys 139:184103 — finite-size correction for charge-changing FEP.
- Loeffler et al. 2018. J Chem Theory Comput 14:5567 — reproducibility of ABFE across engines.
Status
- 2026-04-27 — skeleton created. Software not installed. Smoke not run. Production not run.
Ranking delta
- No change. Skeleton only.
Connections
[parent]STRC h01 Phase 9 Orthogonal Cross-Checks Plan 2026-04-27[see-also]h01 hub[see-also]STRC h01 Phase 5q v5.3 Acyl Sulfonamide Boltz-2 + Vina Consensus 2026-04-26 — provides shortlist[see-also]STRC h01 Phase 5 MD Ensemble Rescoring 2026-04-23 — provides starting frames[about]Misha