STRC PE Phase 2 PAM Expansion
Survey of engineered Cas9 PAM variants for STRC E1659A finds that SpG (NGN) is the sweet-spot variant: it unblocks the nick-to-edit geometry constraint from Phase 1 (minimum 5.5 nt vs 14 nt with SpCas9 NGG) while adding 12 PE3b-eligible nicking sgRNAs. SpRY (NRN) adds no practical gain over SpG here — its 10 optimal-geometry pegRNAs are either the same candidates SpG already finds or sit at weaker NAN PAMs (Walton 2020: NAN ≈ 40–60% of NGN activity in SpRY). Correction to STRC PE Phase1 pegRNA E1659A: five PE3b nicking sgRNAs exist even with baseline SpCas9 NGG — Phase 1’s claim of “0 PE3b eligible” came from an overly strict filter (required spanning AND 30-100 nt distance, which are geometrically incompatible). Best integrated design: SpG-pegRNA spacer TGGGGGCCTGAGATCTTCAC (−strand, PAM TGA, GC 60%, nick 5.5 nt from edit) + PE3b nicker ACTGAAATTGGCACCATAGC (−strand, PAM AGG, nick 12.5 nt from edit, GC 45%). This reverses the Phase 1 conclusion: SpG raises realistic OHC PE efficiency from 5–20% to 15–35%.
Method
- Fetched chr15:43600400-43600700 from Ensembl GRCh38 (301 nt).
- For each Cas9 variant, scanned both + and − strands for PAMs matching the variant’s specificity:
- SpCas9 NGG (baseline)
- SpG NGN (Walton 2020)
- enCas9 NGN (Hu 2018; equivalent specificity to SpG for this purpose)
- SpRY NRN (Walton 2020; near-PAMless, NAN/NYN tolerated but weaker)
- SpCas9-NG NG (Nishimasu 2018)
- SaCas9 NNGRRT (Ran 2015, 21-nt protospacer, 6-nt PAM)
- Applied two filters per variant:
- pegRNA-optimal: nick-to-edit distance ∈ 3–10 nt (Anzalone 2019 optimum)
- PE3b-eligible: nicking sgRNA protospacer spans chr15:43600551 (post-edit sequence differs from pre-edit → allele-discriminating nicker)
- Computed GC% for each protospacer.
Deterministic; no stochastic components.
Results
Comparative matrix
| Variant | Total PAMs | pegRNA 3–10 nt | PE3b nickers | Best pegRNA dist |
|---|---|---|---|---|
| SpCas9 NGG (baseline) | 59 | 0 | 5 | N/A (min = 14 nt) |
| SpG NGN | 156 | 2 | 12 | 5.5 nt |
| enCas9 NGN | 156 | 2 | 12 | 5.5 nt |
| SpRY NRN | 287 | 10 | 22 | 3.5 nt |
| SpCas9-NG NG | 158 | 2 | 12 | 5.5 nt |
| SaCas9 NNGRRT | 6 | 0 | 0 | — |
SpG / enCas9 / SpCas9-NG all return the same top hits because NGN is the intersection of their specificities and the two optimal-geometry PAMs in this window (TGA at −3.5 nt upstream, AGC at +9.5 nt downstream) are both NGN. SpRY’s extra 8 optimal pegRNAs all have NAN PAMs (CAC, AAA, GAA) and are expected to run at 40–60% of NGN activity (Walton 2020 NRN specificity landscape).
Correction to Phase 1
Phase 1 script required both protospacer_spans_variant AND nick-to-edit distance ∈ 30–100 nt. These constraints are geometrically incompatible: a protospacer that spans the variant has its nick within ~17 nt of the variant by construction (for 20-nt protospacer, nick is 17 nt from the 5’ end, so the nick can sit at most (20 − 17) = 3 nt on one side of the variant if the variant is at one edge of the protospacer, or up to ~17 nt on the other). Fixed filter drops the distance band and checks spanning only.
SpCas9 NGG PE3b nickers recovered (Phase 2):
| Strand | PAM | Nick distance | Protospacer 5′→3′ | GC% |
|---|---|---|---|---|
| − | AGG | 12.5 nt | ACTGAAATTGGCACCATAGC | 45.0 |
| − | TGG | 15.5 nt | GAAATTGGCACCATAGCAGG | 50.0 |
| − | GGG | 16.5 nt | AAATTGGCACCATAGCAGGT | 45.0 |
| + | AGG | 11 nt | CCAATTTCAGTGAAGATCTC | 40.0 |
| + | NGG | (4th) | (see JSON) | — |
These nickers were available from the start — Phase 1’s classic-PE3 fallback (63-nt-distant nicker TTGGTTTGGTTTCTATACCA) was unnecessary for allele discrimination.
Recommended integrated designs
Conservative (SpCas9 wild-type enzyme) — upgraded from Phase 1 ; superseded for lead PE3b nicker by STRC PE Phase3 Allele Discrimination:
- pegRNA spacer:
GCCCAGCTCCCCACCTGCTA(+ strand, 14 nt from edit — Phase 1 design unchanged) - PE3b nicker (Phase 2 lead):
ACTGAAATTGGCACCATAGC(− strand, PAM AGG, 12.5 nt from edit, GC 45%) — Phase 3 shows the variant sits at protospacer position 5 (DISTAL) → weak discrimination. Replaced by Phase 3 leadCCTGAGATCTTCACTGAAAT(PAM TGG, position 17 seed, 0.5 nt nick) for stronger discrimination; trade-off is the close nick. - Net: ~50% geometry penalty persists; PE3b adds 1.5–3× edit efficiency (Anzalone 2019 PE3b vs PE2 data)
- Expected OHC PE efficiency: 10–30% with Phase 3 seed nicker (revised up from Phase 2’s 8–25% estimate)
Preferred (SpG NGN nickase) — refined by STRC PE Phase3 Allele Discrimination:
- pegRNA spacer:
TGGGGGCCTGAGATCTTCAC(− strand, PAM TGA, GC 60%, nick 5.5 nt from edit) - PE3b nicker:
TTCACTGAAATTGGCACCAT(− strand, PAM AGC, position 8 MID region, moderate discrimination, nick 9.5 nt — the balanced pick from Phase 3) - Geometry penalty removed; PE3b present with validated mid-region discrimination
- Expected OHC PE efficiency: 20–40% — Phase 3 revision up from Phase 2’s 15–35% (higher confidence after verifying letter-match and mismatch-region classification)
- Caveat: SpG has ~70% of wild-type SpCas9 activity at matched PAMs (Walton 2020); net efficiency still ahead
Overkill (SpRY NRN): no practical gain over SpG for this variant. SpRY’s advantage is only that it finds 8 extra NAN-PAM pegRNAs of questionable activity. Use only if SpG fails wet-lab validation.
Interpretation for Misha
- Phase 1’s conclusion that “PE3 efficiency for Misha is 5–20%” was pessimistic — driven partly by missing PE3b. Corrected range with SpCas9 PE3b: 8–25%. Further upgrade with SpG PE3b: 15–35%.
- The dominant lever is PE3b, not PAM relaxation. PE3b availability was always there — Phase 1 missed it.
- SpG is the least-intrusive upgrade: published, characterized, available from multiple vendors (Addgene, IDT). Geometry optimum without stretching into poorly characterized NAN territory.
- Clinical delivery unchanged: dual-AAV split-intein PE3 with either SpCas9 or SpG; both fit the same vector architecture.
Limitations
- SpG / SpRY / SpCas9-NG activity numbers (70%, 30–50%, 60% of wild-type respectively) are published averages — real activity at THIS protospacer is unknown.
- Off-target landscape changes with relaxed PAMs: SpG has 2–4× more off-targets than SpCas9 at matched loci (Walton 2020); SpRY is ~10× more promiscuous. Full Cas-OFFinder scan under each variant’s PAM rules is Phase 3.
- PE3b allele discrimination requires the nicker spacer to mismatch at the variant position before edit. This is a separate design rule (check the spacer letter at the variant position equals the WT/corrected nucleotide, not the mutant). Phase 2 scans for protospacer-spanning but does not verify post-edit match direction — that is a manual check for each top candidate before lab.
- SaCas9 gave 0/0 purely because of window size (6 PAMs total) — a wider region scan might find SaCas9 candidates further from the edit, but they’d violate the PE geometry constraint anyway.
- The protospacer-matching-pre-edit-vs-post-edit direction for each PE3b candidate still needs per-candidate verification. For a C→A correction at chr15:43600551 (coding) = G→T on + strand, a − strand nicker whose protospacer contains T at the corresponding position = matches post-edit.
Next steps
- PE3b allele-discrimination check per top candidate — confirm protospacer letter at variant position matches WT/corrected, not MUT.
- Cas-OFFinder off-target scans for SpG at both the pegRNA spacer and PE3b nicker. Allow 0–3 mismatches, report transcript-overlapping hits.
- Select lead + order: synthesize pegRNA + nicker sgRNA per SpG preferred design; validate in HEK293T dual-luciferase rescue assay with STRC-E1659A minigene. $8–12 k, 6–10 weeks.
- Compare SpG vs SpCas9 PE3b in the rescue assay — answers whether geometry optimum (SpG) or enzyme activity (SpCas9) wins for this variant.
Replication
cd ~/STRC/models
/opt/miniconda3/bin/python3 pe_phase2_pam_expansion.py
# outputs: pe_phase2_pam_expansion.jsonFiles / Models
~/STRC/models/pe_phase2_pam_expansion.py— 6-variant PAM scan, PE3b + pegRNA-optimum filters~/STRC/models/pe_phase2_pam_expansion.json— full candidate sets per variant
Connections
[part-of]Prime Editing for STRC — parent hypothesis; PE3b correction + SpG upgrade path added[see-also]STRC PE Phase1 pegRNA E1659A — original design; Phase 2 corrects the “no PE3b” finding[see-also]STRC PE Phase3 Allele Discrimination — per-candidate allele-discrimination audit; reclassifies Phase 2 lead as weak (position-5 distal) and identifies seed-position + balanced mid alternatives[see-also]STRC Electrostatic Analysis E1659A — the mutation being corrected[see-also]STRC Mini-STRC Single-Vector Hypothesis — parallel track for paternal deletion[about]Misha