STRC PE Phase 2 PAM Expansion

Survey of engineered Cas9 PAM variants for STRC E1659A finds that SpG (NGN) is the sweet-spot variant: it unblocks the nick-to-edit geometry constraint from Phase 1 (minimum 5.5 nt vs 14 nt with SpCas9 NGG) while adding 12 PE3b-eligible nicking sgRNAs. SpRY (NRN) adds no practical gain over SpG here — its 10 optimal-geometry pegRNAs are either the same candidates SpG already finds or sit at weaker NAN PAMs (Walton 2020: NAN ≈ 40–60% of NGN activity in SpRY). Correction to STRC PE Phase1 pegRNA E1659A: five PE3b nicking sgRNAs exist even with baseline SpCas9 NGG — Phase 1’s claim of “0 PE3b eligible” came from an overly strict filter (required spanning AND 30-100 nt distance, which are geometrically incompatible). Best integrated design: SpG-pegRNA spacer TGGGGGCCTGAGATCTTCAC (−strand, PAM TGA, GC 60%, nick 5.5 nt from edit) + PE3b nicker ACTGAAATTGGCACCATAGC (−strand, PAM AGG, nick 12.5 nt from edit, GC 45%). This reverses the Phase 1 conclusion: SpG raises realistic OHC PE efficiency from 5–20% to 15–35%.

Method

Fetched chr15:43600400-43600700 from Ensembl GRCh38 (301 nt).
For each Cas9 variant, scanned both + and − strands for PAMs matching the variant’s specificity:
- SpCas9 NGG (baseline)
- SpG NGN (Walton 2020)
- enCas9 NGN (Hu 2018; equivalent specificity to SpG for this purpose)
- SpRY NRN (Walton 2020; near-PAMless, NAN/NYN tolerated but weaker)
- SpCas9-NG NG (Nishimasu 2018)
- SaCas9 NNGRRT (Ran 2015, 21-nt protospacer, 6-nt PAM)
Applied two filters per variant:
- pegRNA-optimal: nick-to-edit distance ∈ 3–10 nt (Anzalone 2019 optimum)
- PE3b-eligible: nicking sgRNA protospacer spans chr15:43600551 (post-edit sequence differs from pre-edit → allele-discriminating nicker)
Computed GC% for each protospacer.

Deterministic; no stochastic components.

Results

Comparative matrix

Variant	Total PAMs	pegRNA 3–10 nt	PE3b nickers	Best pegRNA dist
SpCas9 NGG (baseline)	59	0	5	N/A (min = 14 nt)
SpG NGN	156	2	12	5.5 nt
enCas9 NGN	156	2	12	5.5 nt
SpRY NRN	287	10	22	3.5 nt
SpCas9-NG NG	158	2	12	5.5 nt
SaCas9 NNGRRT	6	0	0	—

SpG / enCas9 / SpCas9-NG all return the same top hits because NGN is the intersection of their specificities and the two optimal-geometry PAMs in this window (TGA at −3.5 nt upstream, AGC at +9.5 nt downstream) are both NGN. SpRY’s extra 8 optimal pegRNAs all have NAN PAMs (CAC, AAA, GAA) and are expected to run at 40–60% of NGN activity (Walton 2020 NRN specificity landscape).

Correction to Phase 1

Phase 1 script required both protospacer_spans_variant AND nick-to-edit distance ∈ 30–100 nt. These constraints are geometrically incompatible: a protospacer that spans the variant has its nick within ~17 nt of the variant by construction (for 20-nt protospacer, nick is 17 nt from the 5’ end, so the nick can sit at most (20 − 17) = 3 nt on one side of the variant if the variant is at one edge of the protospacer, or up to ~17 nt on the other). Fixed filter drops the distance band and checks spanning only.

SpCas9 NGG PE3b nickers recovered (Phase 2):

Strand	PAM	Nick distance	Protospacer 5′→3′	GC%
−	AGG	12.5 nt	ACTGAAATTGGCACCATAGC	45.0
−	TGG	15.5 nt	GAAATTGGCACCATAGCAGG	50.0
−	GGG	16.5 nt	AAATTGGCACCATAGCAGGT	45.0
+	AGG	11 nt	CCAATTTCAGTGAAGATCTC	40.0
+	NGG	(4th)	(see JSON)	—

These nickers were available from the start — Phase 1’s classic-PE3 fallback (63-nt-distant nicker TTGGTTTGGTTTCTATACCA) was unnecessary for allele discrimination.

Recommended integrated designs

Conservative (SpCas9 wild-type enzyme) — upgraded from Phase 1 ; superseded for lead PE3b nicker by STRC PE Phase3 Allele Discrimination:

pegRNA spacer: GCCCAGCTCCCCACCTGCTA (+ strand, 14 nt from edit — Phase 1 design unchanged)
PE3b nicker (Phase 2 lead): ACTGAAATTGGCACCATAGC (− strand, PAM AGG, 12.5 nt from edit, GC 45%) — Phase 3 shows the variant sits at protospacer position 5 (DISTAL) → weak discrimination. Replaced by Phase 3 lead CCTGAGATCTTCACTGAAAT (PAM TGG, position 17 seed, 0.5 nt nick) for stronger discrimination; trade-off is the close nick.
Net: ~50% geometry penalty persists; PE3b adds 1.5–3× edit efficiency (Anzalone 2019 PE3b vs PE2 data)
Expected OHC PE efficiency: 10–30% with Phase 3 seed nicker (revised up from Phase 2’s 8–25% estimate)

Preferred (SpG NGN nickase) — refined by STRC PE Phase3 Allele Discrimination:

pegRNA spacer: TGGGGGCCTGAGATCTTCAC (− strand, PAM TGA, GC 60%, nick 5.5 nt from edit)
PE3b nicker: TTCACTGAAATTGGCACCAT (− strand, PAM AGC, position 8 MID region, moderate discrimination, nick 9.5 nt — the balanced pick from Phase 3)
Geometry penalty removed; PE3b present with validated mid-region discrimination
Expected OHC PE efficiency: 20–40% — Phase 3 revision up from Phase 2’s 15–35% (higher confidence after verifying letter-match and mismatch-region classification)
Caveat: SpG has ~70% of wild-type SpCas9 activity at matched PAMs (Walton 2020); net efficiency still ahead

Overkill (SpRY NRN): no practical gain over SpG for this variant. SpRY’s advantage is only that it finds 8 extra NAN-PAM pegRNAs of questionable activity. Use only if SpG fails wet-lab validation.

Interpretation for Misha

Phase 1’s conclusion that “PE3 efficiency for Misha is 5–20%” was pessimistic — driven partly by missing PE3b. Corrected range with SpCas9 PE3b: 8–25%. Further upgrade with SpG PE3b: 15–35%.
The dominant lever is PE3b, not PAM relaxation. PE3b availability was always there — Phase 1 missed it.
SpG is the least-intrusive upgrade: published, characterized, available from multiple vendors (Addgene, IDT). Geometry optimum without stretching into poorly characterized NAN territory.
Clinical delivery unchanged: dual-AAV split-intein PE3 with either SpCas9 or SpG; both fit the same vector architecture.

Limitations

SpG / SpRY / SpCas9-NG activity numbers (70%, 30–50%, 60% of wild-type respectively) are published averages — real activity at THIS protospacer is unknown.
Off-target landscape changes with relaxed PAMs: SpG has 2–4× more off-targets than SpCas9 at matched loci (Walton 2020); SpRY is ~10× more promiscuous. Full Cas-OFFinder scan under each variant’s PAM rules is Phase 3.
PE3b allele discrimination requires the nicker spacer to mismatch at the variant position before edit. This is a separate design rule (check the spacer letter at the variant position equals the WT/corrected nucleotide, not the mutant). Phase 2 scans for protospacer-spanning but does not verify post-edit match direction — that is a manual check for each top candidate before lab.
SaCas9 gave 0/0 purely because of window size (6 PAMs total) — a wider region scan might find SaCas9 candidates further from the edit, but they’d violate the PE geometry constraint anyway.
The protospacer-matching-pre-edit-vs-post-edit direction for each PE3b candidate still needs per-candidate verification. For a C→A correction at chr15:43600551 (coding) = G→T on + strand, a − strand nicker whose protospacer contains T at the corresponding position = matches post-edit.

Next steps

PE3b allele-discrimination check per top candidate — confirm protospacer letter at variant position matches WT/corrected, not MUT.
Cas-OFFinder off-target scans for SpG at both the pegRNA spacer and PE3b nicker. Allow 0–3 mismatches, report transcript-overlapping hits.
Select lead + order: synthesize pegRNA + nicker sgRNA per SpG preferred design; validate in HEK293T dual-luciferase rescue assay with STRC-E1659A minigene. $8–12 k, 6–10 weeks.
Compare SpG vs SpCas9 PE3b in the rescue assay — answers whether geometry optimum (SpG) or enzyme activity (SpCas9) wins for this variant.

Replication

cd ~/STRC/models
/opt/miniconda3/bin/python3 pe_phase2_pam_expansion.py
# outputs: pe_phase2_pam_expansion.json

Files / Models

~/STRC/models/pe_phase2_pam_expansion.py — 6-variant PAM scan, PE3b + pegRNA-optimum filters
~/STRC/models/pe_phase2_pam_expansion.json — full candidate sets per variant

Connections

[part-of] Prime Editing for STRC — parent hypothesis; PE3b correction + SpG upgrade path added
[see-also] STRC PE Phase1 pegRNA E1659A — original design; Phase 2 corrects the “no PE3b” finding
[see-also] STRC PE Phase3 Allele Discrimination — per-candidate allele-discrimination audit; reclassifies Phase 2 lead as weak (position-5 distal) and identifies seed-position + balanced mid alternatives
[see-also] STRC Electrostatic Analysis E1659A — the mutation being corrected
[see-also] STRC Mini-STRC Single-Vector Hypothesis — parallel track for paternal deletion
[about] Misha

STRC Research

Explorer

STRC PE Phase2 PAM Expansion