STRC OTOA Paralog Phase 1A Sequence Audit
Human STRC (UniProt Q7RTU9, 1775 aa) and OTOA (UniProt Q7RTW8, 1153 aa) share 28.3% global identity and 45.7% BLOSUM62 similarity over 1126 ungapped residues — a real paralog signal. The mini-STRC window (700–1775) retains 29.8% local identity against OTOA. The C-terminal ARM-repeat core (STRC 1603–1770) hits 33.8% identity over 148 residues, mapped to a near-continuous OTOA stretch (977–1136) — the ARM region is more conserved than the average protein, the classic paralog-by-function signature. Two of the four pharmacochaperone pocket anchors are identically conserved (F1646→F1014, E1659→E1027); the other two (K1141, G1645) fall in alignment gaps — the druggable sub-pocket is STRC-specific. Gate passed. Proceed to Phase 1B structural superposition.
Correction log
Phase 1A initial run used Q7RTW9 — that is NMD3B_HUMAN (NMDA receptor 3B), not OTOA. Caught when the canonical OTOA AF DB entry returned 404 for Q7RTW9. The correct OTOA accession is Q7RTW8 (OTOAN_HUMAN, 1153 aa). Numbers above are the corrected run; the script file now hard-codes Q7RTW8.
Method
- Fetched canonical FASTA + annotation JSON for STRC (Q7RTU9) and OTOA (Q7RTW8) from UniProt REST.
- Global Needleman–Wunsch alignment, BLOSUM62 matrix, gap open −11, extend −1 (Biopython
PairwiseAligner). - Local alignments: (a) mini-STRC window 700–1775, (b) ARM-repeat span 1603–1770, both against full OTOA.
- Sliding-window local alignment: 50-aa windows, step 20, 87 windows across STRC → OTOA.
- Anchor-residue projection: K1141, G1645, F1646 (pharmacophore triangle) + E1659 (Misha’s mutation) mapped through the global alignment to OTOA.
Deterministic. No stochastic components.
Results
Global metrics
| Metric | Value |
|---|---|
| STRC length | 1775 aa |
| OTOA length | 1153 aa |
| Global score (BLOSUM62) | −158.0 |
| Matched (ungapped) residues | 1126 |
| Identity | 28.33% |
| Similarity (BLOSUM62 positive) | 45.74% |
Region-specific local alignments
| Region | STRC span | Local identity | Matched residues | OTOA blocks (1-idx) |
|---|---|---|---|---|
| Mini-STRC | 700–1775 | 29.83% | 362 | distributed |
| ARM repeats (C-term core) | 1603–1770 | 33.78% | 148 | 977–995, 996–1013, 1014–1067, 1068–1102, 1108–1119, 1127–1136 |
| Peak sliding window | 601–650 | 75.0% | ~38 | (local best) |
ARM-repeat local identity (34%) exceeds the global average (28%) — the C-terminal ARM region is the functionally conserved paralog axis. Aligned OTOA fragments 977–1136 form a nearly contiguous 160-residue stretch (three gaps, total 12 aa missing), consistent with evolution from a common ancestor rather than opportunistic sequence matches.
Anchor residue fates (global alignment)
| STRC residue | WT letter | OTOA equivalent | Conservation |
|---|---|---|---|
| K1141 (pharmacophore anion anchor) | K | gap | STRC-specific |
| G1645 (pharmacophore H-bond) | G | gap | STRC-specific |
| F1646 (pharmacophore aromatic) | F | F1014 | identical |
| E1659 (Misha’s mutation site) | E | E1027 | identical |
The pharmacochaperone druggable pocket (Phase 2B) is partially STRC-specific — K1141 and G1645 are structural features unique to STRC. However, F1646 and E1659 are identically conserved at the equivalent OTOA positions, which is a strong cross-paralog signal for these two residues. The E1659 conservation means Misha’s mutation (E→A) disrupts a residue type that has been preserved since the STRC/OTOA divergence — reinforcing the variant’s pathogenicity assessment.
Top sliding windows (STRC → OTOA local)
STRC 601- 650: 75.0%
STRC 541- 590: 70.6%
STRC 1361-1410: 68.2%
STRC 521- 570: 66.7%
STRC 461- 510: 65.0% (top window in prior wrong-accession run peaked here at 83%)
STRC 961-1010: 64.0%
STRC 621- 670: 63.6%
STRC 1181-1230: 63.2%
STRC 721- 770: 62.5%
STRC 361- 410: 60.9%
Peaks span both halves; multiple 60–75% identity patches distributed across STRC. OTOA is effectively a compressed paralog (1153 vs 1775 aa) carrying discontinuous fragments homologous to many STRC regions.
AAV feasibility recomputed
OTOA CDS = 1153 × 3 + 3 = 3462 bp (parent note’s 3357 bp is a different isoform estimate). With B8 enhancer (587) + polyA (250) + ITRs (290) → 4589 bp. 111 bp headroom inside 4700-bp AAV budget. Tighter than initially estimated, still feasible.
Interpretation
| Avenue | Status after Phase 1A |
|---|---|
| 1. Transcriptional upregulation of endogenous OTOA | Plausible at sequence level (ARM paralog confirmed) — needs structural check. |
| 2. Engineered OTOA-HTC chimera | Plausible pending structural homology — if OTOA 977–1136 superimposes on STRC 1603–1770, grafting a STRC HTC surface onto the OTOA paralog scaffold is engineering-tractable. |
| 3. Small-molecule OTOA activator | Unchanged. |
The sequence alignment alone cannot distinguish between “real functional homolog” and “evolutionary leftover with divergent fold.” The next gate is 3D superposition — see STRC OTOA Paralog Phase1B Structure Alignment.
Limitations
- Identity alone does not predict function. Paralogs at 30% identity can have unrelated biochemistry.
- UniProt has only signal peptide + glycosylation features — no Pfam/ZP domain calls for either protein. Structural homology inferred from external analyses.
- Local alignment with free end gaps counts only matched residues; reported “matched” is not “total covered.”
- The 75% peak at STRC 601–650 is in the N-terminal zone mini-STRC removes — not accessible from any mini-STRC-based design.
Replication
cd ~/STRC/models
/opt/miniconda3/bin/python3 otoa_phase1_sequence_alignment.pyFiles / Models
~/STRC/models/otoa_phase1_sequence_alignment.py— fetch, align, profile, anchor-map~/STRC/models/otoa_phase1_sequence_alignment.json— full metrics + features + profile
Connections
[part-of]STRC OTOA Paralog Cross-Rescue — parent hypothesis, Phase 1A gate PASSED[see-also]STRC OTOA Paralog Phase1B Structure Alignment — 3D follow-up[see-also]STRC Cross-Species Conservation Analysis — methodology template (same aligner + BLOSUM62)[see-also]STRC Pharmacochaperone K1141 Fragment Pocket — pocket partially conserved (F1646/E1659 yes; K1141/G1645 no)[see-also]STRC Electrostatic Analysis E1659A — E1659 identically conserved across STRC/OTOA paralogs[see-also]STRC Mini-STRC Single-Vector Hypothesis — competing therapy (fix STRC vs. recruit OTOA)[see-also]STRC TMEM145 GOLD Domain Interaction — ARM region is TMEM145 docking site; has OTOA homolog[about]Misha