De Novo Design Software — Scoring-Strategy Catalog

P0 reference — verbatim Table 1.3 from 2014-schneider-de-novo-molecular-design-book §1.6 (Schneider & Baringhaus, pp. ~33–35). Chronological catalog of named de novo drug-design software (1989–2013) with the implemented compound-scoring strategy (receptor-based, ligand-based, or both). Useful for citing prior art when an STRC scoring script claims to follow a specific scoring class.

The two element-occurrence statistics from the same chapter (Table 1.2 — H 46.2%, C 40.0%, O 6.2%, N 5.6%, F 0.8%, S 0.6%, Cl 0.4%, P 0.1%, other 0.1% — across 12,647 drugs in COBRA v12.6) and the GDB chemical-universe estimate (166 × 10⁹ molecules ≤ 17 heavy atoms) bracket the scope of the search problem.

Verbatim Table 1.3

De novo design methodYear of publicationReceptor-basedLigand-based
HSITE/2D skeletons1989X
3D skeletons1990X
Builder v11992X
LUDI1992X
NEWLEAD1993X
SPLICE1993X
GroupBuild1993X
CONCEPTS1993X
SPROUT1993X
MCSS and HOOK1994X
GrowMol1994X
Chemical Genesis1995XX
PRO_LIGAND1995XX
SMoG1996X
CONCERTS1996X
RASSE1996X
PRO_SELECT1997X
Skelgen1997XX
Nachbar1998X
Globus1999X
DycoBlock1999X
LEA2000X
LigBuilder2000X
TOPAS2000X
F-DycoBlock2001X
ADAPT2001X
Pellegrini and Field2003XX
SYNOPSIS2003X
CoG2004X
BREED2004X
Nikitin2005X
LEA3D2005X
Flux2006X
FlexNovo2006X
Feher2008X
GANDI2008XX
COLIBREE2008X
SQUIRRELnovo2009X
Hecht and Fogel2009XX
FOG2009X
MED-hybridize2009X
MEGA2009XX
Fragment-shuffling2009XX
AutoGrow2009X
NovoFLAP2010X
PhDD2010X
GARLig2010X
DOGS2010X
White and Wilson2010X
Qsearch2011X
EvoMD2011X
Contour2012X
MOEA2013X
Ulrich2013X

— Verbatim from Schneider 2014, Table 1.3, Ch. 1 (adapted in turn from Ref. [162] of Ch. 1).

Scoring-class equations (referenced from Schneider 2014 §1.6.1)

Force-field-based receptor scoring (Eq. 1.10):

E = Σ_{i∈ligand, j∈receptor} [ A_ij/r_ij^12 − B_ij/r_ij^6 + (q_i q_j)/(ε r_ij) ]

Empirical scoring function (Eq. 1.11):

ΔG = ΔG_0 + Σ_i [ΔG_i · count_i · penalty_i]

Knowledge-based scoring function (Eq. 1.12, Boltzmann-inverted atom-pair statistics):

E(i,j) = −k_B T ln [ p_ij^observed(r) / p_ij^expected(r) ]

How to use in STRC

  • h01 phase 4 docking pipeline: Vina is a force-field-based scorer (Eq. 1.10 family); cite Schneider 2014 §1.6.1 in phase4b.py docstring. AutoDock-class scoring is also force-field-class.
  • h01 phase 4f MM-GBSA: force-field family with continuum solvent correction. Cite Schneider 2014 §1.6.1 + §16.2.5.2 (Westermaier & Hubbard) for the empirical-correction layer.
  • Knowledge-based scoring lineage: SMoG (1996) is the first knowledge-based de novo scorer. If h01 ever moves to knowledge-based scoring (e.g., DSX-CSD), this lineage can be cited.
  • DOGS (Schneider’s lab, 2010): ligand-based, fragment-grow with reaction-driven assembly. Same lineage as h01 v4 scaffold-hop attempts. Provides a literature precedent.
  • LigBuilder (2000): receptor-based fragment-grow, used in Schneider 2014 §1.7 to discover EYA2 inhibitor 8 (IC50 = 6 µM) and BRAF inhibitor 9 (IC50 = 0.4 µM). Cited as a viable in-pocket fragment-grow method — relevant if h01 v4 fragment-grow is implemented from scratch.

Connections