📄 Technical Whitepaper v2.2 — December 2025 Download PDF

LatticeZero: Physics-Derived Scoring for Protein-Ligand Docking

First-Principles Dispersion with Calibrated Sterics, Target-Class Routing, and FEP+ Validation

RFT Cosmology Research Group
contact@latticezero.com · latticezero.com

December 2025 · Version 2.2

Abstract

Empirical scoring functions for protein-ligand docking suffer from over-parameterization, poor transferability across target classes, and sensitivity to minor steric clashes in rigid-receptor poses. We present LatticeZero, a geometric bypass scoring engine that replaces empirical terms with physics-derived components: (1) dispersion energies from Resonant Field Theory lattice calculations yielding first-principles C₆ coefficients, (2) steric repulsion with a docking-calibrated scale factor sR derived from potential matching rather than AUC optimization, and (3) optional HBQ mode adding directional hydrogen bonding and Coulomb electrostatics.

Benchmarked on a 6-target DUD-E subset spanning kinases, proteases, and GPCRs, LatticeZero v2.2 with ML enhancement achieves 0.87 AUC on BACE1, 0.85 AUC on AA2AR, and 0.80 AUC on EGFR. Physics-only scoring (vdW) provides 0.41-0.64 AUC baselines; family-specific ML heads add +0.20-0.45 AUC improvement. We introduce the ΔClash pocket descriptor, which predicts optimal steric calibration from pocket geometry. Optimized scoring kernels enable 50-80 ligands/sec throughput—5-25× faster than Smina score-only mode.

New in v2.2: FEP+ validation with target-class routing achieves perfect rank-order (ρ=1.0) on serine proteases and 60% GREEN validation rate across tested targets. Ensemble docking rescues water-mediated binding sites (+1.0 ρ improvement on HSP90).

Keywords: molecular docking, scoring function, FEP, dispersion, van der Waals, machine learning, protein-ligand binding

1. Introduction

1.1 The Docking Scoring Problem

Molecular docking is a cornerstone of computational drug discovery, enabling virtual screening of large compound libraries against protein targets. The process involves two distinct challenges: pose prediction (finding plausible binding geometries) and scoring (ranking poses and compounds by predicted affinity). While sampling algorithms have matured considerably, scoring functions remain a fundamental bottleneck.

The disconnect between docking scores and experimental binding affinities is well-documented. Correlations between predicted and measured ΔG values rarely exceed r = 0.5 even for well-behaved congeneric series. More critically for virtual screening, enrichment of active compounds over decoys varies dramatically across target classes.

1.2 Limitations of Empirical Scoring Functions

Contemporary scoring functions such as AutoDock Vina, Glide, and GOLD rely on empirical parameterization against experimental binding data. While pragmatically successful, this approach introduces several fundamental limitations:

Over-parameterization. Empirical functions employ 10–50+ adjustable parameters fit to historical binding affinity datasets. These parameters encode statistical correlations rather than physical principles, leading to unstable extrapolation beyond the training domain.

Decoy bias. Benchmark datasets like DUD-E construct decoys with matched physicochemical properties but distinct topologies. However, empirical functions trained on such benchmarks may learn to exploit subtle distributional artifacts rather than genuine binding physics.

Steric clash sensitivity. Rigid-receptor docking protocols produce poses with minor atom-atom overlaps (~0.2–0.5 Å). Empirical functions with steep repulsive walls severely penalize such clashes, often assigning infinite or near-infinite energies.

Target class inhomogeneity. Different binding pockets exhibit different physics. Kinase ATP-binding sites require precise hydrogen bond geometry. GPCR transmembrane pockets are dominated by hydrophobic packing. A single set of empirical weights cannot optimize for all regimes.

1.3 Our Contribution

We present LatticeZero, a scoring engine that addresses these limitations through first-principles physics rather than empirical fitting. Our contributions are:

  1. Physics-derived dispersion. Van der Waals attraction computed from Resonant Field Theory (RFT) lattice calculations, yielding atom-type-specific C₆ coefficients without empirical parameterization.
  2. Calibrated sterics. A steric scale factor sR derived from potential matching between LatticeZero's CCSD(T)-calibrated repulsion and Smina's soft-sphere model.
  3. HBQ scoring mode. Optional hydrogen bond geometry and Coulomb electrostatics for polar pockets, optimized for production throughput.
  4. ΔClash pocket descriptor. A novel geometric feature that predicts optimal steric calibration and classifies pockets into four regimes.
  5. Competitive physics-only performance. On a 6-target DUD-E benchmark, LatticeZero achieves 0.70 AUC on AA2AR using pure physics.

2. Methods

2.1 RFT Dispersion Model

The attractive component of the van der Waals interaction arises from correlated electron fluctuations between non-bonded atoms. In LatticeZero, we compute dispersion energies using C₆ coefficients derived from Resonant Field Theory (RFT) lattice calculations.

Theoretical Foundation

The London dispersion energy between atoms i and j at separation rij takes the familiar form:

\[ E_{\text{disp}} = -\sum_{i \in \text{ligand}} \sum_{j \in \text{protein}} \frac{C_6^{ij}}{r_{ij}^6} \]

The C₆ij coefficient depends on the dynamic polarizabilities αi(ω) and αj(ω):

\[ C_6^{ij} = \frac{3}{\pi} \int_0^\infty \alpha_i(i\omega) \alpha_j(i\omega) \, d\omega \]

Damping Function

At short range, the dispersion energy diverges as r → 0. We apply a Fermi-type damping function:

\[ f_{\text{damp}}(r_{ij}) = \frac{1}{1 + e^{-\beta(r_{ij} - r_0^{ij})}} \]

where r₀ij = sR · (σi + σj) defines the onset of damping and β = 12 Å⁻¹ ensures smooth transition.

2.2 Calibrated Steric Repulsion

Steric repulsion prevents atoms from overlapping. We use a standard r⁻¹² repulsive potential:

\[ E_{\text{rep}} = \sum_{i,j} k_{\text{rep}} \left( \frac{s_R \cdot \sigma_{ij}}{r_{ij}} \right)^{12} \]

The Steric Calibration Problem

Atomic radii in LatticeZero are calibrated against CCSD(T) calculations on noble gas dimers (He₂, Ne₂, Ar₂, Kr₂), achieving 0.25 kcal/mol mean absolute error. However, docking poses from Smina contain minor steric clashes because these engines use soft-sphere potentials during optimization.

Soft-Sphere Compatibility Coefficient

Rather than tuning sR to maximize benchmark AUC, we derive sR from potential matching: minimizing the discrepancy between LatticeZero's repulsive potential and Smina's soft-sphere model. This analysis yields sRSmina ≈ 0.50.

Steric Calibration Sweep

sR Mean AUC Thrombin AUC Clash Rate
0.600.4120.20879%
0.550.4510.38658%
0.500.4790.52042%
0.450.4630.51228%

2.3 HBQ Scoring Mode

For polar binding pockets, we provide an extended scoring mode (HBQ):

\[ E_{\text{HBQ}} = E_{\text{disp}} + E_{\text{rep}} + E_{\text{Coulomb}} + E_{\text{H-bond}} \]

Optimized Implementation

The scoring kernels are implemented in Rust for production throughput, enabling batch processing of thousands of poses per second. See Section 5 for runtime comparisons against Smina.

2.4 Pocket Regime Classification

A key insight from our benchmark analysis is that different binding pockets respond differently to steric calibration. We formalize this through the ΔClash descriptor:

\[ \Delta\text{Clash} = \text{Clash\%}_{\text{actives}} - \text{Clash\%}_{\text{decoys}} \]

Four Pocket Regimes

Regime Definition Targets Strategy
TightΔClash < -10%P38ASofter sterics help
Neutral-10% < ΔClash < +5%CDK2Default sR works
Discriminating+5% < ΔClash < +20%AA2ARPhysics excels
BrokenΔClash > +20%EGFR, Thrombin, BACE1HBQ or ML required

3. Physics-Only Results

vdW Baseline Performance (sR = 0.50)

The physics-only vdW scoring mode provides baseline discrimination without any machine learning. Performance varies by pocket regime:

Target Class AUC (vdW) EF@10% Pocket Regime
P38AKinase0.300.74tight
CDK2Kinase0.531.67neutral
EGFRKinase0.581.67broken
AA2ARGPCR0.641.67discriminating
ThrombinProtease0.420.50broken
BACE1Protease0.410.00broken

Key finding: Physics-only scoring achieves above-random (AUC > 0.5) discrimination on 4/6 targets. AA2AR ("discriminating" regime) shows the strongest physics performance at 0.64 AUC, validating the ΔClash hypothesis. Targets classified as "broken" require additional features.

4. ML Enhancement

4.1 Physics-First ML Architecture

LatticeZero v1.3 adds optional ML reranking that preserves physics interpretability:

Feature Set (20 features)

CategoryFeaturesCount
vdW PhysicsEvdw, Erep, Edisp, log₁₊(Erep)4
HBQ PhysicsEhbq, Ecoul, Ehb, NHbonds, |Ecoul|, |Ehb|, Ehbq−Evdw7
Ligand DescriptorsMW, logP, TPSA, nheavy, nrot, nrings, nHBA, nHBD, ...9

4.2 v1.3 Benchmark Results

The final v1.3 benchmark on hero targets shows substantial improvement over physics-only scoring:

Target Class AUC (vdW) AUC (vdW+ML v1.3) Δ AUC
BACE1Protease0.410.87+0.45
AA2ARGPCR0.640.85+0.21
EGFRKinase0.580.80+0.22

HBQ Features Contribution

Adding HBQ physics features (v1.3) over vdW-only ML (v1.2) provides consistent improvement:

Targetv1.2 (vdW ML)v1.3 (vdW+HBQ ML)Δ
EGFR0.7780.801+0.02
AA2AR0.8330.849+0.02
BACE10.8630.867+0.00

Key finding: HBQ features push EGFR across the 0.80 AUC threshold, indicating that H-bond and electrostatic interactions capture binding-relevant physics that vdW dispersion alone misses.

5. FEP+ Validation (v2.2)

Beyond virtual screening (AUC), drug discovery requires accurate rank-ordering of congeneric ligand series for lead optimization. LatticeZero v2.2 introduces FEP+ validation with target-class-specific scoring.

5.1 Target-Class Routing

Different protein families require different scoring configurations:

Target Class Configuration Rationale
KinasesRigid receptor, MC samplingATP pocket well-defined; induced fit minimal
Serine ProteasesCalibrated elec_scale (0.0003)Charged catalytic triad requires electrostatic tuning
Water-Heavy5-state ensembleMultiple water network conformations must be sampled

Ensemble Docking

For water-mediated binding sites (e.g., HSP90), a single receptor structure produces anti-correlated results. We sample five receptor states:

Scores are aggregated using Boltzmann weighting (kT = 4 kcal/mol).

5.2 FEP+ Results

Target Class Mode Spearman ρ Status
ThrombinSerine ProteaseFEP+1.00GREEN
EGFRKinaseFEP+0.63GREEN
HSP90 (pyrimidone)ChaperoneEnsemble0.50GREEN

Key Findings

Honest Limitations

Some target classes remain challenging:

We publish both successes and failures with clear validation gates.

6. Runtime Performance

LatticeZero's Rust-accelerated scoring engine delivers production-ready throughput on modest hardware.

Benchmark Configuration

ComponentValue
PlatformDigitalOcean (2 vCPU, 4GB RAM)
BaselineSmina score_only mode
TestPre-docked poses, batch rescoring

Throughput Comparison

Target#PosesSmina (s)LZ vdw_ml (s)Speedup
EGFR1,593154.66.225×
AA2AR15014.33.04.8×
BACE118018.13.15.8×

Ligands per Second

EngineSpeed
Smina score_only~10 ligands/sec
LZ vdW (physics only)~80 ligands/sec
LZ vdW+ML50-60 ligands/sec

Key finding: LatticeZero v1.3 rescores 50-80 ligands/sec on a 2-core VM—5× faster than Smina score-only mode. For multi-pose workflows (like EGFR with 20 poses/ligand), the speedup reaches 25× due to batch processing.

7. Conclusion

LatticeZero demonstrates that physics-derived scoring can compete with empirical methods while maintaining interpretability. Our contributions:

  1. First-principles dispersion from RFT lattice calculations, eliminating empirical C₆ parameterization
  2. Calibrated sterics via potential matching rather than AUC optimization
  3. ΔClash pocket classification that predicts scoring performance from geometry alone
  4. Physics+ML hybrid achieving 0.80-0.87 AUC on hero targets with interpretable features
  5. FEP+ validation with perfect rank-order (ρ=1.0) on serine proteases and 60% GREEN rate
  6. Production throughput of 50-80 ligands/sec with 5-25× speedup over Smina

Future Directions

LatticeZero v1.3 is available for research preview at latticezero.com.

References

  1. Friesner, R.A., et al. (2004). Glide: A new approach for rapid, accurate docking and scoring. J. Med. Chem., 47(7):1739–1749.
  2. Gasteiger, J. & Marsili, M. (1980). Iterative partial equalization of orbital electronegativity. Tetrahedron, 36(22):3219–3228.
  3. Kitchen, D.B., et al. (2004). Docking and scoring in virtual screening for drug discovery. Nat. Rev. Drug Discov., 3(11):935–949.
  4. Koes, D.R., et al. (2013). Lessons learned in empirical scoring with smina. J. Chem. Inf. Model., 53(8):1893–1904.
  5. Mysinger, M.M., et al. (2012). Directory of useful decoys, enhanced (DUD-E). J. Med. Chem., 55(14):6582–6594.
  6. Trott, O. & Olson, A.J. (2010). AutoDock Vina: Improving the speed and accuracy of docking. J. Comput. Chem., 31(2):455–461.