Extreme phase compression preserves buildable basins in macromolecular crystallography
Extreme phase compression preserves buildable basins in macromolecular crystallography
AMBROSIO, A. L. B.
AbstractMacromolecular crystallography is limited by the phase problem: diffraction experiments measure amplitudes but not the phases required to reconstruct electron density. Existing phasing routes usually seek enough continuous phase information for density modification and model building to converge. Here, we ask how much phase information can be discarded while preserving convergence. We analyzed 14,148 diffraction datasets from chiral crystals to characterize centric reflections in reciprocal-space asymmetric units. After conditioning by centric trace and, where required, index parity, the two theoretical symmetry-allowed phase values were populated near equally, close to 50:50, independent of space group, defining a compact symmetry scaffold. We then retained this exact scaffold while compressing reference acentric phases to a one-bit alphabet {0, {pi} }; as expected from their diffuse parent distribution, the assignments were also near-balanced. Although this binary representation, with fixed attenuation 2/{pi}, introduces large angular errors (mean of 52{degrees}), it frequently supported automated structure solution: in paired Phenix AutoBuild tests, 705 of 894 binary initializers met a conservative joint criterion of final Free R [≤]30% and relative chain recovery [≥]70%, within a 20.0-2.5 [A] resolution window. To rank candidate seeds without rebuilding, we developed a branch-balanced Basin Score from inexpensive density-modification and map-connectivity observables computed at 20.0-3.5 [A]. The empirical score quickly separates productive from unproductive initializers before AutoBuild. Controlled phase inversion shows that basin compatibility decays gradually and can reappear in an anti-phase-related branch, indicating that buildability is not confined to a single neighborhood around the reference phase set but extends to a much broader field. These results recast phase initialization as basin entry and support future symmetry-aware, binary phase-search strategies.