Probabilistic coupling of cellular and microenvironmental heterogeneity by masked self-supervised learning

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

Probabilistic coupling of cellular and microenvironmental heterogeneity by masked self-supervised learning

Authors

Kojima, Y.; Tanaka, Y.; Hirose, H.; Chiwaki, F.; Nishimura, K.; Hayashi, S.; Itahashi, K.; Ishikawa, M.; Shimamura, T.; Mano, H.

Abstract

Spatial omics technologies have advanced to single-cell resolution, enabling systematic analysis of tissue microenvironments alongside cellular-state heterogeneity. However, computationally defining microenvironmental states at single-cell resolution and identifying representations most informative for biological discovery remain major challenges. Here we present Mievformer, a Transformer-based masked self-supervised framework that learns microenvironmental embeddings by encoding neighboring cellular states and relative spatial configurations to parameterize the conditional distribution of continuous cell states at central spatial positions. Through InfoNCE optimization, Mievformer learns representations that capture the relative enrichment of cell states across microenvironments, formalized as a conditional density ratio, thereby enabling probabilistic inference of the coupling between microenvironmental and cellular heterogeneity. Mievformer outperformed existing methods in niche clustering on simulated spatial transcriptomics data and achieved the highest average performance across five real datasets spanning three spatial transcriptomics platforms when evaluated using DREC, a ground-truth-free metric that most strongly correlated with ground-truth performance in simulations. Beyond conventional clustering, Mievformer enables identification of cellular subpopulations based on their microenvironmental distribution and detection of gene-expression signatures associated with colocalization of specific cell populations. Together, these results establish Mievformer as a quantitatively robust and biologically informative framework for learning microenvironment representations in spatial omics.

Follow Us on

0 comments

Add comment