Lemonite: identification of regulatory metabolites through data-driven, interpretable integration of transcriptomics and metabolomics data
Lemonite: identification of regulatory metabolites through data-driven, interpretable integration of transcriptomics and metabolomics data
Vandemoortele, B.; Devlies, H.; Michoel, T.; Vanhaecke, L.; Vandenbroucke, R. E.; Laukens, D.; Vermeirssen, V.
AbstractBiological regulation emerges from coordinated interactions between genes, proteins, and metabolites; yet, despite their central regulatory potential, metabolites remain largely absent from genome-wide gene regulatory network inference. Current transcriptomics-metabolomics integration approaches are either limited by poor interpretability or constrained by incomplete prior knowledge, preventing the systematic identification of regulatory metabolites. Here, we present Lemonite, a data-driven and interpretable framework for integrating bulk transcriptomics and metabolomics data to uncover regulatory metabolites acting on gene modules. Lemonite extends module network inference to jointly associate transcription factors and metabolites with coexpressed gene programs, without requiring prior differential analysis or complete metabolite annotation. To contextualize predictions, we construct a comprehensive gene-metabolite knowledge graph integrating over 370,000 metabolite-gene and 2.1 million protein-protein interactions. Applied to glioblastoma and inflammatory bowel disease cohorts, Lemonite identifies over 50 functionally coherent gene modules per disease, revealing established and previously uncharacterized metabolite-gene regulatory relationships. In glioblastoma, myo-inositol and phosphatidylcholines, together with IRF6, regulate mesenchymal-like immune programs, which upon integration with single-cell transcriptomics are primarily expressed in tumor-associated macrophages and monocytes. In inflammatory bowel disease, regulatory metabolites are prioritized that change the expression of their predicted target genes in vitro. Together, Lemonite provides a principled framework to explore the genome-wide regulatory potential of the metabolome and to generate biologically interpretable, experimentally testable hypotheses from multi-omics data.