Empowering medaka fish biology with versatile genomic resources in MedakaBase
Empowering medaka fish biology with versatile genomic resources in MedakaBase
Morikami, K.; Tanizawa, Y.; Yagura, M.; Sakamoto, M.; Kawamoto, S.; Nakamura, Y.; Yamaguchi, K.; Shigenobu, S.; Naruse, K.; Ansai, S.; Kuraku, S.
AbstractMedaka, a group of small, mostly freshwater fishes in the teleost order Beloniformes, includes the rice fish Oryzias latipes, which is a prominent model organism for diverse biological fields. Chromosome-scale genome sequences of the Hd-rR strain of this species were obtained in 2007, and its improved version has facilitated various genome-wide studies. However, despite its widespread utility, omics data for O. latipes are dispersed across various public databases and lack a centralized platform. To address this, the medaka section of the National Bioresource Project (NBRP) of Japan established a genome informatics team in 2022 tasked with providing versatile in silico solutions for bench biologists. This initiative led to the launch of MedakaBase (https://medakabase.nbrp.jp), a web server that enables gene-oriented analysis including exhaustive sequence similarity searches. MedakaBase also provides genome-wide browsing of diverse datasets, including tissue-specific transcriptomes and intraspecific genomic variations, integrated with gene models from different sources. Additionally, the platform offers gene models optimized for single-cell transcriptome analysis, which often requires coverage of the 3\' untranslated region (UTR) of transcripts. Currently, MedakaBase provides genome-wide data for seven Oryzias species, including original data for O. mekongensis and O. luzonensis produced by the NBRP team. This article outlines technical details behind the data provided by MedakaBase.