Science Cast

Assemblies of long-read metagenomes suffer from diverse errors

librarianApril 25, 2025 12:56am

Views (0)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Assemblies of long-read metagenomes suffer from diverse errors

bioRxivPDFApril 24, 2025 12:00am

Authors

Trigodet, F.; Sachdeva, R.; Banfield, J. F.; Eren, A. M.

Abstract

Genomes from metagenomes have revolutionised our understanding of microbial diversity, ecology, and evolution, propelling advances in basic science, biomedicine, and biotechnology. Assembly algorithms that take advantage of increasingly available long-read sequencing technologies bring the recovery of complete genomes directly from metagenomes within reach. However, assessing the accuracy of the assembled long reads, especially from complex environments that often include poorly studied organisms, poses remarkable challenges. Here we show that erroneous reporting is pervasive among long-read assemblers and can take many forms, including multi-domain chimeras, prematurely circularized sequences, haplotyping errors, excessive repeats, and phantom sequences. Our study highlights the need for rigorous evaluation of the algorithms while they are in development, and options for users who may opt for more accurate reads than shorter runtimes.

TwitterandLinkedIn

0 comments

Add comment

Assemblies of long-read metagenomes suffer from diverse errors

Assemblies of long-read metagenomes suffer from diverse errors

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments