Finally, tools should be able to assemble metagenomes using the least computational resources possible. Furthermore, an assembler needs an intuitive and user-friendly interface to enable assembly with minimal effort and rapid processing of the metagenomic data. A good assembler should also utilize most of the raw sequence data to generate the largest assembly span possible. Long contigs allow for more accurate interpretation of full genes within a genomic context and facilitate the reconstruction of single genomes. Firstly, an assembler needs to produce a high proportion of long contigs (>1000 bp). Genome-centric questions require long contigs/scaffolds, while gene-centric questions require high confidence contigs and the assembly of a large proportion of the metagenomic dataset.Ĭonsidering the wealth of available assemblers, it is particularly important that researchers understand assembler performance, especially for investigators who lack appropriate bioinformatic expertise. Each of these questions require researchers to emphasise specific features of the metagenome. Improvements to assembly quality have greatly expanded the scope of questions that can be answered using shotgun metagenome sequencing including, for example: determination of microbial community composition and functional capacity, microbial population properties, comparisons of microbial communities from various environments, extraction of full genomes from metagenomes and genomics-informed microorganism isolation. Furthermore, de novo assembled metagenomes facilitate the discovery and reconstruction of novel genomes and/or genomic elements. De novo assembly is advantageous as it allows for more confident gene prediction than is attainable from unassembled data. In short, metagenomic sequences are split into predefined segments ( k-mers), which are overlapped into a network, and paths are traversed iteratively to create longer contigs. To this end, numerous metagenome assembly tools (assemblers) have been developed, the vast majority of which assemble sequences in de novo fashion. To assist in the accurate and thorough analysis of metagenomes, sequence data can be assembled into larger contiguous segments (contigs). Moreover, unassembled metagenomic sequence data are fragmented, noisy, error prone and contain uneven sequencing depths. Thus, raw sequence data alone are typically not sufficient for an in-depth analysis of a communities functional gene repertoire. Even though shotgun metagenomic sequencing provides comprehensive access to microbial genomic material, many of the encoded functional genes are substantially longer (~1000 bp ) than the length of reads provided by the sequencing platforms most commonly used for shotgun metagenomic studies (Illumina HiSeq 3000, 2 × 150 bp ). The ‘science’ of metagenomics has greatly accelerated the study of uncultured microorganisms in their natural environments, providing unparalleled insights into microbial community composition and putative functionality. We provide a concise workflow for the selection of the best assembly tool. We found that assembler choice ultimately depends on the scientific question, the available resources and the bioinformatic competence of the researcher. MEGAHIT emerged as a computationally inexpensive alternative to SPAdes, assembling the most complex dataset using less than 500 GB of RAM and within 10 hours. Overall, we found that SPAdes provided the largest contigs and highest N50 values across 6 of the 9 environmental datasets, followed by MEGAHIT and metaSPAdes. To assist with selection of an appropriate metagenome assembler we evaluated the capabilities of nine prominent assembly tools on nine publicly-available environmental metagenomes, as well as three simulated datasets. However, while several platforms have been developed for this critical step, there is currently no clear framework for the assembly of metagenomic sequence data. The analysis of metagenomic sequences facilitates gene prediction and annotation, and enables the assembly of draft genomes, including uncultured members of a community. Metagenomics allows unprecedented access to uncultured environmental microorganisms.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |