• search hit 1 of 84
Back to Result List

NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language

Please always quote using this URN: urn:nbn:de:bvb:20-opus-223161
  • Background Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline.Background Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline. Results We present NGLess and NG-meta-profiler. NGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility. It provides built-in support for many common operations on sequencing data and is extensible with external tools with configuration files. Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible. Conclusions NG-meta-profiler is a high-performance solution for metagenomics processing built on NGLess. It can be used as-is to execute standard analyses or serve as the starting point for customization in a perfectly reproducible fashion. NGLess and NG-meta-profiler are open source software (under the liberal MIT license) and can be downloaded from https://ngless.embl.de or installed through bioconda.show moreshow less

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar Statistics
Metadaten
Author: Luis Pedro Coelho, Renato Alves, Paulo Monteiro, Jaime Huerta-Cepas, Ana Teresa Freitas, Peer Bork
URN:urn:nbn:de:bvb:20-opus-223161
Document Type:Journal article
Faculties:Fakultät für Biologie / Theodor-Boveri-Institut für Biowissenschaften
Language:English
Parent Title (English):Microbiome
Year of Completion:2019
Volume:7
Issue:84
Source:Microbiome (2019) 7:84. https://doi.org/10.1186/s40168-019-0684-8
DOI:https://doi.org/10.1186/s40168-019-0684-8
Dewey Decimal Classification:5 Naturwissenschaften und Mathematik / 57 Biowissenschaften; Biologie / 570 Biowissenschaften; Biologie
Tag:domain-specific language; metagenomics; next-generation sequencing
Release Date:2024/03/15
EU-Project number / Contract (GA) number:686070
EU-Project number / Contract (GA) number:669830
OpenAIRE:OpenAIRE
Licence (German):License LogoCC BY: Creative-Commons-Lizenz: Namensnennung 4.0 International