Refine
Has Fulltext
- yes (17)
Is part of the Bibliography
- yes (17)
Year of publication
Document Type
- Journal article (11)
- Doctoral Thesis (6)
Keywords
- metagenomics (17) (remove)
Institute
- Theodor-Boveri-Institut für Biowissenschaften (9)
- Institut für Molekulare Infektionsbiologie (3)
- Graduate School of Life Sciences (2)
- Julius-von-Sachs-Institut für Biowissenschaften (2)
- Fakultät für Biologie (1)
- Klinik und Poliklinik für Psychiatrie, Psychosomatik und Psychotherapie (1)
- Medizinische Fakultät (1)
- Pathologisches Institut (1)
Sonstige beteiligte Institutionen
The vast microbial diversity on the planet represents an invaluable source for identifying novel activities with potential industrial and therapeutic application. In this regard, metagenomics has emerged as a group of strategies that have significantly facilitated the analysis of DNA from multiple environments and has expanded the limits of known microbial diversity. However, the functional characterization of enzymes, metabolites, and products encoded by diverse microbial genomes is limited by the inefficient heterologous expression of foreign genes. We have implemented a pipeline that combines NGS and Sanger sequencing as a way to identify fosmids within metagenomic libraries. This strategy facilitated the identification of putative proteins, subcloning of targeted genes and preliminary characterization of selected proteins. Overall, the in silico approach followed by the experimental validation allowed us to efficiently recover the activity of previously hidden enzymes derived from agricultural soil samples. Therefore, the methodology workflow described herein can be applied to recover activities encoded by environmental DNA from multiple sources.
Background
Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data.
Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline.
Results
We present NGLess and NG-meta-profiler. NGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility. It provides built-in support for many common operations on sequencing data and is extensible with external tools with configuration files.
Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible.
Conclusions
NG-meta-profiler is a high-performance solution for metagenomics processing built on NGLess. It can be used as-is to execute standard analyses or serve as the starting point for customization in a perfectly reproducible fashion.
NGLess and NG-meta-profiler are open source software (under the liberal MIT license) and can be downloaded from https://ngless.embl.de or installed through bioconda.
Background
Gut microbes influence their hosts in many ways, in particular by modulating the impact of diet. These effects have been studied most extensively in humans and mice. In this work, we used whole genome metagenomics to investigate the relationship between the gut metagenomes of dogs, humans, mice, and pigs.
Results
We present a dog gut microbiome gene catalog containing 1,247,405 genes (based on 129 metagenomes and a total of 1.9 terabasepairs of sequencing data). Based on this catalog and taxonomic abundance profiling, we show that the dog microbiome is closer to the human microbiome than the microbiome of either pigs or mice. To investigate this similarity in terms of response to dietary changes, we report on a randomized intervention with two diets (high-protein/low-carbohydrate vs. lower protein/higher carbohydrate). We show that diet has a large and reproducible effect on the dog microbiome, independent of breed or sex. Moreover, the responses were in agreement with those observed in previous human studies.
Conclusions
We conclude that findings in dogs may be predictive of human microbiome results. In particular, a novel finding is that overweight or obese dogs experience larger compositional shifts than lean dogs in response to a high-protein diet.
Outdoor dust covers a shattered range of microbial agents from land over transportation, human microbial flora, which includes pathogen and commensals, and airborne from the environment. Dust aerosols are rich in bacterial communities that have a major impact on human health and living environments. In this study, outdoor samples from roadside barricades, safety walls, and fences (18 samples) were collected from Abu Dhabi, UAE and bacterial diversity was assessed through a 16S rRNA amplicon next generation sequencing approach. Clean data from HiSeq produced 1,099,892 total reads pairs for 18 samples. For all samples, taxonomic classifications were assigned to the OTUs (operational taxonomic units) representative sequence using the Ribosomal Database Project database. Analysis such as alpha diversity, beta diversity, differential species analysis, and species relative abundance were performed in the clustering of samples and a functional profile heat map was obtained from the OTUs by using bioinformatics tools. A total of 2814 OTUs were identified from those samples with a coverage of more than 99%. In the phylum, all 18 samples had most of the bacterial groups such as Actinobacteria, Proteobacteria, Firmicutes, and Bacteroidetes. Twelve samples had Propionibacteria acnes and were mainly found in RD16 and RD3. Major bacteria species such as Propionibacteria acnes, Bacillus persicus, and Staphylococcus captis were found in all samples. Most of the samples had Streptococcus mitis, Staphylococcus capitis. and Nafulsella turpanensis and Enhydrobacter aerosaccus was part of the normal microbes of the skin. Salinimicrobium sp., Bacillus alkalisediminis, and Bacillus persicus are halophilic bacteria found in sediments. The heat map clustered the samples and species in vertical and horizontal classification, which represents the relationship between the samples and bacterial diversity. The heat map for the functional profile had high properties of amino acids, carbohydrate, and cofactor and vitamin metabolisms of all bacterial species from all samples. Taken together, our analyses are very relevant from the perspective of out-door air quality, airborne diseases, and epidemics, with broader implications for health safety and monitoring.
Indoor house dust is a blend of organic and inorganic materials, upon which diverse microbial communities such as viruses, bacteria and fungi reside. Adequate moisture in the indoor environment helps microbial communities multiply fast. The outdoor air and materials that are brought into the buildings by airflow, sandstorms, animals pets and house occupants endow the indoor dust particles with extra features that impact human health. Assessment of the health effects of indoor dust particles, the type of indoor microbial inoculants and the secreted enzymes by indoor insects as allergens merit detailed investigation. Here, we discuss the applications of next generation sequencing (NGS) technology which is used to assess microbial diversity and abundance of the indoor dust environments. Likewise, the applications of NGS are discussed to monitor the gene expression profiles of indoor human occupants or their surrogate cellular models when exposed to aqueous solution of collected indoor dust samples. We also highlight the detection methods of dust allergens and analytical procedures that quantify the chemical nature of indoor particulate matter with a potential impact on human health. Our review is thus unique in advocating the applications of interdisciplinary approaches that comprehensively assess the health effects due to bad air quality in built environments.
Postencephalitic parkinsonism (PEP) is a disease of unknown etiology and pathophysiology following encephalitis lethargica (EL), an acute-onset polioencephalitis of cryptic cause in the 1920s. PEP is a tauopathy with multisystem neuronal loss and gliosis, clinically characterized by bradykinesia, rigidity, rest tremor, and oculogyric crises. Though a viral cause of EL is likely, past polymerase chain reaction-based investigations in the etiology of both PEP and EL were negative. PEP might be caused directly by an unknown viral pathogen or the consequence of a post-infectious immunopathology. The development of metagenomic next-generation sequencing in conjunction with bioinformatic techniques has generated a broad-range tool for the detection of unknown pathogens in the recent past. Retrospective identification and characterization of pathogens responsible for past infectious diseases can be successfully performed with formalin-fixed paraffin-embedded (FFPE) tissue samples. In this study, we analyzed 24 FFPE brain samples from six patients with PEP by unbiased metagenomic next-generation sequencing. Our results show that no evidence for the presence of a specific or putative (novel) viral pathogen was found, suggesting a likely post-infectious immune-mediated etiology of PEP.
With the technological advances of the last decade, it is now feasible to analyze microbiome samples, such as human stool specimens, using multi-omic techniques. Given the inherent sample complexity, there exists a need for sample methods which preserve as much information as possible about the biological system at the time of sampling. Here, we analyzed human stool samples preserved and stored using different methods, applying metagenomics as well as metaproteomics. Our results demonstrate that sample preservation and storage have a significant effect on the taxonomic composition of identified proteins. The overall identification rates, as well as the proportion of proteins from Actinobacteria were much higher when samples were flash frozen. Preservation in RNAlater overall led to fewer protein identifications and a considerable increase in the share of Bacteroidetes, as well as Proteobacteria. Additionally, a decrease in the share of metabolism-related proteins and an increase of the relative amount of proteins involved in the processing of genetic information was observed for RNAlater-stored samples. This suggests that great care should be taken in choosing methods for the preservation and storage of microbiome samples, as well as in comparing the results of analyses using different sampling and storage methods. Flash freezing and subsequent storage at −80 °C should be chosen wherever possible.
Microalga are of high relevance for the global carbon cycling and it is well-known that they are associated with a microbiota. However, it remains unclear, if the associated microbiota, often found in phycosphere biofilms, is specific for the microalga strains and which role individual bacterial taxa play. Here we provide experimental evidence that \(Chlorella\) \(saccharophila\), \(Scenedesmus\) \(quadricauda\), and \(Micrasterias\) \(crux-melitensis\), maintained in strain collections, are associated with unique and specific microbial populations. Deep metagenome sequencing, binning approaches, secretome analyses in combination with RNA-Seq data implied fundamental differences in the gene expression profiles of the microbiota associated with the different microalga. Our metatranscriptome analyses indicates that the transcriptionally most active bacteria with respect to key genes commonly involved in plant–microbe interactions in the Chlorella (Trebouxiophyceae) and Scenedesmus (Chlorophyceae) strains belong to the phylum of the α-Proteobacteria. In contrast, in the Micrasterias (Zygnematophyceae) phycosphere biofilm bacteria affiliated with the phylum of the Bacteroidetes showed the highest gene expression rates. We furthermore show that effector molecules known from plant-microbe interactions as inducers for the innate immunity are already of relevance at this evolutionary early plant-microbiome level.
The gastrointestinal tract is abundantly colonized by microbes, yet the translocation of oral species to the intestine is considered a rare aberrant event, and a hallmark of disease. By studying salivary and fecal microbial strain populations of 310 species in 470 individuals from five countries, we found that transmission to, and subsequent colonization of, the large intestine by oral microbes is common and extensive among healthy individuals. We found evidence for a vast majority of oral species to be transferable, with increased levels of transmission in colorectal cancer and rheumatoid arthritis patients and, more generally, for species described as opportunistic pathogens. This establishes the oral cavity as an endogenous reservoir for gut microbial strains, and oral-fecal transmission as an important process that shapes the gastrointestinal microbiome in health and disease.
The biosphere harbors a large quantity and diversity of microbial organisms that can thrive in all environments. Estimates of the total number of microbial species reach up to 1012, of which less than 15,000 have been characterized to date. It has been challenging to delineate phenotypically, evolutionary and ecologically meaningful lineages such as for example, species, subspecies and strains. Even within recognized species, gene content can vary considerably between sublineages (for example strains), a problem that can be addressed by analyzing pangenomes, defined as the non-redundant set of genes within a phylogenetic clade, as evolutionary units.
Species considered to be ecologically and evolutionary coherent units, however to date it is still not fully understood what are primary habitats and ecological niches of many prokaryotic species and how environmental preferences drive their genomic diversity. Majority of comparative genomics studies focused on a single prokaryotic species in context of clinical relevance and ecology. With accumulation of sequencing data due to genomics and metagenomics, it is now possible to investigate trends across many species, which will facilitate understanding of pangenome evolution, species and subspecies delineation.
The major aims of this thesis were 1) to annotate habitat preferences of prokaryotic species and strains; 2) investigate to what extent these environmental preferences drive genomic diversity of prokaryotes and to what extent phylogenetic constraints limit this diversification; 3) explore natural nucleotide identity thresholds to delineate species in bacteria in metagenomics gene catalogs; 4) explore species delineation for applications in subspecies and strain delineation in metagenomics.
The first part of the thesis describes methods to infer environmental preferences of microbial species. This data is a prerequisite for the analyses performed in the second part of the thesis which explores how the structure of bacterial pangenomes is predetermined by past evolutionary history and how is it linked to environmental preferences of the species. The main finding in this subchapter that habitat preferences explained up to 49% of the variance for pangenome structure, compared to 18% by phylogenetic inertia. In general, this trend indicates that phylogenetic inertia does not limit evolution of pangenome size and diversity, but that convergent evolution may overcome phylogenetic constraints. In this project we show that core genome size is associated with higher environmental ubiquity of species. It is likely this is due to the fact that species need to have more versatile genomes and most necessary genes need to be present in majority of genomes of that species to be highly prevalent. Taken together these findings may be useful for future predictive analyses of ecological niches in newly discovered species.
The third part of the thesis explores data-driven, operational species boundaries. I show that homologous genes from the same species from different genomes tend to share at least 95% of nucleotide identity, while different species within the same genus have lower nucleotide identity. This is in line with other studies showing that genome-wide natural species boundary might be in range of 90-95% of nucleotide identity. Finally, the fourth part of the thesis discusses how challenges in species delineation are relevant for the identification of meaningful within-species groups, followed by a discussion on how advancements in species delineation can be applied for classification of within-species genomic diversity in the age of metagenomics.