OPUS Würzburg

Newly designed 16S rRNA metabarcoding primers amplify diverse and novel archaeal taxa from the environment (2019)

Bahram, Mohammad ; Anslan, Sten ; Hildebrand, Falk ; Bork, Peer ; Tedersoo, Leho

High-throughput studies of microbial communities suggest that Archaea are a widespread component of microbial diversity in various ecosystems. However, proper quantification of archaeal diversity and community ecology remains limited, as sequence coverage of Archaea is usually low owing to the inability of available prokaryotic primers to efficiently amplify archaeal compared to bacterial rRNA genes. To improve identification and quantification of Archaea, we designed and validated the utility of several primer pairs to efficiently amplify archaeal 16S rRNA genes based on up-to-date reference genes. We demonstrate that several of these primer pairs amplify phylogenetically diverse Archaea with high sequencing coverage, outperforming commonly used primers. Based on comparing the resulting long 16S rRNA gene fragments with public databases from all habitats, we found several novel family- to phylum-level archaeal taxa from topsoil and surface water. Our results suggest that archaeal diversity has been largely overlooked due to the limitations of available primers, and that improved primer pairs enable to estimate archaeal diversity more accurately.

Pervasive Protein Thermal Stability Variation during the Cell Cycle (2018)

Becher, Isabelle ; Andrés-Pons, Amparo ; Romanov, Natalie ; Stein, Frank ; Schramm, Maike ; Baudin, Florence ; Helm, Dominic ; Kurzawa, Nils ; Mateus, André ; Mackmull, Marie-Therese ; Typas, Athanasios ; Müller, Christoph W. ; Bork, Peer ; Beck, Martin ; Savitski, Mikhail M.

Quantitative mass spectrometry has established proteome-wide regulation of protein abundance and post-translational modifications in various biological processes. Here, we used quantitative mass spectrometry to systematically analyze the thermal stability and solubility of proteins on a proteome-wide scale during the eukaryotic cell cycle. We demonstrate pervasive variation of these biophysical parameters with most changes occurring in mitosis and G1. Various cellular pathways and components vary in thermal stability, such as cell-cycle factors, polymerases, and chromatin remodelers. We demonstrate that protein thermal stability serves as a proxy for enzyme activity, DNA binding, and complex formation in situ. Strikingly, a large cohort of intrinsically disordered and mitotically phosphorylated proteins is stabilized and solubilized in mitosis, suggesting a fundamental remodeling of the biophysical environment of the mitotic cell. Our data represent a rich resource for cell, structural, and systems biologists interested in proteome regulation during biological transitions.

A global ocean atlas of eukaryotic gene (2018)

While our knowledge about the roles of microbes and viruses in the ocean has increased tremendously due to recent advances in genomics and metagenomics, research on marine microbial eukaryotes and zooplankton has benefited much less from these new technologies because of their larger genomes, their enormous diversity, and largely unexplored physiologies. Here, we use a metatranscriptomics approach to capture expressed genes in open ocean Tara Oceans stations across four organismal size fractions. The individual sequence reads cluster into 116 million unigenes representing the largest reference collection of eukaryotic transcripts from any single biome. The catalog is used to unveil functions expressed by eukaryotic marine plankton, and to assess their functional biogeography. Almost half of the sequences have no similarity with known proteins, and a great number belong to new gene families with a restricted distribution in the ocean. Overall, the resource provides the foundations for exploring the roles of marine eukaryotes in ocean ecology and biogeochemistry.

OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines (2017)

Chen, Wei-Hua ; Lu, Guanting ; Chen, Xiao ; Zhao, Xing-Ming ; Bork, Peer

OGEE is an Online GEne Essentiality database. To enhance our understanding of the essentiality of genes, in OGEE we collected experimentally tested essential and non-essential genes, as well as associated gene properties known to contribute to gene essentiality. We focus on large-scale experiments, and complement our data with text-mining results. We organized tested genes into data sets according to their sources, and tagged those with variable essentiality statuses across data sets as conditionally essential genes, intending to highlight the complex interplay between gene functions and environments/experimental perturbations. Developments since the last public release include increased number of species and gene essentiality data sets, inclusion of non-coding essential sequences and genes with intermediate essentiality statuses. In addition, we included 16 essentiality data sets from cancer cell lines, corresponding to 9 human cancers; with OGEE, users can easily explore the shared and differentially essential genes within and between cancer types. These genes, especially those derived from cell lines that are similar to tumor samples, could reveal the oncogenic drivers, paralogous gene expression pattern and chromosomal structure of the corresponding cancer types, and can be further screened to identify targets for cancer therapy and/or new drug development. OGEE is freely available at http://ogee.medgenius.info.

NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language (2019)

Coelho, Luis Pedro ; Alves, Renato ; Monteiro, Paulo ; Huerta-Cepas, Jaime ; Freitas, Ana Teresa ; Bork, Peer

Background Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline. Results We present NGLess and NG-meta-profiler. NGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility. It provides built-in support for many common operations on sequencing data and is extensible with external tools with configuration files. Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible. Conclusions NG-meta-profiler is a high-performance solution for metagenomics processing built on NGLess. It can be used as-is to execute standard analyses or serve as the starting point for customization in a perfectly reproducible fashion. NGLess and NG-meta-profiler are open source software (under the liberal MIT license) and can be downloaded from https://ngless.embl.de or installed through bioconda.

Similarity of the dog and human gut microbiomes in gene content and response to diet (2018)

Coelho, Luis Pedro ; Kultima, Jens Roat ; Costea, Paul Igor ; Fournier, Coralie ; Pan, Yuanlong ; Czarnecki-Maulden, Gail ; Hayward, Matthew Robert ; Forslund, Sofia K. ; Schmidt, Thomas Sebastian Benedikt ; Descombes, Patrick ; Jackson, Janet R. ; Li, Qinghong ; Bork, Peer

Background Gut microbes influence their hosts in many ways, in particular by modulating the impact of diet. These effects have been studied most extensively in humans and mice. In this work, we used whole genome metagenomics to investigate the relationship between the gut metagenomes of dogs, humans, mice, and pigs. Results We present a dog gut microbiome gene catalog containing 1,247,405 genes (based on 129 metagenomes and a total of 1.9 terabasepairs of sequencing data). Based on this catalog and taxonomic abundance profiling, we show that the dog microbiome is closer to the human microbiome than the microbiome of either pigs or mice. To investigate this similarity in terms of response to dietary changes, we report on a randomized intervention with two diets (high-protein/low-carbohydrate vs. lower protein/higher carbohydrate). We show that diet has a large and reproducible effect on the dog microbiome, independent of breed or sex. Moreover, the responses were in agreement with those observed in previous human studies. Conclusions We conclude that findings in dogs may be predictive of human microbiome results. In particular, a novel finding is that overweight or obese dogs experience larger compositional shifts than lean dogs in response to a high-protein diet.

Subspecies in the global human gut microbiome (2017)

Costea, Paul I. ; Coelho, Louis Pedro ; Sunagawa, Shinichi ; Munch, Robin ; Huerta-Cepas, Jaime ; Forslund, Kristoffer ; Hildebrand, Falk ; Kushugulova, Almagul ; Zeller, Georg ; Bork, Peer

Population genomics of prokaryotes has been studied in depth in only a small number of primarily pathogenic bacteria, as genome sequences of isolates of diverse origin are lacking for most species. Here, we conducted a large‐scale survey of population structure in prevalent human gut microbial species, sampled from their natural environment, with a culture‐independent metagenomic approach. We examined the variation landscape of 71 species in 2,144 human fecal metagenomes and found that in 44 of these, accounting for 72% of the total assigned microbial abundance, single‐nucleotide variation clearly indicates the existence of sub‐populations (here termed subspecies). A single subspecies (per species) usually dominates within each host, as expected from ecological theory. At the global scale, geographic distributions of subspecies differ between phyla, with Firmicutes subspecies being significantly more geographically restricted. To investigate the functional significance of the delineated subspecies, we identified genes that consistently distinguish them in a manner that is independent of reference genomes. We further associated these subspecies‐specific genes with properties of the microbial community and the host. For example, two of the three Eubacterium rectale subspecies consistently harbor an accessory pro‐inflammatory flagellum operon that is associated with lower gut community diversity, higher host BMI, and higher blood fasting insulin levels. Using an additional 676 human oral samples, we further demonstrate the existence of niche specialized subspecies in the different parts of the oral cavity. Taken together, we provide evidence for subspecies in the majority of abundant gut prokaryotes, leading to a better functional and ecological understanding of the human gut microbiome in conjunction with its host.

Cell-specific proteome analyses of human bone marrow reveal molecular features of age-dependent functional decline (2018)

Hennrich, Marco L. ; Romanov, Natalie ; Horn, Patrick ; Jaeger, Samira ; Eckstein, Volker ; Steeples, Violetta ; Ye, Fei ; Ding, Ximing ; Poisa-Beiro, Laura ; Mang, Ching Lai ; Lang, Benjamin ; Boultwood, Jacqueline ; Luft, Thomas ; Zaugg, Judith B. ; Pellagatti, Andrea ; Bork, Peer ; Aloy, Patrick ; Gavin, Anne-Claude ; Ho, Anthony D.

Diminishing potential to replace damaged tissues is a hallmark for ageing of somatic stem cells, but the mechanisms remain elusive. Here, we present proteome-wide atlases of age-associated alterations in human haematopoietic stem and progenitor cells (HPCs) and five other cell populations that constitute the bone marrow niche. For each, the abundance of a large fraction of the ~12,000 proteins identified is assessed in 59 human subjects from different ages. As the HPCs become older, pathways in central carbon metabolism exhibit features reminiscent of the Warburg effect, where glycolytic intermediates are rerouted towards anabolism. Simultaneously, altered abundance of early regulators of HPC differentiation reveals a reduced functionality and a bias towards myeloid differentiation. Ageing causes alterations in the bone marrow niche too, and diminishes the functionality of the pathways involved in HPC homing. The data represent a valuable resource for further analyses, and for validation of knowledge gained from animal models.

Sample preservation and storage significantly impact taxonomic and functional profiles in metaproteomics studies of the human gut microbiome (2019)

Hickl, Oskar ; Heintz-Buschart, Anna ; Trautwein-Schult, Anke ; Hercog, Rajna ; Bork, Peer ; Wilmes, Paul ; Becher, Dörte

With the technological advances of the last decade, it is now feasible to analyze microbiome samples, such as human stool specimens, using multi-omic techniques. Given the inherent sample complexity, there exists a need for sample methods which preserve as much information as possible about the biological system at the time of sampling. Here, we analyzed human stool samples preserved and stored using different methods, applying metagenomics as well as metaproteomics. Our results demonstrate that sample preservation and storage have a significant effect on the taxonomic composition of identified proteins. The overall identification rates, as well as the proportion of proteins from Actinobacteria were much higher when samples were flash frozen. Preservation in RNAlater overall led to fewer protein identifications and a considerable increase in the share of Bacteroidetes, as well as Proteobacteria. Additionally, a decrease in the share of metabolism-related proteins and an increase of the relative amount of proteins involved in the processing of genetic information was observed for RNAlater-stored samples. This suggests that great care should be taken in choosing methods for the preservation and storage of microbiome samples, as well as in comparing the results of analyses using different sampling and storage methods. Flash freezing and subsequent storage at −80 °C should be chosen wherever possible.

Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees (2016)

Letunic, Ivica ; Bork, Peer

Interactive Tree Of Life (http://itol.embl.de) is a web-based tool for the display, manipulation and annotation of phylogenetic trees. It is freely available and open to everyone. The current version was completely redesigned and rewritten, utilizing current web technologies for speedy and streamlined processing. Numerous new features were introduced and several new data types are now supported. Trees with up to 100,000 leaves can now be efficiently displayed. Full interactive control over precise positioning of various annotation features and an unlimited number of datasets allow the easy creation of complex tree visualizations. iTOL 3 is the first tool which supports direct visualization of the recently proposed phylogenetic placements format. Finally, iTOL's account system has been redesigned to simplify the management of trees in user-defined workspaces and projects, as it is heavily used and currently handles already more than 500,000 trees from more than 10,000 individual users.

Author(s)
Title
Additional Person(s)
Referee(s)
Abstract
Fulltext

Refine

Has Fulltext

Is part of the Bibliography

Year of publication

Document Type

Language

Keywords

Author

Institute

EU-Project number / Contract (GA) number

20 search hits