Refine
Has Fulltext
- yes (31)
Is part of the Bibliography
- yes (31)
Year of publication
Document Type
- Journal article (29)
- Conference Proceeding (1)
- Doctoral Thesis (1)
Language
- English (31)
Keywords
- RNA-seq (4)
- escherichia coli (4)
- gene expression (3)
- transcriptomics (3)
- Antisense RNA (2)
- Rhodobacter sphaeroides (2)
- bacteria (2)
- dRNA-Seq (2)
- genome annotation (2)
- identification (2)
Institute
- Institut für Molekulare Infektionsbiologie (27)
- Medizinische Fakultät (3)
- Institut für Hygiene und Mikrobiologie (1)
- Institut für Pharmazie und Lebensmittelchemie (1)
- Julius-von-Sachs-Institut für Biowissenschaften (1)
- Klinik und Poliklinik für Psychiatrie, Psychosomatik und Psychotherapie (1)
- Lehrstuhl für Tissue Engineering und Regenerative Medizin (1)
- Theodor-Boveri-Institut für Biowissenschaften (1)
Sonstige beteiligte Institutionen
The topic of my doctorial research was the computational analysis of metagenomic data. A metagenome comprises the genomic information from all the microorganisms within a certain environment. The currently available metagenomic data sets cover only parts of these usually huge metagenomes due to the high technical and financial effort of such sequencing endeavors. During my thesis I developed bioinformatic tools and applied them to analyse genomic features of different metagenomic data sets and to search for enzymes of importance for biotechnology or pharmaceutical applications in those sequence collections. In these studies nine metagenomic projects (with up to 41 subsamples) were analysed. These samples originated from diverse environments like farm soil, acid mine drainage, microbial mats on whale bones, marine water, fresh water, water treatment sludges and the human gut flora. Additionally, data sets of conventionally retrieved sequence data were taken into account and compared with each other
Scientific research is a process concerned with the creation, collective accumulation, contextualization, updating and maintenance of knowledge. Wikis provide an environment that allows to collectively accumulate, contextualize, update and maintain knowledge in a coherent and transparent fashion. Here, we examine the potential of wikis as platforms for scholarly publishing. In the hope to stimulate further discussion, the article itself was drafted on Species-ID – a wiki that hosts a prototype for wiki-based scholarly publishing – where it can be updated, expanded or otherwise improved.
Despite the internet's dynamic and collaborative nature, scientists continue to produce grant proposals, lab notebooks, data files, conclusions etc. that stay in static formats or are not published online and therefore not always easily accessible to the interested public. Because of limited adoption of tools that seamlessly integrate all aspects of a research project (conception, data generation, data evaluation, peerreviewing and publishing of conclusions), much effort is later spent on reproducing or reformatting individual entities before they can be repurposed independently or as parts of articles.
We propose that workflows - performed both individually and collaboratively - could potentially become more efficient if all steps of the research cycle were coherently represented online and the underlying data were formatted, annotated and licensed for reuse. Such a system would accelerate the process of taking projects from conception to publication stages and allow for continuous updating of the data sets and their interpretation as well as their integration into other independent projects.
A major advantage of such work ows is the increased transparency, both with respect to the scientific process as to the contribution of each participant. The latter point is important from a perspective of motivation, as it enables the allocation of reputation, which creates incentives for scientists to contribute to projects. Such work ow platforms offering possibilities to fine-tune the accessibility of their content could gradually pave the path from the current static mode of research presentation into a more coherent practice of open science.
Although the DNA methyltransferase 2 family is highly conserved during evolution and recent reports suggested a dual specificity with stronger activity on transfer RNA (tRNA) than DNA substrates, the biological function is still obscure. We show that the Dictyostelium discoideum Dnmt2-homologue DnmA is an active tRNA methyltransferase that modifies C38 in \(tRNA^{Asp(GUC)}\) in vitro and in vivo. By an ultraviolet-crosslinking and immunoprecipitation approach, we identified further DnmA targets. This revealed specific tRNA fragments bound by the enzyme and identified \(tRNA^{Glu(CUC/UUC)}\) and \(tRNA^{Gly(GCC)}\) as new but weaker substrates for both human Dnmt2 and DnmA in vitro but apparently not in vivo. Dnmt2 enzymes form transient covalent complexes with their substrates. The dynamics of complex formation and complex resolution reflect methylation efficiency in vitro. Quantitative PCR analyses revealed alterations in dnmA expression during development, cell cycle and in response to temperature stress. However, dnmA expression only partially correlated with tRNA methylation in vivo. Strikingly, dnmA expression in the laboratory strain AX2 was significantly lower than in the NC4 parent strain. As expression levels and binding of DnmA to a target in vivo are apparently not necessarily accompanied by methylation, we propose an additional biological function of DnmA apart from methylation.
Integrative "Omics"-Approach Discovers Dynamic and Regulatory Features of Bacterial Stress Responses
(2013)
Bacteria constantly face stress conditions and therefore mount specific responses to ensure adaptation and survival. Stress responses were believed to be predominantly regulated at the transcriptional level. In the phototrophic bacterium Rhodobacter sphaeroides the response to singlet oxygen is initiated by alternative sigma factors. Further adaptive mechanisms include post-transcriptional and post-translational events, which have to be considered to gain a deeper understanding of how sophisticated regulation networks operate. To address this issue, we integrated three layers of regulation: (1) total mRNA levels at different time-points revealed dynamics of the transcriptome, (2) mRNAs in polysome fractions reported on translational regulation (translatome), and (3) SILAC-based mass spectrometry was used to quantify protein abundances (proteome). The singlet oxygen stress response exhibited highly dynamic features regarding short-term effects and late adaptation, which could in part be assigned to the sigma factors RpoE and RpoH2 generating distinct expression kinetics of corresponding regulons. The occurrence of polar expression patterns of genes within stress-inducible operons pointed to an alternative of dynamic fine-tuning upon stress. In addition to transcriptional activation, we observed significant induction of genes at the post-transcriptional level (translatome), which identified new putative regulators and assigned genes of quorum sensing to the singlet oxygen stress response. Intriguingly, the SILAC approach explored the stress-dependent decline of photosynthetic proteins, but also identified 19 new open reading frames, which were partly validated by RNA-seq. We propose that comparative approaches as presented here will help to create multi-layered expression maps on the system level ("expressome"). Finally, intense mass spectrometry combined with RNA-seq might be the future tool of choice to re-annotate genomes in various organisms and will help to understand how they adapt to alternating conditions.
Campylobacter jejuni is currently the leading cause of bacterial gastroenteritis in humans. Comparison of multiple Campylobacter strains revealed a high genetic and phenotypic diversity. However, little is known about differences in transcriptome organization, gene expression, and small RNA (sRNA) repertoires. Here we present the first comparative primary transcriptome analysis based on the differential RNA–seq (dRNA–seq) of four C. jejuni isolates. Our approach includes a novel, generic method for the automated annotation of transcriptional start sites (TSS), which allowed us to provide genome-wide promoter maps in the analyzed strains. These global TSS maps are refined through the integration of a SuperGenome approach that allows for a comparative TSS annotation by mapping RNA–seq data of multiple strains into a common coordinate system derived from a whole-genome alignment. Considering the steadily increasing amount of RNA–seq studies, our automated TSS annotation will not only facilitate transcriptome annotation for a wider range of pro- and eukaryotes but can also be adapted for the analysis among different growth or stress conditions. Our comparative dRNA–seq analysis revealed conservation of most TSS, but also single-nucleotide-polymorphisms (SNP) in promoter regions, which lead to strain-specific transcriptional output. Furthermore, we identified strain-specific sRNA repertoires that could contribute to differential gene regulation among strains. In addition, we identified a novel minimal CRISPR-system in Campylobacter of the type-II CRISPR subtype, which relies on the host factor RNase III and a trans-encoded sRNA for maturation of crRNAs. This minimal system of Campylobacter, which seems active in only some strains, employs a unique maturation pathway, since the crRNAs are transcribed from individual promoters in the upstream repeats and thereby minimize the requirements for the maturation machinery. Overall, our study provides new insights into strain-specific transcriptome organization and sRNAs, and reveals genes that could modulate phenotypic variation among strains despite high conservation at the DNA level.
Background
Prokaryotes have relatively small genomes, densely-packed with protein-encoding sequences. RNA sequencing has, however, revealed surprisingly complex transcriptomes and here we report the transcripts present in the model hyperthermophilic Archaeon, Thermococcus kodakarensis, under different physiological conditions.
Results
Sequencing cDNA libraries, generated from RNA isolated from cells under different growth and metabolic conditions has identified >2,700 sites of transcription initiation, established a genome-wide map of transcripts, and consensus sequences for transcription initiation and post-transcription regulatory elements. The primary transcription start sites (TSS) upstream of 1,254 annotated genes, plus 644 primary TSS and their promoters within genes, are identified. Most mRNAs have a 5'-untranslated region (5'-UTR) 10 to 50 nt long (median = 16 nt), but ~20% have 5'-UTRs from 50 to 300 nt long and ~14% are leaderless. Approximately 50% of mRNAs contain a consensus ribosome binding sequence. The results identify TSS for 1,018 antisense transcripts, most with sequences complementary to either the 5'- or 3'-region of a sense mRNA, and confirm the presence of transcripts from all three CRISPR loci, the RNase P and 7S RNAs, all tRNAs and rRNAs and 69 predicted snoRNAs. Two putative riboswitch RNAs were present in growing but not in stationary phase cells. The procedure used is designed to identify TSS but, assuming that the number of cDNA reads correlates with transcript abundance, the results also provide a semi-quantitative documentation of the differences in T. kodakarensis genome expression under different growth conditions and confirm previous observations of substrate-dependent specific gene expression. Many previously unanticipated small RNAs have been identified, some with relative low GC contents (≤50%) and sequences that do not fold readily into base-paired secondary structures, contrary to the classical expectations for non-coding RNAs in a hyperthermophile.
Conclusion
The results identify >2,700 TSS, including almost all of the primary sites of transcription initiation upstream of annotated genes, plus many secondary sites, sites within genes and sites resulting in antisense transcripts. The T. kodakarensis genome is small (~2.1 Mbp) and tightly packed with protein-encoding genes, but the transcriptomes established also contain many non-coding RNAs and predict extensive RNA-based regulation in this model Archaeon.
Base J, beta-d-glucosyl-hydroxymethyluracil, is an epigenetic modification of thymine in the nuclear DNA of flagellated protozoa of the order Kinetoplastida. J is enriched at sites involved in RNA polymerase ( RNAP) II initiation and termination. Reduction of J in Leishmania tarentolae via growth in BrdU resulted in cell death and indicated a role of J in the regulation of RNAP II termination. To further explore J function in RNAP II termination among kinetoplastids and avoid indirect effects associated with BrdU toxicity and genetic deletions, we inhibited J synthesis in Leishmania major and Trypanosoma brucei using DMOG. Reduction of J in L. major resulted in genome-wide defects in transcription termination at the end of polycistronic gene clusters and the generation of antisense RNAs, without cell death. In contrast, loss of J in T. brucei did not lead to genome-wide termination defects; however, the loss of J at specific sites within polycistronic gene clusters led to altered transcription termination and increased expression of downstream genes. Thus, J regulation of RNAP II transcription termination genome-wide is restricted to Leishmania spp., while in T. brucei it regulates termination and gene expression at specific sites within polycistronic gene clusters.
As matchmaker between mRNA and sRNA interactions, the RNA chaperone Hfq plays a key role in riboregulation of many bacteria. Often, the global influence of Hfq on the transcriptome is reflected by substantially altered proteomes and pleiotropic phenotypes in hfq mutants. Using quantitative proteomics and co-immunoprecipitation combined with RNA-sequencing (RIP-seq) of Hfq-bound RNAs, we demonstrate the pervasive role of Hfq in nutrient acquisition, metabolism and motility of the plant pathogen Agrobacterium tumefaciens. 136 of 2544 proteins identified by iTRAQ (isobaric tags for relative and absolute quantitation) were affected in the absence of Hfq. Most of them were associated with ABC transporters, general metabolism and motility. RIP-seq of chromosomally encoded Hfq 3xFlag revealed 1697 mRNAs and 209 non-coding RNAs (ncRNAs) associated with Hfq. 56 ncRNAs were previously undescribed. Interestingly, 55% of the Hfq-bound ncRNAs were encoded antisense (as) to a protein-coding sequence suggesting that A. tumefaciens Hfq plays an important role in asRNA-target interactions. The exclusive enrichment of 296 mRNAs and 31 ncRNAs under virulence conditions further indicates a role for post-transcriptional regulation in A. tumefaciens-mediated plant infection. On the basis of the iTRAQ and RIP-seq data, we assembled a comprehensive model of the Hfq core regulon in A. tumefaciens.
Role of oxygen and the OxyR protein in the response to iron limitation in Rhodobacter sphaeroides
(2014)
Background: High intracellular levels of unbound iron can contribute to the production of reactive oxygen species (ROS) via the Fenton reaction, while depletion of iron limits the availability of iron-containing proteins, some of which have important functions in defence against oxidative stress. Vice versa increased ROS levels lead to the damage of proteins with iron sulphur centres. Thus, organisms have to coordinate and balance their responses to oxidative stress and iron availability. Our knowledge of the molecular mechanisms underlying the co-regulation of these responses remains limited. To discriminate between a direct cellular response to iron limitation and indirect responses, which are the consequence of increased levels of ROS, we compared the response of the alpha-proteobacterium Rhodobacter sphaeroides to iron limitation in the presence or absence of oxygen. Results: One third of all genes with altered expression under iron limitation showed a response that was independent of oxygen availability. The other iron-regulated genes showed different responses in oxic or anoxic conditions and were grouped into six clusters based on the different expression profiles. For two of these clusters, induction in response to iron limitation under oxic conditions was dependent on the OxyR regulatory protein. An OxyR mutant showed increased ROS production and impaired growth under iron limitation. Conclusion: Some R. sphaeroides genes respond to iron limitation irrespective of oxygen availability. These genes therefore reflect a "core iron response" that is independent of potential ROS production under oxic, iron-limiting conditions. However, the regulation of most of the iron-responsive genes was biased by oxygen availability. Most strikingly, the OxyR-dependent activation of a subset of genes upon iron limitation under oxic conditions, including many genes with a role in iron metabolism, revealed that elevated ROS levels were an important trigger for this response. OxyR thus provides a regulatory link between the responses to oxidative stress and to iron limitation in R. sphaeroides.