TY - THES A1 - Förstner, Konrad Ulrich T1 - Computational analysis of metagenomic data: delineation of compositional features and screens for desirable enzymes T1 - Computergestützte Analyse von Metagenomedate: Beschreibung von kompositionellen Eigenschaften und Suchen nach gewünschten Enzymen N2 - The topic of my doctorial research was the computational analysis of metagenomic data. A metagenome comprises the genomic information from all the microorganisms within a certain environment. The currently available metagenomic data sets cover only parts of these usually huge metagenomes due to the high technical and financial effort of such sequencing endeavors. During my thesis I developed bioinformatic tools and applied them to analyse genomic features of different metagenomic data sets and to search for enzymes of importance for biotechnology or pharmaceutical applications in those sequence collections. In these studies nine metagenomic projects (with up to 41 subsamples) were analysed. These samples originated from diverse environments like farm soil, acid mine drainage, microbial mats on whale bones, marine water, fresh water, water treatment sludges and the human gut flora. Additionally, data sets of conventionally retrieved sequence data were taken into account and compared with each other N2 - Das Thema meiner Doktorarbeit war die bioinformatische Analyse von metagenomischen Sequenzdaten. Ein Metagenom umfasst die genomische Information aller Mikroorganismen eines Biotops. Die bisher durchgeführten metagenomische Projekte sequenzierten auf Grund des technischen und finanziellen Aufwands einer solchen Unternehmung nur kleine Teile dieser im allgemeinen sehr großen Metagenome. Im Zuge meiner Doktorarbeit, die auf solchen Sequenzierungprojekten aufbaut, wurden bioinformatische Werkzeuge entwickelt und angewandt um genomische Eigenschaften verschiedener metagenomische Datensätze zu analysieren und um biotechnologisch und pharmakologisch relevante Enzyme exemplarisch in diesen Datensätzen zu suchen. In den Analysen wurden neun publizierte, metagenomische Projektedatensammlungen (teilweise mit bis zu 41 Subproben) untersucht. Die Probem stammen von zahlreichen unterschiedlichen Habitaten wie Farmerde, sauerer Minendrainage, dem mikrobiellen Belag auf Walknochen, Meerwasser, Süßwasser, Abwasseraufbereitungssschlamm und der menschlichen Darmu flora. Zusätzlich wurden in den meisten Analysen konventionell gewonnene Sequenzdaten vergleichend hinzugezogen und analysiert. KW - Bioinformatik KW - Metagenomomanalyse KW - GC-Wert KW - Enyzme KW - PKS KW - NHase KW - Nitrilase KW - Metagenomics KW - GC-value KW - enzymes KW - PKS KW - NHase KW - Nitrilase Y1 - 2008 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-33577 ER - TY - CHAP A1 - Förstner, Konrad A1 - Hagedorn, Gregor A1 - Koltzenburg, Claudia A1 - Kubke, Fabiana A1 - Mietchen, Daniel T1 - Collaborative platforms for streamlining workflows in Open Science T2 - Proceedings of the 6th Open Knowledge Conference N2 - Despite the internet's dynamic and collaborative nature, scientists continue to produce grant proposals, lab notebooks, data files, conclusions etc. that stay in static formats or are not published online and therefore not always easily accessible to the interested public. Because of limited adoption of tools that seamlessly integrate all aspects of a research project (conception, data generation, data evaluation, peerreviewing and publishing of conclusions), much effort is later spent on reproducing or reformatting individual entities before they can be repurposed independently or as parts of articles. We propose that workflows - performed both individually and collaboratively - could potentially become more efficient if all steps of the research cycle were coherently represented online and the underlying data were formatted, annotated and licensed for reuse. Such a system would accelerate the process of taking projects from conception to publication stages and allow for continuous updating of the data sets and their interpretation as well as their integration into other independent projects. A major advantage of such work ows is the increased transparency, both with respect to the scientific process as to the contribution of each participant. The latter point is important from a perspective of motivation, as it enables the allocation of reputation, which creates incentives for scientists to contribute to projects. Such work ow platforms offering possibilities to fine-tune the accessibility of their content could gradually pave the path from the current static mode of research presentation into a more coherent practice of open science. KW - Open Science KW - Virtual Research Environment KW - collaboratories KW - workflow platform KW - automation Y1 - 2011 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-101678 ER - TY - JOUR A1 - Esken, Jens A1 - Goris, Tobias A1 - Gadkari, Jennifer A1 - Bischler, Thorsten A1 - Förstner, Konrad U. A1 - Sharma, Cynthia M. A1 - Diekert, Gabriele A1 - Schubert, Torsten T1 - Tetrachloroethene respiration in Sulfurospirillum species is regulated by a two‐component system as unraveled by comparative genomics, transcriptomics, and regulator binding studies JF - MicrobiologyOpen N2 - Energy conservation via organohalide respiration (OHR) in dehalogenating Sulfurospirillum species is an inducible process. However, the gene products involved in tetrachloroethene (PCE) sensing and signal transduction have not been unambiguously identified. Here, genome sequencing of Sulfurospirillum strains defective in PCE respiration and comparative genomics, which included the PCE‐respiring representatives of the genus, uncovered the genetic inactivation of a two‐component system (TCS) in the OHR gene region of the natural mutants. The assumption that the TCS gene products serve as a PCE sensor that initiates gene transcription was supported by the constitutive low‐level expression of the TCS operon in fumarate‐adapted cells of Sulfurospirillum multivorans. Via RNA sequencing, eight transcriptional units were identified in the OHR gene region, which includes the TCS operon, the PCE reductive dehalogenase operon, the gene cluster for norcobamide biosynthesis, and putative accessory genes with unknown functions. The OmpR‐family response regulator (RR) encoded in the TCS operon was functionally characterized by promoter‐binding assays. The RR bound a cis‐regulatory element that contained a consensus sequence of a direct repeat (CTATW) separated by 17 bp. Its location either overlapping the −35 box or 50 bp further upstream indicated different regulatory mechanisms. Sequence variations in the regulator binding sites identified in the OHR gene region were in accordance with differences in the transcript levels of the respective gene clusters forming the PCE regulon. The results indicate the presence of a fine‐tuned regulatory network controlling PCE metabolism in dehalogenating Sulfurospirillum species, a group of metabolically versatile organohalide‐respiring bacteria. KW - genomics KW - organohalide respiration KW - RNA sequencing KW - tetrachloroethene KW - transcriptomics KW - two‐component system Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-225754 VL - 9 IS - 12 ER - TY - JOUR A1 - Sharan, Malvika A1 - Förstner, Konrad U. A1 - Eulalio, Ana A1 - Vogel, Jörg T1 - APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins JF - Nucleic Acids Research N2 - RNA-binding proteins (RBPs) have been established as core components of several post-transcriptional gene regulation mechanisms. Experimental techniques such as cross-linking and co-immunoprecipitation have enabled the identification of RBPs, RNA-binding domains (RBDs) and their regulatory roles in the eukaryotic species such as human and yeast in large-scale. In contrast, our knowledge of the number and potential diversity of RBPs in bacteria is poorer due to the technical challenges associated with the existing global screening approaches. We introduce APRICOT, a computational pipeline for the sequence-based identification and characterization of proteins using RBDs known from experimental studies. The pipeline identifies functional motifs in protein sequences using position-specific scoring matrices and Hidden Markov Models of the functional domains and statistically scores them based on a series of sequence-based features. Subsequently, APRICOT identifies putative RBPs and characterizes them by several biological properties. Here we demonstrate the application and adaptability of the pipeline on large-scale protein sets, including the bacterial proteome of Escherichia coli. APRICOT showed better performance on various datasets compared to other existing tools for the sequence-based prediction of RBPs by achieving an average sensitivity and specificity of 0.90 and 0.91 respectively. The command-line tool and its documentation are available at https://pypi.python.org/pypi/bio-apricot. KW - RNA-binding proteins KW - identification KW - characterization Y1 - 2017 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-157963 VL - 45 IS - 11 ER - TY - JOUR A1 - Meier, Doreen A1 - Kruse, Janis A1 - Buttlar, Jann A1 - Friedrich, Michael A1 - Zenk, Fides A1 - Boesler, Benjamin A1 - Forstner, Konrad U. A1 - Hammann, Christian A1 - Nellen, Wolfgang T1 - Analysis of the Microprocessor in Dictyostelium: The Role of RbdB, a dsRNA Binding Protein JF - PLoS Genetics N2 - We identified the dsRNA binding protein RbdB as an essential component in miRNA processing in Dictyostelium discoideum. RbdB is a nuclear protein that accumulates, together with Dicer B, in nucleolar foci reminiscent of plant dicing bodies. Disruption of rbdB results in loss of miRNAs and accumulation of primary miRNAs. The phenotype can be rescued by ectopic expression of RbdB thus allowing for a detailed analysis of domain function. The lack of cytoplasmic dsRBD proteins involved in miRNA processing, suggests that both processing steps take place in the nucleus thus resembling the plant pathway. However, we also find features e.g. in the domain structure of Dicer which suggest similarities to animals. Reduction of miRNAs in the rbdB- strain and their increase in the Argonaute A knock out allowed the definition of new miRNAs one of which appears to belong to a new non-canonical class. KW - microprocessor KW - Dictyostelium discoideum KW - dsRNA binding protein KW - RbdB Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-166687 VL - 12 IS - 6 ER - TY - JOUR A1 - Lavysh, Daria A1 - Sokolova, Maria A1 - Slashcheva, Marina A1 - Förstner, Konrad U. A1 - Severinov, Konstantin T1 - Transcription profiling of "bacillus subtilis" cells infected with AR9, a giant phage encoding two multisubunit RNA polymerases JF - mBio N2 - Bacteriophage AR9 is a recently sequenced jumbo phage that encodes two multisubunit RNA polymerases. Here we investigated the AR9 transcription strategy and the effect of AR9 infection on the transcription of its host, Bacillus subtilis. Analysis of whole-genome transcription revealed early, late, and continuously expressed AR9 genes. Alignment of sequences upstream of the 5′ ends of AR9 transcripts revealed consensus sequences that define early and late phage promoters. Continuously expressed AR9 genes have both early and late promoters in front of them. Early AR9 transcription is independent of protein synthesis and must be determined by virion RNA polymerase injected together with viral DNA. During infection, the overall amount of host mRNAs is significantly decreased. Analysis of relative amounts of host transcripts revealed notable differences in the levels of some mRNAs. The physiological significance of up- or downregulation of host genes for AR9 phage infection remains to be established. AR9 infection is significantly affected by rifampin, an inhibitor of host RNA polymerase transcription. The effect is likely caused by the antibiotic-induced killing of host cells, while phage genome transcription is solely performed by viral RNA polymerases. KW - Bacteriaophage AR9 KW - Transcription profiling Y1 - 2017 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-181810 VL - 8 IS - 1 ER - TY - JOUR A1 - Weidner, Magdalena T. A1 - Lardenoije, Roy A1 - Eijssen, Lars A1 - Mogavero, Floriana A1 - De Groodt, Lilian P. M. T. A1 - Popp, Sandy A1 - Palme, Rupert A1 - Förstner, Konrad U. A1 - Strekalova, Tatyana A1 - Steinbusch, Harry W. M. A1 - Schmitt-Böhrer, Angelika G. A1 - Glennon, Jeffrey C. A1 - Waider, Jonas A1 - van den Hove, Daniel L. A. A1 - Lesch, Klaus-Peter T1 - Identification of cholecystokinin by genome-wide profiling as potential mediator of serotonin-dependent behavioral effects of maternal separation in the amygdala JF - Frontiers in Neuroscience N2 - Converging evidence suggests a role of serotonin (5-hydroxytryptamine, 5-HT) and tryptophan hydroxylase 2 (TPH2), the rate-limiting enzyme of 5-HT synthesis in the brain, in modulating long-term, neurobiological effects of early-life adversity. Here, we aimed at further elucidating the molecular mechanisms underlying this interaction, and its consequences for socio-emotional behaviors, with a focus on anxiety and social interaction. In this study, adult, male Tph2 null mutant (Tph2\(^{-/-}\)) and heterozygous (Tph2\(^{+/-}\)) mice, and their wildtype littermates (Tph2\(^{+/+}\)) were exposed to neonatal, maternal separation (MS) and screened for behavioral changes, followed by genome-wide RNA expression and DNA methylation profiling. In Tph2\(^{-/-}\) mice, brain 5-HT deficiency profoundly affected socio-emotional behaviors, i.e., decreased avoidance of the aversive open arms in the elevated plus-maze (EPM) as well as decreased prosocial and increased rule breaking behavior in the resident-intruder test when compared to their wildtype littermates. Tph2\(^{+/-}\) mice showed an ambiguous profile with context-dependent, behavioral responses. In the EPM they showed similar avoidance of the open arm but decreased prosocial and increased rule breaking behavior in the resident-intruder test when compared to their wildtype littermates. Notably, MS effects on behavior were subtle and depended on the Tph2 genotype, in particular increasing the observed avoidance of EPM open arms in wildtype and Tph2\(^{+/-}\) mice when compared to their Tph2\(^{-/-}\) littermates. On the genomic level, the interaction of Tph2 genotype with MS differentially affected the expression of numerous genes, of which a subset showed an overlap with DNA methylation profiles at corresponding loci. Remarkably, changes in methylation nearby and expression of the gene encoding cholecystokinin, which were inversely correlated to each other, were associated with variations in anxiety-related phenotypes. In conclusion, next to various behavioral alterations, we identified gene expression and DNA methylation profiles to be associated with TPH2 inactivation and its interaction with MS, suggesting a gene-by-environment interaction-dependent, modulatory function of brain 5-HT availability. KW - serotonin KW - maternal separation KW - mouse KW - emotional behavior KW - DNA methylation KW - RNA expression Y1 - 2019 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-201340 VL - 13 ER - TY - JOUR A1 - Westermann, Alexander J. A1 - Venturini, Elisa A1 - Sellin, Mikael E. A1 - Förstner, Konrad U. A1 - Hardt, Wolf-Dietrich A1 - Vogel, Jörg T1 - The major RNA-binding protein ProQ impacts virulence gene expression in Salmonella enterica serovar Typhimurium JF - mBio N2 - FinO domain proteins such as ProQ of the model pathogen Salmonella enterica have emerged as a new class of major RNA-binding proteins in bacteria. ProQ has been shown to target hundreds of transcripts, including mRNAs from many virulence regions, but its role, if any, in bacterial pathogenesis has not been studied. Here, using a Dual RNA-seq approach to profile ProQ-dependent gene expression changes as Salmonella infects human cells, we reveal dysregulation of bacterial motility, chemotaxis, and virulence genes which is accompanied by altered MAPK (mitogen-activated protein kinase) signaling in the host. Comparison with the other major RNA chaperone in Salmonella, Hfq, reinforces the notion that these two global RNA-binding proteins work in parallel to ensure full virulence. Of newly discovered infection-associated ProQ-bound small noncoding RNAs (sRNAs), we show that the 3′UTR-derived sRNA STnc540 is capable of repressing an infection-induced magnesium transporter mRNA in a ProQ-dependent manner. Together, this comprehensive study uncovers the relevance of ProQ for Salmonella pathogenesis and highlights the importance of RNA-binding proteins in regulating bacterial virulence programs. IMPORTANCE The protein ProQ has recently been discovered as the centerpiece of a previously overlooked “third domain” of small RNA-mediated control of gene expression in bacteria. As in vitro work continues to reveal molecular mechanisms, it is also important to understand how ProQ affects the life cycle of bacterial pathogens as these pathogens infect eukaryotic cells. Here, we have determined how ProQ shapes Salmonella virulence and how the activities of this RNA-binding protein compare with those of Hfq, another central protein in RNA-based gene regulation in this and other bacteria. To this end, we apply global transcriptomics of pathogen and host cells during infection. In doing so, we reveal ProQ-dependent transcript changes in key virulence and host immune pathways. Moreover, we differentiate the roles of ProQ from those of Hfq during infection, for both coding and noncoding transcripts, and provide an important resource for those interested in ProQ-dependent small RNAs in enteric bacteria. KW - Hfq KW - noncoding RNA KW - ProQ KW - RNA-seq KW - bacterial pathogen KW - posttranscriptional control Y1 - 2019 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-177722 VL - 10 IS - 1 ER - TY - JOUR A1 - Babski, Julia A1 - Haas, Karina A. A1 - Näther-Schindler, Daniela A1 - Pfeiffer, Friedhelm A1 - Förstner, Konrad U. A1 - Hammelmann, Matthias A1 - Hilker, Rolf A1 - Becker, Anke A1 - Sharma, Cynthia M. A1 - Marchfelder, Anita A1 - Soppa, Jörg T1 - Genome-wide identification of transcriptional start sites in the haloarchaeon Haloferax volcanii based on differential RNA-Seq (dRNA-Seq) JF - BMC Genomics N2 - Background Differential RNA-Seq (dRNA-Seq) is a recently developed method of performing primary transcriptome analyses that allows for the genome-wide mapping of transcriptional start sites (TSSs) and the identification of novel transcripts. Although the transcriptomes of diverse bacterial species have been characterized by dRNA-Seq, the transcriptome analysis of archaeal species is still rather limited. Therefore, we used dRNA-Seq to characterize the primary transcriptome of the model archaeon Haloferax volcanii. Results Three independent cultures of Hfx. volcanii grown under optimal conditions to the mid-exponential growth phase were used to determine the primary transcriptome and map the 5′-ends of the transcripts. In total, 4749 potential TSSs were detected. A position weight matrix (PWM) was derived for the promoter predictions, and the results showed that 64 % of the TSSs were preceded by stringent or relaxed basal promoters. Of the identified TSSs, 1851 belonged to protein-coding genes. Thus, fewer than half (46 %) of the 4040 protein-coding genes were expressed under optimal growth conditions. Seventy-two percent of all protein-coding transcripts were leaderless, which emphasized that this pathway is the major pathway for translation initiation in haloarchaea. A total of 2898 of the TSSs belonged to potential non-coding RNAs, which accounted for an unexpectedly high fraction (61 %) of all transcripts. Most of the non-coding TSSs had not been previously described (2792) and represented novel sequences (59 % of all TSSs). A large fraction of the potential novel non-coding transcripts were cis-antisense RNAs (1244 aTSSs). A strong negative correlation between the levels of antisense transcripts and cognate sense mRNAs was found, which suggested that the negative regulation of gene expression via antisense RNAs may play an important role in haloarchaea. The other types of novel non-coding transcripts corresponded to internal transcripts overlapping with mRNAs (1153 iTSSs) and intergenic small RNA (sRNA) candidates (395 TSSs). Conclusion This study provides a comprehensive map of the primary transcriptome of Hfx. volcanii grown under optimal conditions. Fewer than half of all protein-coding genes have been transcribed under these conditions. Unexpectedly, more than half of the detected TSSs belonged to several classes of non-coding RNAs. Thus, RNA-based regulation appears to play a more important role in haloarchaea than previously anticipated. KW - Archaea KW - dRNA-Seq KW - Promoter KW - Non-coding RNAs KW - sRNA KW - Haloferax volcanii KW - Transcriptome KW - Leaderless transcript KW - Antisense RNA Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-164553 VL - 17 IS - 629 ER - TY - JOUR A1 - Čuklina, Jelena A1 - Hahn, Julia A1 - Imakaev, Maxim A1 - Omasits, Ulrich A1 - Förstner, Konrad U. A1 - Ljubimov, Nikolay A1 - Goebel, Melanie A1 - Pessi, Gabriella A1 - Fischer, Hans-Martin A1 - Ahrens, Christian H. A1 - Gelfand, Mikhail S. A1 - Evguenieva-Hackenberg, Elena T1 - Genome-wide transcription start site mapping of Bradyrhizobium japonicum grown free-living or in symbiosis - a rich resource to identify new transcripts, proteins and to study gene regulation JF - BMC Genomics N2 - Background Differential RNA-sequencing (dRNA-seq) is indispensable for determination of primary transcriptomes. However, using dRNA-seq data to map transcriptional start sites (TSSs) and promoters genome-wide is a bioinformatics challenge. We performed dRNA-seq of Bradyrhizobium japonicum USDA 110, the nitrogen-fixing symbiont of soybean, and developed algorithms to map TSSs and promoters. Results A specialized machine learning procedure for TSS recognition allowed us to map 15,923 TSSs: 14,360 in free-living bacteria, 4329 in symbiosis with soybean and 2766 in both conditions. Further, we provide proteomic evidence for 4090 proteins, among them 107 proteins corresponding to new genes and 178 proteins with N-termini different from the existing annotation (72 and 109 of them with TSS support, respectively). Guided by proteomics evidence, previously identified TSSs and TSSs experimentally validated here, we assign a score threshold to flag 14 % of the mapped TSSs as a class of lower confidence. However, this class of lower confidence contains valid TSSs of low-abundant transcripts. Moreover, we developed a de novo algorithm to identify promoter motifs upstream of mapped TSSs, which is publicly available, and found motifs mainly used in symbiosis (similar to RpoN-dependent promoters) or under both conditions (similar to RpoD-dependent promoters). Mapped TSSs and putative promoters, proteomic evidence and updated gene annotation were combined into an annotation file. Conclusions The genome-wide TSS and promoter maps along with the extended genome annotation of B. japonicum represent a valuable resource for future systems biology studies and for detailed analyses of individual non-coding transcripts and ORFs. Our data will also provide new insights into bacterial gene regulation during the agriculturally important symbiosis between rhizobia and legumes. KW - Bradyrhizobium KW - RNA-seq KW - Promoter prediction KW - Genome re-annotation KW - Internal transcription start site KW - Nodule KW - Transcription start site KW - Proteogenomics KW - Antisense RNA Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-164565 VL - 17 ER -