TY - JOUR A1 - Sharan, Malvika A1 - Förstner, Konrad U. A1 - Eulalio, Ana A1 - Vogel, Jörg T1 - APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins JF - Nucleic Acids Research N2 - RNA-binding proteins (RBPs) have been established as core components of several post-transcriptional gene regulation mechanisms. Experimental techniques such as cross-linking and co-immunoprecipitation have enabled the identification of RBPs, RNA-binding domains (RBDs) and their regulatory roles in the eukaryotic species such as human and yeast in large-scale. In contrast, our knowledge of the number and potential diversity of RBPs in bacteria is poorer due to the technical challenges associated with the existing global screening approaches. We introduce APRICOT, a computational pipeline for the sequence-based identification and characterization of proteins using RBDs known from experimental studies. The pipeline identifies functional motifs in protein sequences using position-specific scoring matrices and Hidden Markov Models of the functional domains and statistically scores them based on a series of sequence-based features. Subsequently, APRICOT identifies putative RBPs and characterizes them by several biological properties. Here we demonstrate the application and adaptability of the pipeline on large-scale protein sets, including the bacterial proteome of Escherichia coli. APRICOT showed better performance on various datasets compared to other existing tools for the sequence-based prediction of RBPs by achieving an average sensitivity and specificity of 0.90 and 0.91 respectively. The command-line tool and its documentation are available at https://pypi.python.org/pypi/bio-apricot. KW - RNA-binding proteins KW - identification KW - characterization Y1 - 2017 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-157963 VL - 45 IS - 11 ER - TY - JOUR A1 - Sass, Andrea M. A1 - Van Acker, Heleen A1 - Förstner, Konrad U. A1 - Van Nieuwerburgh, Filip A1 - Deforce, Dieter A1 - Vogel, Jörg A1 - Coenye, Tom T1 - Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315 JF - BMC Genomics N2 - Background: Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. Methods: RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Results: Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Conclusions: Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation. KW - persistence KW - genomic islands KW - pathogen KW - identification KW - bacteria KW - small RNAs KW - translation initiation KW - cepedia complex KW - global gene expression KW - SEQ KW - resistance KW - burkholderia cenocepacia KW - biofilms KW - dRNA-Seq KW - transcription start site KW - antisense RNA Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-139748 VL - 16 IS - 775 ER - TY - JOUR A1 - Wagner, Ines A1 - Volkmer, Michael A1 - Sharan, Malvika A1 - Villaveces, Jose M. A1 - Oswald, Felix A1 - Surendranath, Vineeth A1 - Habermann, Bianca H. T1 - morFeus: a web-based program to detect remotely conserved orthologs using symmetrical best hits and orthology network scoring JF - BMC Bioinformatics N2 - Background: Searching the orthologs of a given protein or DNA sequence is one of the most important and most commonly used Bioinformatics methods in Biology. Programs like BLAST or the orthology search engine Inparanoid can be used to find orthologs when the similarity between two sequences is sufficiently high. They however fail when the level of conservation is low. The detection of remotely conserved proteins oftentimes involves sophisticated manual intervention that is difficult to automate. Results: Here, we introduce morFeus, a search program to find remotely conserved orthologs. Based on relaxed sequence similarity searches, morFeus selects sequences based on the similarity of their alignments to the query, tests for orthology by iterative reciprocal BLAST searches and calculates a network score for the resulting network of orthologs that is a measure of orthology independent of the E-value. Detecting remotely conserved orthologs of a protein using morFeus thus requires no manual intervention. We demonstrate the performance of morFeus by comparing it to state-of-the-art orthology resources and methods. We provide an example of remotely conserved orthologs, which were experimentally shown to be functionally equivalent in the respective organisms and therefore meet the criteria of the orthology-function conjecture. Conclusions: Based on our results, we conclude that morFeus is a powerful and specific search method for detecting remotely conserved orthologs. KW - reciprocal best hit KW - finder using symmetrical best hits KW - sequences KW - annotation KW - identification KW - database KW - genomes KW - proteins KW - homologs KW - hidden markov-models KW - phylogenetic trees KW - PSI-blast KW - eigenvector centrality KW - meta-analysis based orthology KW - orthology KW - remote sequence conservation KW - alignment clustering KW - orthology network Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-115590 VL - 15 IS - 263 ER - TY - JOUR A1 - Wang, Huiqiang A1 - Chen, Nanhai G. A1 - Minev, Boris R. A1 - Szalay, Aladar A. T1 - Oncolytic vaccinia virus GLV-1h68 strain shows enhanced replication in human breast cancer stem-like cells in comparison to breast cancer cells JF - Journal of Translational Medicine N2 - Background: Recent data suggest that cancer stem cells (CSCs) play an important role in cancer, as these cells possess enhanced tumor-forming capabilities and are responsible for relapses after apparently curative therapies have been undertaken. Hence, novel cancer therapies will be needed to test for both tumor regression and CSC targeting. The use of oncolytic vaccinia virus (VACV) represents an attractive anti-tumor approach and is currently under evaluation in clinical trials. The purpose of this study was to demonstrate whether VACV does kill CSCs that are resistant to irradiation and chemotherapy. Methods: Cancer stem-like cells were identified and separated from the human breast cancer cell line GI-101A by virtue of increased aldehyde dehydrogenase 1 (ALDH1) activity as assessed by the ALDEFLUOR assay and cancer stem cell-like features such as chemo-resistance, irradiation-resistance and tumor-initiating were confirmed in cell culture and in animal models. VACV treatments were applied to both ALDEFLUOR-positive cells in cell culture and in xenograft tumors derived from these cells. Moreover, we identified and isolated CD44\(^+\)CD24\(^+\)ESA\(^+\) cells from GI-101A upon an epithelial-mesenchymal transition (EMT). These cells were similarly characterized both in cell culture and in animal models. Results: We demonstrated for the first time that the oncolytic VACV GLV-1h68 strain replicated more efficiently in cells with higher ALDH1 activity that possessed stem cell-like features than in cells with lower ALDH1 activity. GLV-1h68 selectively colonized and eventually eradicated xenograft tumors originating from cells with higher ALDH1 activity. Furthermore, GLV-1h68 also showed preferential replication in CD44\(^+\)CD24\(^+\)ESA\(^+\) cells derived from GI-101A upon an EMT induction as well as in xenograft tumors originating from these cells that were more tumorigenic than CD44\(^+\)CD24\(^-\)ESA\(^+\) cells. Conclusions: Taken together, our findings indicate that GLV-1h68 efficiently replicates and kills cancer stem-like cells. Thus, GLV-1h68 may become a promising agent for eradicating both primary and metastatic tumors, especially tumors harboring cancer stem-like cells that are resistant to chemo and/or radiotherapy and may be responsible for recurrence of tumors. KW - tumors KW - therapy KW - metastasis KW - identification KW - lines KW - gene expression KW - in-vitro propagation KW - acute myeloid leukemia KW - epithelial-mesenchymal transition KW - subpopulation Y1 - 2012 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-130019 VL - 10 IS - 167 ER - TY - JOUR A1 - Feller, Tatjana A1 - Thom, Pascal A1 - Koch, Natalie A1 - Spiegel, Holger A1 - Addai-Mensah, Otchere A1 - Fischer, Rainer A1 - Reimann, Andreas A1 - Pradel, Gabriele A1 - Fendel, Rolf A1 - Schillberg, Stefan A1 - Scheuermayer, Matthias A1 - Schinkel, Helga T1 - Plant-Based Production of Recombinant Plasmodium Surface Protein Pf38 and Evaluation of its Potential as a Vaccine Candidate JF - PLOS ONE N2 - Pf38 is a surface protein of the malarial parasite Plasmodium falciparum. In this study, we produced and purified recombinant Pf38 and a fusion protein composed of red fluorescent protein and Pf38 (RFP-Pf38) using a transient expression system in the plant Nicotiana benthamiana. To our knowledge, this is the first description of the production of recombinant Pf38. To verify the quality of the recombinant Pf38, plasma from semi-immune African donors was used to confirm specific binding to Pf38. ELISA measurements revealed that immune responses to Pf38 in this African subset were comparable to reactivities to AMA-1 and \(MSP1_{19}\). Pf38 and RFP-Pf38 were successfully used to immunise mice, although titres from these mice were low (on average 1:11.000 and 1:39.000, respectively). In immune fluorescence assays, the purified IgG fraction from the sera of immunised mice recognised Pf38 on the surface of schizonts, gametocytes, macrogametes and zygotes, but not sporozoites. Growth inhibition assays using \(\alpha Pf38\) antibodies demonstrated strong inhibition \((\geq 60 \% ) \) of the growth of blood-stage P. falciparum. The development of zygotes was also effectively inhibited by \(\alpha Pf38\) antibodies, as determined by the zygote development assay. Collectively, these results suggest that Pf38 is an interesting candidate for the development of a malaria vaccine. KW - malaria vaccine KW - balancing selection KW - N-glycans KW - falciparum KW - expression KW - antibodies KW - identification KW - transmission KW - tobacco KW - antigen Y1 - 2013 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-128221 SN - 1932-6203 VL - 8 IS - 11 ER - TY - JOUR A1 - Okoro, Chinyere K. A1 - Barquist, Lars A1 - Connor, Thomas R. A1 - Harris, Simon R. A1 - Clare, Simon A1 - Stevens, Mark P. A1 - Arends, Mark J. A1 - Hale, Christine A1 - Kane, Leanne A1 - Pickard, Derek J. A1 - Hill, Jennifer A1 - Harcourt, Katherine A1 - Parkhill, Julian A1 - Dougan, Gordon A1 - Kingsley, Robert A. T1 - Signatures of adaptation in human invasive Salmonella Typhimurium ST313 populations from sub-Saharan Africa JF - PLoS Neglected Tropical Diseases N2 - Two lineages of Salmonella enterica serovar Typhimurium (S. Typhimurium) of multi-locus sequence type ST313 have been linked with the emergence of invasive Salmonella disease across sub-Saharan Africa. The expansion of these lineages has a temporal association with the HIV pandemic and antibiotic usage. We analysed the whole genome sequence of 129 ST313 isolates representative of the two lineages and found evidence of lineage-specific genome degradation, with some similarities to that observed in S. Typhi. Individual ST313 S. Typhimurium isolates exhibit a distinct metabolic signature and modified enteropathogenesis in both a murine and cattle model of colitis, compared to S. Typhimurium outside of the ST313 lineages. These data define phenotypes that distinguish ST313 isolates from other S. Typhimurium and may represent adaptation to a distinct pathogenesis and lifestyle linked to an-immuno-compromised human population. KW - genome sequence KW - infection KW - pathogenicity KW - children KW - disease KW - adults KW - identification KW - Escherichia coli KW - virulence Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-143779 VL - 9 IS - 3 ER -