TY - JOUR A1 - Barquist, Lars A1 - Mayho, Matthew A1 - Cummins, Carla A1 - Cain, Amy K. A1 - Boinett, Christine J. A1 - Page, Andrew J. A1 - Langridge, Gemma C. A1 - Quail, Michael A. A1 - Keane, Jacqueline A. A1 - Parkhill, Julian T1 - The TraDIS toolkit: sequencing and analysis for dense transposon mutant libraries JF - Bioinformatics N2 - Transposon insertion sequencing is a high-throughput technique for assaying large libraries of otherwise isogenic transposon mutants providing insight into gene essentiality, gene function and genetic interactions. We previously developed the Transposon Directed Insertion Sequencing (TraDIS) protocol for this purpose, which utilizes shearing of genomic DNA followed by specific PCR amplification of transposon-containing fragments and Illumina sequencing. Here we describe an optimized high-yield library preparation and sequencing protocol for TraDIS experiments and a novel software pipeline for analysis of the resulting data. The Bio-Tradis analysis pipeline is implemented as an extensible Perl library which can either be used as is, or as a basis for the development of more advanced analysis tools. This article can serve as a general reference for the application of the TraDIS methodology. KW - mechanisms KW - Transposon insertion sequencing KW - sequencing protocol KW - TraDIS Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-189667 VL - 32 IS - 7 ER - TY - JOUR A1 - Bauriedl, Saskia A1 - Gerovac, Milan A1 - Heidrich, Nadja A1 - Bischler, Thorsten A1 - Barquist, Lars A1 - Vogel, Jörg A1 - Schoen, Christoph T1 - The minimal meningococcal ProQ protein has an intrinsic capacity for structure-based global RNA recognition JF - Nature Communications N2 - FinO-domain proteins are a widespread family of bacterial RNA-binding proteins with regulatory functions. Their target spectrum ranges from a single RNA pair, in the case of plasmid-encoded FinO, to global RNA regulons, as with enterobacterial ProQ. To assess whether the FinO domain itself is intrinsically selective or promiscuous, we determine in vivo targets of Neisseria meningitidis, which consists of solely a FinO domain. UV-CLIP-seq identifies associations with 16 small non-coding sRNAs and 166 mRNAs. Meningococcal ProQ predominantly binds to highly structured regions and generally acts to stabilize its RNA targets. Loss of ProQ alters transcript levels of >250 genes, demonstrating that this minimal ProQ protein impacts gene expression globally. Phenotypic analyses indicate that ProQ promotes oxidative stress resistance and DNA damage repair. We conclude that FinO domain proteins recognize some abundant type of RNA shape and evolve RNA binding selectivity through acquisition of additional regions that constrain target recognition. FinO-domain proteins are bacterial RNA-binding proteins with a wide range of target specificities. Here, the authors employ UV CLIP-seq and show that minimal ProQ protein of Neisseria meningitidis binds to various small non-coding RNAs and mRNAs involved in virulence. KW - Neisseria meningitidis KW - natural transformation KW - dual function KW - FinO family KW - HFQ KW - chaperone KW - transcriptome KW - regulator KW - sequence KW - in vivo Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-230040 VL - 11 ER - TY - JOUR A1 - Dembek, Marcin A1 - Barquist, Lars A1 - Boinett, Christine J. A1 - Cain, Amy K. A1 - Mayho, Matthew A1 - Lawley, Trevor D. A1 - Fairweather, Neil F. A1 - Fagan, Robert P. T1 - High-throughput analysis of gene essentiality and sporulation in Clostridium difficile JF - mBio N2 - Clostridium difficile is the most common cause of antibiotic-associated intestinal infections and a significant cause of morbidity and mortality. Infection with C. difficile requires disruption of the intestinal microbiota, most commonly by antibiotic usage. Therapeutic intervention largely relies on a small number of broad-spectrum antibiotics, which further exacerbate intestinal dysbiosis and leave the patient acutely sensitive to reinfection. Development of novel targeted therapeutic interventions will require a detailed knowledge of essential cellular processes, which represent attractive targets, and species-specific processes, such as bacterial sporulation. Our knowledge of the genetic basis of C. difficile infection has been hampered by a lack of genetic tools, although recent developments have made some headway in addressing this limitation. Here we describe the development of a method for rapidly generating large numbers of transposon mutants in clinically important strains of C. difficile. We validated our transposon mutagenesis approach in a model strain of C. difficile and then generated a comprehensive transposon library in the highly virulent epidemic strain R20291 (027/BI/NAP1) containing more than 70,000 unique mutants. Using transposon-directed insertion site sequencing (TraDIS), we have identified a core set of 404 essential genes, required for growth in vitro. We then applied this technique to the process of sporulation, an absolute requirement for C. difficile transmission and pathogenesis, identifying 798 genes that are likely to impact spore production. The data generated in this study will form a valuable resource for the community and inform future research on this important human pathogen. KW - Bacillus subtilis KW - expression KW - spores KW - toxin KW - transcription KW - germination KW - transposition KW - metabolism KW - infection KW - in vitro Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-143745 VL - 6 IS - 2 ER - TY - JOUR A1 - Heidrich, Nadja A1 - Bauriedl, Saskia A1 - Barquist, Lars A1 - Li, Lei A1 - Schoen, Christoph A1 - Vogel, Jörg T1 - The primary transcriptome of Neisseria meningitidis and its interaction with the RNA chaperone Hfq JF - Nucleic Acids Research N2 - Neisseria meningitidis is a human commensal that can also cause life-threatening meningitis and septicemia. Despite growing evidence for RNA-based regulation in meningococci, their transcriptome structure and output of regulatory small RNAs (sRNAs) are incompletely understood. Using dRNA-seq, we have mapped at single-nucleotide resolution the primary transcriptome of N. meningitidis strain 8013. Annotation of 1625 transcriptional start sites defines transcription units for most protein-coding genes but also reveals a paucity of classical σ70-type promoters, suggesting the existence of activators that compensate for the lack of −35 consensus sequences in N. meningitidis. The transcriptome maps also reveal 65 candidate sRNAs, a third of which were validated by northern blot analysis. Immunoprecipitation with the RNA chaperone Hfq drafts an unexpectedly large post-transcriptional regulatory network in this organism, comprising 23 sRNAs and hundreds of potential mRNA targets. Based on this data, using a newly developed gfp reporter system we validate an Hfq-dependent mRNA repression of the putative colonization factor PrpB by the two trans-acting sRNAs RcoF1/2. Our genome-wide RNA compendium will allow for a better understanding of meningococcal transcriptome organization and riboregulation with implications for colonization of the human nasopharynx. KW - RNA KW - Neisseria meningitidis KW - dRNA-seq KW - transcriptome KW - RNA chaperone Hfq Y1 - 2017 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-170828 VL - 45 IS - 10 ER - TY - JOUR A1 - Homberger, Christina A1 - Barquist, Lars A1 - Vogel, Jörg T1 - Ushering in a new era of single-cell transcriptomics in bacteria JF - microLife N2 - Transcriptome analysis of individual cells by single-cell RNA-seq (scRNA-seq) has become routine for eukaryotic tissues, even being applied to whole multicellular organisms. In contrast, developing methods to read the transcriptome of single bacterial cells has proven more challenging, despite a general perception of bacteria as much simpler than eukaryotes. Bacterial cells are harder to lyse, their RNA content is about two orders of magnitude lower than that of eukaryotic cells, and bacterial mRNAs are less stable than their eukaryotic counterparts. Most importantly, bacterial transcripts lack functional poly(A) tails, precluding simple adaptation of popular standard eukaryotic scRNA-seq protocols that come with the double advantage of specific mRNA amplification and concomitant depletion of rRNA. However, thanks to very recent breakthroughs in methodology, bacterial scRNA-seq is now feasible. This short review will discuss recently published bacterial scRNA-seq approaches (MATQ-seq, microSPLiT, and PETRI-seq) and a spatial transcriptomics approach based on multiplexed in situ hybridization (par-seqFISH). Together, these novel approaches will not only enable a new understanding of cell-to-cell variation in bacterial gene expression, they also promise a new microbiology by enabling high-resolution profiling of gene activity in complex microbial consortia such as the microbiome or pathogens as they invade, replicate, and persist in host tissue. KW - single-cell RNA-seq KW - heterogeneity KW - microSPLiT KW - PETRI-seq KW - MATQ-seq KW - par-seqFISH Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-313292 VL - 3 ER - TY - JOUR A1 - Homberger, Christina A1 - Hayward, Regan J. A1 - Barquist, Lars A1 - Vogel, Jörg T1 - Improved bacterial single-cell RNA-seq through automated MATQ-seq and Cas9-based removal of rRNA reads JF - mBio N2 - Bulk RNA sequencing technologies have provided invaluable insights into host and bacterial gene expression and associated regulatory networks. Nevertheless, the majority of these approaches report average expression across cell populations, hiding the true underlying expression patterns that are often heterogeneous in nature. Due to technical advances, single-cell transcriptomics in bacteria has recently become reality, allowing exploration of these heterogeneous populations, which are often the result of environmental changes and stressors. In this work, we have improved our previously published bacterial single-cell RNA sequencing (scRNA-seq) protocol that is based on multiple annealing and deoxycytidine (dC) tailing-based quantitative scRNA-seq (MATQ-seq), achieving a higher throughput through the integration of automation. We also selected a more efficient reverse transcriptase, which led to reduced cell loss and higher workflow robustness. Moreover, we successfully implemented a Cas9-based rRNA depletion protocol into the MATQ-seq workflow. Applying our improved protocol on a large set of single Salmonella cells sampled over different growth conditions revealed improved gene coverage and a higher gene detection limit compared to our original protocol and allowed us to detect the expression of small regulatory RNAs, such as GcvB or CsrB at a single-cell level. In addition, we confirmed previously described phenotypic heterogeneity in Salmonella in regard to expression of pathogenicity-associated genes. Overall, the low percentage of cell loss and high gene detection limit makes the improved MATQ-seq protocol particularly well suited for studies with limited input material, such as analysis of small bacterial populations in host niches or intracellular bacteria. IMPORTANCE: Gene expression heterogeneity among isogenic bacteria is linked to clinically relevant scenarios, like biofilm formation and antibiotic tolerance. The recent development of bacterial single-cell RNA sequencing (scRNA-seq) enables the study of cell-to-cell variability in bacterial populations and the mechanisms underlying these phenomena. Here, we report a scRNA-seq workflow based on MATQ-seq with increased robustness, reduced cell loss, and improved transcript capture rate and gene coverage. Use of a more efficient reverse transcriptase and the integration of an rRNA depletion step, which can be adapted to other bacterial single-cell workflows, was instrumental for these improvements. Applying the protocol to the foodborne pathogen Salmonella, we confirmed transcriptional heterogeneity across and within different growth phases and demonstrated that our workflow captures small regulatory RNAs at a single-cell level. Due to low cell loss and high transcript capture rates, this protocol is uniquely suited for experimental settings in which the starting material is limited, such as infected tissues. KW - MATQ-seq KW - single-cell RNA-seq KW - Salmonella enterica KW - rRNA depletion KW - gene expression heterogeneity KW - DASH KW - Cas9 Y1 - 2023 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-350059 VL - 14 IS - 2 ER - TY - JOUR A1 - Lindgreen, Stinus A1 - Umu, Sinan Uğur A1 - Lai, Alicia Sook-Wei A1 - Eldai, Hisham A1 - Liu, Wenting A1 - McGimpsey, Stephanie A1 - Wheeler, Nicole E. A1 - Biggs, Patrick J. A1 - Thomson, Nick R. A1 - Barquist, Lars A1 - Poole, Anthony M. A1 - Gardner, Paul P. T1 - Robust Identification of Noncoding RNA from Transcriptomes Requires Phylogenetically-Informed Sampling JF - PLOS Computational Biology N2 - Noncoding RNAs are integral to a wide range of biological processes, including translation, gene regulation, host-pathogen interactions and environmental sensing. While genomics is now a mature field, our capacity to identify noncoding RNA elements in bacterial and archaeal genomes is hampered by the difficulty of de novo identification. The emergence of new technologies for characterizing transcriptome outputs, notably RNA-seq, are improving noncoding RNA identification and expression quantification. However, a major challenge is to robustly distinguish functional outputs from transcriptional noise. To establish whether annotation of existing transcriptome data has effectively captured all functional outputs, we analysed over 400 publicly available RNA-seq datasets spanning 37 different Archaea and Bacteria. Using comparative tools, we identify close to a thousand highly-expressed candidate noncoding RNAs. However, our analyses reveal that capacity to identify noncoding RNA outputs is strongly dependent on phylogenetic sampling. Surprisingly, and in stark contrast to protein-coding genes, the phylogenetic window for effective use of comparative methods is perversely narrow: aggregating public datasets only produced one phylogenetic cluster where these tools could be used to robustly separate unannotated noncoding RNAs from a null hypothesis of transcriptional noise. Our results show that for the full potential of transcriptomics data to be realized, a change in experimental design is paramount: effective transcriptomics requires phylogeny-aware sampling. KW - protein families database KW - small nucleolar RNAs KW - bacterial genomes KW - comparative genomics KW - dark-matter KW - homology search KW - archaea KW - sequence KW - alignment KW - insights Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-115259 VL - 10 IS - 10 ER - TY - JOUR A1 - Michaux, Charlotte A1 - Gerovac, Milan A1 - Hansen, Elisabeth E. A1 - Barquist, Lars A1 - Vogel, Jörg T1 - Grad-seq analysis of Enterococcus faecalis and Enterococcus faecium provides a global view of RNA and protein complexes in these two opportunistic pathogens JF - microLife N2 - Enterococcus faecalis and Enterococcus faecium are major nosocomial pathogens. Despite their relevance to public health and their role in the development of bacterial antibiotic resistance, relatively little is known about gene regulation in these species. RNA–protein complexes serve crucial functions in all cellular processes associated with gene expression, including post-transcriptional control mediated by small regulatory RNAs (sRNAs). Here, we present a new resource for the study of enterococcal RNA biology, employing the Grad-seq technique to comprehensively predict complexes formed by RNA and proteins in E. faecalis V583 and E. faecium AUS0004. Analysis of the generated global RNA and protein sedimentation profiles led to the identification of RNA–protein complexes and putative novel sRNAs. Validating our data sets, we observe well-established cellular RNA–protein complexes such as the 6S RNA–RNA polymerase complex, suggesting that 6S RNA-mediated global control of transcription is conserved in enterococci. Focusing on the largely uncharacterized RNA-binding protein KhpB, we use the RIP-seq technique to predict that KhpB interacts with sRNAs, tRNAs, and untranslated regions of mRNAs, and might be involved in the processing of specific tRNAs. Collectively, these datasets provide departure points for in-depth studies of the cellular interactome of enterococci that should facilitate functional discovery in these and related Gram-positive species. Our data are available to the community through a user-friendly Grad-seq browser that allows interactive searches of the sedimentation profiles (https://resources.helmholtz-hiri.de/gradseqef/). KW - Enterococcus faecalis KW - Enterococcus faecium KW - Grad-seq KW - KhpB protein Y1 - 2023 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-313311 VL - 4 ER - TY - JOUR A1 - Michaux, Charlotte A1 - Hansen, Elisabeth E. A1 - Jenniches, Laura A1 - Gerovac, Milan A1 - Barquist, Lars A1 - Vogel, Jörg T1 - Single-Nucleotide RNA Maps for the Two Major Nosocomial Pathogens Enterococcus faecalis and Enterococcus faecium JF - Frontiers in Cellular and Infection Microbiology N2 - Enterococcus faecalis and faecium are two major representative clinical strains of the Enterococcus genus and are sadly notorious to be part of the top agents responsible for nosocomial infections. Despite their critical implication in worldwide public healthcare, essential and available resources such as deep transcriptome annotations remain poor, which also limits our understanding of post-transcriptional control small regulatory RNA (sRNA) functions in these bacteria. Here, using the dRNA-seq technique in combination with ANNOgesic analysis, we successfully mapped and annotated transcription start sites (TSS) of both E. faecalis V583 and E. faecium AUS0004 at single nucleotide resolution. Analyzing bacteria in late exponential phase, we capture ~40% (E. faecalis) and 43% (E. faecium) of the annotated protein-coding genes, determine 5′ and 3′ UTR (untranslated region) length, and detect instances of leaderless mRNAs. The transcriptome maps revealed sRNA candidates in both bacteria, some found in previous studies and new ones. Expression of candidate sRNAs is being confirmed under biologically relevant environmental conditions. This comprehensive global TSS mapping atlas provides a valuable resource for RNA biology and gene expression analysis in the Enterococci. It can be accessed online at www.helmholtz-hiri.de/en/datasets/enterococcus through an instance of the genomic viewer JBrowse. KW - transcription start sites KW - RNA-seq KW - sRNA atlas KW - Gram-positive bacteria KW - post-transcriptional regulation Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-217947 SN - 2235-2988 VL - 10 ER - TY - JOUR A1 - Okoro, Chinyere K. A1 - Barquist, Lars A1 - Connor, Thomas R. A1 - Harris, Simon R. A1 - Clare, Simon A1 - Stevens, Mark P. A1 - Arends, Mark J. A1 - Hale, Christine A1 - Kane, Leanne A1 - Pickard, Derek J. A1 - Hill, Jennifer A1 - Harcourt, Katherine A1 - Parkhill, Julian A1 - Dougan, Gordon A1 - Kingsley, Robert A. T1 - Signatures of adaptation in human invasive Salmonella Typhimurium ST313 populations from sub-Saharan Africa JF - PLoS Neglected Tropical Diseases N2 - Two lineages of Salmonella enterica serovar Typhimurium (S. Typhimurium) of multi-locus sequence type ST313 have been linked with the emergence of invasive Salmonella disease across sub-Saharan Africa. The expansion of these lineages has a temporal association with the HIV pandemic and antibiotic usage. We analysed the whole genome sequence of 129 ST313 isolates representative of the two lineages and found evidence of lineage-specific genome degradation, with some similarities to that observed in S. Typhi. Individual ST313 S. Typhimurium isolates exhibit a distinct metabolic signature and modified enteropathogenesis in both a murine and cattle model of colitis, compared to S. Typhimurium outside of the ST313 lineages. These data define phenotypes that distinguish ST313 isolates from other S. Typhimurium and may represent adaptation to a distinct pathogenesis and lifestyle linked to an-immuno-compromised human population. KW - genome sequence KW - infection KW - pathogenicity KW - children KW - disease KW - adults KW - identification KW - Escherichia coli KW - virulence Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-143779 VL - 9 IS - 3 ER - TY - JOUR A1 - Prezza, Gianluca A1 - Ryan, Daniel A1 - Mädler, Gohar A1 - Reichardt, Sarah A1 - Barquist, Lars A1 - Westermann, Alexander J. T1 - Comparative genomics provides structural and functional insights into Bacteroides RNA biology JF - Molecular Microbiology N2 - Bacteria employ noncoding RNA molecules for a wide range of biological processes, including scaffolding large molecular complexes, catalyzing chemical reactions, defending against phages, and controlling gene expression. Secondary structures, binding partners, and molecular mechanisms have been determined for numerous small noncoding RNAs (sRNAs) in model aerobic bacteria. However, technical hurdles have largely prevented analogous analyses in the anaerobic gut microbiota. While experimental techniques are being developed to investigate the sRNAs of gut commensals, computational tools and comparative genomics can provide immediate functional insight. Here, using Bacteroides thetaiotaomicron as a representative microbiota member, we illustrate how comparative genomics improves our understanding of RNA biology in an understudied gut bacterium. We investigate putative RNA-binding proteins and predict a Bacteroides cold-shock protein homolog to have an RNA-related function. We apply an in silico protocol incorporating both sequence and structural analysis to determine the consensus structures and conservation of nine Bacteroides noncoding RNA families. Using structure probing, we validate and refine these predictions and deposit them in the Rfam database. Through synteny analyses, we illustrate how genomic coconservation can serve as a predictor of sRNA function. Altogether, this work showcases the power of RNA informatics for investigating the RNA biology of anaerobic microbiota members. KW - BT_1884 KW - cold-shock protein KW - GibS KW - RNA-binding proteins KW - secondary structure KW - 6S RNA Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-259594 VL - 117 IS - 1 ER - TY - JOUR A1 - Read, Hannah M. A1 - Mills, Grant A1 - Johnson, Sarah A1 - Tsai, Peter A1 - Dalton, James A1 - Barquist, Lars A1 - Print, Cristin G. A1 - Patrick, Wayne M. A1 - Wiles, Siouxsie T1 - The in vitro and in vivo effects of constitutive light expression on a bioluminescent strain of the mouse enteropathogen Citrobacter rodentium JF - PeerJ N2 - Bioluminescent reporter genes, such as those from fireflies and bacteria, let researchers use light production as a non-invasive and non-destructive surrogate measure of microbial numbers in a wide variety of environments. As bioluminescence needs microbial metabolites, tagging microorganisms with luciferases means only live metabolically active cells are detected. Despite the wide use of bioluminescent reporter genes, very little is known about the impact of continuous (also called constitutive) light expression on tagged bacteria. We have previously made a bioluminescent strain of Citrobacter rodentium, a bacterium which infects laboratory mice in a similar way to how enteropathogenic Escherichia coli (EPEC) and enterohaemorrhagic E. coli (EHEC) infect humans. In this study, we compared the growth of the bioluminescent C. rodentium strain ICC180 with its non-bioluminescent parent (strain ICC169) in a wide variety of environments. To understand more about the metabolic burden of expressing light, we also compared the growth profiles of the two strains under approximately 2,000 different conditions. We found that constitutive light expression in ICC180 was near-neutral in almost every non-toxic environment tested. However, we also found that the non-bioluminescent parent strain has a competitive advantage over ICC180 during infection of adult mice, although this was not enough for ICC180 to be completely outcompeted. In conclusion, our data suggest that constitutive light expression is not metabolically costly to C. rodentium and supports the view that bioluminescent versions of microbes can be used as a substitute for their non-bioluminescent parents to study bacterial behaviour in a wide variety of environments. KW - bioluminescence KW - lux KW - luciferase KW - biophotonic imaging KW - bioluminescence imaging KW - enteric pathogens KW - animal model KW - reporter genes KW - phenotypic microarray KW - biolog Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-166576 VL - 4 IS - e2130 ER - TY - JOUR A1 - Westermann, Alexander J. A1 - Barquist, Lars A1 - Vogel, Jörg T1 - Resolving host-pathogen interactions by dual RNA-seq JF - PLoS Pathogens N2 - The transcriptome is a powerful proxy for the physiological state of a cell, healthy or diseased. As a result, transcriptome analysis has become a key tool in understanding the molecular changes that accompany bacterial infections of eukaryotic cells. Until recently, such transcriptomic studies have been technically limited to analyzing mRNA expression changes in either the bacterial pathogen or the infected eukaryotic host cell. However, the increasing sensitivity of high-throughput RNA sequencing now enables “dual RNA-seq” studies, simultaneously capturing all classes of coding and noncoding transcripts in both the pathogen and the host. In the five years since the concept of dual RNA-seq was introduced, the technique has been applied to a range of infection models. This has not only led to a better understanding of the physiological changes in pathogen and host during the course of an infection but has also revealed hidden molecular phenotypes of virulence-associated small noncoding RNAs that were not visible in standard infection assays. Here, we use the knowledge gained from these recent studies to suggest experimental and computational guidelines for the design of future dual RNA-seq studies. We conclude this review by discussing prospective applications of the technique. KW - Medicine KW - RNA sequencing KW - Salmonellosis KW - Transcriptome analysis KW - Gene expression KW - Bacterial pathogens KW - Salmonella KW - Host cells KW - Lysis (medicine) Y1 - 2017 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-171921 VL - 13 IS - 2 ER - TY - JOUR A1 - Wheeler, Nicole E. A1 - Barquist, Lars A1 - Kingsley, Robert A. A1 - Gardner, Paul P. T1 - A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes JF - Bioinformatics N2 - Motivation: Next generation sequencing technologies have provided us with a wealth of information on genetic variation, but predi cting the functional significance of this variation is a difficult task. While many comparative genomics studies have focused on gene flux and large scale changes, relatively little attention has been paid to quantifying the effects of single nucleotide polymorphisms and indels on protein function, particularly in bacterial genomics. Results: We present a hidden Markov model based approach we call delta-bitscore (DBS) for identifying orthologous proteins that have diverged at the amino acid sequence level in a way that is likely to impact biological function. We benchmark this approach with several widely used datasets and apply it to a proof-of-concept study of orthologous proteomes in an investigation of host adaptation in Salmonella enterica. We highlight the value of the method in identifying functional divergence of genes, and suggest that this tool may be a better approach than the commonly used dN/dS metric for identifying functionally significant genetic changes occurring in recently diverged organisms. KW - Host adaptation KW - Salmonella-enteritidis KW - Sequence identity KW - Rapid evolution KW - Variants KW - Cystic-fibriosis KW - Strains KW - Pathogenicity KW - Typhimurium KW - Yersinia Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-186502 VL - 32 IS - 23 ER -