TY - JOUR A1 - Fukushima, Kenji A1 - Pollock, David D. T1 - Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution JF - Nature Communications N2 - The origins of multicellular physiology are tied to evolution of gene expression. Genes can shift expression as organisms evolve, but how ancestral expression influences altered descendant expression is not well understood. To examine this, we amalgamate 1,903 RNA-seq datasets from 182 research projects, including 6 organs in 21 vertebrate species. Quality control eliminates project-specific biases, and expression shifts are reconstructed using gene-family-wise phylogenetic Ornstein-Uhlenbeck models. Expression shifts following gene duplication result in more drastic changes in expression properties than shifts without gene duplication. The expression properties are tightly coupled with protein evolutionary rate, depending on whether and how gene duplication occurred. Fluxes in expression patterns among organs are nonrandom, forming modular connections that are reshaped by gene duplication. Thus, if expression shifts, ancestral expression in some organs induces a strong propensity for expression in particular organs in descendants. Regardless of whether the shifts are adaptive or not, this supports a major role for what might be termed preadaptive pathways of gene expression evolution. KW - phylogenetic trees KW - adaptive conflict KW - divergence times KW - duplicate genes KW - recent origin KW - package KW - selection KW - alignmen KW - rates KW - biology Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-230468 VL - 11, ER - TY - JOUR A1 - Schwarz, Roland F. A1 - Tamuri, Asif U. A1 - Kultys, Marek A1 - King, James A1 - Godwin, James A1 - Florescu, Ana M. A1 - Schultz, Jörg A1 - Goldman, Nick T1 - ALVIS: interactive non-aggregative visualization and explorative analysis of multiple sequence alignments JF - Nucleic Acids Research N2 - Sequence Logos and its variants are the most commonly used method for visualization of multiple sequence alignments (MSAs) and sequence motifs. They provide consensus-based summaries of the sequences in the alignment. Consequently, individual sequences cannot be identified in the visualization and covariant sites are not easily discernible. We recently proposed Sequence Bundles, a motif visualization technique that maintains a one-to-one relationship between sequences and their graphical representation and visualizes covariant sites. We here present Alvis, an open-source platform for the joint explorative analysis of MSAs and phylogenetic trees, employing Sequence Bundles as its main visualization method. Alvis combines the power of the visualization method with an interactive toolkit allowing detection of covariant sites, annotation of trees with synapomorphies and homoplasies, and motif detection. It also offers numerical analysis functionality, such as dimension reduction and classification. Alvis is user-friendly, highly customizable and can export results in publication-quality figures. It is available as a full-featured standalone version (http://www.bitbucket.org/rfs/alvis) and its Sequence Bundles visualization module is further available as a web application (http://science-practice.com/projects/sequence-bundles). KW - visualization KW - multiple sequence alignments KW - phylogenetic trees KW - Alvis Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-166374 VL - 44 IS - 8 ER - TY - JOUR A1 - Letunic, Ivica A1 - Bork, Peer T1 - Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees JF - Nucleic Acids Research N2 - Interactive Tree Of Life (http://itol.embl.de) is a web-based tool for the display, manipulation and annotation of phylogenetic trees. It is freely available and open to everyone. The current version was completely redesigned and rewritten, utilizing current web technologies for speedy and streamlined processing. Numerous new features were introduced and several new data types are now supported. Trees with up to 100,000 leaves can now be efficiently displayed. Full interactive control over precise positioning of various annotation features and an unlimited number of datasets allow the easy creation of complex tree visualizations. iTOL 3 is the first tool which supports direct visualization of the recently proposed phylogenetic placements format. Finally, iTOL's account system has been redesigned to simplify the management of trees in user-defined workspaces and projects, as it is heavily used and currently handles already more than 500,000 trees from more than 10,000 individual users. KW - Interactive Tree Of Life (iTOL) KW - phylogenetic trees KW - visualization KW - tool Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-166181 VL - 44 IS - W1 ER - TY - JOUR A1 - Rybalka, Nataliya A1 - Wolf, Matthias A1 - Andersen, Robert A1 - Friedl, Thomas T1 - Congruence of chloroplast- and nuclear-encoded DNA sequence variations used to assess species boundaries in the soil microalga Heterococcus (Stramenopiles, Xanthophyceae) JF - BMC Evolutionary Biology N2 - Background: Heterococcus is a microalgal genus of Xanthophyceae (Stramenopiles) that is common and widespread in soils, especially from cold regions. Species are characterized by extensively branched filaments produced when grown on agarized culture medium. Despite the large number of species described exclusively using light microscopic morphology, the assessment of species diversity is hampered by extensive morphological plasticity. Results: Two independent types of molecular data, the chloroplast-encoded psbA/rbcL spacer complemented by rbcL gene and the internal transcribed spacer 2 of the nuclear rDNA cistron (ITS2), congruently recovered a robust phylogenetic structure. With ITS2 considerable sequence and secondary structure divergence existed among the eight species, but a combined sequence and secondary structure phylogenetic analysis confined to helix II of ITS2 corroborated relationships as inferred from the rbcL gene phylogeny. Intra-genomic divergence of ITS2 sequences was revealed in many strains. The 'monophyletic species concept', appropriate for microalgae without known sexual reproduction, revealed eight different species. Species boundaries established using the molecular-based monophyletic species concept were more conservative than the traditional morphological species concept. Within a species, almost identical chloroplast marker sequences (genotypes) were repeatedly recovered from strains of different origins. At least two species had widespread geographical distributions; however, within a given species, genotypes recovered from Antarctic strains were distinct from those in temperate habitats. Furthermore, the sequence diversity may correspond to adaptation to different types of habitats or climates. Conclusions: We established a method and a reference data base for the unambiguous identification of species of the common soil microalgal genus Heterococcus which uses DNA sequence variation in markers from plastid and nuclear genomes. The molecular data were more reliable and more conservative than morphological data. KW - xanthophyceae KW - psbA/rbcL spacer KW - ITS2 KW - tool KW - RBCL KW - alignment KW - evolution KW - chlorophyta KW - RNA secondary structure KW - terrestrial habitats KW - phylogenetic trees KW - mixed models KW - green algae KW - heterococcus KW - systematics KW - molecular phylogeny KW - species concept Y1 - 2013 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-121848 SN - 1471-2148 VL - 13 IS - 39 ER - TY - JOUR A1 - Koetschan, Christian A1 - Kittelmann, Sandra A1 - Lu, Jingli A1 - Al-Halbouni, Djamila A1 - Jarvis, Graeme N. A1 - Müller, Tobias A1 - Wolf, Matthias A1 - Janssen, Peter H. T1 - Internal Transcribed Spacer 1 Secondary Structure Analysis Reveals a Common Core throughout the Anaerobic Fungi (Neocallimastigomycota) JF - PLOS ONE N2 - The internal transcribed spacer (ITS) is a popular barcode marker for fungi and in particular the ITS1 has been widely used for the anaerobic fungi (phylum Neocallimastigomycota). A good number of validated reference sequences of isolates as well as a large number of environmental sequences are available in public databases. Its highly variable nature predisposes the ITS1 for low level phylogenetics; however, it complicates the establishment of reproducible alignments and the reconstruction of stable phylogenetic trees at higher taxonomic levels (genus and above). Here, we overcame these problems by proposing a common core secondary structure of the ITS1 of the anaerobic fungi employing a Hidden Markov Model-based ITS1 sequence annotation and a helix-wise folding approach. We integrated the additional structural information into phylogenetic analyses and present for the first time an automated sequence-structure-based taxonomy of the ITS1 of the anaerobic fungi. The methodology developed is transferable to the ITS1 of other fungal groups, and the robust taxonomy will facilitate and improve high-throughput anaerobic fungal community structure analysis of samples from various environments. KW - profile distances KW - ITS2 KW - phylogenetic trees KW - RNA sequence KW - reconstruction KW - diversity KW - populations KW - tool KW - systematics KW - herbivores Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-117058 VL - 9 IS - 3 ER - TY - JOUR A1 - Wagner, Ines A1 - Volkmer, Michael A1 - Sharan, Malvika A1 - Villaveces, Jose M. A1 - Oswald, Felix A1 - Surendranath, Vineeth A1 - Habermann, Bianca H. T1 - morFeus: a web-based program to detect remotely conserved orthologs using symmetrical best hits and orthology network scoring JF - BMC Bioinformatics N2 - Background: Searching the orthologs of a given protein or DNA sequence is one of the most important and most commonly used Bioinformatics methods in Biology. Programs like BLAST or the orthology search engine Inparanoid can be used to find orthologs when the similarity between two sequences is sufficiently high. They however fail when the level of conservation is low. The detection of remotely conserved proteins oftentimes involves sophisticated manual intervention that is difficult to automate. Results: Here, we introduce morFeus, a search program to find remotely conserved orthologs. Based on relaxed sequence similarity searches, morFeus selects sequences based on the similarity of their alignments to the query, tests for orthology by iterative reciprocal BLAST searches and calculates a network score for the resulting network of orthologs that is a measure of orthology independent of the E-value. Detecting remotely conserved orthologs of a protein using morFeus thus requires no manual intervention. We demonstrate the performance of morFeus by comparing it to state-of-the-art orthology resources and methods. We provide an example of remotely conserved orthologs, which were experimentally shown to be functionally equivalent in the respective organisms and therefore meet the criteria of the orthology-function conjecture. Conclusions: Based on our results, we conclude that morFeus is a powerful and specific search method for detecting remotely conserved orthologs. KW - reciprocal best hit KW - finder using symmetrical best hits KW - sequences KW - annotation KW - identification KW - database KW - genomes KW - proteins KW - homologs KW - hidden markov-models KW - phylogenetic trees KW - PSI-blast KW - eigenvector centrality KW - meta-analysis based orthology KW - orthology KW - remote sequence conservation KW - alignment clustering KW - orthology network Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-115590 VL - 15 IS - 263 ER -