TY - JOUR A1 - Plieger, Tanja A1 - Wolf, Matthias T1 - 18S and ITS2 rDNA sequence-structure phylogeny of Prototheca (Chlorophyta, Trebouxiophyceae) JF - Biologia N2 - Protothecosis is an infectious disease caused by organisms currently classified within the green algal genus Prototheca. The disease can manifest as cutaneous lesions, olecranon bursitis or disseminated or systemic infections in both immunocompetent and immunosuppressed patients. Concerning diagnostics, taxonomic validity is important. Prototheca, closely related to the Chlorella species complex, is known to be polyphyletic, branching with Auxenochlorella and Helicosporidium. The phylogeny of Prototheca was discussed and revisited several times in the last decade; new species have been described. Phylogenetic analyses were performed using ribosomal DNA (rDNA) and partial mitochondrial cytochrome b (cytb) sequence data. In this work we use Internal Transcribed Spacer 2 (ITS2) as well as 18S rDNA data. However, for the first time, we reconstruct phylogenetic relationships of Prototheca using primary sequence and RNA secondary structure information simultaneously, a concept shown to increase robustness and accuracy of phylogenetic tree estimation. Using encoded sequence-structure data, Neighbor-Joining, Maximum-Parsimony and Maximum-Likelihood methods yielded well-supported trees in agreement with other trees calculated on rDNA; but differ in several aspects from trees using cytb as a phylogenetic marker. ITS2 secondary structures of Prototheca sequences are in agreement with the well-known common core structure of eukaryotes but show unusual differences in their helix lengths. An elongation of the fourth helix of some species seems to have occurred independently in the course of evolution. KW - secondary structure KW - 18S KW - ITS2 KW - phylogeny KW - prototheca Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-269897 SN - 1336-9563 VL - 77 IS - 2 ER - TY - JOUR A1 - Rybalka, Nataliya A1 - Wolf, Matthias A1 - Andersen, Robert A1 - Friedl, Thomas T1 - Congruence of chloroplast- and nuclear-encoded DNA sequence variations used to assess species boundaries in the soil microalga Heterococcus (Stramenopiles, Xanthophyceae) JF - BMC Evolutionary Biology N2 - Background: Heterococcus is a microalgal genus of Xanthophyceae (Stramenopiles) that is common and widespread in soils, especially from cold regions. Species are characterized by extensively branched filaments produced when grown on agarized culture medium. Despite the large number of species described exclusively using light microscopic morphology, the assessment of species diversity is hampered by extensive morphological plasticity. Results: Two independent types of molecular data, the chloroplast-encoded psbA/rbcL spacer complemented by rbcL gene and the internal transcribed spacer 2 of the nuclear rDNA cistron (ITS2), congruently recovered a robust phylogenetic structure. With ITS2 considerable sequence and secondary structure divergence existed among the eight species, but a combined sequence and secondary structure phylogenetic analysis confined to helix II of ITS2 corroborated relationships as inferred from the rbcL gene phylogeny. Intra-genomic divergence of ITS2 sequences was revealed in many strains. The 'monophyletic species concept', appropriate for microalgae without known sexual reproduction, revealed eight different species. Species boundaries established using the molecular-based monophyletic species concept were more conservative than the traditional morphological species concept. Within a species, almost identical chloroplast marker sequences (genotypes) were repeatedly recovered from strains of different origins. At least two species had widespread geographical distributions; however, within a given species, genotypes recovered from Antarctic strains were distinct from those in temperate habitats. Furthermore, the sequence diversity may correspond to adaptation to different types of habitats or climates. Conclusions: We established a method and a reference data base for the unambiguous identification of species of the common soil microalgal genus Heterococcus which uses DNA sequence variation in markers from plastid and nuclear genomes. The molecular data were more reliable and more conservative than morphological data. KW - xanthophyceae KW - psbA/rbcL spacer KW - ITS2 KW - tool KW - RBCL KW - alignment KW - evolution KW - chlorophyta KW - RNA secondary structure KW - terrestrial habitats KW - phylogenetic trees KW - mixed models KW - green algae KW - heterococcus KW - systematics KW - molecular phylogeny KW - species concept Y1 - 2013 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-121848 SN - 1471-2148 VL - 13 IS - 39 ER - TY - JOUR A1 - Sickel, Wiebke A1 - Ankenbrand, Markus J. A1 - Grimmer, Gudrun A1 - Holzschuh, Andrea A1 - Härtel, Stephan A1 - Lanzen, Jonathan A1 - Steffan-Dewenter, Ingolf A1 - Keller, Alexander T1 - Increased efficiency in identifying mixed pollen samples by meta-barcoding with a dual-indexing approach JF - BMC Ecology N2 - Background Meta-barcoding of mixed pollen samples constitutes a suitable alternative to conventional pollen identification via light microscopy. Current approaches however have limitations in practicability due to low sample throughput and/or inefficient processing methods, e.g. separate steps for amplification and sample indexing. Results We thus developed a new primer-adapter design for high throughput sequencing with the Illumina technology that remedies these issues. It uses a dual-indexing strategy, where sample-specific combinations of forward and reverse identifiers attached to the barcode marker allow high sample throughput with a single sequencing run. It does not require further adapter ligation steps after amplification. We applied this protocol to 384 pollen samples collected by solitary bees and sequenced all samples together on a single Illumina MiSeq v2 flow cell. According to rarefaction curves, 2,000–3,000 high quality reads per sample were sufficient to assess the complete diversity of 95% of the samples. We were able to detect 650 different plant taxa in total, of which 95% were classified at the species level. Together with the laboratory protocol, we also present an update of the reference database used by the classifier software, which increases the total number of covered global plant species included in the database from 37,403 to 72,325 (93% increase). Conclusions This study thus offers improvements for the laboratory and bioinformatical workflow to existing approaches regarding data quantity and quality as well as processing effort and cost-effectiveness. Although only tested for pollen samples, it is furthermore applicable to other research questions requiring plant identification in mixed and challenging samples. KW - pollination ecology KW - next generation sequencing KW - ITS2 KW - illumina MiSeq platform KW - high throughput sequencing KW - DNA barcoding KW - NGS KW - osmia KW - palynolog Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-125730 VL - 15 IS - 20 ER - TY - JOUR A1 - Koetschan, Christian A1 - Kittelmann, Sandra A1 - Lu, Jingli A1 - Al-Halbouni, Djamila A1 - Jarvis, Graeme N. A1 - Müller, Tobias A1 - Wolf, Matthias A1 - Janssen, Peter H. T1 - Internal Transcribed Spacer 1 Secondary Structure Analysis Reveals a Common Core throughout the Anaerobic Fungi (Neocallimastigomycota) JF - PLOS ONE N2 - The internal transcribed spacer (ITS) is a popular barcode marker for fungi and in particular the ITS1 has been widely used for the anaerobic fungi (phylum Neocallimastigomycota). A good number of validated reference sequences of isolates as well as a large number of environmental sequences are available in public databases. Its highly variable nature predisposes the ITS1 for low level phylogenetics; however, it complicates the establishment of reproducible alignments and the reconstruction of stable phylogenetic trees at higher taxonomic levels (genus and above). Here, we overcame these problems by proposing a common core secondary structure of the ITS1 of the anaerobic fungi employing a Hidden Markov Model-based ITS1 sequence annotation and a helix-wise folding approach. We integrated the additional structural information into phylogenetic analyses and present for the first time an automated sequence-structure-based taxonomy of the ITS1 of the anaerobic fungi. The methodology developed is transferable to the ITS1 of other fungal groups, and the robust taxonomy will facilitate and improve high-throughput anaerobic fungal community structure analysis of samples from various environments. KW - profile distances KW - ITS2 KW - phylogenetic trees KW - RNA sequence KW - reconstruction KW - diversity KW - populations KW - tool KW - systematics KW - herbivores Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-117058 VL - 9 IS - 3 ER - TY - THES A1 - Förster, Frank T1 - Making the most of phylogeny: Unique adaptations in tardigrades and 216374 internal transcribed spacer 2 structures T1 - Einzigartige Anpassungen in Tardigraden und 216374 "internal transcribed spacer 2" Strukturen N2 - The phylum Tardigrada consists of about 1000 described species to date. The animals live in habitats within marine, freshwater and terrestrial ecosystems allover the world. Tardigrades are polyextremophiles. They are capable to resist extreme temperature, pressure or radiation. In the event of desiccation, tardigrades enter a so-called tun stage. The reason for their great tolerance capabilities against extreme environmental conditions is not discovered yet. Our Funcrypta project aims at finding answers to the question what mechanisms underlie these adaption capabilities particularly with regard to the species Milnesium tardigradum. The first part of this thesis describes the establishment of expressed sequence tags (ESTs) libraries for different stages of M. tardigradum. From proteomics data we bioinformatically identified 144 proteins with a known function and additionally 36 proteins which seemed to be specific for M. tardigradum. The generation of a comprehensive web-based database allows us to merge the proteome and transcriptome data. Therefore we created an annotation pipeline for the functional annotation of the protein and nucleotide sequences. Additionally, we clustered the obtained proteome dataset and identified some tardigrade-specific proteins (TSPs) which did not show homology to known proteins. Moreover, we examined the heat shock proteins of M. tardigradum and their different expression levels depending on the actual state of the animals. In further bioinformatical analyses of the whole data set, we discovered promising proteins and pathways which are described to be correlated with the stress tolerance, e.g. late embryogenesis abundant (LEA) proteins. Besides, we compared the tardigrades with nematodes, rotifers, yeast and man to identify shared and tardigrade specific stress pathways. An analysis of the 50 and 30 untranslated regions (UTRs) demonstrates a strong usage of stabilising motifs like the 15-lipoxygenase differentiation control element (15-LOX-DICE) but also reveals a lack of other common UTR motifs normally used, e.g. AU rich elements. The second part of this thesis focuses on the relatedness between several cryptic species within the tardigrade genus Paramacrobiotus. Therefore for the first time, we used the sequence-structure information of the internal transcribed spacer 2 (ITS2) as a phylogenetic marker in tardigrades. This allowed the description of three new species which were indistinguishable using morphological characters or common molecular markers like the 18S ribosomal ribonucleic acid (rRNA) or the Cytochrome c oxidase subunit I (COI). In a large in silico simulation study we also succeeded to show the benefit for the phylogenetic tree reconstruction by adding structure information to the ITS2 sequence. Next to the genus Paramacrobiotus we used the ITS2 to corroborate a monophyletic DO-group (Sphaeropleales) within the Chlorophyceae. Additionally we redesigned another comprehensive database—the ITS2 database resulting in a doubled number of sequence-structure pairs of the ITS2. In conclusion, this thesis shows the first insights (6 first author publications and 4 coauthor publications) into the reasons for the enormous adaption capabilities of tardigrades and offers a solution to the debate on the phylogenetic relatedness within the tardigrade genus Paramacrobiotus. N2 - Der Tierstamm Tardigrada besteht aus derzeitig etwa 1000 beschriebenen Arten. Die Tiere leben in Habitaten in marinen, limnischen und terrestrischen Ökosystemen auf der ganzen Welt. Tardigraden sind polyextremophil. Sie können extremer Temperatur, Druck oder Strahlung widerstehen. Beim Austrocknen bilden sie ein so genanntes Tönnchenstadium. Der Grund für die hohe Toleranz gegenüber extremen Umweltbedingungen ist bis jetzt nicht aufgeklärt worden. Unser Funcrypta Projekt versucht Antworten darauf zu finden, was die hinter dieser Anpassungsfähigkeit liegenden Mechanismen sind. Dabei steht die Art Milnesium tardigradum im Mittelpunkt. Der erste Teil dieser Arbeit beschreibt die Etablierung einer expressed sequence tags (ESTs) Bibliothek für verschiedene Stadien von M. tardigradum. Aus unseren Proteomansatz konnten wir bislang 144 Proteine bioinformatisch identifizieren, denen eine Funktion zugeordnet werden konnte. Darüber hinaus wurden 36 Proteine gefunden, welche spezifisch für M. tardigradum zu sein scheinen. Die Erstellung einer umfassenden internetbasierenden Datenbank erlaubt uns die Verknüpfung der Proteom und Transkriptomdaten. Dafür wurde eine Annotations-Pipeline erstellt um den Sequenzen Funktionen zuordnen zu können. Außerdem wurden die erhaltenen Proteindaten von uns geclustert. Dabei konnten wir einige Tardigraden-spezifische Proteine (tardigrade-specific protein, TSP) identifizieren die keinerlei Homologie zu bekannten Proteinen zeigen. Außerdem untersuchten wir die Hitze-Schock-Proteine von M. tardigradum und deren differenzielle Expression in Abhängigkeit vom Stadium der Tiere. In weiteren bioinformatischen Analysen konnten wir viel versprechende Proteine und Stoffwechselwege entdecken für die beschrieben ist, dass sie mit Stressreaktionen in Verbindung stehen, beispielsweise late embryogenesis abundant (LEA) Proteine. Des Weiteren verglichen wir Tardigraden mit Nematoden, Rotatorien, Hefe und dem Menschen, um gemeinsame und Tardigraden-spezifische Stoffwechselwege identifizieren zu können. Analysen der 50 und 30 untranslatierten Bereiche zeigen eine verstärkte Nutzung von stabilisierenden Motiven, wie dem 15-lipoxygenase differentiation control element (LEA). Im Gegensatz dazu werden häufig benutzte Motive, wie beispielsweise AU-reiche Bereiche, gar nicht gefunden. Der zweite Teil der Doktorarbeit beschäftigt sich mit den Verwandtschaftsverhältnissen einiger kryptischer Arten in der Tardigradengattung Paramacrobiotus. Hierfür haben wir, zum ersten Mal in Tardigraden, die Sequenz-Struktur-Informationen der internal transcribed spacer 2 Region als phylogenetischen Marker verwendet. Dies erlaubte uns die Beschreibung von drei neuen Arten, welche mit klassischen morphologischen Merkmalen oder anderen molekularen Markern wie 18S ribosomaler RNA oder Cytochrome c oxidase subunit I (COI) nicht unterschieden werden konnten. In einer umfangreichen in silico Simulationsstudie zeigten wir den Vorteil der bei der Rekonstruktion phylogenetischer Bäume unter der Hinzunahme der Strukturinformationen zur Sequenz der ITS2 entsteht. ITS2 Sequenz-Struktur-Informationen wurden außerdem auch dazu benutzt, eine monophyletische DO-Gruppe (Sphaeropleales) in den Chlorophyceae zu bestätigen. Zusätzlich haben wir eine umfassende Datenbank, die ITS2-Datenbank, überarbeitet. Dadurch konnten die Sequenz-Struktur-Informationen verdoppelt werden, die in dieser Datenbank verfügbar sind. Die vorliegende Doktorarbeit zeigt erste Einblicke (6 Erstautor- und 4 Koautor-Publikationen) in die Ursachen für die hervorragende Anpassungsfähigkeit der Tardigraden und beschreibt die erfolgreiche Aufklärung der Verwandtschaftsverhältnisse in der Tardigradengattung Paramacrobiotus. KW - Phylogenie KW - Bioinformatik KW - Würzburg / Universität / Lehrstuhl für Bioinformatik KW - Anpassung KW - Datenbank KW - ITS2 KW - Marker KW - Tardigraden KW - Bärtierchen KW - ITS2 KW - Marker KW - Tardigrades KW - Waterbear Y1 - 2010 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-51466 ER - TY - THES A1 - Selig, Christian T1 - The ITS2 Database - Application and Extension N2 - Der internal transcribed spacer 2 (ITS2) des ribosomalen Genrepeats ist ein zunehmend wichtiger phylogenetischer Marker, dessen RNA-Sekundärstruktur innerhalb vieler eukaryontischer Organismen konserviert ist. Die ITS2-Datenbank hat zum Ziel, eine umfangreiche Ressource für ITS2-Sequenzen und -Sekundärstrukturen auf Basis direkter thermodynamischer als auch homologiemodellierter RNA-Faltung zu sein. Ergebnisse: (a) Eine komplette Neufassung der ursprünglichen die ITS2-Datenbank generierenden Skripte, angewandt auf einen aktuellen NCBI-Datensatz, deckte mehr als 65.000 ITS2-Strukturen auf. Dies verdoppelt den Inhalt der ursprünglichen Datenbank und verdreifacht ihn, wenn partielle Strukturen mit einbezogen werden. (b) Die Endbenutzer-Schnittstelle wurde neu geschrieben, erweitert und ist jetzt in der Lage, benutzerdefinierte Homologiemodellierungen durchzuführen. (c) Andere möglichen RNA-Strukturaufklärungsmethoden (suboptimales und formenbasiertes Falten) sind hilfreich, können aber Homologiemodellierung nicht ersetzen. (d) Ein Anwendungsfall der ITS2-Datenbank in Zusammenhang mit anderen am Lehrstuhl entwickelten Werkzeugen gab Einblick in die Verwendung von ITS2 für molekulare Phylogenie. N2 - The internal transcribed spacer 2 (ITS2) of the ribosomal gene repeat is an increasingly important phylogenetic marker whose RNA secondary structure is widely conserved across eukaryotic organisms. The ITS2 database aims to be a comprehensive resource on ITS2 sequence and secondary structure, based on direct thermodynamic as well as homology modelled RNA folds. Results: (a) A rebuild of the original ITS2 database generation scripts applied to a current NCBI dataset reveal more than 60,000 ITS2 structures. This more than doubles the contents of the original database and triples it when including partial structures. (b) The end-user interface was rewritten, extended and now features user-defined homology modelling. (c) Other possible RNA structure discovery methods (namely suboptimal and shape folding) prove helpful but are not able to replace homology modelling. (d) A use case of the ITS2 database in conjunction with other tools developed at the department gave insight into molecular phylogenetic analysis with ITS2. KW - Phylogenie KW - Bioinformatik KW - Würzburg / Universität / Lehrstuhl für Bioinformatik KW - Datenbank KW - Perl KW - SQL KW - Trichoplax adhaerens KW - Placozoa KW - RNS KW - S KW - internal transcribed spacer 2 KW - ITS-2 KW - ITS2 KW - Phylogeny KW - Database KW - Perl KW - SQL KW - Placozoa Y1 - 2007 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-23895 ER -