TY  - JOUR
A1  - Wolf, Beat
A1  - Kuonen, Pierre
A1  - Dandekar, Thomas
A1  - Atlan, David
T1  - DNAseq workflow in a diagnostic context and an example of a user friendly implementation
JF  - BioMed Research International
N2  - Over recent years next generation sequencing (NGS) technologies evolved from costly tools used by very few, to a much more accessible and economically viable technology. Through this recently gained popularity, its use-cases expanded from research environments into clinical settings. But the technical know-how and infrastructure required to analyze the data remain an obstacle for a wider adoption of this technology, especially in smaller laboratories. We present GensearchNGS, a commercial DNAseq software suite distributed by Phenosystems SA. The focus of GensearchNGS is the optimal usage of already existing infrastructure, while keeping its use simple. This is achieved through the integration of existing tools in a comprehensive software environment, as well as custom algorithms developed with the restrictions of limited infrastructures in mind. This includes the possibility to connect multiple computers to speed up computing intensive parts of the analysis such as sequence alignments. We present a typical DNAseq workflow for NGS data analysis and the approach GensearchNGS takes to implement it. The presented workflow goes from raw data quality control to the final variant report. This includes features such as gene panels and the integration of online databases, like Ensembl for annotations or Cafe Variome for variant sharing.
KW  - next generation sequencing
KW  - genome browser
KW  - mutation
KW  - algorithm
KW  - database
KW  - format
KW  - discovery
KW  - exome
KW  - variants
KW  - alignment
Y1  - 2015
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-144527
IS  - 403497
ER  - 
TY  - JOUR
A1  - Rybalka, Nataliya
A1  - Wolf, Matthias
A1  - Andersen, Robert
A1  - Friedl, Thomas
T1  - Congruence of chloroplast- and nuclear-encoded DNA sequence variations used to assess species boundaries in the soil microalga Heterococcus (Stramenopiles, Xanthophyceae)
JF  - BMC Evolutionary Biology
N2  - Background: Heterococcus is a microalgal genus of Xanthophyceae (Stramenopiles) that is common and widespread in soils, especially from cold regions. Species are characterized by extensively branched filaments produced when grown on agarized culture medium. Despite the large number of species described exclusively using light microscopic morphology, the assessment of species diversity is hampered by extensive morphological plasticity. 
Results: Two independent types of molecular data, the chloroplast-encoded psbA/rbcL spacer complemented by rbcL gene and the internal transcribed spacer 2 of the nuclear rDNA cistron (ITS2), congruently recovered a robust phylogenetic structure. With ITS2 considerable sequence and secondary structure divergence existed among the eight species, but a combined sequence and secondary structure phylogenetic analysis confined to helix II of ITS2 corroborated relationships as inferred from the rbcL gene phylogeny. Intra-genomic divergence of ITS2 sequences was revealed in many strains. The 'monophyletic species concept', appropriate for microalgae without known sexual reproduction, revealed eight different species. Species boundaries established using the molecular-based monophyletic species concept were more conservative than the traditional morphological species concept. Within a species, almost identical chloroplast marker sequences (genotypes) were repeatedly recovered from strains of different origins. At least two species had widespread geographical distributions; however, within a given species, genotypes recovered from Antarctic strains were distinct from those in temperate habitats. Furthermore, the sequence diversity may correspond to adaptation to different types of habitats or climates. 
Conclusions: We established a method and a reference data base for the unambiguous identification of species of the common soil microalgal genus Heterococcus which uses DNA sequence variation in markers from plastid and nuclear genomes. The molecular data were more reliable and more conservative than morphological data.
KW  - xanthophyceae
KW  - psbA/rbcL spacer
KW  - ITS2
KW  - tool
KW  - RBCL
KW  - alignment
KW  - evolution
KW  - chlorophyta
KW  - RNA secondary structure
KW  - terrestrial habitats
KW  - phylogenetic trees
KW  - mixed models
KW  - green algae
KW  - heterococcus
KW  - systematics
KW  - molecular phylogeny
KW  - species concept
Y1  - 2013
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-121848
SN  - 1471-2148
VL  - 13
IS  - 39
ER  - 
TY  - JOUR
A1  - Merget, Benjamin
A1  - Koetschan, Christian
A1  - Hackl, Thomas
A1  - Förster, Frank
A1  - Dandekar, Thomas
A1  - Müller, Tobias
A1  - Schultz, Jörg
A1  - Wolf, Matthias
T1  - The ITS2 Database
JF  - Journal of Visual Expression
N2  - The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1 and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation.

The ITS2 Database presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank accurately reannotated. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold (direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.

The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE and ProfDistS for multiple sequence-structure alignment calculation and Neighbor Joining tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.

In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
KW  - homology modeling
KW  - molecular systematics
KW  - internal transcribed spacer 2
KW  - alignment
KW  - genetics
KW  - secondary structure
KW  - ribosomal RNA
KW  - phylogenetic tree
KW  - phylogeny
Y1  - 2012
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-124600
VL  - 61
IS  - e3806
ER  -