TY - JOUR A1 - Merget, Benjamin A1 - Wolf, Matthias T1 - A molecular phylogeny of Hypnales (Bryophyta) inferred from ITS2 sequence-structure data N2 - Background: Hypnales comprise over 50% of all pleurocarpous mosses. They provide a young radiation complicating phylogenetic analyses. To resolve the hypnalean phylogeny, it is necessary to use a phylogenetic marker providing highly variable features to resolve species on the one hand and conserved features enabling a backbone analysis on the other. Therefore we used highly variable internal transcribed spacer 2 (ITS2) sequences and conserved secondary structures, as deposited with the ITS2 Database, simultaneously. Findings: We built an accurate and in parts robustly resolved large scale phylogeny for 1,634 currently available hypnalean ITS2 sequence-structure pairs. Conclusions: Profile Neighbor-Joining revealed a possible hypnalean backbone, indicating that most of the hypnalean taxa classified as different moss families are polyphyletic assemblages awaiting taxonomic changes. KW - Moose KW - Hypnales KW - Bryophyta Y1 - 2010 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-67997 ER - TY - JOUR A1 - Buchheim, Mark A. A1 - Keller, Alexander A1 - Koetschan, Christian A1 - Förster, Frank A1 - Merget, Benjamin A1 - Wolf, Matthias T1 - Internal Transcribed Spacer 2 (nu ITS2 rRNA) Sequence-Structure Phylogenetics: Towards an Automated Reconstruction of the Green Algal Tree of Life JF - PLoS ONE N2 - Background: Chloroplast-encoded genes (matK and rbcL) have been formally proposed for use in DNA barcoding efforts targeting embryophytes. Extending such a protocol to chlorophytan green algae, though, is fraught with problems including non homology (matK) and heterogeneity that prevents the creation of a universal PCR toolkit (rbcL). Some have advocated the use of the nuclear-encoded, internal transcribed spacer two (ITS2) as an alternative to the traditional chloroplast markers. However, the ITS2 is broadly perceived to be insufficiently conserved or to be confounded by introgression or biparental inheritance patterns, precluding its broad use in phylogenetic reconstruction or as a DNA barcode. A growing body of evidence has shown that simultaneous analysis of nucleotide data with secondary structure information can overcome at least some of the limitations of ITS2. The goal of this investigation was to assess the feasibility of an automated, sequence-structure approach for analysis of IT2 data from a large sampling of phylum Chlorophyta. Methodology/Principal Findings: Sequences and secondary structures from 591 chlorophycean, 741 trebouxiophycean and 938 ulvophycean algae, all obtained from the ITS2 Database, were aligned using a sequence structure-specific scoring matrix. Phylogenetic relationships were reconstructed by Profile Neighbor-Joining coupled with a sequence structure-specific, general time reversible substitution model. Results from analyses of the ITS2 data were robust at multiple nodes and showed considerable congruence with results from published phylogenetic analyses. Conclusions/Significance: Our observations on the power of automated, sequence-structure analyses of ITS2 to reconstruct phylum-level phylogenies of the green algae validate this approach to assessing diversity for large sets of chlorophytan taxa. Moreover, our results indicate that objections to the use of ITS2 for DNA barcoding should be weighed against the utility of an automated, data analysis approach with demonstrated power to reconstruct evolutionary patterns for highly divergent lineages. KW - RBCL Gene-sequences KW - Colonial volvocales chlorophyta KW - 26S RDNA Data KW - Land plants KW - Molecular systematics KW - Secondary structure KW - Nuclear RDNA KW - DNA KW - Barcodes KW - Dasycladales chlorophyta KW - Profile distances Y1 - 2011 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-140866 VL - 6 IS - 2 ER - TY - JOUR A1 - Merget, Benjamin A1 - Koetschan, Christian A1 - Hackl, Thomas A1 - Förster, Frank A1 - Dandekar, Thomas A1 - Müller, Tobias A1 - Schultz, Jörg A1 - Wolf, Matthias T1 - The ITS2 Database JF - Journal of Visual Expression N2 - The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1 and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation. The ITS2 Database presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank accurately reannotated. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold (direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold. The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE and ProfDistS for multiple sequence-structure alignment calculation and Neighbor Joining tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure. In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses. KW - homology modeling KW - molecular systematics KW - internal transcribed spacer 2 KW - alignment KW - genetics KW - secondary structure KW - ribosomal RNA KW - phylogenetic tree KW - phylogeny Y1 - 2012 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-124600 VL - 61 IS - e3806 ER - TY - THES A1 - Merget, Benjamin T1 - Computational methods for assessing drug-target residence times in bacterial enoyl-ACP reductases and predicting small-molecule permeability for the \(Mycobacterium\) \(tuberculosis\) cell wall T1 - Computermethoden zur Bestimmung von Protein-Ligand Verweilzeiten in bakteriellen Enoyl-ACP Reduktasen und Vorhersage der Permeabilitätswahrscheinlichkeit kleiner Moleküle gegenüber der \(Mycobacterium\) \(tuberculosis\) Zellwand N2 - \textbf{Molecular Determinants of Drug-Target Residence Times of Bacterial Enoyl-ACP Reductases.} Whereas optimization processes of early drug discovery campaigns are often affinity-driven, the drug-target residence time $t_R$ should also be considered due to an often strong correlation with \textit{in vivo} efficacy of compounds. However, rational optimization of $t_R$ is not straightforward and generally hampered by the lack of structural information about the transition states of ligand association and dissociation. The enoyl-ACP reductase FabI of the fatty acid synthesis (FAS) type II is an important drug-target in antibiotic research. InhA is the FabI enzyme of \textit{Mycobacterium tuberculosis}, which is known to be inhibited by various compound classes. Slow-onset inhibition of InhA is assumed to be associated with the ordering of the most flexible protein region, the substrate binding loop (SBL). Diphenylethers are one class of InhA inhibitors that can promote such SBL ordering, resulting in long drug-target residence times. Although these inhibitors are energetically and kinetically well characterized, it is still unclear how the structural features of a ligand affect $t_R$. Using classical molecular dynamics (MD) simulations, recurring conformational families of InhA protein-ligand complexes were detected and structural determinants of drug-target residence time of diphenyl\-ethers with different kinetic profiles were described. This information was used to deduce guidelines for efficacy improvement of InhA inhibitors, including 5'-substitution on the diphenylether B-ring. The validity of this suggestion was then analyzed by means of MD simulations. Moreover, Steered MD (SMD) simulations were employed to analyze ligand dissociation of diphenylethers from the FabI enzyme of \textit{Staphylococcus aureus}. This approach resulted in a very accurate and quantitative linear regression model of the experimental $ln(t_R)$ of these inhibitors as a function of the calculated maximum free energy change of induced ligand extraction. This model can be used to predict the residence times of new potential inhibitors from crystal structures or valid docking poses. Since correct structural characterization of the intermediate enzyme-inhibitor state (EI) and the final state (EI*) of two-step slow-onset inhibition is crucial for rational residence time optimization, the current view of the EI and EI* states of InhA was revisited by means of crystal structure analysis, MD and SMD simulations. Overall, the analyses affirmed that the EI* state is a conformation resembling the 2X23 crystal structure (with slow-onset inhibitor \textbf{PT70}), whereas a twist of residues Ile202 and Val203 with a further opened helix $\alpha 6$ corresponds to the EI state. Furthermore, MD simulations emphasized the influence of close contacts to symmetry mates in the SBL region on SBL stability, underlined by the observation that an MD simulation of \textbf{PT155} chain A with chain B' of a symmetry mate in close proximity of the SBL region showed significantly more stable loops, than a simulation of the tetrameric assembly. Closing Part I, SMD simulations were employed which allow the delimitation of slow-onset InhA inhibitors from rapid reversible ligands. \textbf{Prediction of \textit{Mycobacterium tuberculosis} Cell Wall Permeability.} The cell wall of \textit{M. tuberculosis} hampers antimycobacterial drug design due to its unique composition, providing intrinsic antibiotic resistance against lipophilic and hydrophilic compounds. To assess the druggability space of this pathogen, a large-scale data mining endeavor was conducted, based on multivariate statistical analysis of differences in the physico-chemical composition of a normally distributed drug-like chemical space and a database of antimycobacterial--and thus very likely permeable--compounds. The approach resulted in the logistic regression model MycPermCheck, which is able to predict the permeability probability of small organic molecules based on their physico-chemical properties. Evaluation of MycPermCheck suggests a high predictive power. The model was implemented as a freely accessible online service and as a local stand-alone command-line version. Methodologies and findings from both parts of this thesis were combined to conduct a virtual screening for antimycobacterial substances. MycPermCheck was employed to screen the chemical permeability space of \textit{M. tuberculosis} from the entire ZINC12 drug-like database. After subsequent filtering steps regarding ADMET properties, InhA was chosen as an exemplary target. Docking to InhA led to a principal hit compound, which was further optimized. The quality of the interaction of selected derivatives with InhA was subsequently evaluated using MD and SMD simulations in terms of protein and ligand stability, as well as maximum free energy change of induced ligand egress. The results of the presented computational experiments suggest that compounds with an indole-3-acethydrazide scaffold might constitute a novel class of InhA inhibitors, worthwhile of further investigation. N2 - \textbf{Molekulare Determinanten von Wirkstoff-Angriffsziel Verweilzeiten bakterieller Enoyl-ACP Reduktasen.} In frühen Phasen der Wirkstoffentwicklung sind Optimierungsprozesse häufig affini\-täts\-geleitet. Darüber hinaus sollte zusätzlich die Wirkstoff-Angriffsziel Verweilzeit $t_R$ berücksichtigt werden, da diese oft eine starke Korrelation zur \textit{in vivo} Wirksamkeit der Substanzen aufweist. Rationale Optimierung von $t_R$ ist jedoch auf Grund eines Mangels an struktureller Information über den Übergangszustand der Ligandbindung und Dissoziierung nicht einfach umsetzbar. Die Enoyl-ACP Reduktase FabI der Fettsäurebio\-synthese (FAS) Typ II ist ein wichtiger Angriffspunkt in der Antibiotikaforschung. InhA ist das FabI Enzym des Organismus \textit{Mycobacterium tuberculosis} und kann durch Substanzen diverser Klassen gehemmt werden. Es wird vermutet, dass Hemmung von InhA durch langsam-bindende (``slow-onset'') Inhibitoren mit der Ordnung der flexibelsten Region des Enzyms assoziiert ist, dem Substratbindungsloop (SBL). Diphenylether sind eine InhA Inhibitorenklasse, die eine solche SBL Ordnung fördern und dadurch lange Verweilzeiten im Angriffsziel aufweisen. Obwohl diese Inhibitoren energetisch und kinetisch gut charakterisiert sind, ist noch immer unklar, wie die strukturellen Eigenschaften eines Liganden $t_R$ beeinflussen. Durch die Verwendung klassischer Molekulardynamik (MD) Simulationen wurden wiederkehrende Konformationsfamilien von InhA Protein-Ligand Komplexen entdeckt und strukturelle Determinanten der Wirkstoff-Angriffsziel Verweilzeit von Diphenylethern mit verschiedenen kinetischen Profilen beschrieben. Anhand dieser Ergebnisse wurden Richtlinien zur Wirksamkeitsoptimierung von InhA Inhibitoren abgeleitet, einschließlich einer 5'-Substitution am Diphenylether B-Ring. Die Validität dieses Vorschlags wurde mittels MD Simulationen nachfolgend analysiert. Darüber hinaus wurden ``Steered MD'' (SMD) Simulationen als MD Technik für umfangreicheres Sampling verwendet um die Liganddissoziation von Diphenylethern aus dem FabI Enzym von \textit{Staphylococcus aureus} zu untersuchen. Dieser Ansatz resultierte in einem sehr akkuraten, quantitativen linearen Regressionsmodell der experimentellen Verweilzeit $ln(t_R)$ dieser Inhibitoren als Funktion der berechneten maximalen freien Energieänderung induzierter Ligandextraktion. Dieses Modell kann genutzt werden um die Verweilzeiten neuer potentieller Inhibitoren aus Kristallstrukturen oder validen Dockingposen vorherzusagen. Die korrekte strukturelle Charakterisierung des intermediären und des finalen Zustandes (EI und EI*-Zustand) eines Enzym-Inhibitor Komplexes bei einem zweistufigen Inhibitionsmechanismus durch langsam-bindende Hemmstoffe ist essentiell für rationale Verweilzeitoptimierung. Daher wurde die gegenwärtige Ansicht des EI und EI*-Zustandes von InhA mittels Kristallstrukturanalyse, MD und SMD Simulationen erneut aufgegriffen. Insgesamt bestätigten die Analysen, dass der EI*-Zustand einer Konformation ähnlich der 2X23 Kristallstruktur (mit langsam-bindenden Inhibitor \textbf{PT70}) gleicht, während eine Drehung der Reste Ile202 und Val203 mit einer weiter geöffneten Helix $\alpha 6$ dem EI-Zustand entspricht. Des Weiteren zeigten MD Simulationen den Einfluss naher Kristallkontakte zu Symmetrie-Nachbarn in der SBL Region auf die SBL Stabilität. Dies wird durch die Beobachtung hervorgehoben, dass die Ketten A und B' eines InhA-\textbf{PT155}-Komplexes und des angrenzenden Symmetrie-Nachbars, welche in engem Kontakt in der SBL Region stehen, signifikant stabilere SBLs aufweisen, als die Ketten A und B in einer Simulation des Tetramers. Zum Abschluss von Teil I wurden SMD Simulationen angewandt, auf deren Basis es möglich war, langsam-bindende InhA Inhibitoren von schnell-reversiblen (``rapid reversible'') Liganden zu unterscheiden. \textbf{Vorhersage von \textit{Mycobacterium tuberculosis} Zellwand Permeabilität.} Die Zellwand von \textit{M.~tuberculosis} erschwert die antimycobakterielle Wirkstofffindung auf Grund ihrer einzigartigen Zusammensetzung und bietet eine intrinsische Antibiotikaresistenz gegenüber lipophilen und hydrophilen Substanzen. Um den chemischen Raum wirkstoffähnlicher Moleküle gegen diesen Erreger (``Druggability Space'') einzugrenzen, wurde eine groß angelegte Dataminingstudie durchgeführt, welche auf multivariater statistischer Analyse der Unterschiede der physikochemischen Zusammensetzung eines normalverteilten wirkstoffähnlichen chemischen Raumes und einer Datenbank von antimycobakteriellen -- und somit höchstwahrscheinlich permeablen -- Substanzen beruht. Dieser Ansatz resultierte in dem logistischen Regressionsmodell MycPermCheck, welches in der Lage ist die Permeabilitätswahrscheinlichkeit kleiner organischer Moleküle anhand ihrer physikochemischen Eigenschaften vorherzusagen. Die Evaluation von MycPermCheck deutet auf eine große Vorhersagekraft hin. Das Modell wurde als frei zugänglicher online Service und als lokale Kommandozeilenversion implementiert. Methodiken und Ergebnisse aus beiden Teilen dieser Dissertation wurden kombiniert um ein virtuelles Screening nach antimycobakteriellen Substanzen durchzuführen. Myc\-PermCheck wurde verwendet um den chemischen Permeabilitätsraum von \textit{M.~tuberculosis} anhand der gesamten ZINC12 Datenbank wirkstoffähnlicher Moleküle abzuschätzen. Nach weiteren Filterschritten mit Bezug auf ADMET Eigenschaften, wurde InhA als exemplarisches Angriffsziel ausgewählt. Docking nach InhA führte schließlich zu einer Treffersubstanz, welche in darauffolgenden Schritten weiter optimiert wurde. Die Interaktionsqualität ausgewählter Derivate mit InhA wurde daraufhin mittels MD und SMD Simulationen in Bezug auf Protein und Ligand Stabilität, sowie auch der maximalen freien Energieänderung induzierter Ligandextraktion, untersucht. Die Ergebnisse der vorgestellten computerbasierten Experimente legen nahe, dass Substanzen mit einem Indol-3-Acethydrazid Gerüst eine neuartige Klasse von InhA Inhibitoren darstellen könnten. Weiterführende Untersuchungen könnten sich somit als lohnenswert erweisen. KW - Computational chemistry KW - Arzneimitteldesign KW - Molekulardynamik KW - Permeabilität KW - Tuberkelbakterium KW - Computational drug-design KW - steered molecular dynamics KW - molecular dynamics KW - residence time KW - mycobacterium tuberculosis KW - staphylococcus aureus KW - permeability KW - InhA KW - FabI KW - Enoyl-acyl-carrier-protein-Reductase KW - Drug design KW - Computational chemistry Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-127386 ER - TY - JOUR A1 - Merget, Benjamin A1 - Sotriffer, Christoph A. T1 - Slow-Onset Inhibition of Mycobacterium tuberculosis InhA: Revealing Molecular Determinants of Residence Time by MD Simulations JF - PLoS One N2 - An important kinetic parameter for drug efficacy is the residence time of a compound at a drug target, which is related to the dissociation rate constant koff. For the essential antimycobacterial target InhA, this parameter is most likely governed by the ordering of the flexible substrate binding loop (SBL). Whereas the diphenyl ether inhibitors 6PP and triclosan (TCL) do not show loop ordering and thus, no slow-binding inhibition and high koff values, the slightly modified PT70 leads to an ordered loop and a residence time of 24 minutes. To assess the structural differences of the complexes from a dynamic point of view, molecular dynamics (MD) simulations with a total sampling time of 3.0 µs were performed for three ligand-bound and two ligand-free (perturbed) InhA systems. The individual simulations show comparable conformational features with respect to both the binding pocket and the SBL, allowing to define five recurring conformational families. Based on their different occurrence frequencies in the simulated systems, the conformational preferences could be linked to structural differences of the respective ligands to reveal important determinants of residence time. The most abundant conformation besides the stable EI* state is characterized by a shift of Ile202 and Val203 toward the hydrophobic pocket of InhA. The analyses revealed potential directions for avoiding this conformational change and, thus, hindering rapid dissociation: (1) an anchor group in 2'-position of the B-ring for scaffold stabilization, (2) proper occupation of the hydrophobic pocket, and (3) the introduction of a barricade substituent in 5'-position of the diphenyl ether B-ring. KW - crystal structure KW - ethers KW - oxygen KW - cofactors (biochemistry) KW - binding analysis KW - biochemical simulations KW - hydrogen bonding mycobacterium tuberculosis Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-125607 VL - 10 IS - 5 ER -