Refine
Has Fulltext
- yes (36)
Is part of the Bibliography
- yes (36)
Year of publication
Document Type
- Doctoral Thesis (36) (remove)
Keywords
- Genexpression (36) (remove)
Institute
- Theodor-Boveri-Institut für Biowissenschaften (36) (remove)
The transcription factor NRF2 is considered as the master regulator of cytoprotective and ROS-detoxifying gene expression. Due to their vulnerability to accumulating reactive oxygen species, melanomas are dependent on an efficient oxidative stress response, but to what extent melanomas rely on NRF2 is only scarcely investigated so far. In tumor entities harboring activating mutations of NRF2, such as lung adenocarcinoma, NRF2 activation is closely connected to therapy resistance. In melanoma, activating mutations are rare and triggers and effectors of NRF2 are less well characterized.
This work revealed that NRF2 is activated by oncogenic signaling, cytokines and pro-oxidant triggers, released cell-autonomously or by the tumor microenvironment. Moreover, silencing of NRF2 significantly reduced melanoma cell proliferation and repressed well-known NRF2 target genes, indicating basal transcriptional activity of NRF2 in melanoma. Transcriptomic analysis showed a large set of deregulated gene sets, besides the well-known antioxidant effectors. NRF2 suppressed the activity of MITF, a marker for the melanocyte lineage, and induced expression of epidermal growth factor receptor (EGFR), thereby stabilizing the dedifferentiated melanoma phenotype and limiting pigmentation markers and melanoma-associated antigens. In general, the dedifferentiated melanoma phenotype is associated with a reduced tumor immunogenicity. Furthermore, stress-inducible cyclooxygenase 2 (COX2) expression, a crucial immune-modulating gene, was regulated by NRF2 in an ATF4-dependent manner. Only in presence of both transcription factors was COX2 robustly induced by H2O2 or TNFα. COX2 catalyzes the first step of the prostaglandin E2 (PGE2) synthesis, which was described to be associated with tumor immune evasion and reduction of the innate immune response.
In accordance with these potentially immune-suppressive features, immunocompetent mice injected with NRF2 knockout melanoma cells had a strikingly longer tumor-free survival compared to NRF2-proficient cells. In line with the in vitro data, NRF2-deficient tumors showed suppression of COX2 and induction of MITF. Furthermore, transcriptomic analyses of available tumors revealed a strong induction of genes belonging to the innate immune response, such as RSAD2 and IFIH1. The expression of these genes strongly correlated with immune evasion parameters in human melanoma datasets and NRF2 activation or PGE2 supplementation limited the innate immune response in vitro.
In summary, the stress dependent NRF2 activation stabilizes the dedifferentiated melanoma phenotype and facilitates the synthesis of PGE2. As a result, NRF2 reduces gene expression of the innate immune response and promotes the generation of an immune-cold tumor microenvironment. Therefore, NRF2 not only elevated the ROS resilience, but also strongly contributed to tumor growth, maintenance, and immune control in cutaneous melanoma.
Synapsen als Stellen der Kommunikation zwischen Neuronen besitzen spezialisierte Bereiche – Aktive Zonen (AZs) genannt –, die aus einem hoch komplexen Netzwerk von Proteinen aufgebaut sind und die Maschinerie für den Prozess der Neurotransmitter-Ausschüttung und das Vesikel-Recycling beinhalten. In Drosophila ist das Protein Bruchpilot (BRP) ein wichtiger Baustein für die T-förmigen Bänder („T-Bars“) der präsynaptischen Aktiven Zonen. BRP ist notwendig für eine intakte Struktur der Aktiven Zone und eine normale Exocytose von Neurotransmitter-Vesikeln. Auf der Suche nach Mutationen, welche die Verteilung von Bruchpilot im Gewebe beeinträchtigen, wurde eine P-Element-Insertion im Gen CG11489 an der Position 79D identifiziert, welches eine Kinase kodiert, die einen hohen Grad an Homologie zur Familie der SR Proteinkinasen (SRPKs) von Säugern aufweist. Die Mitglieder dieser Familie zeichnen sich durch eine evolutionär hoch konservierte zweigeteilte Kinasedomäne aus, die durch eine nicht konservierte Spacer-Sequenz unterbrochen ist. SRPKs phosphorylieren SR-Proteine, die zu einer evolutionär hoch konservierten Familie Serin/Arginin-reicher Spleißfaktoren gehören und konstitutive sowie alternative Spleißprozesse steuern und damit auf post-transkriptioneller Ebene die Genexpression regulieren. Mutation des Srpk79D-Gens durch die P-Element-Insertion (Srpk79DP1) oder eine Deletion im Gen (Srpk79DVN Nullmutante) führt zu auffälligen BRP-Akkumulationen in larvalen und adulten Nerven. In der vorliegenden Arbeit wird gezeigt, dass diese BRP-Akkumulationen auf Ultrastruktur-Ebene ausgedehnten axonalen Agglomeraten elektronendichter Bänder entsprechen und von klaren Vesikeln umgeben sind. Charakterisierung durch Immuno-Elektronenmikroskopie ergab, dass diese Strukturen BRP-immunoreaktiv sind. Um die Bildung BRP-enthaltender Agglomerate in Axonen zu verhindern und damit eine intakte Gehirnfunktion zu gewährleisten, scheint die SRPK79D nur auf niedrigem Niveau exprimiert zu werden, da die endogene Kinase mit verschiedenen Antikörpern nicht nachweisbar war. Wie in anderen Arbeiten gezeigt werden konnte, ist die Expression der PB-, PC- oder PF-Isoform der vier möglichen SRPK79D-Varianten, die durch alternativen Transkriptionsstart in Exon eins beziehungsweise drei und alternatives Spleißen von Exon sieben zustande kommen, zur Rettung des Phänotyps der BRP-Akkumulation im Srpk79DVN Nullmutanten-Hintergrund ausreichend. Zur Charakterisierung der Rescue-Eigenschaften der SRPK79D-PE-Isoform wurde mit der Klonierung der cDNA in einen UAS-Vektor begonnen. Offenbar beruht die Bildung der axonalen BRP-Agglomerate nicht auf einer Überexpression von BRP in den betroffenen Neuronen, denn auch bei reduzierter Expression des BRP-Proteins im Srpk79DVN Nullmutanten-Hintergrund entstehen die BRP-Agglomerate. In Köpfen der Srpk79DVN Nullmutante ist die Gesamtmenge an Bruchpilot-Protein im Vergleich zum Wildtyp nicht deutlich verändert. Auch die auf Protein-Ebene untersuchte Expression der verschiedenen Isoformen der präsynaptischen Proteine Synapsin, Sap47 und CSP weicht in der Srpk79DVN Nullmutante nicht wesentlich von der Wildtyp-Situation ab, sodass sich keine Hinweise auf verändertes Spleißen der entsprechenden prä-mRNAs ergeben. Jedes der sieben bekannten SR-Proteine von Drosophila ist ein potentielles Zielprotein der SRPK79D. Knock-down-Experimente für die drei hier untersuchten SR-Proteine SC35, X16/9G8 und B52/SRp55 im gesamten Nervensystem durch RNA-Interferenz zeigten allerdings keinen Effekt auf die Verteilung von BRP im Gewebe. Hinsichtlich der Flugfähigkeit der Tiere hat die Srpk79DVN Nullmutation keinen additiven Effekt zum Knock-down des BRP-Proteins, denn die Doppelmutanten zeigten bei der Bestimmung des Anteils an flugunfähigen Tieren vergleichbare Werte wie die Einzelmutanten, die entweder die Nullmutation im Srpk79D-Gen trugen, oder BRP reduziert exprimierten. Vermutlich sind Bruchpilot und die SR Proteinkinase 79D somit Teil desselben Signalwegs. Durch Doppelfärbungen mit Antikörpern gegen BRP und CAPA-Peptide wurde abschließend entdeckt, dass Bruchpilot auch im Median- und Transvers-Nervensystem (MeN/TVN) von Drosophila zu finden ist, welche die Neurohämal-Organe beherbergen. Aufgabe dieser Organe ist die Speicherung und Ausschüttung von Neuropeptid-Hormonen. Daher ist zu vermuten, dass das BRP-Protein neben Funktionen bei der Neurotransmitter-Exocytose möglicherweise eine Rolle bei der Ausschüttung von Neuropeptiden spielt. Anders als in den Axonen der larvalen Segmental- und Intersegmentalnerven der Srpk79DVN Nullmutante, die charakteristische BRP-Agglomerate aufweisen, hat die Mutation des Srpk79D-Gens in den Axonen der Va-Neurone, die das MeN/TVN-System bilden, keinen sichtbaren Effekt auf die Verteilung von Brp, denn das Muster bei Färbung gegen BRP weist keine deutlichen Veränderungen zum Wildtyp auf.
Cardiovascular disease is the leading cause of mortality in both men and women in the Western world. Earlier observations have pointed out that pre-menopausal women have a lower risk of developing cardiovascular disease than age-matched men, with an increase in risk after the onset of menopause. This observation has directed the attention to estrogen as a potential protective factor in the heart. So far the focus of research and clinical studies has been the vascular system, leaving the current knowledge on the role of estrogen in the myocardium itself rather scarce. Functional estrogen receptor-alpha as well as -beta have recently been identified in the myocardium, making the myocardium an estrogen target organ. The focus of this thesis was 1) to investigate the role of estrogen and estrogen receptors in modulating myocardial gene expression both in vivo in an animal model for cardiac hypertrophy (spontaneously hypertensive rats; SHR), as well as in vitro in isolated neonatal cardiomyocytes, 2) to investigate the mechanisms of the rapid induction of an estrogen target gene, the early growth response gene-1 (Egr-1) and 3) to initiate the search for novel estrogen target genes in the myocardium. 1) The effects of estrogen on the expression of one of the major myocardial specific contractile proteins, the alpha-myosin heavy chain (alpha-MHC) have been investigated. In ovarectomised animals treated either with 17beta-estradiol alone or in combination with a specific estrogen receptor antagonist, ICI 182780, it was shown that both alpha-MHC mRNA and protein were upregulated by estrogen in an estrogen receptor specific manner. The in vivo results were confirmed in vitro in isolated neonatal cardiomyocytes which showed that estrogen has a direct action on the myocardium potent enough to upregulate the expression of alpha-MHC. Furthermore it was shown that the alpha-MHC promoter is induced by estrogen in an estrogen receptor-dependent manner and first investigations into the mechanisms involved in this upregulation identified Egr-1 as a potential transcription factor which, upon induction by estrogen, drives the expression of the alpha-MHC promoter. 2) Previously it was shown that Egr-1 is rapidly induced by estrogen in an estrogen receptor-dependent manner which was mediated via 5 serum response elements (SREs) in the promoter region and surprisingly not via the estrogen response elements (EREs). In this study it was shown that estrogen-treatment of cardiomyocytes resulted in the recruitment of serum response factor (SRF), or an antigenically related protein, to the SREs in the Egr-1 promoter, which was specifically inhibited by the estrogen receptor antagonist ICI 182780. Transfection experiments showed that estrogen induced a heterologous promoter consisting only of 5 tandem repeats of the c-fos SRE in an ER-dependent manner, which identified SREs as promoter elements able to confer an estrogen response to target genes. 3) Potentially new target genes regulated by estrogen in vivo were analysed using hearts of ovarectomised animals as well as ovarectomised animals treated with estrogen. Analyses of cDNA microarray filters containing 1250 known genes identified 24 genes that were modified by estrogen in vivo. Among these genes, some might have potentially important functions in the heart and further analyses of these genes will create a more global picture of the role and function of estrogen in the myocardium. Taken together, the results showed that estrogen does have a direct action on the myocardium both by regulating the expression of myocardial specific genes in vivo, as well as exerting rapid non-nuclear effects in cardiac myocytes. It was shown that SREs in the promoter region of genes can confer an estrogen response to genes identifying SREs as important elements in regulation of genes by estrogen. Furthermore, 24 potentially new estrogen targets were identified in the myocardium, contributing to the general understanding of estrogen action in the myocardium.
WISP3 is a member of the CCN family which comprises six members found in the 1990’s: Cysteine-rich,angiogenic inducer 61 (CYR61, CCN1), Connective tissue growth factor (CTGF, CCN2), Nephroblastoma overexpressed (NOV, CNN3) and the Wnt1 inducible signalling pathway protein 1-3 (WISP1-3, CCN4-6).They are involved in the adhesion, migration, mitogenesis, chemotaxis, proliferation, cell survival, angiogenesis, tumorigenesis, and wound healing by the interaction with different integrins and heparan sulfate proteoglycans. Until now the only member correlated to the musculoskeletal autosomal disease Progressive Pseudorheumatoid Dysplasia (PPD) is WISP3. PPD is characterised by normal embryonic development followed by cartilage degradation over time starting around the age of three to eight years. Animal studies in mice exhibited no differences between knock out or overexpression compared to wild type litter mates, thus were not able to reproduce the symptoms observed in PPD patients. Studies in vitro and in vivo revealed a role for WISP3 in antagonising BMP, IGF and Wnt signalling pathways. Since most of the knowledge of WISP3 was gained in epithelial cells, cancer cells or chondrocyte cell lines, we investigated the roll of WISP3 in primary human mesenchymal stem cells (hMSCs) as well as primary chondrocytes.
WISP3 knock down was efficiently established with three short hairpin RNAs in both cell types, displaying a change of morphology followed by a reduction in cell number. Simultaneous treatment with recombinant WISP3 was not enough to rescue the observed phenotype nor increase the endogenous expression of WISP3. We concluded that WISP3 acts as an essential survival factor, where the loss resulted in the passing of cell cycle control points followed by apoptosis. Nevertheless, Annexin V-Cy3 staining and detection of active caspases by Western blot and immunofluorescence staining detected no clear evidence for apoptosis. Furthermore, the gene expression of the death receptors TRAILR1 and TRAILR2,important for the extrinsic activation of apoptosis, remained unchanged during WISP3 mRNA reduction. Autophagy as cause of cell death was also excluded, given that the autophagy marker LC3 A/B demonstrated to be uncleaved in WISP3-deficient hMSCs. To reveal correlated signalling pathways to WISP3 a whole genome expression analyses of WISP3-deficient hMSCs compared to a control (scramble) was performed. Microarray analyses exhibited differentially regulated genes involved in cell cycle control, adhesion, cytoskeleton and cell death. Cell death observed by WISP3 knock down in hMSCs and chondrocytes might be explained by the induction of necroptosis through the BMP/TAK1/RIPK1 signalling axis. Loss of WISP3 allows BMP to bind its receptor activating the Smad 2/3/4 complex which in turn can activate TAK1 as previously demonstrated in epithelial cells. TAK1 is able to block
caspase-dependent apoptosis thereby triggering the assembly of the necrosome resulting in cell death by necroptosis.
Together with its role in cell cycle control and extracellular matrix adhesion, as demonstrated in human mammary epithelial cells, the data supports the role of WISP3 as tumor suppressor and survival factor in cells of the musculoskeletal system as well as epithelial cells.
Ziel dieser Arbeit ist es ein besseres Verständinis der molekularen Prozesse der Melanomentstehung und Tumorprogression zu gewinnen. Hierfür wurde ein Tiermodell transgener Medakas (Oryzias latipes) verwendet, welche als stabiles Transgen das Konstrukt mitf::xmrk besitzen. Diese Fische entwickelten Pigmentzelltumore, welche für eine Microarrayanalyse herangezogen wurden. Aus diesem Microarraydatensatz wurden 11 Gene ausgewählt, welche in dieser Arbeit näher untersucht wurden. Beobachtungen haben ergeben, dass sich bei transgenen Medakas, welche Xmrk exprimieren, verschiedene pigmentierte Hauttumore entwickeln. Diese Tumore wurden je nach ihrem verschiedenen Histiotyp klassifiziert und untersucht. Um einen Eindruck zu gewinnen, wie Xmrk die Transkription verschiedener Gene, welche in der Krebsentstehung und –progression eine wichtige Rolle spielen, beeinflusst, wurden pigmentierte Hauttumore transgener Medakas, so wie zu Vergleichszwecken hyperpigmentierte Haut transgener Medakas und Lymphome und gesunde Organe von Wildtyp-Medakas, untersucht. Mit Hilfe von Real-time-PCR’s wurden die folgenden Gene untersucht: G6PC, GAMT, GM2A, MAPK3, NID1, SLC24A5, SPP1, PDIA4, RASL11B, TACC2 und ZFAND5. Dabei konnte festgestellt werden, dass die Expression der Gene GM2A, MAPK3, NID1, PDIA4, RASL11B, SLC24A5 und ZFAND5 von Xmrk beeinflusst wird, während dies für die Gene G6PC, GAMT, SPP1 und TACC2 nicht zutrifft. Im Vergleich zu gesunder Haut werden GM2A, MAPK3, PDIA4, RASL11B, SLC24A5 und ZFAND5 in Tumoren höher exprimiert. Die Gene G6PC, GAMT, NID1, SPP1 und TACC2 werden dagegen verglichen mit gesunder Haut unverändert oder niedriger exprimiert. Die Bedeutung der erhöhten Genexpression lässt sich in vielen Fällen zurzeit nur theoretisch erfassen. Eine höhere Expression von SLC24A5 beispielsweise lässt vermuten, dass ein Zusammenhang zwischen der Melaninproduktion und der Zellproliferation besteht. Die Überexpression von GM2A weist dagegen auf eine Rolle von GM2A als Tumormarker hin. Dahingegen scheint die erniedrigte Expression von GAMT und G6PC Auskunft über den veränderten Stoffwechsel in Tumoren zu geben. Um diese Ergebnisse zu bestätigen und zu entschlüsseln wie genau Xmrk die Expression der getesteten Gene beeinflusst, sind allerdings noch weitere funktionelle Studien nötig. Generell kommt man zu dem Schluss, dass die Genexpression sich in jedem Tumor unterscheidet. Daher scheint jeder Tumor seinen eigenen Evolutionsweg zu beschreiten.
Das Ziel der vorliegenden Dissertation war die Entwicklung neuartige Ansätze zur Identifizierung von biologisch aktiven Wirkstoffen, die in die Metamorphose von holometabolen Insekten eingreifen. Hexamerine und Neuropeptide besitzen sehr unterschiedliche Funktionen. Während Neuropeptide zusammen mit anderen Gewebshormonen auf einer übergeordneten regulatorischen Ebene wirken, sind Hexamerine als Speicher- und Verteidigungsproteine ein Endglied dieser hormonellen Regulationskaskade. In der vorliegenden Arbeit wurden zwei Fragestellungen bearbeitet: 1) Im ersten Projekt sollten allatotrope Substanzen im Gehirn der großen Wachsmotte Galleria mellonella durch Screening einer Expressionsbibliothek mit polyklonalen Antiseren identifiziert werden. Dabei wurde das Neuropeptid Corazonin identifiziert. Die vollständige Corazonin-mRNA wurde kloniert und sequenziert. Das Expressionsmuster der Corazonin-mRNA und des Peptids wurde mittels Northern-Analyse und in-situ-Hybridisierung charakterisiert. Corazonin wird in vier Zellpaaren, die zu den lateralen neurosekretorischen Zellen gehören, exprimiert. Die Axone dieser Zellen verlaufen ipsilateral zu den Nervi corpori cardiaci I+II, feine Fasern verzweigen sich in die am Ösophagus angrenzende Hirnregion hinein. Corazonin wird offensichtlich an den Axon- Endigungen in den Corpora cardiaca in die Hämolymphe freigesetzt. Einige feine Fasern enden in den Corpora allata bzw. am Vorderdarm. Der Nachweis, dass Corazonin tatsächlich eine allatotrope Wirkung hat, konnte nicht erbracht werden. 2) Die Protein/Protein-Interaktion zwischen Hexamerinen und dem Hexamerinrezeptor der Schmeißfliege Calliphora vicina wurde durch Two-Hybrid-Experimenten analysiert. Durch Interaktionstest mit trunkierten Proteinfragmenten wurden die Bindungsdomänen beider Proteine kartiert. Als rezeptorbindende Domäne des Arylphorins wurde ein 49 AS großes Peptid in der Domäne-3 des Arylphorin- Monomers identifiziert. Die Ligandenbindungsdomäne des Hexamerinrezeptors wurde in den ersten 24 AS des N-Terminus kartiert. Ausgehend von diesen Ergebnissen wurde ein HTS-Protokoll entwickelt, das zur Identifizierung von Substanzen verwendet werden kann, welche die Bindung dieser beiden Proteine beeinflussen. Eine Two-Hybrid-Bibliothek wurde ausgehend von 7dL-Fettkörper-RNA konstruiert und mit "Hexamerinrezeptor-Ködern" gescreent. Dabei wurden zwei neue Interaktionspartner des Hexamerinrezeptors gefunden und genauer charakterisiert. Der erste identifizierte Interaktionspartner - d-AP-3 - ist Teil eines Adaptin- Komplexes, der als Adapter zwischen membranständigen Rezeptoren und Clathrin oder ähnlichen Proteinen an der rezeptorvermittelten Endozytose beteiligt ist. Die Adaptin-Interaktionsdomäne liegt innerhalb des ABP64-Spaltprodukts des Hexamerinrezeptors. Die Funktion des zweiten Interaktionspartners - AFP - ist unbekannt. AFP wird im anterioren Teil des Fettkörpers und in Hämozyten exprimiert. Die Interaktion zwischen dem Hexamerinrezeptor und AFP ist demnach auf diesen Teil des Fettkörpers beschränkt. Die mit AFP interagierende Domäne des Hexamerinrezeptors liegt innerhalb des P30-Spaltprodukts.
Im Genom von Listeria monocytogenes konnten zwei Gene identifiziert werden, die mutmaßlich für niedermolekulare Protein-Tyrosin Phosphatasen (LMW-PTPs) kodieren, Lmo0938/Ptp-1 und Lmo2540/Ptp-2, beide ähneln LMW-PTPs von B. subtilis. Einzel- und Doppeldeletionen der ptp-Gene beeinflussten die Transkription zahlreicher Gene, wie anhand von Gesamtgenom-DNA-Microarray-Analysen und quantitativer RT-PCR gezeigt werden konnten. Insbesondere waren die Gene für i) die Internaline A und B, ii) den Osmoprotektanten-Transporter OpuC, iii) MCP, notwendig zur Flagellen-Bewegung und iv) eine Anzahl von den Proteinen, die in die Nährstoffaufnahme sowie den intrazellulären Metabolismus involviert sind, in vitro herunterreguliert. Die PrfA-regulierten Virulenzgene wurden in den Mutanten verstärkt exprimiert. Im Wesentlichen konnte das gleiche Transkriptionsmuster in infizierten Caco-2-Enterocyten beobachtet werden. Die verringerte Invasivität (abhängig von InlA) und die Unbeweglichkeit der Mutanten passt zu den Transkriptionsergebnissen. Jedoch wurden weder die intrazelluläre Replikation innerhalb eukaryontischer Wirtszellen noch die Resistenz gegen Stressbedingungen durch die Deletion beeinträchtigt. Die Proteome des Wildtyps und der ptp-Mutanten wurden durch 2-dimensionale Gelelektrophorese verglichen und es zeigte sich, dass die Transkriptionsergebnisse nicht vollständig im Proteom reflektiert wurden. Die Ergebnisse zeigen, dass die Ptps in die Regulationsnetzwerke des alternativen Stress-Sigmafaktor SigB und von PrfA eingreifen. Der ähnliche Effekt beider Ptps auf die Transkription oder auf den Proteinlevel deutet eine Interaktion oder Kooperation der beiden Enzyme an.
Recent progresses and developments in molecular biology provide a wealth of new but insufficiently characterised data. This fund comprises amongst others biological data of genomic DNA, protein sequences, 3-dimensional protein structures as well as profiles of gene expression. In the present work, this information is used to develop new methods for the characterisation and classification of organisms and whole groups of organisms as well as to enhance the automated gain and transfer of information. The first two presented approaches (chapters 4 und 5) focus on the medically and scientifically important enterobacteria. Its impact in medicine and molecular biology is founded in versatile mechanisms of infection, their fundamental function as a commensal inhabitant of the intestinal tract and their use as model organisms as they are easy to cultivate. Despite many studies on single pathogroups with clinical distinguishable pathologies, the genotypic factors that contribute to their diversity are still partially unknown. The comprehensive genome comparison described in Chapter 4 was conducted with numerous enterobacterial strains, which cover nearly the whole range of clinically relevant diversity. The genome comparison constitutes the basis of a characterisation of the enterobacterial gene pool, of a reconstruction of evolutionary processes and of comprehensive analysis of specific protein families in enterobacterial subgroups. Correspondence analysis, which is applied for the first time in this context, yields qualitative statements to bacterial subgroups and the respective, exclusively present protein families. Specific protein families were identified for the three major subgroups of enterobacteria namely the genera Yersinia and Salmonella as well as to the group of Shigella and E. coli by applying statistical tests. In conclusion, the genome comparison-based methods provide new starting points to infer specific genotypic traits of bacterial groups from the transfer of functional annotation. Due to the high medical importance of enterobacterial isolates their classification according to pathogenicity has been in focus of many studies. The microarray technology offers a fast, reproducible and standardisable means of bacterial typing and has been proved in bacterial diagnostics, risk assessment and surveillance. The design of the diagnostic microarray of enterobacteria described in chapter 5 is based on the availability of numerous enterobacterial genome sequences. A novel probe selection strategy based on the highly efficient algorithm of string search, which considers both coding and non-coding regions of genomic DNA, enhances pathogroup detection. This principle reduces the risk of incorrect typing due to restrictions to virulence-associated capture probes. Additional capture probes extend the spectrum of applications of the microarray to simultaneous diagnostic or surveillance of antimicrobial resistance. Comprehensive test hybridisations largely confirm the reliability of the selected capture probes and its ability to robustly classify enterobacterial strains according to pathogenicity. Moreover, the tests constitute the basis of the training of a regression model for the classification of pathogroups and hybridised amounts of DNA. The regression model features a continuous learning capacity leading to an enhancement of the prediction accuracy in the process of its application. A fraction of the capture probes represents intergenic DNA and hence confirms the relevance of the underlying strategy. Interestingly, a large part of the capture probes represents poorly annotated genes suggesting the existence of yet unconsidered factors with importance to the formation of respective virulence phenotypes. Another major field of microarray applications is gene expression analysis. The size of gene expression databases rapidly increased in recent years. Although they provide a wealth of expression data, it remains challenging to integrate results from different studies. In chapter 6 the methodology of an unsupervised meta-analysis of genome-wide A. thaliana gene expression data sets is presented, which yields novel insights in function and regulation of genes. The application of kernel-based principal component analysis in combination with hierarchical clustering identified three major groups of contrasts each sharing overlapping expression profiles. Genes associated with two groups are known to play important roles in Indol-3 acetic acid (IAA) mediated plant growth and development as well as in pathogen defence. Yet uncharacterised serine-threonine kinases could be assigned to novel functions in pathogen defence by meta-analysis. In general, hidden interrelation between genes regulated under different conditions could be unravelled by the described approach. HMMs are applied to the functional characterisation of proteins or the detection of genes in genome sequences. Although HMMs are technically mature and widely applied in computational biology, I demonstrate the methodical optimisation with respect to the modelling accuracy on biological data with various distributions of sequence lengths. The subunits of these models, the states, are associated with a certain holding time being the link to length distributions of represented sequences. An adaptation of simple HMM topologies to bell-shaped length distributions described in chapter 7 was achieved by serial chain-linking of single states, while residing in the class of conventional HMMs. The impact of an optimisation of HMM topologies was underlined by performance evaluations with differently adjusted HMM topologies. In summary, a general methodology was introduced to improve the modelling behaviour of HMMs by topological optimisation with maximum likelihood and a fast and easily implementable moment estimator. Chapter 8 describes the application of HMMs to the prediction of interaction sites in protein domains. As previously demonstrated, these sites are not trivial to predict because of varying degree in conservation of their location and type within the domain family. The prediction of interaction sites in protein domains is achieved by a newly defined HMM topology, which incorporates both sequence and structure information. Posterior decoding is applied to the prediction of interaction sites providing additional information of the probability of an interaction for all sequence positions. The implementation of interaction profile HMMs (ipHMMs) is based on the well established profile HMMs and inherits its known efficiency and sensitivity. The large-scale prediction of interaction sites by ipHMMs explained protein dysfunctions caused by mutations that are associated to inheritable diseases like different types of cancer or muscular dystrophy. As already demonstrated by profile HMMs, the ipHMMs are suitable for large-scale applications. Overall, the HMM-based method enhances the prediction quality of interaction sites and improves the understanding of the molecular background of inheritable diseases. With respect to current and future requirements I provide large-scale solutions for the characterisation of biological data in this work. All described methods feature a highly portable character, which allows for the transfer to related topics or organisms, respectively. Special emphasis was put on the knowledge transfer facilitated by a steadily increasing wealth of biological information. The applied and developed statistical methods largely provide learning capacities and hence benefit from the gain of knowledge resulting in increased prediction accuracies and reliability.
Various types of cancer involve aberrant cell cycle regulation. Among the pathways responsible for tumor growth, the YAP oncogene, a key downstream effector of the Hippo pathway, is responsible for oncogenic processes including cell proliferation, and metastasis by controlling the expression of cell cycle genes. In turn, the MMB multiprotein complex (which is formed when B-MYB binds to the MuvB core) is a master regulator of mitotic gene expression, which has also been associated with cancer. Previously, our laboratory identified a novel crosstalk between the MMB-complex and YAP. By binding to enhancers of MMB target genes and promoting B-MYB binding to promoters, YAP and MMB co-regulate a set of mitotic and cytokinetic target genes which promote cell proliferation. This doctoral thesis addresses the mechanisms of YAP and MMB mediated transcription, and it characterizes the role of YAP regulated enhancers in transcription of cell cycle genes.
The results reported in this thesis indicate that expression of constitutively active, oncogenic YAP5SA leads to widespread changes in chromatin accessibility in untransformed human MCF10A cells. ATAC-seq identified that newly accessible and active regions include YAP-bound enhancers, while the MMB-bound promoters were found to be already accessible and remain open during YAP induction. By means of CRISPR-interference (CRISPRi) and chromatin immuniprecipitation (ChIP), we identified a role of YAP-bound enhancers in recruitment of CDK7 to MMB-regulated promoters and in RNA Pol II driven transcriptional initiation and elongation of G2/M genes. Moreover, by interfering with the YAP-B-MYB protein interaction, we can show that binding of YAP to B-MYB is also critical for the initiation of transcription at MMB-regulated genes. Unexpectedly, overexpression of YAP5SA also leads to less accessible chromatin regions or chromatin closing. Motif analysis revealed that the newly closed regions contain binding motifs for the p53 family of transcription factors. Interestingly, chromatin closing by YAP is linked to the reduced expression and loss of chromatin-binding of the p53 family member Np63. Furthermore, I demonstrate that downregulation of Np63 following expression of YAP is a key step in driving cellular migration.
Together, the findings of this thesis provide insights into the role of YAP in the chromatin changes that contribute to the oncogenic activities of YAP. The overexpression of YAP5SA not only leads to the opening of chromatin at YAP-bound enhancers which together with the MMB complex stimulate the expression of G2/M genes, but also promotes the closing of chromatin at ∆Np63 -bound regions in order to lead to cell migration.
In this thesis, the development of a phylogenetic DNA microarray, the analysis of several gene expression microarray datasets and new approaches for improved data analysis and interpretation are described. In the first publication, the development and analysis of a phylogenetic microarray is presented. I could show that species detection with phylogenetic DNA microarrays can be significantly improved when the microarray data is analyzed with a linear regression modeling approach. Standard methods have so far relied on pure signal intensities of the array spots and a simple cutoff criterion was applied to call a species present or absent. This procedure is not applicable to very closely related species with high sequence similarity because cross-hybridization of non-target DNA renders species detection impossible based on signal intensities alone. By modeling hybridization and cross-hybridization with linear regression, as I have presented in this thesis, even species with a sequence similarity of 97% in the marker gene can be detected and distinguished from related species. Another advantage of the modeling approach over existing methods is that the model also performs well on mixtures of different species. In principle, also quantitative predictions can be made. To make better use of the large amounts of microarray data stored in public databases, meta-analysis approaches need to be developed. In the second publication, an explorative meta-analysis exemplified on Arabidopsis thaliana gene expression datasets is presented. Integrating datasets studying effects such as the influence of plant hormones, pathogens and different mutations on gene expression levels, clusters of similarly treated datasets could be found. From the clusters of pathogen-treated and indole-3-acetic acid (IAA) treated datasets, representative genes were selected which pointed to functions which had been associated with pathogen attack or IAA effects previously. Additionally, hypotheses about the functions of so far uncharacterized genes could be set up. Thus, this kind of meta-analysis could be used to propose gene functions and their regulation under different conditions. In this work, also primary data analysis of Arabidopsis thaliana datasets is presented. In the third publication, an experiment which was conducted to find out if microwave irradiation has an effect on the gene expression of a plant cell culture is described. During the first steps, the data analysis was carried out blinded and exploratory analysis methods were applied to find out if the irradiation had an effect on gene expression of plant cells. Small but statistically significant changes in a few genes were found and could be experimentally confirmed. From the functions of the regulated genes and a meta-analysis with publicly available microarray data, it could be suspected that the plant cell culture somehow perceived the irradiation as energy, similar to perceiving light rays. The fourth publication describes the functional analysis of another Arabidopsis thaliana gene expression dataset. The gene expression data of the plant tumor dataset pointed to a switch from a mainly aerobic, auxotrophic to an anaerobic and heterotrophic metabolism in the plant tumor. Genes involved in photosynthesis were found to be repressed in tumors; genes of amino acid and lipid metabolism, cell wall and solute transporters were regulated in a way that sustains tumor growth and development. Furthermore, in the fifth publication, GEPAT (Genome Expression Pathway Analysis Tool), a tool for the analysis and integration of microarray data with other data types, is described. It consists of a web application and database which allows comfortable data upload and data analysis. In later chapters of this thesis (publication 6 and publication 7), GEPAT is used to analyze human microarray datasets and to integrate results from gene expression analysis with other datatypes. Gene expression and comparative genomic hybridization data from 71 Mantle Cell Lymphoma (MCL) patients was analyzed and allowed proposing a seven gene predictor which facilitates survival predictions for patients compared to existing predictors. In this study, it was shown that CGH data can be used for survival predictions. For the dataset of Diffuse Large B-cell lymphoma (DLBCL) patients, an improved survival predictor could be found based on the gene expression data. From the genes differentially expressed between long and short surviving MCL patients as well as for regulated genes of DLBCL patients, interaction networks could be set up. They point to differences in regulation for cell cycle and proliferation genes between patients with good and bad prognosis.