Refine
Has Fulltext
- yes (28)
Is part of the Bibliography
- yes (28)
Year of publication
Document Type
- Doctoral Thesis (28)
Keywords
- miRNS (5)
- SMN (4)
- Genexpression (3)
- Messenger-RNS (3)
- Small nuclear RNP (3)
- Biogenese (2)
- Messenger-RNP (2)
- Multiproteinkomplex (2)
- RNS (2)
- SMN complex (2)
Institute
- Graduate School of Life Sciences (18)
- Lehrstuhl für Biochemie (9)
- Fakultät für Biologie (3)
- Fakultät für Chemie und Pharmazie (2)
- Institut für Organische Chemie (2)
- Institut für Klinische Neurobiologie (1)
- Institut für Molekulare Infektionsbiologie (1)
- Medizinische Fakultät (1)
- Neurologische Klinik und Poliklinik (1)
- Theodor-Boveri-Institut für Biowissenschaften (1)
Sonstige beteiligte Institutionen
In mammals, a major fraction of the genome is transcribed as non-coding RNAs. An increasing amount of evidence has accumulated showing that non-coding RNAs play important roles both for normal cell function and in disease processes such as cancer or neurodegeneration. Interpreting the functions of non-coding RNAs and the molecular mechanisms through which they act is one of the most important challenges facing RNA biology today.
In my Ph.D. thesis, I have been investigating the role of 7SK, one of the most abundant non-coding RNAs, in the development and function of motoneurons. 7SK is a highly structured 331 nt RNA transcribed by RNA polymerase III. It forms four stem-loop (SL) structures that serve as binding sites for different proteins. Larp7 binds to SL4 and protects the 3' end from exonucleolytic degradation. SL1 serves as a binding site for HEXIM1, which recruits the pTEFb complex composed of CDK9 and cyclin T1. pTEFb has a stimulatory role for transcription and is regulated through sequestration by 7SK. More recently, a number of heterogeneous nuclear ribonucleoproteins (hnRNPs) have been identified as 7SK interactors. One of these is hnRNP R, which has been shown to have a role in motoneuron development by regulating axon growth. Taken together, 7SK’s function involves interactions with RNA binding proteins, and different RNA binding proteins interact with different regions of 7SK, such that 7SK can be considered as a hub for recruitment and release of different proteins. The questions I have addressed during my Ph.D. are as follows: 1) which region of 7SK interacts with hnRNP R, a main interactor of 7SK? 2) What effects occur in motoneurons after the protein binding sites of 7SK are abolished? 3) Are there additional 7SK binding proteins that regulate the functions of the 7SK RNP?
Using in vitro and in vivo experiments, I found that hnRNP R binds both the SL1 and SL3 region of 7SK, and also that pTEFb cannot be recruited after deleting the SL1 region but is able to bind to a 7SK mutant with deletion of SL3. In order to answer the question of how the 7SK mutations affect axon outgrowth and elongation in mouse primary motoneurons, we proceeded to conduct rescue experiments in motoneurons by using lentiviral vectors. The constructs were designed to express 7SK deletion mutants under the mouse U6 promoter and at the same time to drive expression of a 7SK shRNA from an H1 promoter for the depletion of endogenous 7SK. Using this system we found that 7SK mutants harboring deletions of either SL1 or SL3 could not rescue the axon growth defect of 7SK-depleted motoneurons suggesting that 7SK/hnRNP R complexes are integral for this process.
In order to identify novel 7SK binding proteins and investigate their functions, I proceeded to conduct pull-down experiments by using a biotinylated RNA antisense oligonucleotide that targets the U17-C33 region of 7SK thereby purifying endogenous 7SK complexes. Following mass spectrometry of purified 7SK complexes, we identified a number of novel 7SK interactors. Among these is the Smn complex. Deficiency of the Smn complex causes the motoneuron disease spinal muscular atrophy (SMA) characterized by loss of lower motoneurons in the spinal cord. Smn has previously been shown to interact with hnRNP R. Accordingly, we found Smn as part of 7SK/hnRNP R complexes. These proteomics data suggest that 7SK potentially plays important roles in different signaling pathways in addition to transcription.
The expression of genetic information into proteins is a key aspect of life. The efficient and exact regulation of this process is essential for the cell to produce the correct amounts of these effector molecules to a given situation. For this purpose, eukaryotic cells have developed many different levels of transcriptional and posttranscriptional gene regulation. These mechanisms themselves heavily rely on interactions of proteins with associated nucleic acids. In the case of posttranscriptional gene regulation an orchestrated interplay between RNA-binding proteins, messenger RNAs (mRNA), and non-coding RNAs is compulsory to achieve this important function.
A pivotal factor hereby are RNA secondary structures. One of the most stable and diverse representatives is the G-quadruplex structure (G4) implicated in many cellular mechanisms, such as mRNA processing and translation. In protein biosynthesis, G4s often act as obstacles but can also assist in this process. However, their presence has to be tightly regulated, a task which is often fulfilled by helicases.
One of the best characterized G4-resolving factors is the DEAH-box protein DHX36. The in vitro function of this helicase is extensively described and individual reports aimed to address diverse cellular functions as well. Nevertheless, a comprehensive and systems-wide study on the function of this specific helicase was missing, so far.
The here-presented doctoral thesis provides a detailed view on the global cellular function of DHX36. The binding sites of this helicase were defined in a transcriptome-wide manner, a consensus binding motif was deviated, and RNA targets as well as the effect this helicase exerts on them were examined. In human embryonic kidney cells, DHX36 is a mainly cytoplasmic protein preferentially binding to G-rich and G4-forming sequence motifs on more than 4,500 mRNAs. Loss of DHX36 leads to increased target mRNA levels whereas ribosome occupancy on and protein output of these transcripts are reduced. Furthermore, DHX36 knockout leads to higher RNA G4 levels and concomitant stress reactions in the cell. I hypothesize that, upon loss of this helicase, translationally-incompetent structured DHX36 target mRNAs, prone to localize in stress granules, accumulate in the cell. The cell reacts with basal stress to avoid cytotoxic effects produced by these mis-regulated and structured transcripts.
The biogenesis of spliceosomal UsnRNPs is a highly elaborate cellular process that occurs both in the nucleus and the cytoplasm. A major part of the process is the assembly of the Sm-core particle, which consists of a ring shaped heptameric unit of seven Sm proteins (SmD1•D2•F•E•G•D3•B) wrapped around a single stranded RNA motif (termed Sm-site) of spliceosomal UsnRNAs. This process occurs mainly in the cytoplasm by the sequential action of two biogenesis factors united in PRMT5- and SMN-complexes, respectively. The PRMT5-complex composed of the three proteins PRMT5, WD45 and pICln is responsible for the symmetric dimethylation of designated arginine residues in the C-terminal tails of some Sm proteins. The action of the PRMT5- complex results in the formation of assembly incompetent Sm-protein intermediates sequestered by the assembly chaperone pICln (SmD1•D2•F•E•G•pICln and pICln•D3•B). Due to the action of pICln, the Sm proteins in these complexes fail to interact with UsnRNAs to form the mature Sm-core. This kinetic trap is relieved by the action of the SMN-complex, which removes the pICln subunit and facilitates the binding of the Sm-core intermediates to the UsnRNA, thus forming the mature Sm-core particle. The human SMN complex consists of 9 subunits termed SMN, Gemin2-8 and Unrip. So far, there are no available atomic structures of the whole SMN-complex, but structures of isolated domains and subunits of the complex have been reported by several laboratories in the past years. The lack of structural information about the entire SMN complex most likely lies in the biophysical properties of the SMN complex, which possesses an oligomeric SMN core, and many unstructured and flexible regions. These were the biggest roadblocks for its structural elucidation using traditional methods such as X-ray crystallography, NMR or CryoEM. To circumvent these obstacles and to obtain structural insight into the SMN-complex, the Schizosaccharomyces pombe SMN complex was used as a model system in this work. In a collaboration with the laboratory of Dr. Remy Bordonne (IGMM, CNRS, France), we could show that the SpSMN complex is minimalistic in its composition, consisting only of SpSMN, SpGemin2, SpGemin8, SpGemin7 and SpGemin6. Using biochemical experiments, an interaction map of the SpSMN complex was established which was found to be highly similar to the reported map of the human SMN complex. The results of this study clearly show that SpSMN is the oligomeric core of the complex and provides the binding sites for the rest of the subunits. Through biochemical and X-ray scattering experiments, the properties of the SpSMN subunit such as oligomerization viii and intrinsic disorder, were shown to determine the overall biophysical characteristics of the whole complex. The structural basis of SpSMN oligomerization is presented in atomic detail which establishes a dimeric SpSMN as the fundamental unit of higher order SpSMN oligomers. In addition to oligomerization, the YG-box domain of SpSMN serves as the binding site for SpGemin8. The unstructured region of SpSMN imparts an unusual large hydrodynamic size, intrinsic disorder, and flexibility to the whole complex. Interestingly, these biophysical properties are partially mitigated by the presence of SpGemin8•SpGemin7•SpGemin6 subunits. These results classify the SpSMN complex as a multidomain entity connected with flexible linkers and characterize the SpSMN subunit to be the central oligomeric structural organizer of the whole complex.
The transcription factor MYC is a onco-protein, found to be deregulated in many human cancers. High MYC levels correlate with an aggressive tumor outcome and poor survival rates. Despite MYC being discovered as an oncogene already in the 1970s, how MYC regulates transcription of its target genes, which are involved in cellular growth and proliferation, is not fully understood yet.
In this study, the question how MYC influences factors interacting with the RNA polymerase II ensuring productive transcription of its target genes was addressed using quantitative mass spectrometry. By comparing the interactome of RNA polymerase II under varying MYC levels, several potential factors involved in transcriptional elongation were identified. Furthermore, the question which of those factors interact with MYC was answered by employing quantitative mass spectrometry of MYC itself. Thereby, the direct interaction of MYC with the transcription elongation factor SPT5, a subunit of the DRB-sensitivity inducing factor, was discovered and analyzed in greater detail. SPT5 was shown to be recruited to chromatin by MYC. In addition, the interaction site of MYC on SPT5 was narrowed down to its evolutionary conserved NGN-domain, which is the known binding site for SPT4, the earlier characterized second subunit of the DRB-sensitivity inducing factor. This finding suggests a model in which MYC and SPT4 compete for binding the NGN-domain of SPT5.
Investigations of the SPT5-interacting region on MYC showed binding of SPT5 to MYC’s N-terminus including MYC-boxes 0, I and II.
In order to analyze proteins interacting specifically with the N-terminal region of MYC, a truncated MYC-mutant was used for quantitative mass spectrometric analysis uncovering reduced binding for several proteins including the well-known interactor TRRAP and TRRAP-associated complexes.
Summarized, ...
Complex formation between macromolecules constitutes the foundation of most cellular processes. Most known complexes are made up of two or more proteins interacting in order to build a functional entity and therefore enabling activities which
the single proteins could otherwise not fulfill. With the increasing knowledge about
noncoding RNAs (ncRNAs) it has become evident that, similar to proteins, many of
them also need to form a complex to be functional. This functionalization is usually executed by specific or global RNA-binding proteins (RBPs) that are specialized
binders of a certain class of ncRNAs. For instance, the enterobacterial global RBPs
Hfq and ProQ together bind >80 % of the known small regulatory RNAs (sRNAs),
a class of ncRNAs involved in post-transcriptional regulation of gene expression.
However, identification of RNA-protein interactions so far was performed individually by employing low-throughput biochemical methods and thereby hindered the discovery of such interactions, especially in less studied organisms such
as Gram-positive bacteria. Using gradient profiling by sequencing (Grad-seq), the
present thesis aimed to establish high-throughput, global RNA/protein complexome resources for Escherichia coli and Streptococcus pneumoniae in order to provide a
new way to investigate RNA-protein as well as protein-protein interactions in these
two important model organisms.
In E. coli, Grad-seq revealed the sedimentation profiles of 4,095 (∼85 % of
total) transcripts and 2,145 (∼49 % of total) proteins and with that reproduced
its major ribonucleoprotein particles. Detailed analysis of the in-gradient distribution of the RNA and protein content uncovered two functionally unknown
molecules—the ncRNA RyeG and the small protein YggL—to be ribosomeassociated. Characterization of RyeG revealed it to encode for a 48 aa long, toxic protein that drastically increases lag times when overexpressed. YggL was shown to
be bound by the 50S subunit of the 70S ribosome, possibly indicating involvement
of YggL in ribosome biogenesis or translation of specific mRNAs.
S. pneumoniae Grad-seq detected 2,240 (∼88 % of total) transcripts and 1,301
(∼62 % of total) proteins, whose gradient migration patterns were successfully reconstructed, and thereby represents the first RNA/protein complexome resource
of a Gram-positive organism. The dataset readily verified many conserved major
complexes for the first time in S. pneumoniae and led to the discovery of a specific
interaction between the 3’!5’ exonuclease Cbf1 and the competence-regulating ciadependent sRNAs (csRNAs). Unexpectedly, trimming of the csRNAs by Cbf1 stabilized the former, thereby promoting their inhibitory function. cbf1 was further shown
to be part of the late competence genes and as such to act as a negative regulator of
competence.
Herpes Simplex Virus type 1 (HSV-1) is an ubiquitous neurotropic human pathogen that infects a large majority of the world’s population. It is the causative agent of the common cold sore but also responsible for life-threatening infections (e.g., encephalitis), particularly in immunocompromised individuals and neonates. Like other herpesviruses, HSV-1 takes over the cellular RNA machinery to facilitate productive infection while efficiently shutting down host gene expression by targeting multiple steps of RNA metabolism. The two viral proteins, vhs and ICP27, play a crucial role in this process. Delivered by the tegument of the incoming virus, the virion host shut-off (vhs) endonuclease rapidly starts cleaving both cellular and viral mRNAs. With the onset of viral gene expression, the HSV-1 immediate-early protein ICP27 promotes the expression of viral early and late genes through various mechanisms, including mRNA processing, export, and translation.
Prior research by the Dölken lab demonstrated that lytic HSV-1 infection results in the disruption of transcription termination (DoTT) of most cellular genes by the viral ICP27 protein. This significantly contributes to HSV-1 induced host shut-off. DoTT results in transcription for tens of thousands of nucleotides beyond poly(A) sites and into downstream genes. Interestingly, this was found to be accompanied by a dramatic increase in chromatin accessibility downstream of the affected poly(A) sites. This is consistent with the formation of extensive downstream open chromatin regions (dOCR) and indicative of impaired histone repositioning in the wake of RNA polymerase II (Pol II) downstream of the affected poly(A) sites.
In my PhD thesis, I demonstrate that dOCR formation is dependent on the viral ICP22 protein when poly(A) read-through transcription is triggered by the ectopic expression of ICP27 or salt stress. I show that dOCR formation occurs when a high level of transcriptional activity arises downstream of genes due to the HSV-1-induced DoTT. To investigate whether histone composition is affected downstream of genes, I established the ChIPmentation approach to study associated changes and the influence of DoTT and dOCR formation on major histone modification marks. In HSV-1 WT infection, dOCR formation was reflected in alterations of canonical H1 histone downstream of affected genes, which was absent in ICP22 infection. To elucidate the underlying molecular mechanism, two major histone chaperones SPT6 and FACT (SPT16 and SSRP1), which govern histone repositioning and may thus play a role in H1 homeostasis, were extensively studied. Both histone chaperones have been recently shown to be recruited to the viral genome by interactions with ICP22 protein. To investigate whether the depletion of SSRP1 or SPT6 would complement the loss of ICP22 to induce dOCR, T-HF cells with doxycycline-inducible knock-down of either of the two factors were generated. ATAC-seq analysis revealed that the interaction between the two histone chaperones and ICP22 is not involved in HSV-1-induced dOCR formation, suggesting the involvement of other proteins. In summary, this work sheds new light on a fundamental molecular mechanism of the cellular transcriptional machinery that is manipulated by the concerted actions of the two HSV-1 immediate-early proteins ICP22 and ICP27.
RNA helicases are key players in the regulation of gene expression. They act by remodeling local RNA secondary structures as well as RNA-protein interactions to enable the dynamic association of RNA binding proteins to their targets. The putative RNA helicase DHX30 is a member of the family of DEAH-box helicases with a putative role in the ATP-dependent unwinding of RNA secondary structures. Mutations in the DHX30 gene causes the autosomal dominant neuronal disease “Neurodevelopmental Disorder with severe Motor Impairment and Absent Language” (NEDMIAL;OMIM#617804). In this thesis, a strategy was established that enabled the large-scale purification of enzymatically active DHX30. Through enzymatic studies performed in vitro, DHX30 was shown to act as an ATP-dependent 3’ → 5’ RNA helicase that catalyzes the unwinding of RNA:RNA and RNA:DNA substrates. Using recombinant DHX30, it could be shown that disease-causing missense mutations in the conserved helicase core caused the disruption of its ATPase and helicase activity. The protein interactome of DHX30 however, was unchanged indicating that the pathogenic missense-mutations do not cause misfolding of DHX30, but rather specifically affect its catalytic activity. DHX30 localizes predominantly in the cytoplasm where it forms a complex with ribosomes and polysomes. Using a cross-linking mass spectrometry approach, a direct interaction of the N-terminal double strand RNA binding domain of DHX30 with sites next to the ribosome’s mRNA entry channel and the subunit interface was uncovered. RNA sequencing of DHX30 knockout cells revealed a strong de-regulation of mRNAs involved in neurogenesis and nervous system development, which is in line with the NEDMIAL disease phenotype. The knockdown of DHX30 results in a decreased 80S peak in polysome gradients, indicating that DHX30 has an effect on the translation machinery. Sequencing of the pool of active translating mRNAs revealed that upon DHX30 knockout mainly 5’TOP mRNAs are downregulated. These mRNAs are coding for proteins of the translational machinery and translation initiation factors. This study identified DHX30 as a factor of the translation machinery that selectively impacts the expression of a subset of proteins and provides insight on the etiology of NEDMIAL.
mRNA is co- or post-transcriptionally processed from a precursor mRNA to a mature mRNA. In addition to 5'capping and splicing, these modifications also include polyadenylation, the addition of a polyA tail to the 3'end of the mRNA. In recent years, alternative polyadenylation in particular has increasingly been taken into account as a mechanism for regulating gene expression. It is assumed that approximately 70-75 % of human protein coding genes contain alternative polyadenylation signals, which are often located within intronic sequences of protein-coding genes. The use of such polyadenylation signals leads to shortened mRNA transcripts and thus to the generation of C-terminal shortened protein isoforms.
Interestingly, the majority of microRNAs, small non-coding RNAs that play an essential role in post-transcriptional gene regulation, are also encoded in intronic sequences of protein-coding genes and are co-transcriptionally expressed with their host genes. The biogenesis of microRNA has been well studied and is well known, but mechanisms that may influence the expression regulation of mature microRNAs are just poorly understood.
In the presented work, I aimed to investigate the influence of alternative intronic polyadenylation on the biogenesis of microRNAs. The human ion channel TRPM1 could already be associated with melanoma pathogenesis and truncated isoforms of this protein have already been described in literature. In addition, TRPM1 harbors a microRNA, miR211, in its sixth intron, which is assumed to act as a tumor suppressor. Since both, TRPM1 and miR211 have already been associated with melanoma pathogenesis, the shift towards truncated transcripts during the development of various cancers is already known and it has been shown that certain microRNAs play a crucial role in the development and progression of melanoma, melanoma cell lines were used as an in vitro model for these investigations.