Characterization of mRNA Transcription Termination and Cleavage in C. elegans

Description
In eukaryotes, most messenger RNA precursors (pre-mRNA) undergo extensive processing, leading to the cleavage of the transcript followed by the addition of a poly(A) tail. This process is executed by a large complex known as the Cleavage and Polyadenylation Complex

In eukaryotes, most messenger RNA precursors (pre-mRNA) undergo extensive processing, leading to the cleavage of the transcript followed by the addition of a poly(A) tail. This process is executed by a large complex known as the Cleavage and Polyadenylation Complex (CPC). Its central subcomplex, the Cleavage and Polyadenylation Specificity Factor (CPSF) complex is responsible for recognizing a short hexameric element AAUAAA located at the 3’end in the nascent mRNA molecule and catalyzing the pre-mRNA cleavage. In the round nematode C. elegans, the cleavage reaction is executed by a subunit of this complex named CPSF3, a highly conserved RNA endonuclease. While the crystal structure of its human ortholog CPSF73 has been recently identified, we still do not understand the molecular mechanisms and sequence specificity used by this protein to induce cleavage, which in turn would help to understand how this process is executed in detail. Additionally, we do not understand in additional factors are needed for this process. In order to address these issues, we performed a comparative analysis of the CPSF3 protein in higher eukaryotes to identify conserved functional domains. The overall percent identities for members of the CPSF complex range from 33.68% to 56.49%, suggesting that the human and C. elegans orthologs retain a high level of conservation. CPSF73 is the protein with the overall highest percent identity of the CPSF complex, with its active site-containing domain possessing 74.60% identity with CPSF3. Additionally, we gathered and expressed using a bacterial expression system CPSF3 and a mutant, which is unable to perform the cleavage reaction, and developed an in vitro cleavage assay to test whether CPSF3 activity is necessary and sufficient to induce nascent mRNA cleavage. This project establishes tools to better understand how CPSF3 functions within the CPC and sheds light on the biology surrounding the transcription process as a whole.
Date Created
2020-05
Agent

RNA-Based Computing Devices for Intracellular and Diagnostic Applications

157621-Thumbnail Image.png
Description
The fundamental building blocks for constructing complex synthetic gene networks are effective biological parts with wide dynamic range, low crosstalk, and modularity. RNA-based components are promising sources of such parts since they can provide regulation at the level of transcription

The fundamental building blocks for constructing complex synthetic gene networks are effective biological parts with wide dynamic range, low crosstalk, and modularity. RNA-based components are promising sources of such parts since they can provide regulation at the level of transcription and translation and their predictable base pairing properties enable large libraries to be generated through in silico design. This dissertation studies two different approaches for initiating interactions between RNA molecules to implement RNA-based components that achieve translational regulation. First, single-stranded domains known as toeholds were employed for detection of the highly prevalent foodborne pathogen norovirus. Toehold switch riboregulators activated by trigger RNAs from the norovirus RNA genome are designed, validated, and coupled with paper-based cell-free transcription-translation systems. Integration of paper-based reactions with synbody enrichment and isothermal RNA amplification enables as few as 160 copies/mL of norovirus from clinical samples to be detected in reactions that do not require sophisticated equipment and can be read directly by eye. Second, a new type of riboregulator that initiates RNA-RNA interactions through the loop portions of RNA stem-loop structures was developed. These loop-initiated RNA activators (LIRAs) provide multiple advantages compared to toehold-based riboregulators, exhibiting ultralow signal leakage in vivo, lacking any trigger RNA sequence constraints, and appending no additional residues to the output protein. Harnessing LIRAs as modular parts, logic gates that exploit loop-mediated control of mRNA folding state to implement AND and OR operations with up to three sequence-independent input RNAs were constructed. LIRA circuits can also be ported to paper-based cell-free reactions to implement portable systems with molecular computing and sensing capabilities. LIRAs can detect RNAs from a variety of different pathogens, such as HIV, Zika, dengue, yellow fever, and norovirus, and after coupling to isothermal amplification reactions, provide visible test results down to concentrations of 20 aM (12 RNA copies/µL). And the logic functionality of LIRA circuits can be used to specifically identify different HIV strains and influenza A subtypes. These findings demonstrate that toehold- and loop-mediated RNA-RNA interactions are both powerful strategies for implementing RNA-based computing systems for intracellular and diagnostic applications.
Date Created
2019
Agent

Mechanisms of miRNA-based gene regulation in C. elegans and human cells

157059-Thumbnail Image.png
Description
Multicellular organisms use precise gene regulation, executed throughout development, to build and sustain various cell and tissue types. Post-transcriptional gene regulation is essential for metazoan development and acts on mRNA to determine its localization, stability, and translation. MicroRNAs (miRNAs) and

Multicellular organisms use precise gene regulation, executed throughout development, to build and sustain various cell and tissue types. Post-transcriptional gene regulation is essential for metazoan development and acts on mRNA to determine its localization, stability, and translation. MicroRNAs (miRNAs) and RNA binding proteins (RBPs) are the principal effectors of post-transcriptional gene regulation and act by targeting the 3'untranslated regions (3'UTRs) of mRNA. MiRNAs are small non-coding RNAs that have the potential to regulate hundreds to thousands of genes and are dysregulated in many prevalent human diseases such as diabetes, Alzheimer's disease, Duchenne muscular dystrophy, and cancer. However, the precise contribution of miRNAs to the pathology of these diseases is not known.

MiRNA-based gene regulation occurs in a tissue-specific manner and is implemented by an interplay of poorly understood and complex mechanisms, which control both the presence of the miRNAs and their targets. As a consequence, the precise contributions of miRNAs to gene regulation are not well known. The research presented in this thesis systematically explores the targets and effects of miRNA-based gene regulation in cell lines and tissues.

I hypothesize that miRNAs have distinct tissue-specific roles that contribute to the gene expression differences seen across tissues. To address this hypothesis and expand our understanding of miRNA-based gene regulation, 1) I developed the human 3'UTRome v1, a resource for studying post-transcriptional gene regulation. Using this resource, I explored the targets of two cancer-associated miRNAs miR-221 and let-7c. I identified novel targets of both these miRNAs, which present potential mechanisms by which they contribute to cancer. 2) Identified in vivo, tissue-specific targets in the intestine and body muscle of the model organism Caenorhabditis elegans. The results from this study revealed that miRNAs regulate tissue homeostasis, and that alternative polyadenylation and miRNA expression patterns modulate miRNA targeting at the tissue-specific level. 3) Explored the functional relevance of miRNA targeting to tissue-specific gene expression, where I found that miRNAs contribute to the biogenesis of mRNAs, through alternative splicing, by regulating tissue-specific expression of splicing factors. These results expand our understanding of the mechanisms that guide miRNA targeting and its effects on tissue-specific gene expression.
Date Created
2019
Agent

Study of the expression pattern and tissue specific roles of the Caenorhabditis elegans dystrophin glycoprotein complex

132725-Thumbnail Image.png
Description
Duchenne muscular dystrophy (DMD) is a lethal, X-linked disease which occurs in approximately 1 in 3,500 male births. This disease is characterized by progressive muscle wasting and causes premature death. One of the earliest symptoms of this disease is mitochondrial

Duchenne muscular dystrophy (DMD) is a lethal, X-linked disease which occurs in approximately 1 in 3,500 male births. This disease is characterized by progressive muscle wasting and causes premature death. One of the earliest symptoms of this disease is mitochondrial dysfunction. Dystrophin is a protein found under the sarcolemma. The N terminus binds to actin and the C terminus binds to dystrophin glycoprotein complex (DGC). DMD is caused by mutations in the dystrophin gene. C. elegans possess an ortholog of dystrophin, DYS-1. Though there is evidence that C. elegans can be used as a model organism to model DMD, nematode DGC has not been well characterized. Additionally, while we know that mitochondrial dysfunction has been found in humans and other model organisms, this has not been well defined in C. elegans. In order to address these issues, we crossed the SJ4103 worm strain (myo-3p::GFP(mit)) with dys-1(cx18) in order to visualize and quantify changes in mitochondria in a dys-1 background. SJ4103;cx18 nematodes were found to have less mitochondrial than SJ4103 which suggests mitochondrial dysfunction does occur in dys-1 worms. Furthermore, mitochondrial dysfunction was studied by knocking down members of the DGC, dys-1, dyb-1, sgn-1, sgca-1, and sgcb-1 in SJ4103 strain. Knock down of each gene resulted in decrease in abundance of mitochondria which suggests that each member of the DGC contributes to the overall health of nematode muscle. The ORF of dyb-1 was successfully cloned and tagged with GFP in order to visualize this DGC member C. elegans. Imaging of the transgenic dyb-1::GFP worm shows green fluoresce expressed in which suggests that dyb-1 is a functional component of the muscle fibers. This project will enable us to better understand the effects of dystrophin deficiency on mitochondrial function as well as visualize the expression of certain members of the DGC in order to establish C. elegans as a good model organism to study this disease.
Date Created
2019-05
Agent

Study of the Requirements of RNA Cleavage and Polyadenylation in C. elegans

133015-Thumbnail Image.png
Description
Cleavage and polyadenylation is a step in mRNA processing in which the 3’UTR is cleaved and a polyA tail is added to create a final mature transcript. This process relies on RNA sequence elements that guide a large multimeric protein

Cleavage and polyadenylation is a step in mRNA processing in which the 3’UTR is cleaved and a polyA tail is added to create a final mature transcript. This process relies on RNA sequence elements that guide a large multimeric protein complex named the Cleavage and Polyadenylation Complex to dock on the 3’UTR and execute the cleavage reaction. Interactions of the complex with the RNA and specific dynamics of complex recruitment and formation still remain largely uncharacterized. In our lab we have identified an Adenosine residue as the nucleotide most often present at the cleavage site, although it is unclear whether this specific element is a required instructor of cleavage and polyadenylation. To address whether the Adenosine residue is necessary and sufficient for the cleavage and polyadenylation reaction, we mutated this nucleotide at the cleavage site in three C. elegans protein coding genes, forcing the expression of these wt and mutant 3’UTRs, and studied how the cleavage and polyadenylation machinery process these genes in vivo. We found that interrupting the wt sequence elements found at the cleavage site interferes with the cleavage and polyadenylation reaction, suggesting that the sequence close to the end of the transcript plays a role in modulating the site of the RNA cleavage. This activity is also gene-specific. Genes such as ges-1 showed little disruption in the cleavage of the transcript, with similar location occurring in both the wt and mutant 3’UTRs. On the other hand, mutation of the cleavage site in genes such as Y106G6H.9 caused the activation of new cryptic cleavage sites within the transcript. Taken together, my experiments suggest that the sequence elements at the cleavage site somehow participate in the reaction to guide the cleavage reaction to occur at an exact site. This work will help to better understand the mechanisms of transcription termination in vivo and will push forward research aimed to study post-transcriptional gene regulation in eukaryotes.
Date Created
2019-05
Agent

ERK/MAPK Requirements for the Development of Long-Range Axonal Projections and Motor Learning in Cortical Glutamatergic Neurons

156939-Thumbnail Image.png
Description
The RASopathies are a collection of developmental diseases caused by germline mutations in components of the RAS/MAPK signaling pathway and is one of the world’s most common set of genetic diseases. A majority of these mutations result in an upregulation

The RASopathies are a collection of developmental diseases caused by germline mutations in components of the RAS/MAPK signaling pathway and is one of the world’s most common set of genetic diseases. A majority of these mutations result in an upregulation of RAS/MAPK signaling and cause a variety of both physical and neurological symptoms. Neurodevelopmental symptoms of the RASopathies include cognitive and motor delays, learning and intellectual disabilities, and various behavioral problems. Recent noninvasive imaging studies have detected widespread abnormalities within white matter tracts in the brains of RASopathy patients. These abnormalities are believed to be indicative of underlying connectivity deficits and a possible source of the behavioral and cognitive deficits. To evaluate these long-range connectivity and behavioral issues in a cell-autonomous manner, MEK1 loss- and gain-of-function (LoF and GoF) mutations were induced solely in the cortical glutamatergic neurons using a Nex:Cre mouse model. Layer autonomous effects of the cortex were also tested in the GoF mouse using a layer 5 specific Rbp4:Cre mouse. Immunohistochemical analysis showed that activated ERK1/2 (P-ERK1/2) was expressed in high levels in the axonal compartments and reduced levels in the soma when compared to control mice. Axonal tract tracing using a lipophilic dye and an adeno-associated viral (AAV) tract tracing vector, identified significant corticospinal tract (CST) elongation deficits in the LoF and GoF Nex:Cre mouse and in the GoF Rbp4:Cre mouse. AAV tract tracing was further used to identify significant deficits in axonal innervation of the contralateral cortex, the dorsal striatum, and the hind brain of the Nex:Cre GoF mouse and the contralateral cortex and dorsal striatum of the Rbp4:Cre mouse. Behavioral testing of the Nex:Cre GoF mouse indicated deficits in motor learning acquisition while the Rbp4:Cre GoF mouse showed no failure to acquire motor skills as tested. Analysis of the expression levels of the immediate early gene ARC in Nex:Cre and Rbp4:Cre mice showed a specific reduction in a cell- and layer-autonomous manner. These findings suggest that hyperactivation of the RAS/MAPK pathway in cortical glutamatergic neurons, induces changes to the expression patterns of P-ERK1/2, disrupts axonal elongation and innervation patterns, and disrupts motor learning abilities.
Date Created
2018
Agent

Molecular profiling plasma extracellular vesicles from breast cancer patients

Description
Extracellular vesicles (EVs) represent a heterogeneous population of small vesicles, consisting of a phospholipidic bilayer surrounding a soluble interior cargo. These vesicles play an important role in cellular communication by virtue of their protein, RNA, and lipid content, which can

Extracellular vesicles (EVs) represent a heterogeneous population of small vesicles, consisting of a phospholipidic bilayer surrounding a soluble interior cargo. These vesicles play an important role in cellular communication by virtue of their protein, RNA, and lipid content, which can be transferred among cells. Peripheral blood is a rich source of circulating EVs. An analysis of EVs in peripheral blood could provide access to unparalleled amounts of biomarkers of great diagnostic, prognostic as well as therapeutic value. In the current study, a plasma EV enrichment method based on pluronic co-polymer was first established and characterized. Plasma EVs from breast cancer patients were then enriched, profiled and compared to non-cancer controls. Proteins signatures that contributed to the prediction of cancer samples from non-cancer controls were created by a random-forest based cross-validation approach. We found that a large portion of these signatures were related to breast cancer aggression. To verify such findings, KIAA0100, one of the features identified, was chosen for in vitro molecular and cellular studies in the breast cancer cell line MDA-MB-231. We found that KIAA0100 regulates cancer cell aggression in MDA-MB-231 in an anchorage-independent manner and is particularly associated with anoikis resistance through its interaction with HSPA1A. Lastly, plasma EVs contain not only individual proteins, but also numerous molecular complexes. In order to measure millions of proteins, isoforms, and complexes simultaneously, Adaptive Dynamic Artificial Poly-ligand Targeting (ADAPT) platform was applied. ADAPT employs an enriched library of single-stranded oligodeoxynucleotides to profile complex biological samples, thus achieving a deep coverage of system-wide, native biomolecules. Profiling of EVs from breast cancer patients was able to obtain a prediction AUC performance of 0.73 when compared biopsy-positive cancer patient to healthy controls and 0.64 compared to biopsy-negative controls and such performance was not associated with the physical breast condition indicated by BIRAD scores. Taken together, current research demonstrated the potential of profiling plasma EVs in searching for therapeutic targets as well as diagnostic signatures.
Date Created
2018
Agent

Exploring nuclease resistance and biological stability of threose nucleic acid

135135-Thumbnail Image.png
Description
Nucleic acid polymers have numerous applications in both therapeutics and research to control gene expression and bind biologically relevant targets. However, due to poor biological stability their clinical applications are limited. Chemical modifications can improve both intracellular and extracellular stability

Nucleic acid polymers have numerous applications in both therapeutics and research to control gene expression and bind biologically relevant targets. However, due to poor biological stability their clinical applications are limited. Chemical modifications can improve both intracellular and extracellular stability and enhance resistance to nuclease degradation. To identify a potential candidate for a highly stable synthetic nucleic acid, the biostability of α-L-threofuranosyl nucleic acid (TNA) was evaluated under simulated biological conditions. TNA contains a four-carbon sugar and is linked by 2’, 3’ phosphodiester bonds. We hypothesized that this distinct chemical structure would yield greater nuclease resistance in human serum and human liver microsomes, which were selected as biologically relevant nuclease conditions. We found that TNA oligonucleotides remained undigested for 7 days in these conditions. In addition, TNA/DNA heteropolymers and TNA/RNA oligonucleotide duplexes displayed nuclease resistance, suggesting that TNA has a protective effect over DNA and RNA. In conclusion TNA demonstrates potential as a viable synthetic nucleic acid for use in numerous clinical and therapeutic applications.
Date Created
2016-12
Agent

Multiplexed, In-Solution Protein Array (MISPA) for Identification of Novel Protein Interactions and Early Detection of Pathogen Induced Cancers

134770-Thumbnail Image.png
Description
Disturbances in the protein interactome often play a large role in cancer progression. Investigation of protein-protein interactions (PPI) can increase our understanding of cancer pathways and will disclose unknown targets involved in cancer disease biology. Although numerous methods are available

Disturbances in the protein interactome often play a large role in cancer progression. Investigation of protein-protein interactions (PPI) can increase our understanding of cancer pathways and will disclose unknown targets involved in cancer disease biology. Although numerous methods are available to study protein interactions, most platforms suffer from drawbacks including high false positive rates, low throughput, and lack of quantification. Moreover, most methods are not compatible for use in a clinical setting. To address these limitations, we have developed a multiplexed, in-solution protein microarray (MISPA) platform with broad applications in proteomics. MISPA can be used to quantitatively profile PPIs and as a robust technology for early detection of cancers. This method utilizes unique DNA barcoding of individual proteins coupled with next generation sequencing to quantitatively assess interactions via barcode enrichment. We have tested the feasibility of this technology in the detection of patient immune responses to oropharyngeal carcinomas and in the discovery of novel PPIs in the B-cell receptor (BCR) pathway. To achieve this goal, 96 human papillomavirus (HPV) antigen genes were cloned into pJFT7-cHalo (99% success) and pJFT7-n3xFlag-Halo (100% success) expression vectors. These libraries were expressed via a cell-free in vitro transcription-translation system with 93% and 96% success, respectively. A small-scale study of patient serum interactions with barcoded HPV16 antigens was performed and a HPV proteome-wide study will follow using additional patient samples. In addition, 15 query proteins were cloned into pJFT7_nGST expression vectors, expressed, and purified with 93% success to probe a library of 100 BCR pathway proteins and detect novel PPIs.
Date Created
2016-12
Agent

Transcriptome gene expression analysis of breast cancer using RNA-Seq

137766-Thumbnail Image.png
Description
Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of primary breast epithelial cells were treated for infinite proliferation by using a ROCK inhibitor and mouse feeder cells. Methods: Raw paired-end, 100x coverage RNA-Seq data was aligned to the Human Reference Genome Version 19 using BWA and Tophat. Gene differential expression analysis was completed using Cufflinks and Cuffdiff. Interactive Genome Viewer was used for data visualization. Results: 15 genes were found to be down-regulated by at least one log-fold change in 4/5 of tumor samples. 75 genes were found to be down-regulated in 3/5 of our tumor samples by at least one log-fold change. 11 genes were found to be up-regulated in 4/5 of our tumor samples, and 68 genes were identified to be up-regulated in 3/5 of the tumor samples by at least one-fold change. Conclusion: Expression changes in genes such as AZGP1, AGER, ALG11, and S1007 suggest a disruption in the glycosylation pathway. No correlation was found between Cufflink's Her2 gene-expression and DAKO score classification.
Date Created
2013-05
Agent