8 resultados para Single-strand RNA

em CaltechTHESIS


Relevância:

80.00% 80.00%

Publicador:

Resumo:

The main focus of this thesis is the use of high-throughput sequencing technologies in functional genomics (in particular in the form of ChIP-seq, chromatin immunoprecipitation coupled with sequencing, and RNA-seq) and the study of the structure and regulation of transcriptomes. Some parts of it are of a more methodological nature while others describe the application of these functional genomic tools to address various biological problems. A significant part of the research presented here was conducted as part of the ENCODE (ENCyclopedia Of DNA Elements) Project.

The first part of the thesis focuses on the structure and diversity of the human transcriptome. Chapter 1 contains an analysis of the diversity of the human polyadenylated transcriptome based on RNA-seq data generated for the ENCODE Project. Chapter 2 presents a simulation-based examination of the performance of some of the most popular computational tools used to assemble and quantify transcriptomes. Chapter 3 includes a study of variation in gene expression, alternative splicing and allelic expression bias on the single-cell level and on a genome-wide scale in human lymphoblastoid cells; it also brings forward a number of critical to the practice of single-cell RNA-seq measurements methodological considerations.

The second part presents several studies applying functional genomic tools to the study of the regulatory biology of organellar genomes, primarily in mammals but also in plants. Chapter 5 contains an analysis of the occupancy of the human mitochondrial genome by TFAM, an important structural and regulatory protein in mitochondria, using ChIP-seq. In Chapter 6, the mitochondrial DNA occupancy of the TFB2M transcriptional regulator, the MTERF termination factor, and the mitochondrial RNA and DNA polymerases is characterized. Chapter 7 consists of an investigation into the curious phenomenon of the physical association of nuclear transcription factors with mitochondrial DNA, based on the diverse collections of transcription factor ChIP-seq datasets generated by the ENCODE, mouseENCODE and modENCODE consortia. In Chapter 8 this line of research is further extended to existing publicly available ChIP-seq datasets in plants and their mitochondrial and plastid genomes.

The third part is dedicated to the analytical and experimental practice of ChIP-seq. As part of the ENCODE Project, a set of metrics for assessing the quality of ChIP-seq experiments was developed, and the results of this activity are presented in Chapter 9. These metrics were later used to carry out a global analysis of ChIP-seq quality in the published literature (Chapter 10). In Chapter 11, the development and initial application of an automated robotic ChIP-seq (in which these metrics also played a major role) is presented.

The fourth part presents the results of some additional projects the author has been involved in, including the study of the role of the Piwi protein in the transcriptional regulation of transposon expression in Drosophila (Chapter 12), and the use of single-cell RNA-seq to characterize the heterogeneity of gene expression during cellular reprogramming (Chapter 13).

The last part of the thesis provides a review of the results of the ENCODE Project and the interpretation of the complexity of the biochemical activity exhibited by mammalian genomes that they have revealed (Chapters 15 and 16), an overview of the expected in the near future technical developments and their impact on the field of functional genomics (Chapter 14), and a discussion of some so far insufficiently explored research areas, the future study of which will, in the opinion of the author, provide deep insights into many fundamental but not yet completely answered questions about the transcriptional biology of eukaryotes and its regulation.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The genomes of many positive stranded RNA viruses and of all retroviruses are translated as large polyproteins which are proteolytically processed by cellular and viral proteases. Viral proteases are structurally related to two families of cellular proteases, the pepsin-like and trypsin-like proteases. This thesis describes the proteolytic processing of several nonstructural proteins of dengue 2 virus, a representative member of the Flaviviridae, and describes methods for transcribing full-length genomic RNA of dengue 2 virus. Chapter 1 describes the in vitro processing of the nonstructural proteins NS2A, NS2B and NS3. Chapter 2 describes a system that allows identification of residues within the protease that are directly or indirectly involved with substrate recognition. Chapter 3 describes methods to produce genome length dengue 2 RNA from cDNA templates.

The nonstructural protein NS3 is structurally related to viral trypsinlike proteases from the alpha-, picorna-, poty-, and pestiviruses. The hypothesis that the flavivirus nonstructural protein NS3 is a viral proteinase that generates the termini of several nonstructural proteins was tested using an efficient in vitro expression system and antisera specific for the nonstructural proteins NS2B and NS3. A series of cDNA constructs was transcribed using T7 RNA polymerase and the RNA translated in reticulocyte lysates. Proteolytic processing occurred in vitro to generate NS2B and NS3. The amino termini of NS2B and NS3 produced in vitro were found to be the same as the termini of NS2B and NS3 isolated from infected cells. Deletion analysis of cDNA constructs localized the protease domain necessary and sufficient for correct cleavage to the first 184 amino acids of NS3. Kinetic analysis of processing events in vitro and experiments to examine the sensitivity of processing to dilution suggested that an intramolecular cleavage between NS2A and NS2B preceded an intramolecular cleavage between NS2B and NS3. The data from these expression experiments confirm that NS3 is the viral proteinase responsible for cleavage events generating the amino termini of NS2B and NS3 and presumably for cleavages generating the termini of NS4A and NS5 as well.

Biochemical and genetic experiments using viral proteinases have defined the sequence requirements for cleavage site recognition, but have not identified residues within proteinases that interact with substrates. A biochemical assay was developed that could identify residues which were important for substrate recognition. Chimeric proteases between yellow fever and dengue 2 were constructed that allowed mapping of regions involved in substrate recognition, and site directed mutagenesis was used to modulate processing efficiency.

Expression in vitro revealed that the dengue protease domain efficiently processes the yellow fever polyprotein between NS2A and NS2B and between NS2B and NS3, but that the reciprocal construct is inactive. The dengue protease processes yellow fever cleavage sites more efficiently than dengue cleavage sites, suggesting that suboptimal cleavage efficiency may be used to increase levels of processing intermediates in vivo. By mutagenizing the putative substrate binding pocket it was possible to change the substrate specificity of the yellow fever protease; changing a minimum of three amino acids in the yellow fever protease enabled it to recognize dengue cleavage sites. This system allows identification of residues which are directly or indirectly involved with enzyme-substrate interaction, does not require a crystal structure, and can define the substrate preferences of individual members of a viral proteinase family.

Full-length cDNA clones, from which infectious RNA can be transcribed, have been developed for a number of positive strand RNA viruses, including the flavivirus type virus, yellow fever. The technology necessary to transcribe genomic RNA of dengue 2 virus was developed in order to better understand the molecular biology of the dengue subgroup. A 5' structural region clone was engineered to transcribe authentic dengue RNA that contains an additional 1 or 2 residues at the 5' end. A 3' nonstructural region clone was engineered to allow production of run off transcripts, and to allow directional ligation with the 5' structural region clone. In vitro ligation and transcription produces full-length genomic RNA which is noninfectious when transfected into mammalian tissue culture cells. Alternative methods for constructing cDNA clones and recovering live dengue virus are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Oligonucleotide-directed triple helix formation is one of the most versatile methods for the sequence specific recognition of double helical DNA. Chapter 2 describes affinity cleaving experiments carried out to assess the recognition potential for purine-rich oligonucleotides via the formation of triple helices. Purine-rich oligodeoxyribonucleotides were shown to bind specifically to purine tracts of double helical DNA in the major groove antiparallel to the purine strand of the duplex. Specificity was derived from the formation of reverse Hoogsteen G•GC, A•AT and T•AT triplets and binding was limited to mostly purine tracts. This triple helical structure was stabilized by multivalent cations, destabilized by high concentrations of monovalent cations and was insensitive to pH. A single mismatched base triplet was shown to destabilize a 15 mer triple helix by 1.0 kcal/mole at 25°C. In addition, stability appeared to be correlated to the number of G•GC triplets formed in the triple helix. This structure provides an additional framework as a basis for the design of new sequence specific DNA binding molecules.

In work described in Chapter 3, the triplet specificities and required strand orientations of two classes of DNA triple helices were combined to target double helical sequences containing all four base pairs by alternate strand triple helix formation. This allowed for the use of oligonucleotides containing only natural 3'-5' phosphodiester linkages to simultaneously bind both strands of double helical DNA in the major groove. The stabilities and structures of these alternate strand triple helices depended on whether the binding site sequence was 5'-(purine)_m (pyrimidine)_n-3' or 5'- (pyrimidine)_m (purine)_n-3'.

In Chapter 4, the ability of oligonucleotide-cerium(III) chelates to direct the transesterfication of RNA was investigated. Procedures were developed for the modification of DNA and RNA oligonucleotides with a hexadentate Schiff-base macrocyclic cerium(III) complex. In addition, oligoribonucleotides modified by covalent attachment of the metal complex through two different linker structures were prepared. The ability of these structures to direct transesterification to specific RNA phosphodiesters was assessed by gel electrophoresis. No reproducible cleavage of the RNA strand consistent with transesterification could be detected in any of these experiments.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Distinct structures delineating the introns of Simian Virus 40 T-antigen and Adenovirus 2 E1A genes have been discovered. The structures, which are centered around the branch points of the genes inserted in supercoiled double-stranded plasmids, are specifically targeted through photoactivated strand cleavage by the metal complex tris(4,7-diphenyl-1,10-phenanthroline)rhodium(III). The DNA sites that are recognized lack sequence homology but are similar in demarcating functionally important sites on the RNA level. The single-stranded DNA fragments corresponding to the coding strands of the genes were also found to fold into a structure apparently identical to that in the supercoiled genes based on the recognition by the metal complex. Further investigation of different single-stranded DNA fragments with other structural probes, such as another metal complex bis(1,10-phenanthroline)(phenanthrenequinone diimine)rhodium(III), AMT (4'aminomethyl-4,5',8 trimethylpsoralen), restriction enzyme Mse I, and mung bean nuclease, showed that the structures require the sequ ences at both ends of the intron plus the flanking sequences but not the middle of the intron. The two ends form independent helices which interact with each other to form the global tertiary structures. Both of the intron structures share similarities to the structure of the Holliday junction, which is also known to be specifically targeted by the former metal complex. These structures may have arisen from early RNA intron structures and may have been used to facilitate the evolution of genes through exon shuffling by acting as target sites for recombinase enzymes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Over the last century, the silicon revolution has enabled us to build faster, smaller and more sophisticated computers. Today, these computers control phones, cars, satellites, assembly lines, and other electromechanical devices. Just as electrical wiring controls electromechanical devices, living organisms employ "chemical wiring" to make decisions about their environment and control physical processes. Currently, the big difference between these two substrates is that while we have the abstractions, design principles, verification and fabrication techniques in place for programming with silicon, we have no comparable understanding or expertise for programming chemistry.

In this thesis we take a small step towards the goal of learning how to systematically engineer prescribed non-equilibrium dynamical behaviors in chemical systems. We use the formalism of chemical reaction networks (CRNs), combined with mass-action kinetics, as our programming language for specifying dynamical behaviors. Leveraging the tools of nucleic acid nanotechnology (introduced in Chapter 1), we employ synthetic DNA molecules as our molecular architecture and toehold-mediated DNA strand displacement as our reaction primitive.

Abstraction, modular design and systematic fabrication can work only with well-understood and quantitatively characterized tools. Therefore, we embark on a detailed study of the "device physics" of DNA strand displacement (Chapter 2). We present a unified view of strand displacement biophysics and kinetics by studying the process at multiple levels of detail, using an intuitive model of a random walk on a 1-dimensional energy landscape, a secondary structure kinetics model with single base-pair steps, and a coarse-grained molecular model that incorporates three-dimensional geometric and steric effects. Further, we experimentally investigate the thermodynamics of three-way branch migration. Our findings are consistent with previously measured or inferred rates for hybridization, fraying, and branch migration, and provide a biophysical explanation of strand displacement kinetics. Our work paves the way for accurate modeling of strand displacement cascades, which would facilitate the simulation and construction of more complex molecular systems.

In Chapters 3 and 4, we identify and overcome the crucial experimental challenges involved in using our general DNA-based technology for engineering dynamical behaviors in the test tube. In this process, we identify important design rules that inform our choice of molecular motifs and our algorithms for designing and verifying DNA sequences for our molecular implementation. We also develop flexible molecular strategies for "tuning" our reaction rates and stoichiometries in order to compensate for unavoidable non-idealities in the molecular implementation, such as imperfectly synthesized molecules and spurious "leak" pathways that compete with desired pathways.

We successfully implement three distinct autocatalytic reactions, which we then combine into a de novo chemical oscillator. Unlike biological networks, which use sophisticated evolved molecules (like proteins) to realize such behavior, our test tube realization is the first to demonstrate that Watson-Crick base pairing interactions alone suffice for oscillatory dynamics. Since our design pipeline is general and applicable to any CRN, our experimental demonstration of a de novo chemical oscillator could enable the systematic construction of CRNs with other dynamic behaviors.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

I. ELECTROPHORESIS OF THE NUCLEIC ACIDS

A zone electrophoresis apparatus using ultraviolet optics has been constructed to study nucleic acids at concentrations less than 0.004%. Native DNA has a mobility about 15% higher than denatured DNA over a range of conditions. Otherwise, the electrophoretic mobility is independent of molecular weight, base composition or source. DNA mobilities change in the expected way with pH but the fractional change in mobility is less than the calculated change in charge. A small decrease in mobility accompanies an increase in ionic strength. RNA’s from various sources have mobilities slightly lower than denatured DNA except for s-RNA which travels slightly faster. The important considerations governing the mobility of nucleic acids appear to be the nature of the hydrodynamic segment, and the binding of counterions. The differences between electrophoresis and sedimentation stem from the fact that all random coil polyelectrolytes are fundamentally free draining in electrophoresis.

II. THE CYTOCHROME C/DNA COMPLEX

The basic protein, cytochrome c, has been complexed to DNA. Up to a cytochrome:DNA mass ratio of 2, a single type of complex is formed. Dissociation of this complex occurs between 0.05F and 0.1F NaCl. The complexing of cytochrome to DNA causes a slight increase in the melting temperature of the DNA, and a reduction of the electrophoretic mobility proportional to the decrease in net charge. Above a cytochrome:DNA mass ratio of 2.5, a different type of complex is formed. The results suggest that complexes such as are formed in the Kleinschmidt technique of electron microscopy would not exist in bulk solution and are exclusively film phenomena.

III. STUDIES OF THE ELECTROPHORESIS AND MELTING BEHAVIOUR OF NUCLEOHISTONES

Electrophoresis studies on reconstituted nucleohistones indicate that the electrophoretic mobility for these complexes is a function of the net charge of the complex. The mobility is therefore dependent on the charge density of the histone complexing the DNA, as well as on the histone/DNA ratio. It is found that the different histones affect the transition from native to denatured DNA in different ways. It appears that histone I is exchanging quite rapidly between DNA molecules in 0.01 F salt, while histone II is irreversibly bound. Histone III-IV enhances the capacity of non-strand separated denatured DNA to reanneal. Studies on native nucleoproteins indicate that there are no gene-sized uncomplexed DNA regions in any preparations studied.

IV. THE DISSOCIATION OF HISTONE FROM CALF THYMUS CROMATIN

Calf thymus nucleoprotein was treated with varying concentrations of NaCl. The identity of the histones associated and dissociated from the DNA at each salt concentration was determined by gel electrophoresis. It was found that there is no appreciable histone dissociation below 0.4 F NaCl. The lysine rich histones dissociate between 0.4 and 0.5 F NaCl. Their dissociation is accompanies by a marked increase in the solubility of the chromatin. The moderately lysine rich histones dissociate mainly between 0.8 and 1.1 F NaCl. There are two arginine rich histone components: the first dissociates between 0.8 F and 1.1 F NaCl, but the second class is the very last to be dissociated from the DNA (dissociation beginning at 1.0 F NaCl). By 2.0 F NaCl, essentially all the histones are dissociated.

The properties of the extracted nucleoprotein were studied. The electrophoretic mobility increases and the melting temperature decreases as more histones are dissociated from the DNA. A comparison with the dissociation of histones from DNA in NaClO4 shows that to dissociate the same class of histones, the concentration of NaCl required is twice that of NaClO4.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Part I. The cellular slime mold Dictyostelium discoideum is a simple eukaryote which undergoes a multi-cellular developmental process. Single cell myxamoebae divide vegetatively in the presence of a food source. When the food is depleted or removed, the cells aggregate, forming a migrating pseudoplasmodium which differentiates into a fruiting body containing stalk and spore cells. I have shown that during the developmental cycle glycogen phosphorylase, aminopeptidase, and alanine transaminase are developmentally regulated, that is their specific activities increased at a specific time in the developmental cycle. Phosphorylase activity is undetectable in developing cells until mid-aggregation whereupon it increases and reaches a maximum at mid-culmination. Thereafter the enzyme disappears. Actinomycin D and cycloheximide studies as well as studies with morphologically aberrant and temporally deranged mutants indicate that prior RNA and concomitant protein synthesis are necessary for the rise and decrease in activity and support the view that the appearance of the enzyme is regulated at the transcriptional level. Aminopeptidase and alanine transaminase increase 3 fold starting at starvation and reach maximum activity at 18 and 5 hours respectively.

The cellular DNA s of D. discoideum were characterized by CsC1 buoyant density gradient centrifugation and by renaturation kinetics. Whole cell DNA exhibits three bands in CsCl: ρ = 1.676 g/cc (nuclear main band), 1.687 (nuclear satellite), and 1.682 (mitochondrial). Reassociation kinetics at a criterion of Tm -23°C indicates that the nuclear reiterated sequences make up 30% of the genome (Cot1/2 (pure) 0.28) and the single-copy DNA 70% (Cot1/2(pure) 70). The complexity of the nuclear genome is 30 x 109 daltons and that of the mitochondrial DNA is 35-40 x 106 daltons (Cot1/2 0.15). rRNA cistrons constitute 2.2% of nuclear DNA and have a ρ = 1.682.

RNA extracted from 4 stages during developmental cycle of Dictyostelium was hybridized with purified single-copy nuclear DNA. The hybrids had properties indicative of single-copy DNA-RNA hybrids. These studies indicate that there are, during development, qualitative and quantitative changes in the portion of the single-copy of the genome transcribed. Overall, 56% of the genome is represented by transcripts between the amoeba and mid-culmination stages. Some 19% are sequences which are represented at all stages while 37% of the genome consists of stage specific sequences.

Part II. RNA and protein synthesis and polysome formation were studied during early development of the surf clam Spisula solidissima embryos. The oocyte has a small number of polysomes and a low but measurable rate of protein synthesis (leucine-3H incorporation). After fertilization, there is a continual increase in the percentage of ribosomes sedimenting in the polysome region. Newly synthesized RNA (uridine-5-3H incorporation) was found in polysomes as early as the 2-cell stage. During cleavage, the newly formed RNA is associated mainly with the light polysomes.

RNA extracted from polysomes labeled at the 4-cell stage is polydisperse, nonribosomal, and non-4 S. Actinomycin D causes a reduction of about 30% of the polysomes formed between fertilization and the 16-cell stage.

In the early cleavage stages the light polysomes are mostly affected by actinomycin.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Part I. Complexes of Biological Bases and Oligonucleotides with RNA

The physical nature of complexes of several biological bases and oligonucleotides with single-stranded ribonucleic acids have been studied by high resolution proton magnetic resonance spectroscopy. The importance of various forces in the stabilization of these complexes is also discussed.

Previous work has shown that purine forms an intercalated complex with single-stranded nucleic acids. This complex formation led to severe and stereospecific broadening of the purine resonances. From the field dependence of the linewidths, T1 measurements of the purine protons and nuclear Overhauser enhancement experiments, the mechanism for the line broadening was ascertained to be dipole-dipole interactions between the purine protons and the ribose protons of the nucleic acid.

The interactions of ethidium bromide (EB) with several RNA residues have been studied. EB forms vertically stacked aggregates with itself as well as with uridine, 3'-uridine monophosphate and 5'-uridine monophosphate and forms an intercalated complex with uridylyl (3' → 5') uridine and polyuridylic acid (poly U). The geometry of EB in the intercalated complex has also been determined.

The effect of chain length of oligo-A-nucleotides on their mode of interaction with poly U in D20 at neutral pD have also been studied. Below room temperatures, ApA and ApApA form a rigid triple-stranded complex involving a stoichiometry of one adenine to two uracil bases, presumably via specific adenine-uracil base pairing and cooperative base stacking of the adenine bases. While no evidence was obtained for the interaction of ApA with poly U above room temperature, ApApA exhibited complex formation of a 1:1 nature with poly U by forming Watson-Crick base pairs. The thermodynamics of these systems are discussed.

Part II. Template Recognition and the Degeneracy of the Genetic Code

The interaction of ApApG and poly U was studied as a model system for the codon-anticodon interaction of tRNA and mRNA in vivo. ApApG was shown to interact with poly U below ~20°C. The interaction was of a 1:1 nature which exhibited the Hoogsteen bonding scheme. The three bases of ApApG are in an anti conformation and the guanosine base appears to be in the lactim tautomeric form in the complex.

Due to the inadequacies of previous models for the degeneracy of the genetic code in explaining the observed interactions of ApApG with poly U, the "tautomeric doublet" model is proposed as a possible explanation of the degenerate interactions of tRNA with mRNA during protein synthesis in vivo.