8 resultados para Sequencing

em CaltechTHESIS


Relevância:

10.00% 10.00%

Publicador:

Resumo:

The initial objective of Part I was to determine the nature of upper mantle discontinuities, the average velocities through the mantle, and differences between mantle structure under continents and oceans by the use of P'dP', the seismic core phase P'P' (PKPPKP) that reflects at depth d in the mantle. In order to accomplish this, it was found necessary to also investigate core phases themselves and their inferences on core structure. P'dP' at both single stations and at the LASA array in Montana indicates that the following zones are candidates for discontinuities with varying degrees of confidence: 800-950 km, weak; 630-670 km, strongest; 500-600 km, strong but interpretation in doubt; 350-415 km, fair; 280-300 km, strong, varying in depth; 100-200 km, strong, varying in depth, may be the bottom of the low-velocity zone. It is estimated that a single station cannot easily discriminate between asymmetric P'P' and P'dP' for lead times of about 30 sec from the main P'P' phase, but the LASA array reduces this uncertainty range to less than 10 sec. The problems of scatter of P'P' main-phase times, mainly due to asymmetric P'P', incorrect identification of the branch, and lack of the proper velocity structure at the velocity point, are avoided and the analysis shows that one-way travel of P waves through oceanic mantle is delayed by 0.65 to 0.95 sec relative to United States mid-continental mantle.

A new P-wave velocity core model is constructed from observed times, dt/dΔ's, and relative amplitudes of P'; the observed times of SKS, SKKS, and PKiKP; and a new mantle-velocity determination by Jordan and Anderson. The new core model is smooth except for a discontinuity at the inner-core boundary determined to be at a radius of 1215 km. Short-period amplitude data do not require the inner core Q to be significantly lower than that of the outer core. Several lines of evidence show that most, if not all, of the arrivals preceding the DF branch of P' at distances shorter than 143° are due to scattering as proposed by Haddon and not due to spherically symmetric discontinuities just above the inner core as previously believed. Calculation of the travel-time distribution of scattered phases and comparison with published data show that the strongest scattering takes place at or near the core-mantle boundary close to the seismic station.

In Part II, the largest events in the San Fernando earthquake series, initiated by the main shock at 14 00 41.8 GMT on February 9, 1971, were chosen for analysis from the first three months of activity, 87 events in all. The initial rupture location coincides with the lower, northernmost edge of the main north-dipping thrust fault and the aftershock distribution. The best focal mechanism fit to the main shock P-wave first motions constrains the fault plane parameters to: strike, N 67° (± 6°) W; dip, 52° (± 3°) NE; rake, 72° (67°-95°) left lateral. Focal mechanisms of the aftershocks clearly outline a downstep of the western edge of the main thrust fault surface along a northeast-trending flexure. Faulting on this downstep is left-lateral strike-slip and dominates the strain release of the aftershock series, which indicates that the downstep limited the main event rupture on the west. The main thrust fault surface dips at about 35° to the northeast at shallow depths and probably steepens to 50° below a depth of 8 km. This steep dip at depth is a characteristic of other thrust faults in the Transverse Ranges and indicates the presence at depth of laterally-varying vertical forces that are probably due to buckling or overriding that causes some upward redirection of a dominant north-south horizontal compression. Two sets of events exhibit normal dip-slip motion with shallow hypocenters and correlate with areas of ground subsidence deduced from gravity data. Several lines of evidence indicate that a horizontal compressional stress in a north or north-northwest direction was added to the stresses in the aftershock area 12 days after the main shock. After this change, events were contained in bursts along the downstep and sequencing within the bursts provides evidence for an earthquake-triggering phenomenon that propagates with speeds of 5 to 15 km/day. Seismicity before the San Fernando series and the mapped structure of the area suggest that the downstep of the main fault surface is not a localized discontinuity but is part of a zone of weakness extending from Point Dume, near Malibu, to Palmdale on the San Andreas fault. This zone is interpreted as a decoupling boundary between crustal blocks that permits them to deform separately in the prevalent crustal-shortening mode of the Transverse Ranges region.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The main focus of this thesis is the use of high-throughput sequencing technologies in functional genomics (in particular in the form of ChIP-seq, chromatin immunoprecipitation coupled with sequencing, and RNA-seq) and the study of the structure and regulation of transcriptomes. Some parts of it are of a more methodological nature while others describe the application of these functional genomic tools to address various biological problems. A significant part of the research presented here was conducted as part of the ENCODE (ENCyclopedia Of DNA Elements) Project.

The first part of the thesis focuses on the structure and diversity of the human transcriptome. Chapter 1 contains an analysis of the diversity of the human polyadenylated transcriptome based on RNA-seq data generated for the ENCODE Project. Chapter 2 presents a simulation-based examination of the performance of some of the most popular computational tools used to assemble and quantify transcriptomes. Chapter 3 includes a study of variation in gene expression, alternative splicing and allelic expression bias on the single-cell level and on a genome-wide scale in human lymphoblastoid cells; it also brings forward a number of critical to the practice of single-cell RNA-seq measurements methodological considerations.

The second part presents several studies applying functional genomic tools to the study of the regulatory biology of organellar genomes, primarily in mammals but also in plants. Chapter 5 contains an analysis of the occupancy of the human mitochondrial genome by TFAM, an important structural and regulatory protein in mitochondria, using ChIP-seq. In Chapter 6, the mitochondrial DNA occupancy of the TFB2M transcriptional regulator, the MTERF termination factor, and the mitochondrial RNA and DNA polymerases is characterized. Chapter 7 consists of an investigation into the curious phenomenon of the physical association of nuclear transcription factors with mitochondrial DNA, based on the diverse collections of transcription factor ChIP-seq datasets generated by the ENCODE, mouseENCODE and modENCODE consortia. In Chapter 8 this line of research is further extended to existing publicly available ChIP-seq datasets in plants and their mitochondrial and plastid genomes.

The third part is dedicated to the analytical and experimental practice of ChIP-seq. As part of the ENCODE Project, a set of metrics for assessing the quality of ChIP-seq experiments was developed, and the results of this activity are presented in Chapter 9. These metrics were later used to carry out a global analysis of ChIP-seq quality in the published literature (Chapter 10). In Chapter 11, the development and initial application of an automated robotic ChIP-seq (in which these metrics also played a major role) is presented.

The fourth part presents the results of some additional projects the author has been involved in, including the study of the role of the Piwi protein in the transcriptional regulation of transposon expression in Drosophila (Chapter 12), and the use of single-cell RNA-seq to characterize the heterogeneity of gene expression during cellular reprogramming (Chapter 13).

The last part of the thesis provides a review of the results of the ENCODE Project and the interpretation of the complexity of the biochemical activity exhibited by mammalian genomes that they have revealed (Chapters 15 and 16), an overview of the expected in the near future technical developments and their impact on the field of functional genomics (Chapter 14), and a discussion of some so far insufficiently explored research areas, the future study of which will, in the opinion of the author, provide deep insights into many fundamental but not yet completely answered questions about the transcriptional biology of eukaryotes and its regulation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Being able to detect a single molecule without the use of labels has been a long standing goal of bioengineers and physicists. This would simplify applications ranging from single molecular binding studies to those involving public health and security, improved drug screening, medical diagnostics, and genome sequencing. One promising technique that has the potential to detect single molecules is the microtoroid optical resonator. The main obstacle to detecting single molecules, however, is decreasing the noise level of the measurements such that a single molecule can be distinguished from background. We have used laser frequency locking in combination with balanced detection and data processing techniques to reduce the noise level of these devices and report the detection of a wide range of nanoscale objects ranging from nanoparticles with radii from 100 to 2.5 nm, to exosomes, ribosomes, and single protein molecules (mouse immunoglobulin G and human interleukin-2). We further extend the exosome results towards creating a non-invasive tumor biopsy assay. Our results, covering several orders of magnitude of particle radius (100 nm to 2 nm), agree with the `reactive' model prediction for the frequency shift of the resonator upon particle binding. In addition, we demonstrate that molecular weight may be estimated from the frequency shift through a simple formula, thus providing a basis for an ``optical mass spectrometer'' in solution. We anticipate that our results will enable many applications, including more sensitive medical diagnostics and fundamental studies of single receptor-ligand and protein-protein interactions in real time. The thesis summarizes what we have achieved thus far and shows that the goal of detecting a single molecule without the use of labels can now be realized.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Protein structure prediction has remained a major challenge in structural biology for more than half a century. Accelerated and cost efficient sequencing technologies have allowed researchers to sequence new organisms and discover new protein sequences. Novel protein structure prediction technologies will allow researchers to study the structure of proteins and to determine their roles in the underlying biology processes and develop novel therapeutics.

Difficulty of the problem stems from two folds: (a) describing the energy landscape that corresponds to the protein structure, commonly referred to as force field problem; and (b) sampling of the energy landscape, trying to find the lowest energy configuration that is hypothesized to be the native state of the structure in solution. The two problems are interweaved and they have to be solved simultaneously. This thesis is composed of three major contributions. In the first chapter we describe a novel high-resolution protein structure refinement algorithm called GRID. In the second chapter we present REMCGRID, an algorithm for generation of low energy decoy sets. In the third chapter, we present a machine learning approach to ranking decoys by incorporating coarse-grain features of protein structures.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

With recent advances in high-throughput sequencing, mapping of genome-wide transcription factor occupancy has become feasible. To advance the understanding of skeletal muscle differentiation specifically and transcriptional regulation in general, I determined the genome-wide occupancy map for myogenin in differentiating C2C12 myocyte cells. I then analyzed the myogenin map for underlying sequence content and the association between occupied elements and expression trajectories of adjacent genes. Having determined that myogenin primarily associates with expressed genes, I performed a similar analysis on occupancy maps of other transcription factors active during skeletal muscle differentiation, including an extensive analysis of co-occupancy. This analysis provided strong motif evidence for protein-protein interactions as the primary driving force in the formation of Myogenin / Mef2 and MyoD / AP-1 complexes at jointly-occupied sites. Finally, factor occupancy analysis was extended to include bHLH transcription factors in tissues other than skeletal muscle. The cross-tissue analysis led to the emergence of a motif structure used by bHLH TFs to encode either tissue-specific or "general" (public) access in a variety of lineages.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The investigations presented in this thesis use various in vivo techniques to understand how trans-acting factors control gene expression. The first part addresses the transcriptional regulation of muscle creatine kinase (MCK). MCK expression is activated during the course of development and is found only in differentiated muscle. Several in vivo footprints are observed at the enhancer of this gene, but all of these interactions are limited to cell types that express MCK. This is interesting because two of the footprints appear to represent muscle specific use of general transcription factors, while the other two correspond to sites that can bind the myogenic regulator, MyoD1, in vitro. MyoD1 and these general factors are present in myoblasts, but can bind to the enhancer only in myocytes. This suggests that either the factors themselves are post-translationally modified (phosphorylation or protein:protein interactions), or the accessibility of the enhancer to the factors is limited (changes in chromatin structure). The in vivo footprinting study of MCK was performed with a new ligation mediated, single-sided PCR (polymerase chain reaction) technique that I have developed.

The second half of the thesis concerns the regulation of mouse metallothionein (MT). Metallothioneins are a family of highly conserved housekeeping genes whose expression can be induced by heavy metals, steroids, and other stresses. By adapting a primer extension method of genomic sequencing to in vivo footprinting, I've observed both metal inducible and noninducible interactions at the promoter of MT-I. From these results I've been able to limit the possible mechanisms by which metal responsive trans-acting factors induce transcription. These interpretations correlate with a second line of experiments involving the stable titration of positive acting factors necessary for induction of MT. I've amplified the promoter of MT to 10^2-10^3 copies per cell by fusing the 5' and 3' ends of the MT gene to the coding region of DHFR and selecting cells for methotrexate resistance. In these cells, there is a metal-specific titration effect, and although it acts at the level of transcription, it appears to be independent of direct DNA binding factors.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Transcription factor p53 is the most commonly altered gene in human cancer. As a redox-active protein in direct contact with DNA, p53 can directly sense oxidative stress through DNA-mediated charge transport. Electron hole transport occurs with a shallow distance dependence over long distances through the π-stacked DNA bases, leading to the oxidation and dissociation of DNA-bound p53. The extent of p53 dissociation depends upon the redox potential of the response element DNA in direct contact with each p53 monomer. The DNA sequence dependence of p53 oxidative dissociation was examined by electrophoretic mobility shift assays using radiolabeled oligonucleotides containing both synthetic and human p53 response elements with an appended anthraquinone photooxidant. Greater p53 dissociation is observed from DNA sequences containing low redox potential purine regions, particularly guanine triplets, within the p53 response element. Using denaturing polyacrylamide gel electrophoresis of irradiated anthraquinone-modified DNA, the DNA damage sites, which correspond to locations of preferred electron hole localization, were determined. The resulting DNA damage preferentially localizes to guanine doublets and triplets within the response element. Oxidative DNA damage is inhibited in the presence of p53, however, only at DNA sites within the response element, and therefore in direct contact with p53. From these data, predictions about the sensitivity of human p53-binding sites to oxidative stress, as well as possible biological implications, have been made. On the basis of our data, the guanine pattern within the purine region of each p53-binding site determines the response of p53 to DNA-mediated oxidation, yielding for some sequences the oxidative dissociation of p53 from a distance and thereby providing another potential role for DNA charge transport chemistry within the cell.

To determine whether the change in p53 response element occupancy observed in vitro also correlates in cellulo, chromatin immunoprecipition (ChIP) and quantitative PCR (qPCR) were used to directly quantify p53 binding to certain response elements in HCT116N cells. The HCT116N cells containing a wild type p53 were treated with the photooxidant [Rh(phi)2bpy]3+, Nutlin-3 to upregulate p53, and subsequently irradiated to induce oxidative genomic stress. To covalently tether p53 interacting with DNA, the cells were fixed with disuccinimidyl glutarate and formaldehyde. The nuclei of the harvested cells were isolated, sonicated, and immunoprecipitated using magnetic beads conjugated with a monoclonal p53 antibody. The purified immounoprecipiated DNA was then quantified via qPCR and genomic sequencing. Overall, the ChIP results were significantly varied over ten experimental trials, but one trend is observed overall: greater variation of p53 occupancy is observed in response elements from which oxidative dissociation would be expected, while significantly less change in p53 occupancy occurs for response elements from which oxidative dissociation would not be anticipated.

The chemical oxidation of transcription factor p53 via DNA CT was also investigated with respect to the protein at the amino acid level. Transcription factor p53 plays a critical role in the cellular response to stress stimuli, which may be modulated through the redox modulation of conserved cysteine residues within the DNA-binding domain. Residues within p53 that enable oxidative dissociation are herein investigated. Of the 8 mutants studied by electrophoretic mobility shift assay (EMSA), only the C275S mutation significantly decreased the protein affinity (KD) for the Gadd45 response element. EMSA assays of p53 oxidative dissociation promoted by photoexcitation of anthraquinone-tethered Gadd45 oligonucleotides were used to determine the influence of p53 mutations on oxidative dissociation; mutation to C275S severely attenuates oxidative dissociation while C277S substantially attenuates dissociation. Differential thiol labeling was used to determine the oxidation states of cysteine residues within p53 after DNA-mediated oxidation. Reduced cysteines were iodoacetamide labeled, while oxidized cysteines participating in disulfide bonds were 13C2D2-iodoacetamide labeled. Intensities of respective iodoacetamide-modified peptide fragments were analyzed using a QTRAP 6500 LC-MS/MS system, quantified with Skyline, and directly compared. A distinct shift in peptide labeling toward 13C2D2-iodoacetamide labeled cysteines is observed in oxidized samples as compared to the respective controls. All of the observable cysteine residues trend toward the heavy label under conditions of DNA CT, indicating the formation of multiple disulfide bonds potentially among the C124, C135, C141, C182, C275, and C277. Based on these data it is proposed that disulfide formation involving C275 is critical for inducing oxidative dissociation of p53 from DNA.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Systems-level studies of biological systems rely on observations taken at a resolution lower than the essential unit of biology, the cell. Recent technical advances in DNA sequencing have enabled measurements of the transcriptomes in single cells excised from their environment, but it remains a daunting technical problem to reconstruct in situ gene expression patterns from sequencing data. In this thesis I develop methods for the routine, quantitative in situ measurement of gene expression using fluorescence microscopy.

The number of molecular species that can be measured simultaneously by fluorescence microscopy is limited by the pallet of spectrally distinct fluorophores. Thus, fluorescence microscopy is traditionally limited to the simultaneous measurement of only five labeled biomolecules at a time. The two methods described in this thesis, super-resolution barcoding and temporal barcoding, represent strategies for overcoming this limitation to monitor expression of many genes in a single cell. Super-resolution barcoding employs optical super-resolution microscopy (SRM) and combinatorial labeling via-smFISH (single molecule fluorescence in situ hybridization) to uniquely label individual mRNA species with distinct barcodes resolvable at nanometer resolution. This method dramatically increases the optical space in a cell, allowing a large numbers of barcodes to be visualized simultaneously. As a proof of principle this technology was used to study the S. cerevisiae calcium stress response. The second method, sequential barcoding, reads out a temporal barcode through multiple rounds of oligonucleotide hybridization to the same mRNA. The multiplexing capacity of sequential barcoding increases exponentially with the number of rounds of hybridization, allowing over a hundred genes to be profiled in only a few rounds of hybridization.

The utility of sequential barcoding was further demonstrated by adapting this method to study gene expression in mammalian tissues. Mammalian tissues suffer both from a large amount of auto-fluorescence and light scattering, making detection of smFISH probes on mRNA difficult. An amplified single molecule detection technology, smHCR (single molecule hairpin chain reaction), was developed to allow for the quantification of mRNA in tissue. This technology is demonstrated in combination with light sheet microscopy and background reducing tissue clearing technology, enabling whole-organ sequential barcoding to monitor in situ gene expression directly in intact mammalian tissue.

The methods presented in this thesis, specifically sequential barcoding and smHCR, enable multiplexed transcriptional observations in any tissue of interest. These technologies will serve as a general platform for future transcriptomic studies of complex tissues.