5 resultados para Genome sequencing

em CaltechTHESIS


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Being able to detect a single molecule without the use of labels has been a long standing goal of bioengineers and physicists. This would simplify applications ranging from single molecular binding studies to those involving public health and security, improved drug screening, medical diagnostics, and genome sequencing. One promising technique that has the potential to detect single molecules is the microtoroid optical resonator. The main obstacle to detecting single molecules, however, is decreasing the noise level of the measurements such that a single molecule can be distinguished from background. We have used laser frequency locking in combination with balanced detection and data processing techniques to reduce the noise level of these devices and report the detection of a wide range of nanoscale objects ranging from nanoparticles with radii from 100 to 2.5 nm, to exosomes, ribosomes, and single protein molecules (mouse immunoglobulin G and human interleukin-2). We further extend the exosome results towards creating a non-invasive tumor biopsy assay. Our results, covering several orders of magnitude of particle radius (100 nm to 2 nm), agree with the `reactive' model prediction for the frequency shift of the resonator upon particle binding. In addition, we demonstrate that molecular weight may be estimated from the frequency shift through a simple formula, thus providing a basis for an ``optical mass spectrometer'' in solution. We anticipate that our results will enable many applications, including more sensitive medical diagnostics and fundamental studies of single receptor-ligand and protein-protein interactions in real time. The thesis summarizes what we have achieved thus far and shows that the goal of detecting a single molecule without the use of labels can now be realized.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The main focus of this thesis is the use of high-throughput sequencing technologies in functional genomics (in particular in the form of ChIP-seq, chromatin immunoprecipitation coupled with sequencing, and RNA-seq) and the study of the structure and regulation of transcriptomes. Some parts of it are of a more methodological nature while others describe the application of these functional genomic tools to address various biological problems. A significant part of the research presented here was conducted as part of the ENCODE (ENCyclopedia Of DNA Elements) Project.

The first part of the thesis focuses on the structure and diversity of the human transcriptome. Chapter 1 contains an analysis of the diversity of the human polyadenylated transcriptome based on RNA-seq data generated for the ENCODE Project. Chapter 2 presents a simulation-based examination of the performance of some of the most popular computational tools used to assemble and quantify transcriptomes. Chapter 3 includes a study of variation in gene expression, alternative splicing and allelic expression bias on the single-cell level and on a genome-wide scale in human lymphoblastoid cells; it also brings forward a number of critical to the practice of single-cell RNA-seq measurements methodological considerations.

The second part presents several studies applying functional genomic tools to the study of the regulatory biology of organellar genomes, primarily in mammals but also in plants. Chapter 5 contains an analysis of the occupancy of the human mitochondrial genome by TFAM, an important structural and regulatory protein in mitochondria, using ChIP-seq. In Chapter 6, the mitochondrial DNA occupancy of the TFB2M transcriptional regulator, the MTERF termination factor, and the mitochondrial RNA and DNA polymerases is characterized. Chapter 7 consists of an investigation into the curious phenomenon of the physical association of nuclear transcription factors with mitochondrial DNA, based on the diverse collections of transcription factor ChIP-seq datasets generated by the ENCODE, mouseENCODE and modENCODE consortia. In Chapter 8 this line of research is further extended to existing publicly available ChIP-seq datasets in plants and their mitochondrial and plastid genomes.

The third part is dedicated to the analytical and experimental practice of ChIP-seq. As part of the ENCODE Project, a set of metrics for assessing the quality of ChIP-seq experiments was developed, and the results of this activity are presented in Chapter 9. These metrics were later used to carry out a global analysis of ChIP-seq quality in the published literature (Chapter 10). In Chapter 11, the development and initial application of an automated robotic ChIP-seq (in which these metrics also played a major role) is presented.

The fourth part presents the results of some additional projects the author has been involved in, including the study of the role of the Piwi protein in the transcriptional regulation of transposon expression in Drosophila (Chapter 12), and the use of single-cell RNA-seq to characterize the heterogeneity of gene expression during cellular reprogramming (Chapter 13).

The last part of the thesis provides a review of the results of the ENCODE Project and the interpretation of the complexity of the biochemical activity exhibited by mammalian genomes that they have revealed (Chapters 15 and 16), an overview of the expected in the near future technical developments and their impact on the field of functional genomics (Chapter 14), and a discussion of some so far insufficiently explored research areas, the future study of which will, in the opinion of the author, provide deep insights into many fundamental but not yet completely answered questions about the transcriptional biology of eukaryotes and its regulation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The recombination-activating gene products, RAG1 and RAG2, initiate V(D)J recombination during lymphocyte development by cleaving DNA adjacent to conserved recombination signal sequences (RSSs). The reaction involves DNA binding, synapsis, and cleavage at two RSSs located on the same DNA molecule and results in the assembly of antigen receptor genes. Since their discovery full-length, RAG1 and RAG2 have been difficult to purify, and core derivatives are shown to be most active when purified from adherent 293-T cells. However, the protein yield from adherent 293-T cells is limited. Here we develop a human suspension cell purification and change the expression vector to boost RAG production 6-fold. We use these purified RAG proteins to investigate V(D)J recombination on a mechanistic single molecule level. As a result, we are able to measure the binding statistics (dwell times and binding energies) of the initial RAG binding events with or without its co-factor high mobility group box protein 1 (HMGB1), and to characterize synapse formation at the single-molecule level yielding insights into the distribution of dwell times in the paired complex and the propensity for cleavage upon forming the synapse. We then go on to investigate HMGB1 further by measuring it compact single DNA molecules. We observed concentration dependent DNA compaction, differential DNA compaction depending on the divalent cation type, and found that at a particular HMGB1 concentration the percentage of DNA compacted is conserved across DNA lengths. Lastly, we investigate another HMGB protein called TFAM, which is essential for packaging the mitochondrial genome. We present crystal structures of TFAM bound to the heavy strand promoter 1 (HSP1) and to nonspecific DNA. We show TFAM dimerization is dispensable for DNA bending and transcriptional activation, but is required for mtDNA compaction. We propose that TFAM dimerization enhances mtDNA compaction by promoting looping of mtDNA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

With recent advances in high-throughput sequencing, mapping of genome-wide transcription factor occupancy has become feasible. To advance the understanding of skeletal muscle differentiation specifically and transcriptional regulation in general, I determined the genome-wide occupancy map for myogenin in differentiating C2C12 myocyte cells. I then analyzed the myogenin map for underlying sequence content and the association between occupied elements and expression trajectories of adjacent genes. Having determined that myogenin primarily associates with expressed genes, I performed a similar analysis on occupancy maps of other transcription factors active during skeletal muscle differentiation, including an extensive analysis of co-occupancy. This analysis provided strong motif evidence for protein-protein interactions as the primary driving force in the formation of Myogenin / Mef2 and MyoD / AP-1 complexes at jointly-occupied sites. Finally, factor occupancy analysis was extended to include bHLH transcription factors in tissues other than skeletal muscle. The cross-tissue analysis led to the emergence of a motif structure used by bHLH TFs to encode either tissue-specific or "general" (public) access in a variety of lineages.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Biological information storage and retrieval is a dynamic process that requires the genome to undergo dramatic structural rearrangements. Recent advances in single-molecule techniques have allowed precise quantification of the nano-mechanical properties of DNA [1, 2], and direct in vivo observation of molecules in action [3]. In this work, we will examine elasticity in protein-mediated DNA looping, whose structural rearrangement is essential for transcriptional regulation in both prokaryotes and eukaryotes. We will look at hydrodynamics in the process of viral DNA ejection, which mediates information transfer and exchange and has prominent implications in evolution. As in the case of Kepler's laws of planetary motion leading to Newton's gravitational theory, and the allometric scaling laws in biology revealing the organizing principles of complex networks [4], experimental data collapse in these biological phenomena has guided much of our studies and urged us to find the underlying physical principles.