978 resultados para gene discovery
Resumo:
Chromatin, composed of repeating nucleosome units, is the genetic polymer of life. To aid in DNA compaction and organized storage, the double helix wraps around a core complex of histone proteins to form the nucleosome, and is therefore no longer freely accessible to cellular proteins for the processes of transcription, replication and DNA repair. Over the course of evolution, DNA-based applications have developed routes to access DNA bound up in chromatin, and further, have actually utilized the chromatin structure to create another level of complexity and information storage. The histone molecules that DNA surrounds have free-floating tails that extend out of the nucleosome. These tails are post-translationally modified to create docking sites for the proteins involved in transcription, replication and repair, thus providing one prominent way that specific genomic sequences are accessed and manipulated. Adding another degree of information storage, histone tail-modifications paint the genome in precise manners to influence a state of transcriptional activity or repression, to generate euchromatin, containing gene-dense regions, or heterochromatin, containing repeat sequences and low-density gene regions. The work presented here is the study of histone tail modifications, how they are written and how they are read, divided into two projects. Both begin with protein microarray experiments where we discover the protein domains that can bind modified histone tails, and how multiple tail modifications can influence this binding. Project one then looks deeper into the enzymes that lay down the tail modifications. Specifically, we studied histone-tail arginine methylation by PRMT6. We found that methylation of a specific histone residue by PRMT6, arginine 2 of H3, can antagonize the binding of protein domains to the H3 tail and therefore affect transcription of genes regulated by the H3-tail binding proteins. Project two focuses on a protein we identified to bind modified histone tails, PHF20, and was an endeavor to discover the biological role of this protein. Thus, in total, we are looking at a complete process: (1) histone tail modification by an enzyme (here, PRMT6), (2) how this and other modifications are bound by conserved protein domains, and (3) by using PHF20 as an example, the functional outcome of binding through investigating the biological role of a chromatin reader. ^
Resumo:
The difficulty of detecting differential gene expression in microarray data has existed for many years. Several correction procedures try to avoid the family-wise error rate in multiple comparison process, including the Bonferroni and Sidak single-step p-value adjustments, Holm's step-down correction method, and Benjamini and Hochberg's false discovery rate (FDR) correction procedure. Each multiple comparison technique has its advantages and weaknesses. We studied each multiple comparison method through numerical studies (simulations) and applied the methods to the real exploratory DNA microarray data, which detect of molecular signatures in papillary thyroid cancer (PTC) patients. According to our results of simulation studies, Benjamini and Hochberg step-up FDR controlling procedure is the best process among these multiple comparison methods and we discovered 1277 potential biomarkers among 54675 probe sets after applying the Benjamini and Hochberg's method to PTC microarray data.^
Resumo:
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.
Resumo:
The use of data mining techniques for the gene profile discovery of diseases, such as cancer, is becoming usual in many researches. These techniques do not usually analyze the relationships between genes in depth, depending on the different variety of manifestations of the disease (related to patients). This kind of analysis takes a considerable amount of time and is not always the focus of the research. However, it is crucial in order to generate personalized treatments to fight the disease. Thus, this research focuses on finding a mechanism for gene profile analysis to be used by the medical and biologist experts. Results: In this research, the MedVir framework is proposed. It is an intuitive mechanism based on the visualization of medical data such as gene profiles, patients, clinical data, etc. MedVir, which is based on an Evolutionary Optimization technique, is a Dimensionality Reduction (DR) approach that presents the data in a three dimensional space. Furthermore, thanks to Virtual Reality technology, MedVir allows the expert to interact with the data in order to tailor it to the experience and knowledge of the expert.
Resumo:
Carbon catabolite repression (CCR) of several Bacillus subtilis catabolic genes is mediated by ATP-dependent phosphorylation of histidine-containing protein (HPr), a phosphocarrier protein of the phosphoenolpyruvate (PEP): sugar phosphotransferase system. In this study, we report the discovery of a new B. subtilis gene encoding a HPr-like protein, Crh (for catabolite repression HPr), composed of 85 amino acids. Crh exhibits 45% sequence identity with HPr, but the active site His-15 of HPr is replaced with a glutamine in Crh. Crh is therefore not phosphorylated by PEP and enzyme I, but is phosphorylated by ATP and the HPr kinase in the presence of fructose-1,6-bisphosphate. We determined Ser-46 as the site of phosphorylation in Crh by carrying out mass spectrometry with peptides obtained by tryptic digestion or CNBr cleavage. In a B. subtilis ptsH1 mutant strain, synthesis of β-xylosidase, inositol dehydrogenase, and levanase was only partially relieved from CCR. Additional disruption of the crh gene caused almost complete relief from CCR. In a ptsH1 crh1 mutant, producing HPr and Crh in which Ser-46 is replaced with a nonphosphorylatable alanyl residue, expression of β-xylosidase was also completely relieved from glucose repression. These results suggest that CCR of certain catabolic operons requires, in addition to CcpA, ATP-dependent phosphorylation of Crh, and HPr at Ser-46.
Resumo:
Tuberculosis is a chronic infectious disease that is transmitted by cough-propelled droplets that carry the etiologic bacterium, Mycobacterium tuberculosis. Although currently available drugs kill most isolates of M. tuberculosis, strains resistant to each of these have emerged, and multiply resistant strains are increasingly widespread. The growing problem of drug resistance combined with a global incidence of seven million new cases per year underscore the urgent need for new antituberculosis therapies. The recent publication of the complete sequence of the M. tuberculosis genome has made possible, for the first time, a comprehensive genomic approach to the biology of this organism and to the drug discovery process. We used a DNA microarray containing 97% of the ORFs predicted from this sequence to monitor changes in M. tuberculosis gene expression in response to the antituberculous drug isoniazid. Here we show that isoniazid induced several genes that encode proteins physiologically relevant to the drug’s mode of action, including an operonic cluster of five genes encoding type II fatty acid synthase enzymes and fbpC, which encodes trehalose dimycolyl transferase. Other genes, not apparently within directly affected biosynthetic pathways, also were induced. These genes, efpA, fadE23, fadE24, and ahpC, likely mediate processes that are linked to the toxic consequences of the drug. Insights gained from this approach may define new drug targets and suggest new methods for identifying compounds that inhibit those targets.
Resumo:
Gene silencing is an important but little understood regulatory mechanism in plants. Here we report that a viral sequence, initially identified as a mediator of synergistic viral disease, acts to suppress the establishment of both transgene-induced and virus-induced posttranscriptional gene silencing. The viral suppressor of silencing comprises the 5′-proximal region of the tobacco etch potyviral genomic RNA encoding P1, helper component-proteinase (HC-Pro) and a small part of P3, and is termed the P1/HC-Pro sequence. A reversal of silencing assay was used to assess the effect of the P1/HC-Pro sequence on transgenic tobacco plants (line T4) that are posttranscriptionally silenced for the uidA reporter gene. Silencing was lifted in offspring of T4 crosses with four independent transgenic lines expressing P1/HC-Pro, but not in offspring of control crosses. Viral vectors were used to assess the effect of P1/HC-Pro expression on virus-induced gene silencing (VIGS). The ability of a potato virus X vector expressing green fluorescent protein to induce silencing of a green fluorescent protein transgene was eliminated or greatly reduced when P1/HC-Pro was expressed from the same vector or from coinfecting potato virus X vectors. Expression of the HC-Pro coding sequence alone was sufficient to suppress virus-induced gene silencing, and the HC-Pro protein product was required for the suppression. This discovery points to the role of gene silencing as a natural antiviral defense system in plants and offers different approaches to elucidate the molecular basis of gene silencing.
Resumo:
Telomerase is an essential enzyme that maintains telomeres on eukaryotic chromosomes. In mammals, telomerase is required for the lifelong proliferative capacity of normal regenerative and reproductive tissues and for sustained growth in a dedifferentiated state. Although the importance of telomeres was first elucidated in plants 60 years ago, little is known about the role of telomeres and telomerase in plant growth and development. Here we report the cloning and characterization of the Arabidopsis telomerase reverse transcriptase (TERT) gene, AtTERT. AtTERT is predicted to encode a highly basic protein of 131 kDa that harbors the reverse transcriptase and telomerase-specific motifs common to all known TERT proteins. AtTERT mRNA is 10–20 times more abundant in callus, which has high levels of telomerase activity, versus leaves, which contain no detectable telomerase. Plants homozygous for a transfer DNA insertion into the AtTERT gene lack telomerase activity, confirming the identity and function of this gene. Because telomeres in wild-type Arabidopsis are short, the discovery that telomerase-null plants are viable for at least two generations was unexpected. In the absence of telomerase, telomeres decline by approximately 500 bp per generation, a rate 10 times slower than seen in telomerase-deficient mice. This gradual loss of telomeric DNA may reflect a reduced rate of nucleotide depletion per round of DNA replication, or the requirement for fewer cell divisions per organismal generation. Nevertheless, progressive telomere shortening in the mutants, however slow, ultimately should be lethal.
Resumo:
X-linked lymphoproliferative syndrome (XLP) is an inherited immunodeficiency characterized by increased susceptibility to Epstein–Barr virus (EBV). In affected males, primary EBV infection leads to the uncontrolled proliferation of virus-containing B cells and reactive cytotoxic T cells, often culminating in the development of high-grade lymphoma. The XLP gene has been mapped to chromosome band Xq25 through linkage analysis and the discovery of patients harboring large constitutional genomic deletions. We describe here the presence of small deletions and intragenic mutations that specifically disrupt a gene named DSHP in 6 of 10 unrelated patients with XLP. This gene encodes a predicted protein of 128 amino acids composing a single SH2 domain with extensive homology to the SH2 domain of SHIP, an inositol polyphosphate 5-phosphatase that functions as a negative regulator of lymphocyte activation. DSHP is expressed in transformed T cell lines and is induced following in vitro activation of peripheral blood T lymphocytes. Expression of DSHP is restricted in vivo to lymphoid tissues, and RNA in situ hybridization demonstrates DSHP expression in activated T and B cell regions of reactive lymph nodes and in both T and B cell neoplasms. These observations confirm the identity of DSHP as the gene responsible for XLP, and suggest a role in the regulation of lymphocyte activation and proliferation. Induction of DSHP may sustain the immune response by interfering with SHIP-mediated inhibition of lymphocyte activation, while its inactivation in XLP patients results in a selective immunodeficiency to EBV.
Resumo:
The cell matrix adhesion regulator (CMAR) gene has been suggested to be a signal transduction molecule influencing cell adhesion to collagen and, through this, possibly involved in tumor suppression. The originally reported CMAR cDNA was 464 bp long with a tyrosine phosphorylation site at the extreme 3′ end, which mutagenesis studies had shown to be central to the function of this gene. Since the discovery of a 4-bp insertion polymorphism within the originally reported coding region, further sequence information has been obtained. The cDNA has been extended 5′ by ≈2 kb revealing a 559-bp region showing strong homology to the proposed 5′ untranslated sequence of a murine protein kinase receptor family member, variant in kinase (vik). CMAR genomic sequencing has shown the presence of an intron, the intron/exon boundary lying within this region of homology. An RNA transcript for CMAR of ≈2.5 kb has also been identified. The data suggest complex mechanisms for control of expression of two closely associated genes, CMAR and the vik- associated sequence.
Resumo:
The discovery that the dilute gene encodes a class V myosin led to the hypothesis that this molecular motor is involved in melanosome transport and/or dendrite outgrowth in mammalian melanocytes. The present studies were undertaken to gain insight into the subcellular distribution of myosin-V in the melanoma cell line B16-F10, which is wild-type for the dilute gene. Immunofluorescence studies showed some degree of superimposed labeling of myosin-V with melanosomes that predominated at the cell periphery. A subcellular fraction highly enriched in melanosomes was also enriched in myosin-V based on Western blot analysis. Immunoelectron microscopy showed myosin-V labeling associated with melanosomes and other organelles. The stimulation of B16 cells with the α-melanocyte-stimulating hormone led to a significant increase in myosin-V expression. This is the first evidence that a cAMP signaling pathway might regulate the dilute gene expression. Immunofluorescence also showed an intense labeling of myosin-V independent of melanosomes that was observed within the dendrites and at the perinuclear region. Although the results presented herein are consistent with the hypothesis that myosin-V might act as a motor for melanosome translocation, they also suggest a broader cytoplasmic function for myosin-V, acting on other types of organelles or in cytoskeletal dynamics.
Resumo:
The proper localization of resident membrane proteins to the trans-Golgi network (TGN) involves mechanisms for both TGN retention and retrieval from post-TGN compartments. In this study we report identification of a new gene, GRD20, involved in protein sorting in the TGN/endosomal system of Saccharomyces cerevisiae. A strain carrying a transposon insertion allele of GRD20 exhibited rapid vacuolar degradation of the resident TGN endoprotease Kex2p and aberrantly secreted ∼50% of the soluble vacuolar hydrolase carboxypeptidase Y. The Kex2p mislocalization and carboxypeptidase Y missorting phenotypes were exhibited rapidly after loss of Grd20p function in grd20 temperature-sensitive mutant strains, indicating that Grd20p plays a direct role in these processes. Surprisingly, little if any vacuolar degradation was observed for the TGN membrane proteins A-ALP and Vps10p, underscoring a difference in trafficking patterns for these proteins compared with that of Kex2p. A grd20 null mutant strain exhibited extremely slow growth and a defect in polarization of the actin cytoskeleton, and these two phenotypes were invariably linked in a collection of randomly mutagenized grd20 alleles. GRD20 encodes a hydrophilic protein that partially associates with the TGN. The discovery of GRD20 suggests a link between the cytoskeleton and function of the yeast TGN.
Resumo:
Cloning and sequencing of the upstream region of the gene of the CC chemokine HCC-1 led to the discovery of an adjacent gene coding for a CC chemokine that was named “HCC-2.” The two genes are separated by 12-kbp and reside in a head-to-tail orientation on chromosome 17. At variance with the genes for HCC-1 and other human CC chemokines, which have a three-exon-two-intron structure, the HCC-2 gene consists of four exons and three introns. Expression of HCC-2 and HCC-1 as studied by Northern analysis revealed, in addition to the regular, monocistronic mRNAs, a common, bicistronic transcript. In contrast to HCC-1, which is expressed constitutively in numerous human tissues, HCC-2 is expressed only in the gut and the liver. HCC-2 shares significant sequence homology with CKβ8 and the murine chemokines C10, CCF18/MRP-2, and macrophage inflammatory protein 1γ, which all contain six instead of four conserved cysteines. The two additional cysteines of HCC-2 form a third disulfide bond, which anchors the COOH-terminal domain to the core of the molecule. Highly purified recombinant HCC-2 was tested on neutrophils, eosinophils, monocytes, and lymphocytes and was found to exhibit marked functional similarities to macrophage inflammatory protein 1α. It is a potent chemoattractant and inducer of enzyme release in monocytes and a moderately active attractant for eosinophils. Desensitization studies indicate that HCC-2 acts mainly via CC chemokine receptor CCR1.
Resumo:
Mitochondrial genomes of all vertebrate animals analyzed to date have the same 37 genes, whose arrangement in the circular DNA molecule varies only in the relative position of a few genes. This relative conservation suggests that mitochondrial gene order characters have potential utility as phylogenetic markers for higher-level vertebrate taxa. We report discovery of a mitochondrial gene order that has had multiple independent originations within birds, based on sampling of 137 species representing 13 traditionally recognized orders. This provides evidence of parallel evolution in mitochondrial gene order for animals. Our results indicate operation of physical constraints on mitochondrial gene order changes and support models for gene order change based on replication error. Bird mitochondria have a displaced OL (origin of light-strand replication site) as do various other Reptilia taxa prone to gene order changes. Our findings point to the need for broad taxonomic sampling in using mitochondrial gene order for phylogenetic analyses. We found, however, that the alternative mitochondrial gene orders distinguish the two primary groups of songbirds (order Passeriformes), oscines and suboscines, in agreement with other molecular as well as morphological data sets. Thus, although mitochondrial gene order characters appear susceptible to some parallel evolution because of mechanistic constraints, they do hold promise for phylogenetic studies.
Resumo:
The biological bases of learning and memory are being revealed today with a wide array of molecular approaches, most of which entail the analysis of dysfunction produced by gene disruptions. This perspective derives both from early “genetic dissections” of learning in mutant Drosophila by Seymour Benzer and colleagues and from earlier behavior-genetic analyses of learning and in Diptera by Jerry Hirsch and coworkers. Three quantitative-genetic insights derived from these latter studies serve as guiding principles for the former. First, interacting polygenes underlie complex traits. Consequently, learning/memory defects associated with single-gene mutants can be quantified accurately only in equilibrated, heterogeneous genetic backgrounds. Second, complex behavioral responses will be composed of genetically distinct functional components. Thus, genetic dissection of complex traits into specific biobehavioral properties is likely. Finally, disruptions of genes involved with learning/memory are likely to have pleiotropic effects. As a result, task-relevant sensorimotor responses required for normal learning must be assessed carefully to interpret performance in learning/memory experiments. In addition, more specific conclusions will be obtained from reverse-genetic experiments, in which gene disruptions are restricted in time and/or space.