989 resultados para 060407 Genome Structure and Regulation
Resumo:
Insects are useful models for the study of innate immune reactions and development. The distinction between recognition mechanisms preceding the breakdown of apoptotic cells during metamorphosis, and the breakdown of cells in response to infections, is unclear. Hemolin, a Lepidopteran member of the immunoglobulin superfamily, is a candidate molecule in self/nonself recognition. This thesis investigates hemolin function and hemolin gene regulation at a molecular level. We investigated the binding and cell adhesion properties of hemolin from H. cecropia and demonstrated that the proteins could homodimerize in presence of calcium. Moreover, a higher molecular weight membrane form of hemolin was present on hemocytes. These results, taken together with an earlier finding that soluble hemolin inhibits hemocyte adhesion, indicated that the secreted hemolin could modulate hemocyte aggregation in a competitive manner in the blood. In addition, hemolin was expressed in different tissues and at different developmental stages. Since hemolin is expressed both during development and during the immune response, its different regulatory factors must act in concert. We found that the third intron contains an enhancer, through which Dif, C/EBP and HMGI synergistically activate a reporter construct in vitro. We concluded that the enhancer is used during infection, since the κB-site is crucial for an immune response. Interestingly, we also found that the active form of the steroid hormone, ecdysone, induces the hemolin gene transcription in vivo, and in addition, acts synergistically during bacterial infection. Preliminary in vivo results indicate a secondary effect of ecdysone and the importance of hormone receptor elements in the upstream promoter region of hemolin. To explore the use of Drosophila as a genetic tool for understanding hemolin function and regulation, we sought to isolate the functional homologue in this species. A fly cDNA library in yeast was screened using H. cecropia hemolin as bait. The screen was not successful. However, it did lead to the discovery of a Drosophila protein with true binding specificity for hemolin. Subsequent characterization revealed a new, highly conserved gene, which we named yippee. Yippee is distantly related to zinc finger proteins and represents a novel family of proteins present in numerous eukaryotes, including fungi, plants and humans. Notably, when the Drosophila genome sequence was revealed, no hemolin orthologue could be detected. Finally, an extensive Drosophila genome chip analysis was initiated. The goal was to investigate the Drosophila immune response, and, in contrast to earlier studies of artificially injected flies, to examine a set of natural microbes, orally and externally applied. In parallel experiments viruses, bacteria, fungi and parasites were compared to unchallenged controls. We obtained a unique set of genes that were up-regulated in the response to the parasite Octosporea muscadomesticae and to the fungus Beauveria bassiana. We expect both down-regulated and up-regulated genes to serve as a source for the discovery of new effector molecules, in particular those that are active against parasites and fungi.
Resumo:
The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.
Resumo:
Eukaryotic ribosomal DNA constitutes a multi gene family organized in a cluster called nucleolar organizer region (NOR); this region is composed usually by hundreds to thousands of tandemly repeated units. Ribosomal genes, being repeated sequences, evolve following the typical pattern of concerted evolution. The autonomous retroelement R2 inserts in the ribosomal gene 28S, leading to defective 28S rDNA genes. R2 element, being a retrotransposon, performs its activity in the genome multiplying its copy number through a “copy and paste” mechanism called target primed reverse transcription. It consists in the retrotranscription of the element’s mRNA into DNA, then the DNA is integrated in the target site. Since the retrotranscription can be interrupted, but the integration will be carried out anyway, truncated copies of the element will also be present in the genome. The study of these truncated variants is a tool to examine the activity of the element. R2 phylogeny appears, in general, not consistent with that of its hosts, except some cases (e.g. Drosophila spp. and Reticulitermes spp.); moreover R2 is absent in some species (Fugu rubripes, human, mouse, etc.), while other species have more R2 lineages in their genome (the turtle Mauremys reevesii, the Japanese beetle Popilia japonica, etc). R2 elements here presented are isolated in 4 species of notostracan branchiopods and in two species of stick insects, whose reproductive strategies range from strict gonochorism to unisexuality. From sequencing data emerges that in Triops cancriformis (Spanish gonochoric population), in Lepidurus arcticus (two putatively unisexual populations from Iceland) and in Bacillus rossius (gonochoric population from Capalbio) the R2 elements are complete and encode functional proteins, reflecting the general features of this family of transposable elements. On the other hand, R2 from Italian and Austrian populations of T. cancriformis (respectively unisexual and hermaphroditic), Lepidurus lubbocki (two elements within the same Italian population, gonochoric but with unfunctional males) and Bacillus grandii grandii (gonochoric population from Ponte Manghisi) have sequences that encode incomplete or non-functional proteins in which it is possible to recognize only part of the characteristic domains. In Lepidurus couesii (Italian gonochoric populations) different elements were found as in L. lubbocki, and the sequencing is still in progress. Two hypothesis are given to explain the inconsistency of R2/host phylogeny: vertical inheritance of the element followed by extinction/diversification or horizontal transmission. My data support previous study that state the vertical transmission as the most likely explanation; nevertheless horizontal transfer events can’t be excluded. I also studied the element’s activity in Spanish populations of T. cancriformis, in L. lubbocki, in L. arcticus and in gonochoric and parthenogenetic populations of B. rossius. In gonochoric populations of T. cancriformis and B. rossius I found that each individual has its own private set of truncated variants. The situation is the opposite for the remaining hermaphroditic/parthenogenetic species and populations, all individuals sharing – in the so far analyzed samples - the majority of variants. This situation is very interesting, because it isn’t concordant with the Muller’s ratchet theory that hypothesizes the parthenogenetic populations being either devoided of transposable elements or TEs overloaded. My data suggest a possible epigenetic mechanism that can block the retrotransposon activity, and in this way deleterious mutations don’t accumulate.
Resumo:
The research presented in my PhD thesis is part of a wider European project, FishPopTrace, focused on traceability of fish populations and products. My work was aimed at developing and analyzing novel genetic tools for a widely distributed marine fish species, the European hake (Merluccius merluccius), in order to investigate population genetic structure and explore potential applications to traceability scenarios. A total of 395 SNPs (Single Nucleotide Polymorphisms) were discovered from a massive collection of Expressed Sequence Tags, obtained by high-throughput sequencing, and validated on 19 geographic samples from Atlantic and Mediterranean. Genome-scan approaches were applied to identify polymorphisms on genes potentially under divergent selection (outlier SNPs), showing higher genetic differentiation among populations respect to the average observed across loci. Comparative analysis on population structure were carried out on putative neutral and outlier loci at wide (Atlantic and Mediterranean samples) and regional (samples within each basin) spatial scales, to disentangle the effects of demographic and adaptive evolutionary forces on European hake populations genetic structure. Results demonstrated the potential of outlier loci to unveil fine scale genetic structure, possibly identifying locally adapted populations, despite the weak signal showed from putative neutral SNPs. The application of outlier SNPs within the framework of fishery resources management was also explored. A minimum panel of SNP markers showing maximum discriminatory power was selected and applied to a traceability scenario aiming at identifying the basin (and hence the stock) of origin, Atlantic or Mediterranean, of individual fish. This case study illustrates how molecular analytical technologies have operational potential in real-world contexts, and more specifically, potential to support fisheries control and enforcement and fish and fish product traceability.
Resumo:
Tumor necrosis factor receptor p75/80 ((TNF-R p75/80) is a 75 kDa type 1 transmembrane protein expressed predominately on cells of hematopoietic lineage. TNF-R p75/80 belongs to the TNF receptor superfamily characterized by cysteine-rich extracellular regions composed of three to six disulfide-linked domains. In the present report, we have characterized, for the first time, the complete gene structure for human TNF-R p75/80 which spans approximately 43 kbp. The gene consists of 10 exons (ranging from 34 bp to 2.5 kbp) and 9 introns (343 bp to 19 kbp). Consensus elements for transcription factors involved in T cell development and activation were noted in the 5$\sp\prime$ flanking region including TCF-1, Ikaros, AP-1, CK-2, IL-6RE, ISRE, GAS, NF-$\kappa$B and SP1, as well as an unusually high GC content and CpG frequency that appears characteristic of some TNF-R family members. The unusual (GATA)$\sb{\rm n}$ and (GAA)(GGA) repeats found within intron 1 may prove useful for further genome analysis within the 1p36 chromosomal locus. The human TNF-R p75/80 gene structure will permit further assessment of its involvement in normal hematopoietic cell development and function, autoimmune disease, and non-random translocations in hematopoietic malignancies. The region 1.8 kb 5$\sp\prime$ of the ATG was able to drive luciferase expression when transfected into cell lines expressing TNF-R p75/80. Further characterization of the 5$\sp\prime$-regulatory region will aid in determining factors and signal transduction pathways involved in regulating TNF-R p75/80 expression. ^
Resumo:
To investigate the evolution of globin genes in the genus Xenopus, we have determined the primary structure of the related adult alpha I- and alpha II-globin genes of X. laevis and of the adult alpha-globin gene of X. tropicalis, including their 5'-flanking regions. All three genes are comprised of three exons and two introns at homologous positions. The exons are highly conserved and code for 141 amino acids. By contrast, the corresponding introns vary in length and show considerable divergence. Comparison of 900 bp of the 5'-flanking region revealed that the X. tropicalis gene contains a conserved proximal 310-bp promoter sequence, comprised of the canonical TATA and CCAAT motifs at homologous positions, and five conserved elements in the same order and at similar positions as previously shown for the corresponding genes of X. laevis. We therefore conclude that these conserved upstream elements may represent regulatory sequences for cell-specific regulation of the adult Xenopus globin genes.
Resumo:
Classical swine fever virus (CSFV) causes a highly contagious disease in pigs that can range from a severe haemorrhagic fever to a nearly unapparent disease, depending on the virulence of the virus strain. Little is known about the viral molecular determinants of CSFV virulence. The nonstructural protein NS4B is essential for viral replication. However, the roles of CSFV NS4B in viral genome replication and pathogenesis have not yet been elucidated. NS4B of the GPE- vaccine strain and of the highly virulent Eystrup strain differ by a total of seven amino acid residues, two of which are located in the predicted trans-membrane domains of NS4B and were described previously to relate to virulence, and five residues clustering in the N-terminal part. In the present study, we examined the potential role of these five amino acids in modulating genome replication and determining pathogenicity in pigs. A chimeric low virulent GPE- -derived virus carrying the complete Eystrup NS4B showed enhanced pathogenicity in pigs. The in vitro replication efficiency of the NS4B chimeric GPE- replicon was significantly higher than that of the replicon carrying only the two Eystrup-specific amino acids in NS4B. In silico and in vitro data suggest that the N-terminal part of NS4B forms an amphipathic α-helix structure. The N-terminal NS4B with these five amino acid residues is associated with the intracellular membranes. Taken together, this is the first gain-of-function study showing that the N-terminal domain of NS4B can determine CSFV genome replication in cell culture and viral pathogenicity in pigs.
Resumo:
Cell signaling by nitric oxide (NO) through soluble guanylyl cyclase (sGC) and cGMP production regulates physiological responses such as smooth muscle relaxation, neurotransmission, and cell growth and differentiation. Although the NO receptor, sGC, has been studied extensively at the protein level, information on regulation of the sGC genes remains elusive. In order to understand the molecular mechanisms involved at the level of gene expression, cDNA and genomic fragments of the murine sGCα1 subunit gene were obtained through library screenings. Using the acquired clones, the sGCα 1 gene structure was determined following primer extension, 3 ′RACE and intron/exon boundary analyses. The basal activity of several 5′-flanking regions (putative promoter regions) for both the α1 and β1 sGC subunits were determined following their transfection into mouse N1E-115 neuroblastoma and rat RENE1Δ14 uterine epithelial cells using a luciferase reporter plasmid. Using the sGC sequences, real-time RT-PCR assays were designed to measure mRNA levels of the sGC α1 and β1 genes in rat, mouse and human. Subsequent studies found that uterine sGC mRNA and protein levels decreased rapidly in response to 17β-estradiol (estrogen) in an in vivo rat model. As early as 1 hour following treatment, mRNA levels of both sGC mRNAs decreased, and reached their lowest level of expression after 3 hours. This in vivo response was completely blocked by the pure estrogen receptor antagonist, ICI 182,780, was not seen in several other tissues examined, did not occur in response to other steroid hormones, and was due to a post-transcriptional mechanism. Additional studies ex vivo and in various cell culture models suggested that the estrogen-mediated decreased sGC mRNA expression did not require signals from other tissues, but may require cell communication or paracrine factors between different cell types within the uterus. Using chemical inhibitors and molecular targeting in other related studies, it was revealed that c-Jun-N-terminal kinase (JNK) signaling was responsible for decreased sGC mRNA expression in rat PC12 and RFL-6 cells, two models previously determined to exhibit rapid decreased sGC mRNA expression in response to different stimuli. To further investigate the post-transcriptional gene regulation, the full length sGCα1 3′-untranslated region (3′UTR) was cloned from rat uterine tissue and ligated downstream of the rabbit β-globin gene and expressed as a chimeric mRNA in the rat PC12 and RFL-6 cell models. Expression studies with the chimeric mRNA showed that the sGCα 1 3′UTR was not sufficient to mediate the post-transcriptional regulation of its mRNA by JNK or cAMP signaling in PC12 and RFL-6 cells. This study has provided numerous valuable tools for future studies involving the molecular regulation of the sGC genes. Importantly, the present results identified a novel paradigm and a previously unknown signaling pathway for sGC mRNA regulation that could potentially be exploited to treat diseases such as uterine cancers, neuronal disorders, hypertension or various inflammatory conditions. ^
Resumo:
The capital structure and regulation of financial intermediaries is an important topic for practitioners, regulators and academic researchers. In general, theory predicts that firms choose their capital structures by balancing the benefits of debt (e.g., tax and agency benefits) against its costs (e.g., bankruptcy costs). However, when traditional corporate finance models have been applied to insured financial institutions, the results have generally predicted corner solutions (all equity or all debt) to the capital structure problem. This paper studies the impact and interaction of deposit insurance, capital requirements and tax benefits on a bankÇs choice of optimal capital structure. Using a contingent claims model to value the firm and its associated claims, we find that there exists an interior optimal capital ratio in the presence of deposit insurance, taxes and a minimum fixed capital standard. Banks voluntarily choose to maintain capital in excess of the minimum required in order to balance the risks of insolvency (especially the loss of future tax benefits) against the benefits of additional debt. Because we derive a closed- form solution, our model provides useful insights on several current policy debates including revisions to the regulatory framework for GSEs, tax policy in general and the tax exemption for credit unions.
Resumo:
It has been assumed that constitutive and regulated splicing of RNA polymerase II transcripts depends exclusively on signals present in the RNA molecule. Here we show that changes in promoter structure strongly affect splice site selection. We investigated the splicing of the ED I exon, which encodes a facultative type III repeat of fibronectin, whose inclusion is regulated during development and in proliferative processes. We used an alternative splicing assay combined with promoter swapping to demonstrate that the extent of ED I splicing is dependent on the promoter structure from which the transcript originated and that this regulation is independent of the promoter strength. Thus, these results provide the first evidence for coupling between alternative splicing and promoter-specific transcription, which agrees with recent cytological and biochemical evidence of coordination between splicing and transcription.
Resumo:
As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing ‘global views’ of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein–protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and protein–protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to V–b, for attribute value V and constant exponent b), with a few folds having large values and most having small values.
Resumo:
Msx1 is a key factor for the development of tooth and craniofacial skeleton and has been proposed to play a pivotal role in terminal cell differentiation. In this paper, we demonstrated the presence of an endogenous Msx1 antisense RNA (Msx1-AS RNA) in mice, rats, and humans. In situ analysis revealed that this RNA is expressed only in differentiated dental and bone cells with an inverse correlation with Msx1 protein. These in vivo data and overexpression of Msx1 sense and AS RNA in an odontoblastic cell line (MO6-G3) showed that the balance between the levels of the two Msx1 RNAs is related to the expression of Msx1 protein. To analyze the impact of this balance in the Msx-Dlx homeoprotein pathway, we analyzed the effect of Msx1, Msx2, and Dlx5 overexpression on proteins involved in skeletal differentiation. We showed that the Msx1-AS RNA is involved in crosstalk between the Msx-Dlx pathways because its expression was abolished by Dlx5. Msx1 was shown to down-regulate a master gene of skeletal cells differentiation, Cbfa1. All these data strongly suggest that the ratio between Msx1 sense and antisense RNAs is a very important factor in the control of skeletal terminal differentiation. Finally, the initiation site for Msx1-AS RNA transcription was located by primer extension in both mouse and human in an identical region, including a consensus TATA box, suggesting an evolutionary conservation of the AS RNA-mediated regulation of Msx1 gene expression.
Resumo:
Using allozymes and mtDNA sequences from the cytochrome b gene, we report that the brown kiwi has the highest levels of genetic structuring observed in birds. Moreover, the mtDNA sequences are, with two minor exceptions, diagnostic genetic markers for each population investigated, even though they are among the more slowly evolving coding regions in this genome. A major unexpected finding was the concordant split in molecular phylogenies between brown kiwis in the southern South Island and elsewhere in New Zealand. This basic phylogeographic boundary halfway down the South Island coincides with a fixed allele difference in the Hb nuclear locus and strongly suggests that two morphologically cryptic species are currently merged under one polytypic species. This is another striking example of how molecular genetic assays can detect phylogenetic discontinuities that are not reflected in traditional morphologically based taxonomies. However, reanalysis of the morphological characters by using phylogenetic methods revealed that the reason for this discordance is that most are primitive and thus are phylogenetically uninformative. Shared-derived morphological characters support the same relationships evident in the molecular phylogenies and, in concert with the molecular data, suggest that as brown kiwis colonized northward from the southern South Island, they retained many primitive characters that confounded earlier systematists. Strong subdivided population structure and cryptic species in brown kiwis seem to have evolved relatively recently as a consequence of Pleistocene range disjunctions, low dispersal power, and genetic drift in small populations.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06
Resumo:
We have determined the crystal structure of the core (C) protein from the Kunjin subtype of West Nile virus (WNV), closely related to the NY99 strain of WNV, currently a major health threat in the U.S. WNV is a member of the Flaviviridae family of enveloped RNA viruses that contains many important human pathogens. The C protein is associated with the RNA genome and forms the internal core which is surrounded by the envelope in the virion. The C protein structure contains four a. helices and forms dimers that are organized into tetramers. The tetramers form extended filamentous ribbons resembling the stacked alpha helices seen in HEAT protein structures.