998 resultados para Genomic Structure
Resumo:
An entire gene encoding wheat (var. Hard Red Winter Tam 107) acetyl-CoA carboxylase [ACCase; acetyl-CoA:carbon-dioxide ligase (ADP-forming), EC 6.4.1.2] has been cloned and sequenced. Comparison of the 12-kb genomic sequence with the 7.4-kb cDNA sequence reported previously revealed 29 introns. Within the coding region, the exon sequence is 98% identical to the known wheat cDNA sequence. A second ACCase gene was identified by sequencing fragments of genomic clones that include the first two exons and the first intron. Additional transcripts were detected by 5' and 3' RACE analysis (rapid amplification of cDNA ends). One set of transcripts had a 5' end sequence identical to the cDNA found previously and another set was identical to the gene reported here. The 3' RACE clones fall into four distinguishable sequence sets, bringing the number of ACCase sequences to six. None of these cDNA or genomic clones encodes a chloroplast targeting signal. Identification of six different sequences suggests that either the cytosolic ACCase genes are duplicated in the three chromosome sets in hexaploid wheat or that each of the six alleles of the cytosolic ACCase gene has a readily distinguishable DNA sequence.
Resumo:
PR-39 is a porcine 39-aa peptide antibiotic composed of 49% proline and 24% arginine, with an activity against Gram-negative bacteria comparable to that of tetracycline. In Escherichia coli, it inhibits DNA and protein synthesis. PR-39 was originally isolated from pig small intestine, but subsequent cDNA cloning showed that the gene is expressed in the bone marrow. The open reading frame of the clone showed that PR-39 is made as 173-aa precursor whose proregion belongs to the cathelin family. The PR39 gene, which is rather compact and spans only 1784 bp has now been sequenced. The coding information is split into four exons. The first exon contains the signal sequence of 29 residues and the first 37 residues of the cathelin propart. Exons 2 and 3 contain only cathelin information, while exon 4 codes for the four C-terminal cathelin residues and the mature PR-39 peptide extended by three residues. The sequenced upstream region (1183 bp) contains four potential recognition sites for NF-IL6 and three for APRF, transcription factors known to regulate genes for both cytokines and acute phase response factors. Genomic hybridizations revealed a fairly high level of restriction fragment length polymorphism and indicated that there are at least two copies of the PR39 gene in the pig genome. PR39 was mapped to pig chromosome 13 by linkage and in situ hybridization mapping. The gene for the human peptide antibiotic FALL-39 (also a member of the cathelin family) was mapped to human chromosome 3, which is homologous to pig chromosome 13.
Resumo:
The genome of some icosahedral RNA viruses plays an essential role in capsid assembly and structure. In T=3 particles of the nodavirus Pariacoto virus (PaV), a remarkable 35% of the single-stranded RNA genome is icosahedrally ordered. This ordered RNA can be visualized at high resolution by X-ray crystallography as a dodecahedral cage consisting of 30 24-nucleotide A-form RNA duplex segments that each underlie a twofold icosahedral axis of the virus particle and interact extensively with the basic N-terminal region of 60 subunits of the capsid protein. To examine whether the PaV genome is a specific determinant of the RNA structure, we produced virus-like particles (VLPs) by expressing the wild-type capsid protein open reading frame from a recombinant baculovirus. VLPs produced by this system encapsidated similar total amounts of RNA as authentic virus particles, but only about 6% of this RNA was PaV specific, the rest being of cellular or baculovirus origin. Examination of the VLPs by electron cryomicroscopy and image reconstruction at 15.4-Angstrom resolution showed that the encapsidated RNA formed a dodecahedral cage similar to that of wild-type particles. These results demonstrate that the specific nucleotide sequence of the PaV genome is not required to form the dodecahedral cage of ordered RNA.
Resumo:
The EF-hand superfamily of calcium binding proteins includes the S100, calcium binding protein, and troponin subfamilies. This study represents a genome, structure, and expression analysis of the S100 protein family, in mouse, human, and rat. We confirm the high level of conservation between mammalian sequences but show that four members, including S100A12, are present only in the human genome. We describe three new members of the S100 family in the three species and their locations within the S100 genomic clusters and propose a revised nomenclature and phylogenetic relationship between members of the EF-hand superfamily. Two of the three new genes were induced in bone-marrow-derived macrophages activated with bacterial lipopolysaccharide, suggesting a role in inflammation. Normal human and murine tissue distribution profiles indicate that some members of the family are expressed in a specific manner, whereas others are more ubiquitous. Structure-function analysis of the chemotactic properties of murine S100A8 and human S100A12, particularly within the active hinge domain, suggests that the human protein is the functional homolog of the murine protein. Strong similarities between the promoter regions of human S100A12 and murine S100A8 support this possibility. This study provides insights into the possible processes of evolution of the EF-hand protein superfamily. Evolution of the S100 proteins appears to have occurred in a modular fashion, also seen in other protein families such as the C2H2-type zinc-finger family. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
Lines of transgenic tobacco have been generated that are transformed with either the wild-type peanut peroxidase prxPNC2 cDNA, driven by the CaMV3 5S promoter (designated 35S::prxPNC2-WT) or a mutated PNC2 cDNA in which the asparagine residue (Asn(189)) associated with the point of glycan attachment (Asn(189)) has been replaced with alanine (designated 35S::prxPNC2-M). PCR, using genomic DNA as template, has confirmed the integration of the 35S::prxPNC2-WT and 35::prxPNC2-M constructs into the tobacco genome, and western analysis using anti-PNC2 antibodies has revealed that the prxPNC2-WT protein product (PNC2-WT) accumulates with a molecular mass of 34,670 Da, while the prxPNC2-M protein product (PNC2-M) accumulates with a molecular mass of 32,600 Da. Activity assays have shown that both PNC2-WT and PNC2-M proteins accumulate preferentially in the ionically-bound cell wall fraction, with a significantly higher relative accumulation of the PNC2-WT isoenzyme in the ionically-bound fraction when compared with the PNC2-M isoform. Kinetic analysis of the partially purified PNC2-WT isozyme revealed an affinity constant (apparent K-m) of 11.2 mM for the reductor substrate guaiacol and 1.29 mM for H2O2, while values of 11.9 mM and 1.12 mM were determined for the PNC2-M isozyme. A higher Arrenhius activation energy (E,,) was determined for the PNC2-M isozyme (22.9 kJ mol(-1)), when compared with the PNC2-WT isozyme (17.6 kJ mol(-1)), and enzyme assays have determined that the absence of the glycan influences the thermostability of the PNC2-M isozyme. These results are discussed with respect to the proposed roles of N-linked glycans attached to plant peroxidases. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
In humans, a polymorphic gene encodes the drug-metabolizing enzyme NATI (arylamine N-acetyltransferase Type 1), which is widely expressed throughout the body. While the protein-coding region of NATI is contained within a single exon, examination of the human EST (expressed sequence tag) database at the NCBI revealed the presence of nine separate exons, eight of which were located in the 5'non-coding region of NATI. Differential splicing produced at least eight unique mRNA isoforms that could be grouped according to the location of the first exon, which suggested that NATI expression occurs from three alternative promoters. Using RT (reverse transcriptase)-PCR, we identified one major transcript in various epithelial cells derived from different tissues. In contrast, multiple transcripts were observed in blood-derived cell lines (CEM, THP-1 and Jurkat), with a novel variant, not identified in the EST database, found in CEM cells only. The major splice variant increased gene expression 9-11-fold in a luciferase reporter assay, while the other isoforrns were similar or slightly greater than the control. We examined the upstream region of the most active splice variant in a promoter-reporter assay, and isolated a 257 bp sequence that produced maximal promoter activity. This sequence lacked a TATA box, but contained a consensus Sp1 site and a CAAT box, as well as several other putative transcription-factor-binding sites. Cell-specific expression of the different NATI transcripts may contribute to the variation in NATI activity in vivo.
Resumo:
Sulfate plays an essential role in human growth and development. Here, we characterized the functional properties of the human Na+-sulfate cotransporter (hNaS2), determined its tissue distribution, and identified its gene (SLC13A4) structure. Expression of hNaS2 protein in Xenopus oocytes led to a Na+-dependent transport of sulfate that was inhibited by thiosulfate, phosphate, molybdate. selenate and tungstate, but not by oxalate, citrate, succinate, phenol red or DIDS. Transport kinetics of hNaS2 determined a K, for sulfate of 0.38 mM, suggestive of a high affinity sulfate transporter. Na+ kinetics determined a Hill coefficient of 1.6 +/- 0.6, suggesting a Na: SO42- stoichiometry of 2:1. hNaS2 mRNA was highly expressed in placenta and testis, with intermediate levels in brain and lower levels found in the heart, thymus, and liver. The SLC13A4 gene contains 16 exons, spanning over 47 kb in length. Its 5'-flanking region contains CAAT- and GC-box motifs, and a number of putative transcription factor binding sites, including GATA-1, AP-1, and AP-2 consensus sequences. This is the first study to characterize hNaS2 transport kinetics, define its tissue distribution, and resolve its gene (SLC13A4) structure and 5' flanking region. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
In Mesoamerica, tropical dry forest is a highly threatened habitat, and species endemic to this environment are under extreme pressure. The tree species, Lonchocarpus costaricensis is endemic to the dry northwest of Costa Rica and southwest Nicaragua. It is a locally important species but, as land has been cleared for agriculture, populations have experienced considerable reduction and fragmentation. To assess current levels and distribution of genetic diversity in the species, a combination of chloroplast-specific (cpDNA) and whole genome DNA markers (amplified fragment length polymorphism, AFLP) were used to fingerprint 121 individual trees in 6 populations. Two cpDNA haplotypes were identified, distributed among populations such that populations at the extremes of the distribution showed lowest diversity. A large number (487) of AFLP markers were obtained and indicated that diversity levels were highest in the two coastal populations (Cobano, Matapalo, H = 0.23, 0.28 respectively). Population differentiation was low overall, F-ST = 0.12, although Matapalo was strongly differentiated from all other populations (F-ST = 0.16-0.22), apart from Cobano (F., = 0.11). Spatial genetic structure was present in both datasets at different scales: cpDNA was structured at a range-wide distribution scale, whilst AFLP data revealed genetic neighbourhoods on a population scale. In general, the habitat degradation of recent times appears not to have yet impacted diversity levels in mature populations. However, although no data on seed or saplings were collected, it seems likely that reproductive mechanisms in the species will have been affected by land clearance. It is recommended that efforts should be made to conserve the extant genetic resource base and further research undertaken to investigate diversity levels in the progeny generation.
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
The advent of next-generation sequencing has significantly reduced the cost of obtaining large-scale genetic resources, opening the door for genomic studies of non-model but ecologically interesting species. The shift in mating system, from outcrossing to selfing, has occurred thousands of times in angiosperms and is accompanied by profound changes in the population genetics and ecology of a species. A large body of work has been devoted to understanding why the shift occurs and the impact of the shift on the genetics of the resulting selfing populations, however, the causes and consequences of the transition to selfing involve a complicated interaction of genetic and demographic factors which are difficult to untangle. Abronia umbellata is a Pacific coastal dune endemic which displays a striking shift in mating system across its geographic range, with large-flowered outcrossing populations south of San Francisco and small-flowered selfing populations to the north. Abronia umbellata is an attractive model system for the study of mating system transitions because the shift appears to be recent and therefore less obscured by post-shift processes, it has a near one-dimensional geographic range which simplifies analysis and interpretation, and demographic data has been collected for many of the populations. In this study, we generated transcriptome-level data for 12 plants including individuals from both subspecies, along with a resequencing study of 48 individuals from populations across the range. The genetic analysis revealed a recent transition to selfing involving a drastic reduction in genetic diversity in the selfing lineage, potentially indicative of a recent population bottleneck and a transition to selfing due to reproductive assurance. Interestingly, the genetic structure of the populations was not coincident with the current subspecies demarcation, and two large-flowered populations were classified with the selfing subspecies, suggesting a potential need for re-evaluation of the current subspecies classification. Our finding of low diversity in selfing populations may also have implications for the conservation value of the threatened selfing subspecies.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
Neogobius caspius is a small benthic fish that is native to the Caspian Sea. The importance of this fish is because of it is role as a main food resource of the sturgeon fish. The genetic diversity of N. caspius population in the Caspian Sea was studied using PCR- RFLP technique. A total of 135 samples of N. caspius were collected from coastal line in the north Caspian sea, including specimens from coasts of Anzali , Torkman Port and Chalus. Genomic DNA was extracted by phenol-chloroform method and then was amplified using a pair primer of cytochrom b gene, 2 tRNA gene and the control region sequences by a thermal cycler. D2 (5'-CCGGAGTATGTAGGGCATTCTCAC-3'), CY1 (5'-YYTAACCRRGACYAATGACTTGA-3') 12 restriction enzyme were used to digest the target gene region including: Alul HincII —Tas1 —Rsa1 -MboI -DraI -BSeNI(BSRI) Alw261(BsmAI). Bsul 51 Hin11 Bsh12851- BsuRI(HaeIII) digested PCR products were observed by silver staining method followed by Polyacrylamide gel electrophoresis (PAGE). The results were shown the same pattern among the species. There was no polymorphism and no differentiation in population in the Neogobius caspius fish and all individuals have shown homogenous genotype.
Resumo:
Investigating stock identity of marine species in a multidisciplinary holistic approach can reveal patterns of complex spatial population structure and signatures of potential local adaptation. The population structure of common sole (Solea solea) in the Mediterranean Sea was delineated using genomic and otolith data, including single nucleotide polymorphisms (SNPs) markers and otolith data. SNPs were correlated with environmental and spatial variables to evaluate the impact of these features on the actual genetic population structure. Integrated holistic approach was applied to combine the tracers with different spatio-temporal scales. SNPs data was also used to illustrate the population structure of European hake (Merluccius merluccius) within the Alboran Sea, extending into the neighboring Mediterranean Sea and Atlantic Ocean. The aim was to identify patterns of neutral and potential adaptive genetic variation by applying seascape genomic framework. Results from both genetic and otolith data suggested significant divergence among putative populations of common sole, confirming a clear separation between Western, Adriatic Sea and Eastern Mediterranean Sea. Evidence of fine-scale population structure in the Western Mediterranean Sea was observed at outlier loci level and in the Adriatic. Our study not only indicates that separation among Mediterranean sole population is led primarily by neutral processes, but it also suggests the presence of local adaptation influenced by environmental and spatial factors. The holistic approach by considering the spatio-temporal scales of variation confirmed that the same pattern of separation between these geographical sites is currently occurring and has occurred for many generations. Results showed the occurrence of population structure in Merluccius merluccius by detecting westward–eastward differentiation among populations and distinct subgroups at a fine geographical scale using outlier SNPs. These results enhance the knowledge of the population structure of commercially relevant species to support the application of spatial stock assessment models, including a redefinition of fishery management units.
Resumo:
A global italian pharmaceutical company has to provide two work environments that favor different needs. The environments will allow to develop solutions in a controlled, secure and at the same time in an independent manner on a state-of-the-art enterprise cloud platform. The need of developing two different environments is dictated by the needs of the working units. Indeed, the first environment is designed to facilitate the creation of application related to genomics, therefore, designed more for data-scientists. This environment is capable of consuming, producing, retrieving and incorporating data, furthermore, will support the most used programming languages for genomic applications (e.g., Python, R). The proposal was to obtain a pool of ready-togo Virtual Machines with different architectures to provide best performance based on the job that needs to be carried out. The second environment has more of a traditional trait, to obtain, via ETL (Extract-Transform-Load) process, a global datamodel, resembling a classical relational structure. It will provide major BI operations (e.g., analytics, performance measure, reports, etc.) that can be leveraged both for application analysis or for internal usage. Since, both architectures will maintain large amounts of data regarding not only pharmaceutical informations but also internal company informations, it would be possible to digest the data by reporting/ analytics tools and also apply data-mining, machine learning technologies to exploit intrinsic informations. The thesis work will introduce, proposals, implementations, descriptions of used technologies/platforms and future works of the above discussed environments.