990 resultados para 270202 Genome Structure
Resumo:
The full sequence of the genome-linked viral protein (VPg) cistron located in the central part of potato virus Y (common strain) genome has been identified. The VPg gene codes for a protein of 188 amino acids, with significant homology to other known potyviral VPg polypeptides. A three-dimensional model structure of VPg is proposed on the basis of similarity of hydrophobic-hydrophilic residue distribution to the sequence of malate dehydrogenase of known crystal structure. The 5' end of the viral RNA can be fitted to interact with the protein through the exposed hydroxyl group of Tyr-64, in agreement with experimental data. The complex favors stereochemically the formation of a phosphodiester bond [5'-(O4-tyrosylphospho)adenylate] typical for representatives of picornavirus-like viruses. The chemical mechanisms of viral RNA binding to VPg are discussed on the basis of the model structure of protein-RNA complex.
Resumo:
The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.
Resumo:
Chlorarachniophyte algae contain a complex, multi-membraned chloroplast derived from the endosymbiosis of a eukaryotic alga. The vestigial nucleus of the endosymbiont, called the nucleomorph, contains only three small linear chromosomes with a haploid genome size of 380 kb and is the smallest known eukaryotic genome. Nucleotide sequence data from a subtelomeric fragment of chromosome III were analyzed as a preliminary investigation of the coding capacity of this vestigial genome. Several housekeeping genes including U6 small nuclear RNA (snRNA), ribosomal proteins S4 and S13, a core protein of the spliceosome [small nuclear ribonucleoprotein (snRNP) E], and a cip-like protease (clpP) were identified. Expression of these genes was confirmed by combinations of Northern blot analysis, in situ hybridization, immunocytochemistry, and cDNA analysis. The protein-encoding genes are typically eukaryotic in overall structure and their messenger RNAs are polyadenylylated. A novel feature is the abundance of 18-, 19-, or 20-nucleotide introns; the smallest spliceosomal introns known. Two of the genes, U6 and S13, overlap while another two genes, snRNP E and clpP, are cotranscribed in a single mRNA. The overall gene organization is extraordinarily compact, making the nucleomorph a unique model for eukaryotic genomics.
Resumo:
The mouse is the best model system for the study of mammalian genetics and physiology. Because of the feasibility and importance of studying genetic crosses, the mouse genetic map has received tremendous attention in recent years. It currently contains over 14,000 genetically mapped markers, including 700 mutant loci, 3500 genes, and 6500 simple sequence length polymorphisms (SSLPs). The mutant loci and genes allow insights and correlations concerning physiology and development. The SSLPs provide highly polymorphic anchor points that allow inheritance to be traced in any cross and provide a scaffold for assembling physical maps. Adequate physical mapping resources--notably large-insert yeast artificial chromosome (YAC) libraries--are available to support positional cloning projects based on the genetic map, but a comprehensive physical map is still a few years away. Large-scale sequencing efforts have not yet begun in mouse, but comparative sequence analysis between mouse and human is likely to provide tremendous information about gene structure and regulation.
Resumo:
High-resolution physical maps of the genomes of three Rhodobacter capsulatus strains, derived from ordered cosmid libraries, were aligned. The 1.2-Mb segment of the SB1003 genome studied here is adjacent to a 1-Mb region analyzed previously [Fonstein, M., Nikolskaya, T. & Haselkorn, H. (1995) J. Bacteriol. 177, 2368-2372]. Probes derived from the ordered cosmid set of R. capsulatus SB1003 were used to link cosmids from the St. Louis and 2.3.1 strain libraries. Cosmids selected this way did not merge into a single contig but formed several unlinked groups. EcoRV restriction maps of the ordered cosmids were then constructed using lambda terminase and fused to derive fragments of the chromosomal map. In order to link these fragments, their ends were transcribed to produce secondary probes for hybridization to gridded cosmid libraries of the same strains. This linking reduced the number of subcontigs to three for the St. Louis strain and one for the 2.3.1 strain. Hybridization of the same probes back to the ordered cosmid set of SB1003 positioned the subcontigs on the high-resolution physical map of SB1003. The final alignment of the restriction maps shows numerous large and small translocations in this 1.2-Mb chromosomal region of the three Rhodobacter strains. In addition, the chromosomes of the three strains, whose fine-structure maps can now be compared over 2.2 Mb, are seen to contain regions of 15-80 kb in which restriction sites are highly polymorphic, interspersed among regions in which the positions of restriction sites are highly conserved.
Resumo:
Using allozymes and mtDNA sequences from the cytochrome b gene, we report that the brown kiwi has the highest levels of genetic structuring observed in birds. Moreover, the mtDNA sequences are, with two minor exceptions, diagnostic genetic markers for each population investigated, even though they are among the more slowly evolving coding regions in this genome. A major unexpected finding was the concordant split in molecular phylogenies between brown kiwis in the southern South Island and elsewhere in New Zealand. This basic phylogeographic boundary halfway down the South Island coincides with a fixed allele difference in the Hb nuclear locus and strongly suggests that two morphologically cryptic species are currently merged under one polytypic species. This is another striking example of how molecular genetic assays can detect phylogenetic discontinuities that are not reflected in traditional morphologically based taxonomies. However, reanalysis of the morphological characters by using phylogenetic methods revealed that the reason for this discordance is that most are primitive and thus are phylogenetically uninformative. Shared-derived morphological characters support the same relationships evident in the molecular phylogenies and, in concert with the molecular data, suggest that as brown kiwis colonized northward from the southern South Island, they retained many primitive characters that confounded earlier systematists. Strong subdivided population structure and cryptic species in brown kiwis seem to have evolved relatively recently as a consequence of Pleistocene range disjunctions, low dispersal power, and genetic drift in small populations.
Resumo:
PR-39 is a porcine 39-aa peptide antibiotic composed of 49% proline and 24% arginine, with an activity against Gram-negative bacteria comparable to that of tetracycline. In Escherichia coli, it inhibits DNA and protein synthesis. PR-39 was originally isolated from pig small intestine, but subsequent cDNA cloning showed that the gene is expressed in the bone marrow. The open reading frame of the clone showed that PR-39 is made as 173-aa precursor whose proregion belongs to the cathelin family. The PR39 gene, which is rather compact and spans only 1784 bp has now been sequenced. The coding information is split into four exons. The first exon contains the signal sequence of 29 residues and the first 37 residues of the cathelin propart. Exons 2 and 3 contain only cathelin information, while exon 4 codes for the four C-terminal cathelin residues and the mature PR-39 peptide extended by three residues. The sequenced upstream region (1183 bp) contains four potential recognition sites for NF-IL6 and three for APRF, transcription factors known to regulate genes for both cytokines and acute phase response factors. Genomic hybridizations revealed a fairly high level of restriction fragment length polymorphism and indicated that there are at least two copies of the PR39 gene in the pig genome. PR39 was mapped to pig chromosome 13 by linkage and in situ hybridization mapping. The gene for the human peptide antibiotic FALL-39 (also a member of the cathelin family) was mapped to human chromosome 3, which is homologous to pig chromosome 13.
Resumo:
Elongated particles of simple RNA viruses of plants are composed of an RNA molecule coated with numerous identical capsid protein subunits to form a regular helical structure, of which tobacco mosaic virus is the archetype. Filamentous particles of the closterovirus beet yellow virus (BYV) reportedly contain approximately 4000 identical 22-kDa (p22) capsid protein subunits. The BYV genome encodes a 24-kDa protein (p24) that is structurally related to the p22. We searched for the p24 in BYV particles by using immunoelectron microscopy with specific antibodies against the recombinant p24 protein and its N-terminal peptide. A 75-nm segment at one end of the 1370-nm filamentous viral particle was found to be consistently labeled with both types of antibodies, thus indicating that p24 is indeed the second capsid protein and that the closterovirus particle, unlike those of other plant viruses with helical symmetry, has a "rattlesnake" rather than uniform structure.
Resumo:
We sampled leaves from 678 individuals in 21 natural populations (30-36 individuals per population), covering the entire distribution of Euptelea pleiospermum in China.Total DNA was isolated from about 50 mg powdered leaf tissue following the protocol of a DNA extraction kit (Tiangen Biotech Co., LTD., Beijing, China). We used seven fluorescence-labeled microsatellite loci (EP036, EP059, EP081, EP087, EP091, EP278 and EP294; Zhang et al., 2008) to genotype our 678 DNA samples.
Resumo:
We have determined the crystal structure of the core (C) protein from the Kunjin subtype of West Nile virus (WNV), closely related to the NY99 strain of WNV, currently a major health threat in the U.S. WNV is a member of the Flaviviridae family of enveloped RNA viruses that contains many important human pathogens. The C protein is associated with the RNA genome and forms the internal core which is surrounded by the envelope in the virion. The C protein structure contains four a. helices and forms dimers that are organized into tetramers. The tetramers form extended filamentous ribbons resembling the stacked alpha helices seen in HEAT protein structures.
Resumo:
The SOX family of transcription factors are found throughout the animal kingdom and are important in a variety of developmental contexts. Genome analysis has identified 20 Sox genes in human and mouse, which can be subdivided into 8 groups, based on sequence comparison and intron-exon structure. Most of the SOX groups identified in mammals are represented by a single SOX sequence in invertebrate model organisms, suggesting a duplication and divergence mechanism has operated during vertebrate evolution. We have now analysed the Sox gene complement in the pufferfish, Fugu rubripes, in order to shed further light on the diversity and origins of the Sox gene family. Major differences were found between the Sox family in Fugu and those in humans and mice. In particular, Fugu does not have orthologues of Sry, Sox,15 and Sox30, which appear to be specific to mammals, while Sox19, found in Fugu and zebrafish but absent in mammals, seems to be specific to fishes. Six mammalian Sox genes are represented by two copies each in Fugu, indicating a large-scale gene duplication in the fish lineage. These findings point to recent Sox gene loss, duplication and divergence occurring during the evolution of tetrapod and teleost lineages, and provide further evidence for large-scale segmental or a whole-genome duplication occurring early in the radiation of teleosts. (C) 2004 Elsevier B.V. All rights reserved.
Resumo:
Lines of transgenic tobacco have been generated that are transformed with either the wild-type peanut peroxidase prxPNC2 cDNA, driven by the CaMV3 5S promoter (designated 35S::prxPNC2-WT) or a mutated PNC2 cDNA in which the asparagine residue (Asn(189)) associated with the point of glycan attachment (Asn(189)) has been replaced with alanine (designated 35S::prxPNC2-M). PCR, using genomic DNA as template, has confirmed the integration of the 35S::prxPNC2-WT and 35::prxPNC2-M constructs into the tobacco genome, and western analysis using anti-PNC2 antibodies has revealed that the prxPNC2-WT protein product (PNC2-WT) accumulates with a molecular mass of 34,670 Da, while the prxPNC2-M protein product (PNC2-M) accumulates with a molecular mass of 32,600 Da. Activity assays have shown that both PNC2-WT and PNC2-M proteins accumulate preferentially in the ionically-bound cell wall fraction, with a significantly higher relative accumulation of the PNC2-WT isoenzyme in the ionically-bound fraction when compared with the PNC2-M isoform. Kinetic analysis of the partially purified PNC2-WT isozyme revealed an affinity constant (apparent K-m) of 11.2 mM for the reductor substrate guaiacol and 1.29 mM for H2O2, while values of 11.9 mM and 1.12 mM were determined for the PNC2-M isozyme. A higher Arrenhius activation energy (E,,) was determined for the PNC2-M isozyme (22.9 kJ mol(-1)), when compared with the PNC2-WT isozyme (17.6 kJ mol(-1)), and enzyme assays have determined that the absence of the glycan influences the thermostability of the PNC2-M isozyme. These results are discussed with respect to the proposed roles of N-linked glycans attached to plant peroxidases. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.
Resumo:
As advances in molecular biology continue to reveal additional layers of complexity in gene regulation, computational models need to incorporate additional features to explore the implications of new theories and hypotheses. It has recently been suggested that eukaryotic organisms owe their phenotypic complexity and diversity to the exploitation of small RNAs as signalling molecules. Previous models of genetic systems are, for several reasons, inadequate to investigate this theory. In this study, we present an artificial genome model of genetic regulatory networks based upon previous work by Torsten Reil, and demonstrate how this model generates networks with biologically plausible structural and dynamic properties. We also extend the model to explore the implications of incorporating regulation by small RNA molecules in a gene network. We demonstrate how, using these signals, highly connected networks can display dynamics that are more stable than expected given their level of connectivity.