10 resultados para Geographical positions
em National Center for Biotechnology Information - NCBI
Resumo:
Alignments of homologous genes typically reveal a great diversity of intron locations, far more than could fit comfortably in a single gene. Thus, a minority of these intron positions could be inherited from a single ancestral gene, but the larger share must be attributed to subsequent events of intron gain or intron “sliding” (movement from one position to another within a gene). Intron sliding has been argued from cases of discordant introns and from putative spatial clustering of intron positions. A list of 32 cases of discordant introns is presented here. Most of these cases are found to be artefactual. The spatial and phylogenetic distributions of intron positions from five published compilations of gene data, comprising 205 intron positions, have been examined systematically for evidence of intron sliding. The results suggest that sliding, if it occurs at all, has contributed little to the diversity of intron positions.
Resumo:
We analyze the three-dimensional structure of proteins by a computer program that finds regions of sequence that contain module boundaries, defining a module as a segment of polypeptide chain bounded in space by a specific given distance. The program defines a set of “linker regions” that have the property that if an intron were to be placed into each linker region, the protein would be dissected into a set of modules all less than the specified diameter. We test a set of 32 proteins, all of ancient origin, and a corresponding set of 570 intron positions, to ask if there is a statistically significant excess of intron positions within the linker regions. For 28-Å modules, a standard size used historically, we find such an excess, with P < 0.003. This correlation is neither due to a compositional or sequence bias in the linker regions nor to a surface bias in intron positions. Furthermore, a subset of 20 introns, which can be putatively identified as old, lies even more explicitly within the linker regions, with P < 0.0003. Thus, there is a strong correlation between intron positions and three-dimensional structural elements of ancient proteins as expected by the introns-early approach. We then study a range of module diameters and show that, as the diameter varies, significant peaks of correlation appear for module diameters centered at 21.7, 27.6, and 32.9 Å. These preferred module diameters roughly correspond to predicted exon sizes of 15, 22, and 30 residues. Thus, there are significant correlations between introns, modules, and a quantized pattern of the lengths of polypeptide chains, which is the prediction of the “Exon Theory of Genes.”
Resumo:
Objective: To investigate the geographical variation and clustering of congenital anophthalmia and microphthalmia in England, in response to media reports of clusters.
Resumo:
The Conserved Key Amino Acid Positions DataBase (CKAAPs DB) provides access to an analysis of structurally similar proteins with dissimilar sequences where key residues within a common fold are identified. The derivation and significance of CKAAPs starting from pairwise structure alignments is described fully in Reddy et al. [Reddy,B.V.B., Li,W.W., Shindyalov,I.N. and Bourne,P.E. (2000) Proteins, in press]. The CKAAPs identified from this theoretical analysis are provided to experimentalists and theoreticians for potential use in protein engineering and modeling. It has been suggested that CKAAPs may be crucial features for protein folding, structural stability and function. Over 170 substructures, as defined by the Combinatorial Extension (CE) database, which are found in approximately 3000 representative polypeptide chains have been analyzed and are available in the CKAAPs DB. CKAAPs DB also provides CKAAPs of the representative set of proteins derived from the CE and FSSP databases. Thus the database contains over 5000 representative polypeptide chains, covering all known structures in the PDB. A web interface to a relational database permits fast retrieval of structure-sequence alignments, CKAAPs and associated statistics. Users may query by PDB ID, protein name, function and Enzyme Classification number. Users may also submit protein alignments of their own to obtain CKAAPs. An interface to display CKAAPs on each structure from a web browser is also being implemented. CKAAPs DB is maintained by the San Diego Supercomputer Center and accessible at the URL http://ckaaps.sdsc.edu.
Resumo:
The effects of the rotational information of DNA in determining the in vitro localization of nucleosomal core particles (ncps) have been studied in the Saccharomyces cerevisiae 5S rRNA repeat gene. We have altered the distribution of the phased series of flexibility signals present on this DNA by inserting a 25-bp tract, and we have analyzed the effects of this mutation on the distribution and on the frequencies of ncps, as compared with the wild type and a reference 21-bp insertion mutant. The variation of the standard free energy of nucleosome reconstitution was determined. The results show that the DNA rotational information is a major determinant of ncps positioning, define how many rotationally phased signals are required for the formation of a stable particle, and teach how to modify their distribution through the alteration of the rotational signals.
Resumo:
Geographical patterns of mtDNA variation were studied in 12 Italian samples (1072 individuals) by two different spatial autocorrelation methods. Separate analyses of the frequencies of 12 restriction morphs show North-South clines, differences between Sardinia and the mainland populations, and the effects of isolation by distance. A recently developed autocorrelation statistic summarizing molecular similarity at all sites (AIDA; autocorrelation index for DNA analysis) confirms the presence of a clinical pattern; differences between random pairs of haplotypes tend to increase with their geographical distance. The partition of gene diversity, however, reveals that most variability occurs within populations, whereas differences between populations are minor (GST = 0.057). When the data from the 12 samples are pooled, two descriptors of genetic variability (number of polymorphic sites and average sequence difference between pairs of individuals) do not behave as expected under neutrality. The presence of clinal patterns, Tajima's tests, and a simulation experiment agree in suggesting that population sizes increased rapidly in Italy and Sicily but not necessarily so in Sardinia. The distribution of pairwise sequence differences in the Italian peninsula (excluding Sardinia) permits a tentative location of the demographic increase between 8000 and 20,500 years ago. These dates are consistent with archaeological estimates of two distinct expansion processes, occurring, respectively, in the Neolithic and after the last glacial maximum in the Paleolithic. Conversely, there is no genetic evidence that such processes have had a major impact on the Sardinian population.
Resumo:
The gene encoding the glycolytic enzyme triose-phosphate isomerase (TPI; EC 5.3.1.1) has been central to the long-standing controversy on the origin and evolutionary significance of spliceosomal introns by virtue of its pivotal support for the introns-early view, or exon theory of genes. Putative correlations between intron positions and TPI protein structure have led to the conjecture that the gene was assembled by exon shuffling, and five TPI intron positions are old by the criterion of being conserved between animals and plants. We have sequenced TPI genes from three diverse eukaryotes--the basidiomycete Coprinus cinereus, the nematode Caenorhabditis elegans, and the insect Heliothis virescens--and have found introns at seven novel positions that disrupt previously recognized gene/protein structure correlations. The set of 21 TPI introns now known is consistent with a random model of intron insertion. Twelve of the 21 TPI introns appear to be of recent origin since each is present in but a single examined species. These results, together with their implication that as more TPI genes are sequenced more intron positions will be found, render TPI untenable as a paradigm for the introns-early theory and, instead, support the introns-late view that spliceosomal introns have been inserted into preexisting genes during eukaryotic evolution.
Resumo:
By using taxonomic characters derived from EcoRI restriction endonuclease digestion of genomic DNA and hybridization with a labeled rRNA operon from Escherichia coli, a polymorphic structure of Listeria monocytogenes, characterized by fragments with different frequencies of occurrence, was observed. This structure was expanded by creating predicted patterns through a recursive process of observation, expectation, prediction, and assessment of completeness. This process was applied, in turn, to normalized strain patterns, fragment bands, and positions of EcoRI recognition sites relative to rRNA regions. Analysis of 1346 strains provided observed patterns, fragment sizes, and their frequencies of occurrence in the patterns. Fragment size statistics led to the creation of unobserved combinations of bands, predicted pattern types. The observed fragment bands revealed positions of EcoRI sites relative to rRNA sequences. Each EcoRI site had a frequency of occurrence, and unobserved fragment sizes were postulated on the basis of knowing the restriction site locations. The result of the recursion process applied to the components of the strain data was an extended classification with observed and predicted members.
Resumo:
Sequence analysis of peptides naturally presented by major histocompatibility complex (MHC) class I molecules has revealed allele-specific motifs in which the peptide length and the residues observed at certain positions are restricted. Nevertheless, peptides containing the standard motif often fail to bind with high affinity or form physiologically stable complexes. Here we present the crystal structure of a well-characterized antigenic peptide from ovalbumin [OVA-8, ovalbumin-(257-264), SIINFEKL] in complex with the murine MHC class I H-2Kb molecule at 2.5-A resolution. Hydrophobic peptide residues Ile-P2 and Phe-P5 are packed closely together into binding pockets B and C, suggesting that the interplay of peptide anchor (P5) and secondary anchor (P2) residues can couple the preferred sequences at these positions. Comparison with the crystal structures of H-2Kb in complex with peptides VSV-8 (RGYVYQGL) and SEV-9 (FAPGNYPAL), where a Tyr residue is used as the C pocket anchor, reveals that the conserved water molecule that binds into the B pocket and mediates hydrogen bonding from the buried anchor hydroxyl group could not be likewise positioned if the P2 side chain were of significant size. Based on this structural evidence, H-2Kb has at least two submotifs: one with Tyr at P5 (or P6 for nonamer peptides) and a small residue at P2 (i.e., Ala or Gly) and another with Phe at P5 and a medium-sized hydrophobic residue at P2 (i.e., Ile). Deciphering of these secondary submotifs from both crystallographic and immunological studies of MHC peptide binding should increase the accuracy of T-cell epitope prediction.