9 resultados para evolutionary computation
em DigitalCommons@The Texas Medical Center
Resumo:
POLN is a nuclear A-family DNA polymerase encoded in vertebrate genomes. POLN has unusual fidelity and DNA lesion bypass properties, including strong strand displacement activity, low fidelity favoring incorporation of T for template G and accurate translesion synthesis past a 5S-thymine glycol (5S-Tg). We searched for conserved features of the polymerase domain that distinguish it from prokaryotic pol I-type DNA polymerases. A Lys residue (679 in human POLN) of particular interest was identified in the conserved 'O-helix' of motif 4 in the fingers sub-domain. The corresponding residue is one of the most important for controlling fidelity of prokaryotic pol I and is a nonpolar Ala or Thr in those enzymes. Kinetic measurements show that K679A or K679T POLN mutant DNA polymerases have full activity on nondamaged templates, but poorly incorporate T opposite template G and do not bypass 5S-Tg efficiently. We also found that a conserved Tyr residue in the same motif not only affects sensitivity to dideoxynucleotides, but also greatly influences enzyme activity, fidelity and bypass. Protein sequence alignment reveals that POLN has three specific insertions in the DNA polymerase domain. The results demonstrate that residues have been strictly retained during evolution that confer unique bypass and fidelity properties on POLN.
Resumo:
Molluscan preparations have yielded seminal discoveries in neuroscience, but the experimental advantages of this group have not, until now, been complemented by adequate molecular or genomic information for comparisons to genetically defined model organisms in other phyla. The recent sequencing of the transcriptome and genome of Aplysia californica, however, will enable extensive comparative studies at the molecular level. Among other benefits, this will bring the power of individually identifiable and manipulable neurons to bear upon questions of cellular function for evolutionarily conserved genes associated with clinically important neural dysfunction. Because of the slower rate of gene evolution in this molluscan lineage, more homologs of genes associated with human disease are present in Aplysia than in leading model organisms from Arthropoda (Drosophila) or Nematoda (Caenorhabditis elegans). Research has hardly begun in molluscs on the cellular functions of gene products that in humans are associated with neurological diseases. On the other hand, much is known about molecular and cellular mechanisms of long-term neuronal plasticity. Persistent nociceptive sensitization of nociceptors in Aplysia displays many functional similarities to alterations in mammalian nociceptors associated with the clinical problem of chronic pain. Moreover, in Aplysia and mammals the same cell signaling pathways trigger persistent enhancement of excitability and synaptic transmission following noxious stimulation, and these highly conserved pathways are also used to induce memory traces in neural circuits of diverse species. This functional and molecular overlap in distantly related lineages and neuronal types supports the proposal that fundamental plasticity mechanisms important for memory, chronic pain, and other lasting alterations evolved from adaptive responses to peripheral injury in the earliest neurons. Molluscan preparations should become increasingly useful for comparative studies across phyla that can provide insight into cellular functions of clinically important genes.
Resumo:
A series of human-rodent somatic cell hybrids were investigated by Southern blot analysis for the presence or absence of twenty-six molecular markers and three isozyme loci from human chromosome 19. Based on the co-retention of these markers in the various independent hybrid clones containing portions of human chromosome 19 and on pulsed field mapping, chromosome 19 is divided into twenty ordered regions. The most likely marker order for the chromosome is: (LDLR, C3)-(cen-MANNB)-D19S7-PEPD-D19S9-GPI-TGF$ \beta$-(CYP2A, NCA, CGM2, BCKAD)-PSG1a-(D19S8, XRCC1)-(D19S19, ATP1A3)-(D19S37, APOC2)-CKMM-ERCC2-ERCC1-(D19S62, D19S51)-D19S6-D19S50-D19S22-(CGB, FTL)-qter.^ The region of 19q between the proximal marker D19S7 and the distal gene coding for the beta subunit of chorionic gonadotropin (CGB) is about 37 Mb in size and covers about 37 cM genetic distance. The ration of genetic to physical distance on 19q is therefore very close to the genomic average OF 1 cM/Mb. Estimates of physical distances for intervals between chromosome 19 markers were calculated using a mapping function which estimates distances based on the number of breaks in hybrid clone panels. The consensus genetic distances between individual markers (established at HBM10) were compared to these estimates of physical distances. The close agreement between the two estimates suggested that spontaneously broken hybrids are as appropriate for this type of study as radiation hybrids.^ All three DNA repair genes located on chromosome 19 were found to have homologues on Chinese hamster chromosome 9, which is hemizygous in CHO cells, providing an explanation for the apparent ease with which mutations at these loci were identified in CHO cells. Homologues of CKMM and TGF$\beta$ (from human chromosome 19q) and a mini-satellite DNA specific to the distal region of human chromosome 19q were also mapped to Chinese hamster 9. Markers from 19p did not map to this hamster chromosome. Thus the q-arm of chromosome 19, at least between the genes PEPD and ERCC1, appears to be a linkage group which is conserved intact between humans and Chinese hamsters. ^
Resumo:
Variable number of tandem repeats (VNTR) are genetic loci at which short sequence motifs are found repeated different numbers of times among chromosomes. To explore the potential utility of VNTR loci in evolutionary studies, I have conducted a series of studies to address the following questions: (1) What are the population genetic properties of these loci? (2) What are the mutational mechanisms of repeat number change at these loci? (3) Can DNA profiles be used to measure the relatedness between a pair of individuals? (4) Can DNA fingerprint be used to measure the relatedness between populations in evolutionary studies? (5) Can microsatellite and short tandem repeat (STR) loci which mutate stepwisely be used in evolutionary analyses?^ A large number of VNTR loci typed in many populations were studied by means of statistical methods developed recently. The results of this work indicate that there is no significant departure from Hardy-Weinberg expectation (HWE) at VNTR loci in most of the human populations examined, and the departure from HWE in some VNTR loci are not solely caused by the presence of population sub-structure.^ A statistical procedure is developed to investigate the mutational mechanisms of VNTR loci by studying the allele frequency distributions of these loci. Comparisons of frequency distribution data on several hundreds VNTR loci with the predictions of two mutation models demonstrated that there are differences among VNTR loci grouped by repeat unit sizes.^ By extending the ITO method, I derived the distribution of the number of shared bands between individuals with any kinship relationship. A maximum likelihood estimation procedure is proposed to estimate the relatedness between individuals from the observed number of shared bands between them.^ It was believed that classical measures of genetic distance are not applicable to analysis of DNA fingerprints which reveal many minisatellite loci simultaneously in the genome, because the information regarding underlying alleles and loci is not available. I proposed a new measure of genetic distance based on band sharing between individuals that is applicable to DNA fingerprint data.^ To address the concern that microsatellite and STR loci may not be useful for evolutionary studies because of the convergent nature of their mutation mechanisms, by a theoretical study as well as by computer simulation, I conclude that the possible bias caused by the convergent mutations can be corrected, and a novel measure of genetic distance that makes the correction is suggested. In summary, I conclude that hypervariable VNTR loci are useful in evolutionary studies of closely related populations or species, especially in the study of human evolution and the history of geographic dispersal of Homo sapiens. (Abstract shortened by UMI.) ^
Resumo:
D1S1, an anonymous human DNA clone originally called (lamda)Ch4-H3 or (lamda)H3, was the first single copy mapped to a human chromosome (1p36) by in situ hybridization. The chromosomal assignment has been confirmed in other laboratories by repeating the in situ hybridization but not by another method. In the present study, hybridization to a panel of hamster-human somatic cell hybrids revealed copies of D1S1 on both chromosomes 1 and 3. Subcloning D1S1 showed that the D1S1 clone itself is from chromosome 3, and the sequence detected by in situ hybridization is at least two copies of part of the chromosome 3 copy. This finding demonstrates the importance of verifying gene mapping with two methods and questions the accuracy of in situ hybridization mapping.^ Non-human mammals have only one copy of D1S1, and the non-human primate D1S1 map closely resembles the human chromosome 3 copy. Thus, the human chromosome 1 copies appear to be part of a very recent duplication that occurred after the divergence between humans and the other great apes.^ A moderately informative HindIII D1S1 RFLP was mapped to chromosome 3. This marker and 12 protein markers were applied to a linkage study of autosomal dominant retinitis pigmentosa (ADRP). None of the markers proved linkage, but adding the three families examined to previously published data raises the ADRP:Rh lod score to 1.92 at (THETA) = 0.30. ^
Resumo:
With the aim of understanding the mechanism of molecular evolution, mathematical problems on the evolutionary change of DNA sequences are studied. The problems studied and the results obtained are as follows: (1) Estimation of evolutionary distance between nucleotide sequences. Studying the pattern of nucleotide substitution for the case of unequal substitution rates, a new mathematical formula for estimating the average number of nucleotide substitutions per site between two homologous DNA sequences is developed. It is shown that this formula has a wider applicability than currently available formulae. A statistical method for estimating the number of nucleotide changes due to deletion and insertion is also developed. (2) Biases of the estimates of nucleotide substitutions obtained by the restriction enzyme method. The deviation of the estimate of nucleotide substitutions obtained by the restriction enzyme method from the true value is investigated theoretically. It is shown that the amount of the deviation depends on the nucleotides in the recognition sequence of the restriction enzyme used, unequal rates of substitution among different nucleotides, and nucleotide frequences, but the primary factor is the unequal rates of nucleotide substitution. When many different kinds of enzymes are used, however, the amount of average deviation is generally small. (3) Distribution of restriction fragment lengths. To see the effect of undetectable restriction fragments and fragment differences on the estimate of nucleotide differences, the theoretical distribution of fragment lengths is studied. This distribution depends on the type of restriction enzymes used as well as on the relative frequencies of four nucleotides. It is shown that undetectability of small fragments or fragment differences gives a serious underestimate of nucleotide substitutions when the length-difference method of estimation is used, but the extent of underestimation is small when the site-difference method is used. (4) Evolutionary relationships of DNA sequences in finite populations. A mathematical theory on the expected evolutionary relationships among DNA sequences (nucleons) randomly chosen from the same or different populations is developed under the assumption that the evolutionary change of nucleons is determined solely by mutation and random genetic drift. . . . (Author's abstract exceeds stipulated maximum length. Discontinued here with permission of author). UMI ^
Resumo:
Essential biological processes are governed by organized, dynamic interactions between multiple biomolecular systems. Complexes are thus formed to enable the biological function and get dissembled as the process is completed. Examples of such processes include the translation of the messenger RNA into protein by the ribosome, the folding of proteins by chaperonins or the entry of viruses in host cells. Understanding these fundamental processes by characterizing the molecular mechanisms that enable then, would allow the (better) design of therapies and drugs. Such molecular mechanisms may be revealed trough the structural elucidation of the biomolecular assemblies at the core of these processes. Various experimental techniques may be applied to investigate the molecular architecture of biomolecular assemblies. High-resolution techniques, such as X-ray crystallography, may solve the atomic structure of the system, but are typically constrained to biomolecules of reduced flexibility and dimensions. In particular, X-ray crystallography requires the sample to form a three dimensional (3D) crystal lattice which is technically di‑cult, if not impossible, to obtain, especially for large, dynamic systems. Often these techniques solve the structure of the different constituent components within the assembly, but encounter difficulties when investigating the entire system. On the other hand, imaging techniques, such as cryo-electron microscopy (cryo-EM), are able to depict large systems in near-native environment, without requiring the formation of crystals. The structures solved by cryo-EM cover a wide range of resolutions, from very low level of detail where only the overall shape of the system is visible, to high-resolution that approach, but not yet reach, atomic level of detail. In this dissertation, several modeling methods are introduced to either integrate cryo-EM datasets with structural data from X-ray crystallography, or to directly interpret the cryo-EM reconstruction. Such computational techniques were developed with the goal of creating an atomic model for the cryo-EM data. The low-resolution reconstructions lack the level of detail to permit a direct atomic interpretation, i.e. one cannot reliably locate the atoms or amino-acid residues within the structure obtained by cryo-EM. Thereby one needs to consider additional information, for example, structural data from other sources such as X-ray crystallography, in order to enable such a high-resolution interpretation. Modeling techniques are thus developed to integrate the structural data from the different biophysical sources, examples including the work described in the manuscript I and II of this dissertation. At intermediate and high-resolution, cryo-EM reconstructions depict consistent 3D folds such as tubular features which in general correspond to alpha-helices. Such features can be annotated and later on used to build the atomic model of the system, see manuscript III as alternative. Three manuscripts are presented as part of the PhD dissertation, each introducing a computational technique that facilitates the interpretation of cryo-EM reconstructions. The first manuscript is an application paper that describes a heuristics to generate the atomic model for the protein envelope of the Rift Valley fever virus. The second manuscript introduces the evolutionary tabu search strategies to enable the integration of multiple component atomic structures with the cryo-EM map of their assembly. Finally, the third manuscript develops further the latter technique and apply it to annotate consistent 3D patterns in intermediate-resolution cryo-EM reconstructions. The first manuscript, titled An assembly model for Rift Valley fever virus, was submitted for publication in the Journal of Molecular Biology. The cryo-EM structure of the Rift Valley fever virus was previously solved at 27Å-resolution by Dr. Freiberg and collaborators. Such reconstruction shows the overall shape of the virus envelope, yet the reduced level of detail prevents the direct atomic interpretation. High-resolution structures are not yet available for the entire virus nor for the two different component glycoproteins that form its envelope. However, homology models may be generated for these glycoproteins based on similar structures that are available at atomic resolutions. The manuscript presents the steps required to identify an atomic model of the entire virus envelope, based on the low-resolution cryo-EM map of the envelope and the homology models of the two glycoproteins. Starting with the results of the exhaustive search to place the two glycoproteins, the model is built iterative by running multiple multi-body refinements to hierarchically generate models for the different regions of the envelope. The generated atomic model is supported by prior knowledge regarding virus biology and contains valuable information about the molecular architecture of the system. It provides the basis for further investigations seeking to reveal different processes in which the virus is involved such as assembly or fusion. The second manuscript was recently published in the of Journal of Structural Biology (doi:10.1016/j.jsb.2009.12.028) under the title Evolutionary tabu search strategies for the simultaneous registration of multiple atomic structures in cryo-EM reconstructions. This manuscript introduces the evolutionary tabu search strategies applied to enable a multi-body registration. This technique is a hybrid approach that combines a genetic algorithm with a tabu search strategy to promote the proper exploration of the high-dimensional search space. Similar to the Rift Valley fever virus, it is common that the structure of a large multi-component assembly is available at low-resolution from cryo-EM, while high-resolution structures are solved for the different components but lack for the entire system. Evolutionary tabu search strategies enable the building of an atomic model for the entire system by considering simultaneously the different components. Such registration indirectly introduces spatial constrains as all components need to be placed within the assembly, enabling the proper docked in the low-resolution map of the entire assembly. Along with the method description, the manuscript covers the validation, presenting the benefit of the technique in both synthetic and experimental test cases. Such approach successfully docked multiple components up to resolutions of 40Å. The third manuscript is entitled Evolutionary Bidirectional Expansion for the Annotation of Alpha Helices in Electron Cryo-Microscopy Reconstructions and was submitted for publication in the Journal of Structural Biology. The modeling approach described in this manuscript applies the evolutionary tabu search strategies in combination with the bidirectional expansion to annotate secondary structure elements in intermediate resolution cryo-EM reconstructions. In particular, secondary structure elements such as alpha helices show consistent patterns in cryo-EM data, and are visible as rod-like patterns of high density. The evolutionary tabu search strategy is applied to identify the placement of the different alpha helices, while the bidirectional expansion characterizes their length and curvature. The manuscript presents the validation of the approach at resolutions ranging between 6 and 14Å, a level of detail where alpha helices are visible. Up to resolution of 12 Å, the method measures sensitivities between 70-100% as estimated in experimental test cases, i.e. 70-100% of the alpha-helices were correctly predicted in an automatic manner in the experimental data. The three manuscripts presented in this PhD dissertation cover different computation methods for the integration and interpretation of cryo-EM reconstructions. The methods were developed in the molecular modeling software Sculptor (http://sculptor.biomachina.org) and are available for the scientific community interested in the multi-resolution modeling of cryo-EM data. The work spans a wide range of resolution covering multi-body refinement and registration at low-resolution along with annotation of consistent patterns at high-resolution. Such methods are essential for the modeling of cryo-EM data, and may be applied in other fields where similar spatial problems are encountered, such as medical imaging.
Resumo:
Normal humans have one red and at least one green visual pigment genes. These genes are tightly linked as tandem repeats on the X chromosome and each of them has six exons. There is only one X-linked visual pigment gene in New World monkeys (NWMs) but the locus has three polymorphic alleles encoding red, yellow and green visual pigments, respectively. The spectral properties of the squirrel monkey and the marmoset (both NWMs) have been studied and partial sequences of the three alleles are available. To study the evolutionary history of these X-linked opsin genes in humans and NWMs, coding and intron sequences of the three squirrel monkey alleles and the three marmoset alleles were amplified by PCR followed by subcloning and sequencing. Introns 2 and 4 of the human red and green pigment genes were also sequenced. The results obtained are as follows: (1) The sequences of introns 2 and 4 of the human red and green opsin genes are significantly more similar between the two genes than are coding sequences, contrary to the usual situation where coding regions are better conserved in evolution than are introns. The high similarities in the two introns are probably due to recent gene conversion events during evolution of the human lineage. (2) Phylogenetic analysis of both intron and exon sequences indicates that the phylogenetic tree of the available primate opsin genes is the same as the species tree. The two human genes were derived from a gene duplication event after the divergence of the human and NWM lineages. The three alleles in each of the two NWM species diverged after the split of the two NWMs but have persisted in the population for at least 5 million years. (3) Allelic gene conversion might have occurred between the three squirrel monkey alleles. (4) A model of additive effect of hydroxyl-bearing amino acids on spectral tuning is proposed by treating some unknown variables as groups. Under the assumption that some residues have no effect, it is found that at least five amino acid residues, at positions 178 (3 nm), 180 (5 nm), 230 ($-$4 nm), 277 (9 nm) and 285 (13 nm), have linear spectral tuning effects. (5) Adaptive evolution of the opsin genes to different spectral peaks was observed at four residues that are important for spectral tuning. ^