171 resultados para Biology, Molecular|Biology, Genetics|Computer Science
em DigitalCommons@The Texas Medical Center
Resumo:
Historically morphological features were used as the primary means to classify organisms. However, the age of molecular genetics has allowed us to approach this field from the perspective of the organism's genetic code. Early work used highly conserved sequences, such as ribosomal RNA. The increasing number of complete genomes in the public data repositories provides the opportunity to look not only at a single gene, but at organisms' entire parts list. ^ Here the Sequence Comparison Index (SCI) and the Organism Comparison Index (OCI), algorithms and methods to compare proteins and proteomes, are presented. The complete proteomes of 104 sequenced organisms were compared. Over 280 million full Smith-Waterman alignments were performed on sequence pairs which had a reasonable expectation of being related. From these alignments a whole proteome phylogenetic tree was constructed. This method was also used to compare the small subunit (SSU) rRNA from each organism and a tree constructed from these results. The SSU rRNA tree by the SCI/OCI method looks very much like accepted SSU rRNA trees from sources such as the Ribosomal Database Project, thus validating the method. The SCI/OCI proteome tree showed a number of small but significant differences when compared to the SSU rRNA tree and proteome trees constructed by other methods. Horizontal gene transfer does not appear to affect the SCI/OCI trees until the transferred genes make up a large portion of the proteome. ^ As part of this work, the Database of Related Local Alignments (DaRLA) was created and contains over 81 million rows of sequence alignment information. DaRLA, while primarily used to build the whole proteome trees, can also be applied shared gene content analysis, gene order analysis, and creating individual protein trees. ^ Finally, the standard BLAST method for analyzing shared gene content was compared to the SCI method using 4 spirochetes. The SCI system performed flawlessly, finding all proteins from one organism against itself and finding all the ribosomal proteins between organisms. The BLAST system missed some proteins from its respective organism and failed to detect small ribosomal proteins between organisms. ^
Resumo:
(1) A mathematical theory for computing the probabilities of various nucleotide configurations is developed, and the probability of obtaining the correct phylogenetic tree (model tree) from sequence data is evaluated for six phylogenetic tree-making methods (UPGMA, distance Wagner method, transformed distance method, Fitch-Margoliash's method, maximum parsimony method, and compatibility method). The number of nucleotides (m*) necessary to obtain the correct tree with a probability of 95% is estimated with special reference to the human, chimpanzee, and gorilla divergence. m* is at least 4,200, but the availability of outgroup species greatly reduces m* for all methods except UPGMA. m* increases if transitions occur more frequently than transversions as in the case of mitochondrial DNA. (2) A new tree-making method called the neighbor-joining method is proposed. This method is applicable either for distance data or character state data. Computer simulation has shown that the neighbor-joining method is generally better than UPGMA, Farris' method, Li's method, and modified Farris method on recovering the true topology when distance data are used. A related method, the simultaneous partitioning method, is also discussed. (3) The maximum likelihood (ML) method for phylogeny reconstruction under the assumption of both constant and varying evolutionary rates is studied, and a new algorithm for obtaining the ML tree is presented. This method gives a tree similar to that obtained by UPGMA when constant evolutionary rate is assumed, whereas it gives a tree similar to that obtained by the maximum parsimony tree and the neighbor-joining method when varying evolutionary rate is assumed. ^
Resumo:
A strain of Saccaromyces cerevisiae (SC3B) with a temperature sensitive defect in the synthesis of DNA has been isolated. This defect is due to a single recessive mutation in a gene named INS1 required for the initiation of S phase. Arrested cells carrying the ins1$\sp{ts}$ allele are defective in the completion of G1 to S phase transition events including SPB duplication or separation, initiation of DNA synthesis, normal control of budding, and bud neck stability. The mutation and a gene which complements the mutation were mapped to chromosome IV. The complementing gene was proved to be the wild type allele of the temperature sensitive mutation by genetic linkage of an integrated clone. A very low abundance 4.2 kb RNA message was observed in the strain SC3B which increased greatly in this strain transformed with a multiple copy plasmid carrying the complementing clone. The wild type gene was sequenced and found to encode a 1268 amino acid protein of with a molecular weight of 142,655 Daltons. Computer assisted searches for similar DNA sequences revealed no significant homology matches. However, searches for protein sequence homology revealed a protein (the DIS3 gene product of S. pombe) with a similar sequence over a 534 amino acid stretch to the predicted INS1 gene product. A later search revealed a near identical sequence for a gene (SRK1) also isolated from S. cerevisiae. ^
Resumo:
Primate immunodeficiency viruses, or lentiviruses (HIV-1, HIV-2, and SIV), and hepatitis delta virus (HDV) are RNA viruses characterized by rapid evolution. Infection by primate immunodeficiency viruses usually results in the development of acquired immunodeficiency syndrome (AIDS) in humans and AIDS-like illnesses in Asian macaques. Similarly, hepatitis delta virus infection causes hepatitis and liver cancer in humans. These viruses are heterogeneous within an infected patient and among individuals. Substitution rates in the virus genomes are high and vary in different lineages and among sites. Methods of phylogenetic analysis were applied to study the evolution of primate lentiviruses and the hepatitis delta virus. The following results have been obtained: (1) The substitution rate varies among sites of primate lentivirus genes according to the two parameter gamma distribution, with the shape parameter $\alpha$ being close to 1. (2) Primate immunodeficiency viruses fall into species-specific lineages. Therefore, viral transmissions across primate species are not as frequent as suggested by previous authors. (3) Primate lentiviruses have acquired or lost their pathogenicity several times in the course of evolution. (4) Evidence was provided for multiple infections of a North American patient by distinct HIV-1 strains of the B subtype. (5) Computer simulations indicate that the probability of committing an error in testing HIV transmission depends on the number of virus sequences and their length, the divergence times among sequences, and the model of nucleotide substitution. (6) For future investigations of HIV-1 transmissions, using longer virus sequences and avoiding the use of distant outgroups is recommended. (7) Hepatitis delta virus strains are usually related according to the geographic region of isolation. (8) Evolution of HDV is characterized by the rate of synonymous substitution being lower than the nonsynonymous substitution rate and the rate of evolution of the noncoding region. (9) There is a strong preference for G and C nucleotides at the third codon positions of the HDV coding region. ^
Resumo:
Normal humans have one red and at least one green visual pigment genes. These genes are tightly linked as tandem repeats on the X chromosome and each of them has six exons. There is only one X-linked visual pigment gene in New World monkeys (NWMs) but the locus has three polymorphic alleles encoding red, yellow and green visual pigments, respectively. The spectral properties of the squirrel monkey and the marmoset (both NWMs) have been studied and partial sequences of the three alleles are available. To study the evolutionary history of these X-linked opsin genes in humans and NWMs, coding and intron sequences of the three squirrel monkey alleles and the three marmoset alleles were amplified by PCR followed by subcloning and sequencing. Introns 2 and 4 of the human red and green pigment genes were also sequenced. The results obtained are as follows: (1) The sequences of introns 2 and 4 of the human red and green opsin genes are significantly more similar between the two genes than are coding sequences, contrary to the usual situation where coding regions are better conserved in evolution than are introns. The high similarities in the two introns are probably due to recent gene conversion events during evolution of the human lineage. (2) Phylogenetic analysis of both intron and exon sequences indicates that the phylogenetic tree of the available primate opsin genes is the same as the species tree. The two human genes were derived from a gene duplication event after the divergence of the human and NWM lineages. The three alleles in each of the two NWM species diverged after the split of the two NWMs but have persisted in the population for at least 5 million years. (3) Allelic gene conversion might have occurred between the three squirrel monkey alleles. (4) A model of additive effect of hydroxyl-bearing amino acids on spectral tuning is proposed by treating some unknown variables as groups. Under the assumption that some residues have no effect, it is found that at least five amino acid residues, at positions 178 (3 nm), 180 (5 nm), 230 ($-$4 nm), 277 (9 nm) and 285 (13 nm), have linear spectral tuning effects. (5) Adaptive evolution of the opsin genes to different spectral peaks was observed at four residues that are important for spectral tuning. ^
Resumo:
Pedigree analysis of certain families with a high incidence of tumors suggests a genetic predisposition to cancer. Li and Fraumeni described a familial cancer syndrome that is characterized by multiple primary tumors, early age of onset, and marked variation in tumor type. Williams and Strong (1) demonstrated that at least 7% of childhood soft tissue sarcoma patients had family histories that is readily explained by a highly penetrant autosomal dominant gene. To characterize the mechanism for genetic predisposition to many tumor types in these families, we have studied genetic alterations in fibroblasts, a target tissue from patients with the Li-Fraumeni Syndrome (LFS).^ We have observed spontaneous changes in initially normal dermal fibroblasts from LFS patients as they are cultured in vitro. The cells acquire an altered morphology, chromosomal anomalies, and anchorage-independent growth. This aberrant behavior of fibroblasts from LFS patients had never been observed in fibroblasts from normal donors. In addition to these phenotypic alterations, patient fibroblasts spontaneously immortalize by 50 population doublings (pd) in culture; unlike controls that remain normal and senesce by 30-35 (2). At 50 pd, immortal fibroblasts from two patients were found to be susceptible to tumorigenic transformation by an activated T24 H-ras oncogene (3). Approximately 80% of the oncogene expressing transfectants were capable of forming tumors in nude mice within 2-3 weeks. p53 has been previously associated with immortalization of cells in culture and cooperation with ras in transfection assays. Therefore, patients' fibroblast and lymphocyte derived DNA was tested for point mutations in p53. It was shown that LFS patients inherited certain point mutations in one of the two p53 alleles (4). Further studies on the above LFS immortal fibroblasts have demonstrated loss of the remaining p53 allele concomitant with escape from senescence. While the loss of the second allele correlates with immortalization it is not sufficient to transformation by an activated H-ras or N-ras oncogene. These immortal fibroblasts are resistant to tumorigenic transformation by v-abl, v-src, c-neu or v-mos oncogene; implying that additional steps are required in the tumorigenic progression of LFS patients' fibroblasts.^ References. (1) Williams et al., J. Natl. Cancer Inst. 79:1213, 1987. (2) Bischoff et al., Cancer Res. 50:7979, 1990. (3) Bischoff et al., Oncogene 6:183, 1991. (4) Malkin et al., Science 250:1233, 1990. ^
Resumo:
Genetic evidence has indicated that the segmentation gene runt plays a key role in regulating gene expression of the pair-rule genes hairy, even-skipped, and fushi tarazu. In contrast to other pair-rule genes, sequence data of the runt open reading frame did not reveal homologies to DNA-binding motifs of known transcriptional regulatory proteins. This thesis project examined several properties of the runt gene based on the sequence of the transcription unit, including the subcellular localization of the protein in vivo, its ability to bind DNA, and the functionality of a putative nucleotide binding domain.^ A runt-specific antibody was generated and used to demonstrate that runt is localized in the nucleus. Since the precise overlap of the pair-rule stripes is thought to be critical for the determination of cellular identity along the anterior-posterior axis, phasing of early runt expression in the blastoderm was examined with regard to the segmentation genes hairy, even-skipped, and fushi tarazu. runt was also expressed at later stages of embryogenesis, including expression in neuroblasts, and ganglion mother cells of the developing nervous system. Expression at this stage was required for the subsequent formation of specific neurons and runt was extensively expressed in the central and peripheral nervous systems.^ Several experiments were done to address the biochemical function of the runt protein. A direct interaction of runt with DNA was first examined. Although bacterial expressed runt was found to bind dsDNA-cellulose, subsequent experiments failed to detect sequence-specific interactions with DNA. Inter-species conservation of the putative nucleotide binding domain suggested that this region was functionally important, and runt protein bound a labeled ATP analog with high affinity in vitro. Finally, the effect of substitution of a critical residue of the nucleotide binding domain on runt activity was examined in vivo. Ectopic expression of the mutant protein indicated that this conserved substitution altered, but did not eliminate, runt activity as evaluated by segmentation phenotype and viability. ^
Resumo:
I studied the apolipoprotein (apo) B 3$\sp\prime$ variable number tandem repeat (VNTR) and did computer simulations of the stepwise mutation model to address four questions: (1) How did the apo B VNTR originate? (2) What is the mutational mechanism of repeat number change at the apo B VNTR? (3) To what extent are population and molecular level events responsible for the determination of the contemporary apo B allele frequency distribution? (4) Can VNTR allele frequency distributions be explained by a simple and conservative mutation-drift model? I used three general approaches to address these questions: (1) I characterized the apo B VNTR region in non-human primate species; (2) I constructed haplotypes of polymorphic markers flanking the apo B VNTR in a sample of individuals from Lorrain, France and studied the associations between the flanking-marker haplotypes and apo B VNTR size; (3) I did computer simulations of the one-step stepwise mutation model and compared the results to real data in terms of four allele frequency distribution characteristics.^ The results of this work have allowed me to conclude that the apo B VNTR originated after an initial duplication of a sequence which is still present as a single copy sequence in New World monkey species. I conclude that this locus did not originate by the transposition of an array of repeats from somewhere else in the genome. It is unlikely that recombination is the primary mutational mechanism. Furthermore, the clustered nature of these associations implicates a stepwise mutational mechanism. From the high frequencies of certain haplotype-allele size combinations, it is evident that population level events have also been important in the determination of the apo B VNTR allele frequency distribution. Results from computer simulations of the one-step stepwise mutation model have allowed me to conclude that bimodal and multimodal allele frequency distributions are not unexpected at loci evolving via stepwise mutation mechanisms. Short tandem repeat loci fit the stepwise mutation model best, followed by microsatellite loci. I therefore conclude that there are differences in the mutational mechanisms of VNTR loci as classed by repeat unit size. (Abstract shortened by UMI.) ^
Resumo:
Molecular and cytogenetic analyses of human glioblastomas have revealed frequent genetic alterations, including major deletions in chromosomes 9, 10, and 17, suggesting the presence of glioma-associated tumor suppressor genes on these chromosomes. To examine this hypothesis, copies of chromosomes 2, 4, and 10 derived from a human fibroblast cell line were independently introduced into a human glioma cell line, U251, by microcell-mediated chromosomal transfer. Successful transfer of chromosomes in each case was confirmed by resistance to the drug G418, indicating the presence of the neomycin-resistance gene previously integrated into each transferred chromosome. The presence of novel chromosomes and or chromosomal fragments was also demonstrated by molecular and karyotypic analyses. The hybrid clones containing either a novel chromosome 4 or chromosome 10 displayed suppression of the tumorigenic phenotype in vivo and suppression of the transformed phenotype in vitro, while cells containing a transferred chromosome 2 failed to alter their tumorigenic phenotype. The hybrid cells containing chromosome 4 or 10 exhibited a significant decrease in their saturation density, altered cellular morphology at high cell density, but only a slight decrease in their exponential growth rate. A dramatic decrease was observed in growth of cells with chromosome 4 or 10 in soft agarose, with the number and size of the colonies being greatly reduced, compared to the parental or chromosome 2 containing cells. The introduction of chromosome 4 or 10 also completely suppressed tumor formation in nude mice. These studies indicate that chromosome 10, as hypothesized, and chromosome 4, a novel finding for gliomas, harbor tumor suppressor loci that may be directly involved in the initiation or progression of normal glial precursors to human glioblastoma multiforme. ^
Resumo:
Variable number of tandem repeats (VNTR) are genetic loci at which short sequence motifs are found repeated different numbers of times among chromosomes. To explore the potential utility of VNTR loci in evolutionary studies, I have conducted a series of studies to address the following questions: (1) What are the population genetic properties of these loci? (2) What are the mutational mechanisms of repeat number change at these loci? (3) Can DNA profiles be used to measure the relatedness between a pair of individuals? (4) Can DNA fingerprint be used to measure the relatedness between populations in evolutionary studies? (5) Can microsatellite and short tandem repeat (STR) loci which mutate stepwisely be used in evolutionary analyses?^ A large number of VNTR loci typed in many populations were studied by means of statistical methods developed recently. The results of this work indicate that there is no significant departure from Hardy-Weinberg expectation (HWE) at VNTR loci in most of the human populations examined, and the departure from HWE in some VNTR loci are not solely caused by the presence of population sub-structure.^ A statistical procedure is developed to investigate the mutational mechanisms of VNTR loci by studying the allele frequency distributions of these loci. Comparisons of frequency distribution data on several hundreds VNTR loci with the predictions of two mutation models demonstrated that there are differences among VNTR loci grouped by repeat unit sizes.^ By extending the ITO method, I derived the distribution of the number of shared bands between individuals with any kinship relationship. A maximum likelihood estimation procedure is proposed to estimate the relatedness between individuals from the observed number of shared bands between them.^ It was believed that classical measures of genetic distance are not applicable to analysis of DNA fingerprints which reveal many minisatellite loci simultaneously in the genome, because the information regarding underlying alleles and loci is not available. I proposed a new measure of genetic distance based on band sharing between individuals that is applicable to DNA fingerprint data.^ To address the concern that microsatellite and STR loci may not be useful for evolutionary studies because of the convergent nature of their mutation mechanisms, by a theoretical study as well as by computer simulation, I conclude that the possible bias caused by the convergent mutations can be corrected, and a novel measure of genetic distance that makes the correction is suggested. In summary, I conclude that hypervariable VNTR loci are useful in evolutionary studies of closely related populations or species, especially in the study of human evolution and the history of geographic dispersal of Homo sapiens. (Abstract shortened by UMI.) ^
Resumo:
DNA for this study was collected from a sample of 133 retinitis pigmentosa (RP) patients and the rhodopsin locus molecularly analyzed by linkage and for disease specific mutations. The cohort of patients consisted of 85 individuals diagnosed with autosomal dominant RP (adRP), and 48 patients representing other forms of retinitis pigmentosa or retinal dystrophy related disease. In three large families with adRP rhodopsin was excluded from linkage to the disease locus. A search for subtle mutations in the rhodopsin coding region using single strand conformational polymorphisms (SSCP) and sequencing detected a total of 14 unique sequence variants in 24 unrelated patients. These variants included one splicing variant, 5168 -1G-A, one deletion variant of 17 base pairs causing a frame shift at codon 332, and 12 misense variants: Pro23His, Leu46Arg, Gly106Trp, Arg135Pro, Pro171Glu, Pro180Ala, Glu181Lys, Asp190Asn, His211Arg, Ser270Arg, Leu328Pro and Pro347Thr. All but three of the missense variants change amino acids that are evolutionarily conserved. The Pro23His mutation was found in 10 unrelated individuals with family histories of adRP and not in any normal controls (over 80 chromosomes tested). The Pro180Ala mutation was present in a patient with simplex RP and probably represents a new mutation. Three normal polymorphic nucleotide substitutions, A-269-G, T-3982-C, and G-5145-A, were also identified. We conclude, based on this study, that 25% of adRP cases are attributable to rhodopsin mutations.^ Clinical data, including ERG results and visual field testing, was available for patients with eleven different mutations. The eleven patients were all diagnosed with RP, however the severity of the disease varied with five patients mildly affected and diagnosed with type II adRP and 5 patients severely affected and diagnosed with type I adRP. The patient with simplex RP was mildly affected. The location of the mutations within the rhodopsin protein was randomly associated with the severity of the disease in those patients evaluated. However, four mutations, Pro23His, Leu46Arg, Pro347Thr, and 5168 -1G-A, are particularly interesting. The Pro23His mutation appears to have radiated from a recent common ancestor of the affected patients as all of them share a common haplotype at the rhodopsin locus. The Leu46Arg mutation causes an unusually severe form of RP. Hydropathy analysis of the mutated sequence revealed a marked change in the hydrophobicity of this first transmembrane spanning region. Codon 347 has been the target of multiple mutations with at least six documented changes at the position, significantly more than expected by a random distribution of mutations. Finally the splice-site variant is extremely variable in its expression in the family studied. Similar mutations have been reported in other cases of adRP and postulated to be involved in autosomal recessive RP (arRP). Mechanisms to account for the variable expression of rhodopsin mutations in relation to RP heterogeneity are discussed. (Abstract shortened by UMI.) ^
Resumo:
The myocyte enhancer factor (MEF)-2 family of transcription factors has been implicated in the regulation of muscle transcription in vertebrates, but the precise position of these regulators within the genetic hierarchy leading to myogenesis is unclear. The MEF2 proteins bind to a conserved A/T-rich DNA sequence present in numerous muscle-specific genes, and they are expressed in the cells of the developing somites and in the embryonic heart at the onset of muscle formation in mammals. The MEF2 genes belong to the MADS box family of transcription factors, which control specific programs of gene expression in species ranging from yeast to humans. Each MEF2 family member contains two highly conserved protein motifs, the MADS domain and the MEF2-specific domain, which together provide the MEF2 factors with their unique DNA binding and dimerization properties. In an effort to further define the function of the MEF2 proteins, and to evaluate the degree of conservation shared among these factors and the phylogenetic pathways that they regulate, we sought to identify MEF2 family members in other species. In Drosophila, a homolog of the vertebrate MEF2 genes was identified and termed D-mef2. The D-MEF2 protein binds to the consensus MEF2 element and can activate transcription through tandem copies of that site. During Drosophila embryogenesis, D-MEF2 is specific to the mesoderm germ layer of the developing embryo and becomes expressed in all muscle cell types within the embryo. The role of D-mef2 in Drosophila embryogenesis was examined by generating a loss-of-function mutation in the D-mef2 gene. In embryos homozygous for this mutant allele, somatic, cardiac, and visceral muscles fail to differentiate, but precursors of these myogenic lineages are normally specified and positioned. These results demonstrate that different muscle cell types share a common myogenic differentiation program controlled by MEF2 and suggest that this program has been conserved from Drosophila to mammals. ^
Resumo:
Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^
Resumo:
PAX6 is a transcription activator that regulates eye development in animals ranging from Drosophila to human. The C-terminal region of PAX6 is proline/serine/threonine-rich (PST) and functions as a potent transactivation domain when attached to a heterologous DNA-binding domain of the yeast transcription factor, GAL4. The PST region comprises 152 amino acids encoded by four exons. The transactivation function of the PST region has not been defined and characterized in detail by in vitro mutagenesis. I dissected the PST domain in two independent systems, a heterologous system using a GAL4 DNA-binding site and the native system of PAX6. In both systems, the results show consistently that all four constituent exons of the PST domain are responsible for the transactivation function. The four exon fragments act cooperatively to stimulate transcription, although none of them can function individually as an independent transactivation domain. Combinations of two or more exon fragments can reconstitute substantial transactivation activity when fused to the DNA-binding domain of GAL4, but they surprisingly do not produce much activity in the context of native PAX6 even though the mutant PAX6 proteins are stable and their DNA-binding function remains unaffected. I conclude that the PAX6 protein contains an unusually large transactivation domain that is evolutionarily conserved to a high degree, and that its full transactivation activity relies on the cooperative action of the four exon fragments.^ Most PAX6 mutations detected in patients with aniridia result in truncations of the protein. Some of the truncation mutations occur in the PST region of PAX6, resulting in mutant proteins that retain their DNA-binding ability but have no significant transactivation activity. It is not clear whether such mutants are true loss-of-function or dominant-negative mutants. I show that these mutants are dominant-negative if they are coexpressed with wild-type PAX6 in cultured cells and that the dominant-negative effects result from enhanced DNA-binding ability of these mutants due to removal of the PST domain. These mutants are able to repress the wild-type PAX6 activity not only at target genes with paired domain binding sites but also at target genes with homeodomain binding sites.^ Mutations in the human PAX6 gene produce various phenotypes, including aniridia, Peters' anomaly, autosomal dominant keratitis, and familial foveal dysplasia. The various phenotypes may arise from different mutations in the same gene. To test this theory, I performed a functional analysis of two missense mutations in the paired domain: the R26G mutation reported in a case of Peters' anomaly, and the I87R mutation identified in a patient with aniridia. While both the R26 and the I87 positions are conserved in the paired boxes of all known PAX genes, X-ray crystallography has shown that only R26 makes contact with DNA. I found that the R26G mutant failed to bind a subset of paired domain binding sites but, surprisingly, bound other sites and successfully transactivated promoters containing those sites. In contrast, the I87R mutant had lost the ability to bind DNA at all tested sites and failed to transactivate promoters. My data support the haploinsufficiency hypothesis of aniridia, and the hypothesis that R26G is a hypomorphic allele. ^
Resumo:
Human pigmentation is a complex trait with the observed variation caused by the varied production of eumelanin (brown/black melanins) and phaeomelanin (red/yellow melanins) by the melanocytes. The melanocortin 1 receptor (MC1R), a G protein-coupled receptor expressed in the melanocytes, is a regulator eu- and phaeomelanin synthesis, and MC1R mutations causing skin and coat color changes are known in many mammals. To understand the role of MC1R in human pigmentation variation, I have sequenced the MC1R gene in 121 individuals sampled from world populations. In addition, I have sequenced the MC1R gene in common and pygmy chimpanzees, gorilla, orangutan, and baboon to study the evolution of MC1R and to infer the ancestral human MC1R sequence. The ancestral MC1R sequence is observed in all 25 African individuals studied, but at lower frequencies in the other populations examined, especially in East and Southeast Asians. The Arg163Gln variant is absent in the Africans studied, almost absent in Europeans, and at a low frequency in Indians, but is at an exceptionally high frequency (70%) in East and Southeast Asians. To further evaluate the role of MC1R variants in human pigmentation variation, I have combined these molecular evolution and population studies with functional assays on MC1R variants and primate MC1Rs. ^