618 resultados para no duplication
Resumo:
Eight novel families of miniature inverted repeat transposable elements (MITEs) were discovered in the African malaria mosquito, Anopheles gambiae, by using new software designed to rapidly identify MITE-like sequences based on their structural characteristics. Divergent subfamilies have been found in two families. Past mobility was demonstrated by evidence of MITE insertions that resulted in the duplication of specific TA, TAA, or 8-bp targets. Some of these MITEs share the same target duplications and similar terminal sequences with MITEs and other DNA transposons in human and other organisms. MITEs in A. gambiae range from 40 to 1340 copies per genome, much less abundant than MITEs in the yellow fever mosquito, Aedes aegypti. Statistical analyses suggest that most A. gambiae MITEs are in highly AT-rich regions, many of which are closely associated with each other. The analyses of these novel MITEs underscored interesting questions regarding their diversity, origin, evolution, and relationships to the host genomes. The discovery of diverse families of MITEs in A. gambiae has important practical implications in light of current efforts to control malaria by replacing vector mosquitoes with genetically modified refractory mosquitoes. Finally, the systematic approach to rapidly identify novel MITEs should have broad applications for the analysis of the ever-growing sequence databases of a wide range of organisms.
Resumo:
ERCC1–XPF is a structure-specific nuclease with two subunits, ERCC1 and XPF. The enzyme cuts DNA at junctions where a single strand moves 5′ to 3′ away from a branch point with duplex DNA. This activity has a central role in nucleotide excision repair (NER), DNA cross-link repair and recombination. To dissect the activities of the nuclease it is necessary to investigate the subunits individually, as studies of the enzyme so far have only used the heterodimeric complex. We produced recombinant ERCC1 and XPF separately in Escherichia coli as soluble proteins. Activity was monitored by a sensitive dual incision assay for NER by complementation of cell extracts. XPF and ERCC1 are unstable in mammalian cells in the absence of their partners but we found, surprisingly, that ERCC1 alone could confer some repair to extracts from ERCC1-defective cells. A version of ERCC1 lacking the first 88 non-conserved amino acids was also functional. This indicated that a small amount of active XPF was present in ERCC1 extracts, and immunoassays showed this to be the case. Some repair in XPF-defective extracts could be achieved by adding ERCC1 and XPF proteins together, but not by adding only XPF. The results show for the first time that functional ERCC1–XPF can be formed from separately produced subunits. Protein sequence comparison revealed similarity between the ERCC1 family and the C-terminal region of the XPF family, including the regions of both proteins that are necessary for the ERCC1–XPF heterodimeric interaction. This suggests that the ERCC1 and XPF families are related via an ancient duplication.
Resumo:
We have developed a novel cost-effective procedure, namely ‘chemical nanoprinting’, for oligonucleotide or cDNA chips manufacture. In this thermo-controlled process, the oligonucleotides, covalently attached to a highly loaded ‘master-chip’ through disulfide bonds, are chemically transferred to the acrylamide layer mounted on a ‘print-chip’. It is demonstrated here that multiple identical print-chips can be produced from a single master-chip. This duplication process is a few hundreds of times faster than any existing methods and the speed of process and cost incurred are independent of the scale of the DNA chips.
Resumo:
The downstream prion-like protein (doppel, or Dpl) is a paralog of the cellular prion protein, PrPC. The two proteins have ≈25% sequence identity, but seem to have distinct physiologic roles. Unlike PrPC, Dpl does not support prion replication; instead, overexpression of Dpl in the brain seems to cause a completely different neurodegenerative disease. We report the solution structure of a fragment of recombinant mouse Dpl (residues 26–157) containing a globular domain with three helices and a small amount of β-structure. Overall, the topology of Dpl is very similar to that of PrPC. Significant differences include a marked kink in one of the helices in Dpl, and a different orientation of the two short β-strands. Although the two proteins most likely arose through duplication of a single ancestral gene, the relationship is now so distant that only the structures retain similarity; the functions have diversified along with the sequence.
Resumo:
Pseudogenes are non-functioning copies of genes in genomic DNA, which may either result from reverse transcription from an mRNA transcript (processed pseudogenes) or from gene duplication and subsequent disablement (non-processed pseudogenes). As pseudogenes are apparently ‘dead’, they usually have a variety of obvious disablements (e.g., insertions, deletions, frameshifts and truncations) relative to their functioning homologs. We have derived an initial estimate of the size, distribution and characteristics of the pseudogene population in the Caenorhabditis elegans genome, performing a survey in ‘molecular archaeology’. Corresponding to the 18 576 annotated proteins in the worm (i.e., in Wormpep18), we have found an estimated total of 2168 pseudogenes, about one for every eight genes. Few of these appear to be processed. Details of our pseudogene assignments are available from http://bioinfo.mbb.yale.edu/genome/worm/pseudogene. The population of pseudogenes differs significantly from that of genes in a number of respects: (i) pseudogenes are distributed unevenly across the genome relative to genes, with a disproportionate number on chromosome IV; (ii) the density of pseudogenes is higher on the arms of the chromosomes; (iii) the amino acid composition of pseudogenes is midway between that of genes and (translations of) random intergenic DNA, with enrichment of Phe, Ile, Leu and Lys, and depletion of Asp, Ala, Glu and Gly relative to the worm proteome; and (iv) the most common protein folds and families differ somewhat between genes and pseudogenes—whereas the most common fold found in the worm proteome is the immunoglobulin fold and the most common ‘pseudofold’ is the C-type lectin. In addition, the size of a gene family bears little overall relationship to the size of its corresponding pseudogene complement, indicating a highly dynamic genome. There are in fact a number of families associated with large populations of pseudogenes. For example, one family of seven-transmembrane receptors (represented by gene B0334.7) has one pseudogene for every four genes, and another uncharacterized family (represented by gene B0403.1) is approximately two-thirds pseudogenic. Furthermore, over a hundred apparent pseudogenic fragments do not have any obvious homologs in the worm.
Resumo:
Plant chloroplasts originated from an endosymbiotic event by which an ancestor of contemporary cyanobacteria was engulfed by an early eukaryotic cell and then transformed into an organelle. Oxygenic photosynthesis is the specific feature of cyanobacteria and chloroplasts, and the photosynthetic machinery resides in an internal membrane system, the thylakoids. The origin and genesis of thylakoid membranes, which are essential for oxygenic photosynthesis, are still an enigma. Vipp1 (vesicle-inducing protein in plastids 1) is a protein located in both the inner envelope and the thylakoids of Pisum sativum and Arabidopsis thaliana. In Arabidopsis disruption of the VIPP1 gene severely affects the plant's ability to form properly structured thylakoids and as a consequence to carry out photosynthesis. In contrast, Vipp1 in Synechocystis appears to be located exclusively in the plasma membrane. Yet, as in higher plants, disruption of the VIPP1 gene locus leads to the complete loss of thylakoid formation. So far VIPP1 genes are found only in organisms carrying out oxygenic photosynthesis. They share sequence homology with a subunit encoded by the bacterial phage shock operon (PspA) but differ from PspA by a C-terminal extension of about 30 amino acids. In two cyanobacteria, Synechocystis and Anabaena, both a VIPP1 and a pspA gene are present, and phylogenetic analysis indicates that VIPP1 originated from a gene duplication of the latter and thereafter acquired its new function. It also appears that the C-terminal extension that discriminates VIPP1 proteins from PspA is important for its function in thylakoid formation.
Resumo:
Many DNA polymerases (Pol) have an intrinsic 3′→5′ exonuclease (Exo) activity which corrects polymerase errors and prevents mutations. We describe a role of the 3′→5′ Exo of Pol δ as a supplement or backup for the Rad27/Fen1 5′ flap endonuclease. A yeast rad27 null allele was lethal in combination with Pol δ mutations in Exo I, Exo II, and Exo III motifs that inactivate its exonuclease, but it was viable with mutations in other parts of Pol δ. The rad27-p allele, which has little phenotypic effect by itself, was also lethal in combination with mutations in the Pol δ Exo I and Exo II motifs. However, rad27-p Pol δ Exo III double mutants were viable. They exhibited strong synergistic increases in CAN1 duplication mutations, intrachromosomal and interchromosomal recombination, and required the wild-type double-strand break repair genes RAD50, RAD51, and RAD52 for viability. Observed effects were similar to those of the rad27-null mutant deficient in the removal of 5′ flaps in the lagging strand. These results suggest that the 3′→5′ Exo activity of Pol δ is redundant with Rad27/Fen1 for creating ligatable nicks between adjacent Okazaki fragments, possibly by reducing the amount of strand-displacement in the lagging strand.
Resumo:
Sm proteins form the core of small nuclear ribonucleoprotein particles (snRNPs), making them key components of several mRNA-processing assemblies, including the spliceosome. We report the 1.75-Å crystal structure of SmAP, an Sm-like archaeal protein that forms a heptameric ring perforated by a cationic pore. In addition to providing direct evidence for such an assembly in eukaryotic snRNPs, this structure (i) shows that SmAP homodimers are structurally similar to human Sm heterodimers, (ii) supports a gene duplication model of Sm protein evolution, and (iii) offers a model of SmAP bound to single-stranded RNA (ssRNA) that explains Sm binding-site specificity. The pronounced electrostatic asymmetry of the SmAP surface imparts directionality to putative SmAP–RNA interactions.
Resumo:
We analyze the evolutionary dynamics of three of the best-studied plant nuclear multigene families. The data analyzed derive from the genes that encode the small subunit of ribulose-1,5-bisphosphate carboxylase (rbcS), the gene family that encodes the enzyme chalcone synthase (Chs), and the gene family that encodes alcohol dehydrogenases (Adh). In addition, we consider the limited evolutionary data available on plant transposable elements. New Chs and rbcS genes appear to be recruited at about 10 times the rate estimated for Adh genes, and this is correlated with a much smaller average gene family size for Adh genes. In addition, duplication and divergence in function appears to be relatively common for Chs genes in flowering plant evolution. Analyses of synonymous nucleotide substitution rates for Adh genes in monocots reject a linear relationship with clock time. Replacement substitution rates vary with time in a complex fashion, which suggests that adaptive evolution has played an important role in driving divergence following gene duplication events. Molecular population genetic studies of Adh and Chs genes reveal high levels of molecular diversity within species. These studies also reveal that inter- and intralocus recombination are important forces in the generation allelic novelties. Moreover, illegitimate recombination events appear to be an important factor in transposable element loss in plants. When we consider the recruitment and loss of new gene copies, the generation of allelic diversity within plant species, and ectopic exchange among transposable elements, we conclude that recombination is a pervasive force at all levels of plant evolution.
Resumo:
Concerted evolution is often invoked to explain the diversity and evolution of the multigene families of major histocompatibility complex (MHC) genes and immunoglobulin (Ig) genes. However, this hypothesis has been controversial because the member genes of these families from the same species are not necessarily more closely related to one another than to the genes from different species. To resolve this controversy, we conducted phylogenetic analyses of several multigene families of the MHC and Ig systems. The results show that the evolutionary pattern of these families is quite different from that of concerted evolution but is in agreement with the birth-and-death model of evolution in which new genes are created by repeated gene duplication and some duplicate genes are maintained in the genome for a long time but others are deleted or become nonfunctional by deleterious mutations. We found little evidence that interlocus gene conversion plays an important role in the evolution of MHC and Ig multigene families.
Resumo:
Genetic mapping of wheat, maize, and rice and other grass species with common DNA probes has revealed remarkable conservation of gene content and gene order over the 60 million years of radiation of Poaceae. The linear organization of genes in some nine different genomes differing in basic chromosome number from 5 to 12 and nuclear DNA amount from 400 to 6,000 Mb, can be described in terms of only 25 “rice linkage blocks.” The extent to which this intergenomic colinearity is confounded at the micro level by gene duplication and micro-rearrangements is still an open question. Nevertheless, it is clear that the elucidation of the organization of the economically important grasses with larger genomes, such as maize (2n = 10, 4,500 Mb DNA), will, to a greater or lesser extent, be predicted from sequence analysis of smaller genomes such as rice, with only 400 Mb, which in turn may be greatly aided by knowledge of the entire sequence of Arabidopsis, which may be available as soon as the turn of the century. Comparative genetics will provide the key to unlock the genomic secrets of crop plants with bigger genomes than Homo sapiens.
Resumo:
In previous research, we have found a steep learning curve in the production of semiconductors. We estimated that most production knowledge remains internal to the firm, but that a significant fraction “spills over” to other firms. The existence of such spillovers may justify government actions to stimulate research on semiconductor manufacturing technology. The fact that not all production knowledge spills over, meanwhile, creates opportunities for firms to form joint ventures and slide down their learning curves more efficiently. With these considerations in mind, in 1987 14 leading U.S. semiconductor producers, with the assistance of the U.S. government in the form of $100 million in annual subsidies, formed a research and development (R&D) consortium called Sematech. In previous research, we estimated that Sematech has induced its member firms to lower their R&D spending. This may reflect more sharing and less duplication of research, i.e., more research being done with each R&D dollar. If this is the case, then Sematech members may wish to replace any funding withdrawn by the U.S. government. This in turn would imply that the U.S. government’s contributions to Sematech do not induce more semiconductor research than would otherwise occur.
Resumo:
Plasmodium falciparum is the agent of malignant malaria, one of mankind's most severe maladies. The parasite exhibits antigenic polymorphisms that have been postulated to be ancient. We have proposed that the extant world populations of P. falciparum have derived from one single parasite, a cenancestor, within the last 5,000–50,000 years. This inference derives from the virtual or complete absence of synonymous nucleotide polymorphisms at genes not involved in immune or drug responses. Seeking to conciliate this claim with extensive antigenic polymorphism, we first note that allele substitutions or polymorphisms can arise very rapidly, even in a single generation, in large populations subject to strong natural selection. Second, new alleles can arise not only by single-nucleotide mutations, but also by duplication/deletion of short simple-repeat DNA sequences, a process several orders of magnitude faster than single-nucleotide mutation. We analyze three antigenic genes known to be extremely polymorphic: Csp, Msp-1, and Msp-2. We identify regions consisting of tandem or proximally repetitive short DNA sequences, including some previously unnoticed. We conclude that the antigenic polymorphisms are consistent with the recent origin of the world populations of P. falciparum inferred from the analysis of nonantigenic genes.
Resumo:
The maize genome is replete with chromosomal duplications and repetitive DNA. The duplications resulted from an ancient polyploid event that occurred over 11 million years ago. Based on DNA sequence data, the polyploid event occurred after the divergence between sorghum and maize, and hence the polyploid event explains some of the difference in DNA content between these two species. Genomic rearrangement and diploidization followed the polyploid event. Most of the repetitive DNA in the maize genome is retrotransposable elements, and they comprise 50% of the genome. Retrotransposon multiplication has been relatively recent—within the last 5–6 million years—suggesting that the proliferation of retrotransposons has also contributed to differences in DNA content between sorghum and maize. There are still unanswered questions about repetitive DNA, including the distribution of repetitive DNA throughout the genome, the relative impacts of retrotransposons and chromosomal duplication in plant genome evolution, and the hypothesized correlation of duplication events with transposition. Population genetic processes also affect the evolution of genomes. We discuss how centromeric genes should, in theory, contain less genetic diversity than noncentromeric genes. In addition, studies of diversity in the wild relatives of maize indicate that different genes have different histories and also show that domestication and intensive breeding have had heterogeneous effects on genetic diversity across genes.
Resumo:
The fate of redundant genes resulting from genome duplication is poorly understood. Previous studies indicated that ribosomal RNA genes from one parental origin are epigenetically silenced during interspecific hybridization or polyploidization. Regulatory mechanisms for protein-coding genes in polyploid genomes are unknown, partly because of difficulty in studying expression patterns of homologous genes. Here we apply amplified fragment length polymorphism (AFLP)–cDNA display to perform a genome-wide screen for orthologous genes silenced in Arabidopsis suecica, an allotetraploid derived from Arabidopsis thaliana and Cardaminopsis arenosa. We identified ten genes that are silenced from either A. thaliana or C. arenosa origin in A. suecica and located in four of the five A. thaliana chromosomes. These genes represent a variety of RNA and predicted proteins including four transcription factors such as TCP3. The silenced genes in the vicinity of TCP3 are hypermethylated and reactivated by blocking DNA methylation, suggesting epigenetic regulation is involved in the expression of orthologous genes in polyploid genomes. Compared with classic genetic mutations, epigenetic regulation may be advantageous for selection and adaptation of polyploid species during evolution and development.