35 resultados para Repetitive Sequences

em National Center for Biotechnology Information - NCBI


Relevância:

70.00% 70.00%

Publicador:

Resumo:

We present statistical methods for analyzing replicated cDNA microarray expression data and report the results of a controlled experiment. The study was conducted to investigate inherent variability in gene expression data and the extent to which replication in an experiment produces more consistent and reliable findings. We introduce a statistical model to describe the probability that mRNA is contained in the target sample tissue, converted to probe, and ultimately detected on the slide. We also introduce a method to analyze the combined data from all replicates. Of the 288 genes considered in this controlled experiment, 32 would be expected to produce strong hybridization signals because of the known presence of repetitive sequences within them. Results based on individual replicates, however, show that there are 55, 36, and 58 highly expressed genes in replicates 1, 2, and 3, respectively. On the other hand, an analysis by using the combined data from all 3 replicates reveals that only 2 of the 288 genes are incorrectly classified as expressed. Our experiment shows that any single microarray output is subject to substantial variability. By pooling data from replicates, we can provide a more reliable analysis of gene expression data. Therefore, we conclude that designing experiments with replications will greatly reduce misclassification rates. We recommend that at least three replicates be used in designing experiments by using cDNA microarrays, particularly when gene expression data from single specimens are being analyzed.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The representational difference analysis (RDA) and other subtraction techniques are used to enrich sample-specific sequences by elimination of ubiquitous sequences existing in both the sample of interest (tester) and the subtraction partner (driver). While applying the RDA to genomic DNA of cutaneous lymphoma cells in order to identify tumor relevant alterations, we predominantly isolated repetitive sequences and artificial repeat-mediated fusion products of otherwise independent PCR fragments (PCR hybrids). Since these products severely interfered with the isolation of tester-specific fragments, we developed a considerably more robust and efficient approach, termed ligation-mediated subtraction (Limes). In first applications of Limes, genomic sequences and/or transcripts of genes involved in the regulation of transcription, such as transforming growth factor β stimulated clone 22 related gene (TSC-22R), cell death and cytokine production (caspase-1) or antigen presentation (HLA class II sequences), were found to be completely absent in a cutaneous lymphoma line. On the assumption that mutations in tumor-relevant genes can affect their transcription pattern, a protocol was developed and successfully applied that allows the identification of such sequences. Due to these results, Limes may substitute/supplement other subtraction/comparison techniques such as RDA or DNA microarray techniques in a variety of different research fields.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Instability of repetitive sequences, both in intronic sequences and within coding regions, has been demonstrated to be a hallmark of genomic instability in human cancer. Understanding how these mutational events arise may provide an opportunity for prevention or early intervention in cancer development. To study the source of this instability, we have identified a region of the β-lactamase gene that is tolerant to the insertion of fragments of exogenous DNA as large as 1,614 bp with minimal loss of enzyme activity, as determined by antibiotic resistance. Fragments inserted out-of-frame render Escherichia coli sensitive to antibiotic, and compensatory frameshift mutations that restore the reading frame of β-lactamase can be selected on the basis of antibiotic resistance. We have utilized this site to insert a synthetic microsatellite sequence within the β-lactamase gene and selected for mutations yielding frameshifts. This assay provides for detection of one frameshift mutation in a background of 106 wild-type sequences. Mismatch repair deficiency increased the observed frameshift frequency ≈300-fold. Exposure of plasmid containing microsatellite sequences to hydrogen peroxide resulted in frameshift mutations that were localized exclusively to the microsatellite sequences, whereas DNA damage by UV or N-methyl-N′-nitro-N-nitrosoguanidine did not result in enhanced mutagenesis. We postulate that in tumor cells, endogenous production of oxygen free radicals may be a major factor in promoting instability of microsatellite sequences. This β-lactamase assay may provide a sensitive methodology for the detection and quantitation of mutations associated with the development of cancer.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Translesion synthesis at replication-blocking lesions requires the induction of proteins that are controlled by the SOS system in Escherichia coli. Of the proteins identified so far, UmuD′, UmuC, and RecA* were shown to facilitate replication across UV-light-induced lesions, yielding both error-free and mutagenic translesion-synthesis products. Similar to UV lesions, N-2-acetylaminofluorene (AAF), a chemical carcinogen that forms covalent adducts at the C8 position of guanine residues, is a strong replication-blocking lesion. Frameshift mutations are induced efficiently by AAF adducts when located within short repetitive sequences in a two-step mechanism; AAF adducts incorporate a cytosine across from the lesion and then form a primer-template misaligned intermediate that, upon elongation, yields frameshift mutations. Recently, we have shown that although elongation from the nonslipped intermediate depends on functional umuDC+ gene products, elongation from the slipped intermediate is umuDC+-independent but requires another, as yet biochemically uncharacterized, SOS function. We now show that in DNA Polymerase III-proofreading mutant strains (dnaQ49 and mutD5 strains), elongation from the slipped intermediate is highly efficient in the absence of SOS induction—in contrast to elongation from the nonslipped intermediate, which still requires UmuDC functions.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We have previously shown that both a centromere (CEN) and a replication origin are necessary for plasmid maintenance in the yeast Yarrowia lipolytica (Vernis et al., 1997). Because of this requirement, only a small number of centromere-proximal replication origins have been isolated from Yarrowia. We used a CEN-based plasmid to obtain noncentromeric origins, and several new fragments, some unique and some repetitive sequences, were isolated. Some of them were analyzed by two-dimensional gel electrophoresis and correspond to actual sites of initiation (ORI) on the chromosome. We observed that a 125-bp fragment is sufficient for a functional ORI on plasmid, and that chromosomal origins moved to ectopic sites on the chromosome continue to act as initiation sites. These Yarrowia origins share an 8-bp motif, which is not essential for origin function on plasmids. The Yarrowia origins do not display any obvious common structural features, like bent DNA or DNA unwinding elements, generally present at or near eukaryotic replication origins. Y. lipolytica origins thus share features of those in the unicellular Saccharomyces cerevisiae and in multicellular eukaryotes: they are discrete and short genetic elements without sequence similarity.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

For many agronomically important plant genes, only their position on a genetic map is known. In the absence of an efficient transposon tagging system, such genes have to be isolated by map-based cloning. In bread wheat Triticum aestivum, the genome is hexaploid, has a size of 1.6 × 1010 bp, and contains more than 80% of repetitive sequences. So far, this genome complexity has not allowed chromosome walking and positional cloning. Here, we demonstrate that chromosome walking using bacterial artificial chromosome (BAC) clones is possible in the diploid wheat Triticum monococcum (Am genome). BAC end sequences were mostly repetitive and could not be used for the first walking step. New probes corresponding to rare low-copy sequences were efficiently identified by low-pass DNA sequencing of the BACs. Two walking steps resulted in a physical contig of 450 kb on chromosome 1AmS. Genetic mapping of the probes derived from the BAC contig demonstrated perfect colinearity between the physical map of T. monococcum and the genetic map of bread wheat on chromosome 1AS. The contig genetically spans the Lr10 leaf rust disease resistance locus in bread wheat, with 0.13 centimorgans corresponding to 300 kb between the closest flanking markers. Comparison of the genetic to physical distances has shown large variations within 350 kb of the contig. The physical contig can now be used for the isolation of the orthologous regions in bread wheat. Thus, subgenome chromosome walking in wheat can produce large physical contigs and saturate genomic regions to support positional cloning.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

An integrated map of the genome of the tubercle bacillus, Mycobacterium tuberculosis, was constructed by using a twin-pronged approach. Pulsed-field gel electrophoretic analysis enabled cleavage sites for Asn I and Dra I to be positioned on the 4.4-Mb circular chromosome, while, in parallel, clones from two cosmid libraries were ordered into contigs by means of fingerprinting and hybridization mapping. The resultant contig map was readily correlated with the physical map of the genome via the landmarked restriction sites. Over 165 genes and markers were localized on the integrated map, thus enabling comparisons with the leprosy bacillus, Mycobacterium leprae, to be undertaken. Mycobacterial genomes appear to have evolved as mosaic structures since extended segments with conserved gene order and organization are interspersed with different flanking regions. Repetitive sequences and insertion elements are highly abundant in M. tuberculosis, but the distribution of IS6110 is apparently nonrandom.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Although integration of viral DNA into host chromosomes occurs regularly in bacteria and animals, there are few reported cases in plants, and these involve insertion at only one or a few sites. Here, we report that pararetrovirus-like sequences have integrated repeatedly into tobacco chromosomes, attaining a copy number of ≈103. Insertion apparently occurred by illegitimate recombination. From the sequences of 22 independent insertions recovered from a healthy plant, an 8-kilobase genome encoding a previously uncharacterized pararetrovirus that does not contain an integrase function could be assembled. Preferred boundaries of the viral inserts may correspond to recombinogenic gaps in open circular viral DNA. An unusual feature of the integrated viral sequences is a variable tandem repeat cluster, which might reflect defective genomes that preferentially recombine into plant DNA. The recurrent invasion of pararetroviral DNA into tobacco chromosomes demonstrates that viral sequences can contribute significantly to plant genome evolution.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Rearrangements between tandem sequence homologies of various lengths are a major source of genomic change and can be deleterious to the organism. These rearrangements can result in either deletion or duplication of genetic material flanked by direct sequence repeats. Molecular genetic analysis of repetitive sequence instability in Escherichia coli has provided several clues to the underlying mechanisms of these rearrangements. We present evidence for three mechanisms of RecA-independent sequence rearrangements: simple replication slippage, sister-chromosome exchange-associated slippage, and single-strand annealing. We discuss the constraints of these mechanisms and contrast their properties with RecA-dependent homologous recombination. Replication plays a critical role in the two slipped misalignment mechanisms, and difficulties in replication appear to trigger rearrangements via all these mechanisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have implemented an approach for the detection of DNA alterations in cancer by means of computerized analysis of end-labeled genomic fragments, separated in two dimensions. Analysis of two-dimensional patterns of neuroblastoma tumors, prepared by first digesting DNA with the methylation-sensitive restriction enzyme Not I, yielded a multicopy fragment which was detected in some tumor patterns but not in normal controls. Cloning and sequencing of the fragment, isolated from two-dimensional gels, yielded a sequence with a strong homology to a subtelomeric sequence in chimpanzees and which was previously reported to be undetectable in humans. Fluorescence in situ hybridization indicated the occurrence of this sequence in normal tissue, for the most part in the satellite regions of acrocentric chromosomes. A product containing this sequence was obtained by telomere-anchored PCR using as a primer an oligonucleotide sequence from the cloned fragment. Our data suggest demethylation of cytosines at the cloned Not I site and in neighboring DNA in some tumors, compared with normal tissue, and suggest a greater similarity between human and chimpanzee subtelomeric sequences than was previously reported.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have characterized a family of repetitive DNA elements with homology to the MgPa cellular adhesion operon of Mycoplasma genitalium, a bacterium that has the smallest known genome of any free-living organism. One element, 2272 bp in length and flanked by DNA with no homology to MgPa, was completely sequenced. At least four others were partially sequenced. The complete element is a composite of six regions. Five of these regions show sequence similarity with nonadjacent segments of genes of the MgPa operon. The sixth region, located near the center of the element, is an A+T-rich sequence that has only been found in this repeat family. Open reading frames are present within the five individual regions showing sequence homology to MgPa and the adjacent open reading frame 3 (ORF3) gene. However, termination codons are found between adjacent regions of homology to the MgPa operon and in the A+T-rich sequence. Thus, these repetitive elements do not appear to be directly expressible protein coding sequences. The sequence of one region from five different repetitive elements was compared with the homologous region of the MgPa gene from the type strain G37 and four newly isolated M. genitalium strains. Recombination between repetitive elements of strain G37 and the MgPa operon can explain the majority of polymorphisms within our partial sequences of the MgPa genes of the new isolates. Therefore, we propose that the repetitive elements of M. genitalium provide a reservoir of sequence that contributes to antigenic variation in proteins of the MgPa cellular adhesion operon.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Eukaryotic genomes contain tracts of DNA in which a single base or a small number of bases are repeated (microsatellites). Mutations in the yeast DNA mismatch repair genes MSH2, PMS1, and MLH1 increase the frequency of mutations for normal DNA sequences and destabilize microsatellites. Mutations of human homologs of MSH2, PMS1, and MLH1 also cause microsatellite instability and result in certain types of cancer. We find that a mutation in the yeast gene MSH3 that does not substantially affect the rate of spontaneous mutations at several loci increases microsatellite instability about 40-fold, preferentially causing deletions. We suggest that MSH3 has different substrate specificities than the other mismatch repair proteins and that the human MSH3 homolog (MRP1) may be mutated in some tumors with microsatellite instability.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The ability to carry out high-resolution genetic mapping at high throughput in the mouse is a critical rate-limiting step in the generation of genetically anchored contigs in physical mapping projects and the mapping of genetic loci for complex traits. To address this need, we have developed an efficient, high-resolution, large-scale genome mapping system. This system is based on the identification of polymorphic DNA sites between mouse strains by using interspersed repetitive sequence (IRS) PCR. Individual cloned IRS PCR products are hybridized to a DNA array of IRS PCR products derived from the DNA of individual mice segregating DNA sequences from the two parent strains. Since gel electrophoresis is not required, large numbers of samples can be genotyped in parallel. By using this approach, we have mapped > 450 polymorphic probes with filters containing the DNA of up to 517 backcross mice, potentially allowing resolution of 0.14 centimorgan. This approach also carries the potential for a high degree of efficiency in the integration of physical and genetic maps, since pooled DNAs representing libraries of yeast artificial chromosomes or other physical representations of the mouse genome can be addressed by hybridization of filter representations of the IRS PCR products of such libraries.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50°C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83–92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.