993 resultados para Dispersed repetitive sequence family
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
The nucleotide sequences of the 5S rRNA multigene family and their distribution across the karyotypes in 2 species of Gymnotiformes, genus Gymnotus (G. sylvius and G. inaequilabiatus) were investigated by means of fluorescence in situ hybridization (FISH). The results showed the existence of 2 distinct classes of 5S rDNA sequences in both species: class I and class II. A high conservative pattern of the codifying region of the 5S rRNA gene was identified, contrasting with significant alterations detected in the nontranscribed spacer (NTS). The presence of TATA-like sequences along the NTS of both species was an expected occurrence, since such sequences have been associated with the regulation of the gene expression. FISH using 5S rDNA class I and class II probes revealed that both gene classes were collocated in the same chromosome pair in the genome of G. sylvius, while in that of G. inaequilabiatus, class II appeared more disperse than class I. Copyright (C) 2012 S. Karger AG, Basel
Resumo:
The creation, preservation, and degeneration of cis-regulatory elements controlling developmental gene expression are fundamental genome-level evolutionary processes about which little is known. In this study, critical differences in cis-regulatory elements controlling the expression of the sea urchin aboral ectoderm-specific spec genes were identified and explored. In genomes of species within the Strongylocentrotidae family, multiple copies of a repetitive sequence element termed RSR were present, but RSRs were not detected in genomes of species outside Strongylocentrotidae. RSRs are invariably associated with spec genes, and in Strongylocentrotus purpuratus, the spec2a RSR functioned as a transcriptional enhancer displaying greater activity than RSRs from the spec1 or spec2c paralogs. Single base-pair differences at two cis-regulatory elements within the spec2a RSR greatly increased the binding affinities of four transcription factors: SpCCAAT-binding factor at one element and SpOtx, SpGoosecoid, and SpGATA-E at another. The cis-regulatory elements to which SpCCAAT-binding factor, SpOtx, SpGoosecoid, and SpGATA-E bound were recent evolutionary acquisitions that could act either to activate or repress transcription, depending on the cell type. These elements were found in the spec2a RSR ortholog in Strongylocentrotus pallidus but not in the RSR orthologs of Strongylocentrotus droebachiensis or Hemicentrotus pulcherrimus. These results indicate that spec genes exhibit a dynamic pattern of cis-regulatory element evolution while stabilizing selection preserves their aboral ectoderm expression domain. ^
Resumo:
Synthetic peptides containing a repetitive hexapeptide sequence (Ala-His-His-Ala-Ala-Asp) of malarial histidine-rich protein II were evaluated for binding with haem in vitro. The pattern of haem binding suggested that each repeat unit of this sequence provides one binding site for haem. Chloroquine inhibited the haem-peptide complex formation with preferential formation of a haem chloroquine complex. In vitro studies on haem polymerisation showed that none of the peptides could initiate haemozoin formation. However, they could inhibit haemozoin formation promoted by a malarial parasite extract, possibly by competitively binding free haem. These results indicate this hexapeptide sequence represents the haem binding site of the malarial histidine-rich protein and possibly the site of nucleation for haem polymerisation.
Resumo:
Although integration of viral DNA into host chromosomes occurs regularly in bacteria and animals, there are few reported cases in plants, and these involve insertion at only one or a few sites. Here, we report that pararetrovirus-like sequences have integrated repeatedly into tobacco chromosomes, attaining a copy number of ≈103. Insertion apparently occurred by illegitimate recombination. From the sequences of 22 independent insertions recovered from a healthy plant, an 8-kilobase genome encoding a previously uncharacterized pararetrovirus that does not contain an integrase function could be assembled. Preferred boundaries of the viral inserts may correspond to recombinogenic gaps in open circular viral DNA. An unusual feature of the integrated viral sequences is a variable tandem repeat cluster, which might reflect defective genomes that preferentially recombine into plant DNA. The recurrent invasion of pararetroviral DNA into tobacco chromosomes demonstrates that viral sequences can contribute significantly to plant genome evolution.
Resumo:
We have implemented an approach for the detection of DNA alterations in cancer by means of computerized analysis of end-labeled genomic fragments, separated in two dimensions. Analysis of two-dimensional patterns of neuroblastoma tumors, prepared by first digesting DNA with the methylation-sensitive restriction enzyme Not I, yielded a multicopy fragment which was detected in some tumor patterns but not in normal controls. Cloning and sequencing of the fragment, isolated from two-dimensional gels, yielded a sequence with a strong homology to a subtelomeric sequence in chimpanzees and which was previously reported to be undetectable in humans. Fluorescence in situ hybridization indicated the occurrence of this sequence in normal tissue, for the most part in the satellite regions of acrocentric chromosomes. A product containing this sequence was obtained by telomere-anchored PCR using as a primer an oligonucleotide sequence from the cloned fragment. Our data suggest demethylation of cytosines at the cloned Not I site and in neighboring DNA in some tumors, compared with normal tissue, and suggest a greater similarity between human and chimpanzee subtelomeric sequences than was previously reported.
Resumo:
The ability to carry out high-resolution genetic mapping at high throughput in the mouse is a critical rate-limiting step in the generation of genetically anchored contigs in physical mapping projects and the mapping of genetic loci for complex traits. To address this need, we have developed an efficient, high-resolution, large-scale genome mapping system. This system is based on the identification of polymorphic DNA sites between mouse strains by using interspersed repetitive sequence (IRS) PCR. Individual cloned IRS PCR products are hybridized to a DNA array of IRS PCR products derived from the DNA of individual mice segregating DNA sequences from the two parent strains. Since gel electrophoresis is not required, large numbers of samples can be genotyped in parallel. By using this approach, we have mapped > 450 polymorphic probes with filters containing the DNA of up to 517 backcross mice, potentially allowing resolution of 0.14 centimorgan. This approach also carries the potential for a high degree of efficiency in the integration of physical and genetic maps, since pooled DNAs representing libraries of yeast artificial chromosomes or other physical representations of the mouse genome can be addressed by hybridization of filter representations of the IRS PCR products of such libraries.
Resumo:
Tandemly repeated DNA sequences are found in the genome of higher eukaryotes, and have also been demonstrated in Trypanosoma cruzi. Repeated DNA sequences are potentially useful for the diagnostic detection of T. cruzi (A. Gonzales et al., 1984, Proc. Natl. Acad. Sci. USA, 81: 3356-3360). We have isoleted two clones from a genomic library of T. cruzi (Y strain) that contain, in one clone a family of at least seven copies of a repetitive sequence of approximately 600 base pairs, and in the other an independent copy of the same sequence. One copy of the repetition (HSP) and the independent clone (HCR) were sequenced by the Sanger procedure (Fig.). This sequence hybridized to four strains of T. cruzi tested and did not hybridize to eleven species of trypanosotids from five different Genera, being a good candidate for diagnostic assays. GenBank accession numbers: HSP#m31919, HCR#31920.
Resumo:
We report the molecular characterization of a novel reiterated family of transcribed oligo(A)-terminated, interspersed DNA elements in the genome of Trypanosoma cruzi. Steady-state level of transcripts of this sequence family appeared to be developmentally regulated, since only in the replicative forms the parasite showed expression of related sequences with a major band around 3 kb. The presence of frame shifts or premature stop codons predicts that transcripts are not translated. The sequence family also contains truncated forms of retrotransposons elements that may become potential hot spots for retroelement insertion. Sequences homologous to this family are interspersed at many chromosomes including the subtelomeric regions.
Resumo:
Immune evasion by Plasmodium falciparum is favored by extensive allelic diversity of surface antigens. Some of them, most notably the vaccine-candidate merozoite surface protein (MSP)-1, exhibit a poorly understood pattern of allelic dimorphism, in which all observed alleles group into two highly diverged allelic families with few or no inter-family recombinants. Here we describe contrasting levels and patterns of sequence diversity in genes encoding three MSP-1-associated surface antigens of P. falciparum, ranging from an ancient allelic dimorphism in the Msp-6 gene to a near lack of allelic divergence in Msp-9 to a more classical multi-allele polymorphism in Msp-7 Other members of the Msp-7 gene family exhibit very little polymorphism in non-repetitive regions. A comparison of P. falciparum Msp-6 sequences to an orthologous sequence from P. reichenowi provided evidence for distinct evolutionary histories of the 5` and 3` segments of the dimorphic region in PfMsp-6, consistent with one dimorphic lineage having arisen from recombination between now-extinct ancestral alleles. In addition. we uncovered two surprising patterns of evolution in repetitive sequence. Firsts in Msp-6, large deletions are associated with (nearly) identical sequence motifs at their borders. Second, a comparison of PfMsp-9 with the P. reichenowi ortholog indicated retention of a significant inter-unit diversity within an 18-base pair repeat within the coding region of P. falciparum, but homogenization in P. reichenowi. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.
Resumo:
We describe here two new transposable elements, CemaT4 and CemaT5, that were identified within the sequenced genome of Caenorhabditis elegans using homology based searches. Five variants of CemaT4 were found, all non-autonomous and sharing 26 bp inverted terminal repeats (ITRs) and segments (152-367 bp) of sequence with similarity to the CemaT1 transposon of C. elegans. Sixteen copies of a short, 30 bp repetitive sequence, comprised entirely of an inverted repeat of the first 15 bp of CemaT4's ITR, were also found, each flanked by TA dinucleotide duplications, which are hallmarks of target site duplications of mariner-Tc transposon transpositions. The CemaT5 transposable element had no similarity to maT elements, except for sharing identical ITR sequences with CemaT3. We provide evidence that CemaT5 and CemaT3 are capable of excising from the C. elegans genome, despite neither transposon being capable of encoding a functional transposase enzyme. Presumably, these two transposons are cross-mobilised by an autonomous transposon that recognises their shared ITRs. The excisions of these and other non-autonomous elements may provide opportunities for abortive gap repair to create internal deletions and/or insert novel sequence within these transposons. The influence of non-autonomous element mobility and structural diversity on genome variation is discussed.
Resumo:
HAMAP (High-quality Automated and Manual Annotation of Proteins-available at http://hamap.expasy.org/) is a system for the automatic classification and annotation of protein sequences. HAMAP provides annotation of the same quality and detail as UniProtKB/Swiss-Prot, using manually curated profiles for protein sequence family classification and expert curated rules for functional annotation of family members. HAMAP data and tools are made available through our website and as part of the UniRule pipeline of UniProt, providing annotation for millions of unreviewed sequences of UniProtKB/TrEMBL. Here we report on the growth of HAMAP and updates to the HAMAP system since our last report in the NAR Database Issue of 2013. We continue to augment HAMAP with new family profiles and annotation rules as new protein families are characterized and annotated in UniProtKB/Swiss-Prot; the latest version of HAMAP (as of 3 September 2014) contains 1983 family classification profiles and 1998 annotation rules (up from 1780 and 1720). We demonstrate how the complex logic of HAMAP rules allows for precise annotation of individual functional variants within large homologous protein families. We also describe improvements to our web-based tool HAMAP-Scan which simplify the classification and annotation of sequences, and the incorporation of an improved sequence-profile search algorithm.
Resumo:
Sugar beet (Beta vulgaris ssp. vulgaris) is an important crop of temperate climates which provides nearly 30% of the world's annual sugar production and is a source for bioethanol and animal feed. The species belongs to the order of Caryophylalles, is diploid with 2n = 18 chromosomes, has an estimated genome size of 714-758 megabases and shares an ancient genome triplication with other eudicot plants. Leafy beets have been cultivated since Roman times, but sugar beet is one of the most recently domesticated crops. It arose in the late eighteenth century when lines accumulating sugar in the storage root were selected from crosses made with chard and fodder beet. Here we present a reference genome sequence for sugar beet as the first non-rosid, non-asterid eudicot genome, advancing comparative genomics and phylogenetic reconstructions. The genome sequence comprises 567 megabases, of which 85% could be assigned to chromosomes. The assembly covers a large proportion of the repetitive sequence content that was estimated to be 63%. We predicted 27,421 protein-coding genes supported by transcript data and annotated them on the basis of sequence homology. Phylogenetic analyses provided evidence for the separation of Caryophyllales before the split of asterids and rosids, and revealed lineage-specific gene family expansions and losses. We sequenced spinach (Spinacia oleracea), another Caryophyllales species, and validated features that separate this clade from rosids and asterids. Intraspecific genomic variation was analysed based on the genome sequences of sea beet (Beta vulgaris ssp. maritima; progenitor of all beet crops) and four additional sugar beet accessions. We identified seven million variant positions in the reference genome, and also large regions of low variability, indicating artificial selection. The sugar beet genome sequence enables the identification of genes affecting agronomically relevant traits, supports molecular breeding and maximizes the plant's potential in energy biotechnology.