916 resultados para Genome-specific Sequence
Resumo:
Importin-alpha is the nuclear import receptor that recognizes cargo proteins carrying conventional basic monopartite and bipartite nuclear localization sequences (NLSs) and facilitates their transport into the nucleus. Bipartite NLSs contain two clusters of basic residues, connected by linkers of variable lengths. To determine the structural basis of the recognition of diverse bipartite NLSs by mammalian importin-alpha, we co-crystallized a non-autoinhibited mouse receptor protein with peptides corresponding to the NLSs from human retinoblastoma protein and Xenopus laevis phosphoprotein N1N2, containing diverse sequences and lengths of the linker. We show that the basic clusters interact analogously in both NLSs, but the linker sequences adopt different conformations, whereas both make specific contacts with the receptor. The available data allow us to draw general conclusions about the specificity of NLS binding by importin-alpha and facilitate an improved definition of the consensus sequence of a conventional basic/bipartite NLS (KRX10-12KRRK) that can be used to identify novel nuclear proteins.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Cultivated peanut (Arachis hypogaea) is an important crop, widely grown in tropical and subtropical regions of the world. It is highly susceptible to several biotic and abiotic stresses to which wild species are resistant. As a first step towards the introgression of these resistance genes into cultivated peanut, a linkage map based on microsatellite markers was constructed, using an F-2 population obtained from a cross between two diploid wild species with AA genome (A. duranensis and A. stenosperma). A total of 271 new microsatellite markers were developed in the present study from SSR-enriched genomic libraries, expressed sequence tags (ESTs), and by data-mining sequences available in GenBank. of these, 66 were polymorphic for cultivated peanut. The 271 new markers plus another 162 published for peanut were screened against both progenitors and 204 of these (47.1%) were polymorphic, with 170 codominant and 34 dominant markers. The 80 codominant markers segregating 1:2:1 (P < 0.05) were initially used to establish the linkage groups. Distorted and dominant markers were subsequently included in the map. The resulting linkage map consists of 11 linkage groups covering 1,230.89 cM of total map distance, with an average distance of 7.24 cM between markers. This is the first microsatellite-based map published for Arachis, and the first map based on sequences that are all currently publicly available. Because most markers used were derived from ESTs and genomic libraries made using methylation-sensitive restriction enzymes, about one-third of the mapped markers are genic. Linkage group ordering is being validated in other mapping populations, with the aim of constructing a transferable reference map for Arachis.
Resumo:
Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTEs were assembled into 81,429 contigs. of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTEs sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTEs coincided with DNA regions predicted as encoding exons by GENSCAN.
Resumo:
Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm. functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.
Resumo:
The data mining of Eucalyptus ESTs genome finds four clusters (EGCEST2257E11.g, EGBGRT3213F11.g, and EGCCFB1223H11.g) from highly conservative 14-3-3 protein family which modulates a wide variety of cellular processes. Multiple alignments were built from twenty four sequences of 14-3-3 proteins searched into the GenBank databases and into the four pools of Eucalyptus genome programs. The alignment has shown two regions highly conservative on the sequences corresponding to the motifs of protein phosphorylation and nine highly conservative regions on the sequence corresponding to the linkage regions of alpha helices structure based on three dimensional of dimer functional structure. The differences of amino acid into the structural and functional domains of 14-3-3 plant protein were identified and can explain the functional diversity of different isoforms. The phylogenic protein trees were built by the maximum parsimony and neighborjoining procedures of Clustal X alignments and PAUP software for phylogenic analysis.
Resumo:
Studies of telomere structure and maintenance in trypanosomatids have provided insights into the evolutionary origin and conservation of some telomeric components shared by trypanosomes and vertebrates. For example, trypanosomatid telomeres are maintained by telomerase and consist of the canonical TTAGGG repeats, which in Trypanosoma brucei can form telomeric loops (t-loops). However, the telomeric chromatin of trypanosomatids is composed of organ ism-specific proteins and other proteins that share little sequence similarity with their vertebrate counterparts. Because telomere maintenance mechanisms are essential for genome stability, we propose that the particular features shown by the trypanosome telomeric chromatin hold the key for the design of antiparasitic drugs.
Resumo:
O objetivo deste trabalho foi caracterizar biológica e molecularmente três isolados de Sugarcane mosaic virus (SCMV) de lavouras de milho, analisá-los filogeneticamente e discriminar polimorfismos do genoma. Plantas com sintomas de mosaico e nanismo foram coletadas em lavouras de milho, no Estado de São Paulo e no Município de Rio Verde, GO, e seus extratos foliares foram inoculados em plantas indicadoras e submetidos à análise sorológica com antissoros contra o SCMV, contra o Maize dwarf mosaic virus (MDMV) e contra o Johnsongrass mosaic virus (JGMV). Mudas de sorgo 'Rio' e 'TX 2786' apresentaram sintomas de mosaico após a inoculação dos três isolados, e o DAS-ELISA confirmou a infecção pelo SCMV. O RNA total foi extraído e usado para amplificação por transcriptase reversa seguida de reação em cadeia de polimerase (RT-PCR). Fragmentos específicos foram amplificados, submetidos à análise por polimorfismo de comprimento de fragmento de restrição (RFLP) e sequenciados. Foi possível discriminar os genótipos de SCMV isolados de milho de outros isolados brasileiros do vírus. Alinhamentos múltiplos e análises dos perfis filogenéticos corroboram esses dados e mostram diversidade nas sequências de nucleotídeos que codificam para a proteína capsidial, o que explica o agrupamento separado desses isolados e sugere sua classificação como estirpes distintas, em lugar de simples isolados geográficos.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Phenotypically discordant monozygotic twins offer the possibility of gene discovery through delineation of molecular abnormalities in one member of the twin pair. One proposed mechanism of discordance is postzygotically occurring genomic alterations resulting from mitotic recombination and other somatic changes. Detection of altered genomic fragments can reveal candidate gene loci that can be verified through additional analyses. We investigated this hypothesis using array comparative genomic hybridization; the 50K and 250K Affymetrix GeneChip (R) SNP arrays and an Illumina custom array consisting of 1,536 SNPs, to scan for genomic alterations in a sample of monozygotic twin pairs with discordant cleft lip and/or palate phenotypes. Paired analysis for deletions, amplifications and loss of heterozygosity, along with sequence verification of SNPs with discordant genotype calls did not reveal any genomic discordance between twin pairs in lymphocyte DNA samples. Our results demonstrate that postzygotic genomic alterations are not a common cause of monozygotic twin discordance for isolated cleft lip and/or palate. However, rare or balanced genomic alterations, tissue-specific events and small aberrations beyond the detection level of our experimental approach cannot be ruled out. The stability of genomes we observed in our study samples also suggests that detection of discordant events in other monozygotic twin pairs would be remarkable and of potential disease significance.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
A substantial fraction of the eukaryotic genome consists of repetitive DNA sequences that include satellites, minisatellites, microsatellites, and transposable elements. Although extensively studied for the past three decades, the molecular forces that generate, propagate and maintain repetitive DNAs in the genomes are still discussed. To further understand the dynamics and the mechanisms of evolution of repetitive DNAs in vertebrate genome, we searched for repetitive sequences in the genome of the fish species Hoplias malabaricus. A satellite sequence, named 5SHindIII-DNA, which has a conspicuous similarity with 5S rRNA genes and spacers was identified. FISH experiments showed that the 5S rRNA bona fide gene repeats were clustered in the interstitial position of two chromosome pairs of H. malabaricus, while the satellite 5SHindIII-DNA sequences were clustered in the centromeric position in nine chromosome pairs of the species. The presence of the 5SHindIII-DNA sequences in the centromeres of several chromosomes indicates that this satellite family probably escaped from the selective pressure that maintains the structure and organization of the 5S rDNA repeats and become disperse into the genome. Although it is not feasible to explain how this sequence has been maintained in the centromeric regions, it is possible to hypothesize that it may be involved in some structural or functional role of the centromere organization.
Resumo:
In higher eukaryotes, the 5S ribosomal DNA (5S rDNA) is organized in tandem arrays with repeat units composed of a coding region and a non-transcribed spacer sequence (NTS). These tandem arrays can be found on either one or more chromosome pairs. 5S rDNA copies from the tilapia fish. Oreochromis niloticus, were cloned and the nucleotide sequences of the coding region and of the non-transcribed spacer were deter-mined. Moreover, the genomic organization of the 5S rDNA tandem repeats was investigated by fluorescence in situ hybridization (FISH) and Southern blot hybridization. Two 5S rDNA classes, one consisting of 1.4-kb repeats and another one with 0.5-kb repeats were identified and designated 5S rDNA type I and type II, respectively, An inverted 5S rRNA gene and a 5S rRNA putative pseudogene were also identified inside the tandem repeats of 5S rDNA type I. FISH permitted the visualization of the 5S rRNA genes at three chromosome loci, one of them consisting of arrays of the 5S rDNA type I, and the two others corresponding to arrays of the 5S rDNA type II. The two classes of the 5S rDNA. The presence of pseudogenes, and the inverted genes observed in the O. niloticus genome might be a consequence of the intense dynamics of the evolution of these tandem repeat elements. Copyright (C) 2002 S. Karger AG, Basel.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)