84 resultados para sequence database
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
The availaibilty of chloroplast genome (cpDNA) sequences of Atropa belladonna, Nicotiana sylvestris, N tabacum, N tomentosiformis, Solanum bulbocastanum, S lycopersicum and S tuberosum, which are Solanaceae species, allowed us to analyze the organization of cpSSRs in their genic and intergenic regions In general, the number of cpSSRs in cpDNA ranged from 161 in S tuberosum to 226 in N tabacum, and the number of intergenic cpSSRs was higher than genic cpSSRs The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, penta- and hexanucleotide repeats Multiple alignments of all cpSSRs sequence from Solanaceae species made the identification of nucleotide variability possible and the phylogeny was estimated by maximum parsimony Our study showed that the plastome database can be exploited for phylogenetic analyses and biotechnological approaches
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Context. A sample of 27 sources, cataloged as pre-main sequence stars by the Pico dos Dias Survey (PDS), is analyzed to investigate a possible contamination by post-AGB stars. The far-infrared excess due to dust present in the circumstellar envelope is typical of both categories: young stars and objects that have already left the main sequence and are suffering severe mass loss. Aims. The two known post-AGB stars in our sample inspired us to seek for other very likely or possible post-AGB objects among PDS sources previously suggested to be Herbig Ae/Be stars, by revisiting the observational database of this sample. Methods. In a comparative study with well known post-AGBs, several characteristics were evaluated: (i) parameters related to the circumstellar emission; (ii) spatial distribution to verify the background contribution from dark clouds; (iii) spectral features; and (iv) optical and infrared colors. Results. These characteristics suggest that seven objects of the studied sample are very likely post-AGBs, five are possible post-AGBs, eight are unlikely post-AGBs, and the nature of seven objects remains unclear.
Resumo:
Staphylococcus aureus is one of the most important infectious mastitis causative agents in small ruminants. In order to know the distribution of Staph. aureus strains associated with infectious mastitis in flocks of sheep in the northeast of Brazil and establish whether these clones are related to the strains distributed internationally, this study analysed the genetic diversity of Staph. aureus isolates from cases of clinical and subclinical mastitis in ewes by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST). In this research, 135 ewes with mastitis from 31 sheep flocks distributed in 15 districts were examined. Staph. aureus was isolated from sheep milk in 9 (29%) out of 31 herds located in 47% of the districts surveyed. MLST analysis allowed the identification of four STs (ST750, ST1728, ST1729 and ST1730). The last three with their respective novel alleles (g/p-220; pta-182 and yqil-180) were recently reported in the Staph. aureus MLST database (http://www.mlst.net). Each novel allele showed only a nucleotide different from those already described. The occurrence of CC133 (ST750 and ST1729) in this study is in agreement with other reports that only a few clones of Staph. aureus seem to be responsible for most cases of mastitis in dairy farms and that some of these clones may have broad geographic distribution. However, the prevalence of CC5 (ST1728 and ST1730)-an important group related to cases of colonization or infection in humans-differs from previous studies by its widespread occurrence and may suggest human contamination followed by selective pressures of the allelic diversifications presented for these STs.
Resumo:
Enteropathogenic Escherichia coli (EPEC) infections are a leading cause of infantile diarrhea in developing nations. Multilocus sequence typing (MLST) characterizes bacterial strains based on the sequences of internal fragments in housekeeping genes. Little is known about strains of EPEC analyzed by MLST from Brazil. In this study, a diverse collection of 29 EPEC strains isolated from patients with diarrhea, admitted to the University Hospital of Ribeirao Preto, was characterized by MLST. Strain analysis demonstrated 22 different sequence types (STs), of which almost half (48%) were new, indicating a high genotype diversity. The 22 STs were divided by eBURST into 12 clonal complexes. It was not possible to correlate typical and atypical EPEC with other strains in the MLST database. This is the first study that analyzed EPEC strains from South America that are included in the E. coli MLST database. Nine (31%) out of 29 strains are part of the CC10 clonal complex, the major clonal complex in the database, which comprises 174 strains and 86 different STs, suggesting that these strains might be the most important intestinal pathogenic E. coli worldwide. Genetic relationships between typical and atypical EPEC, enterohemorrhagic E. coli, and enteroaggregative E. coli strains were not established by MLST.
Resumo:
Hepatitis C virus (HCV) infection frequently persists despite substantial virus-specific immune responses and the combination of pegylated interferon (INF)-alpha and ribavirin therapy. Major histocompatibility complex class I restricted CD8+ T cells are responsible for the control of viraemia in HCV infection, and several studies suggest protection against viral infection associated with specific HLAs. The reason for low rates of sustained viral response (SVR) in HCV patients remains unknown. Escape mutations in response to cytotoxic T lymphocyte are widely described; however, its influence in the treatment outcome is ill understood. Here, we investigate the differences in CD8 epitopes frequencies from the Los Alamos database between groups of patients that showed distinct response to pegylated alpha-INF with ribavirin therapy and test evidence of natural selection on the virus in those who failed treatment, using five maximum likelihood evolutionary models from PAML package. The group of sustained virological responders showed three epitopes with frequencies higher than Non-responders group, all had statistical support, and we observed evidence of selection pressure in the last group. No escape mutation was observed. Interestingly, the epitope VLSDFKTWL was 100% conserved in SVR group. These results suggest that the response to treatment can be explained by the increase in immune pressure, induced by interferon therapy, and the presence of those epitopes may represent an important factor in determining the outcome of therapy.
Resumo:
OBJECTIVE: To determine the timing and sequence of eruption of primary teeth in children with complete bilateral cleft lip and palate. MATERIAL AND METHODS: This cross-sectional study was conducted at the Hospital for Rehabilitation of Craniofacial Anomalies of the University of São Paulo, Bauru, SP, Brazil, with a sample of 395 children (128 girls and 267 boys) aged 0 to 48 months, with complete bilateral cleft lip and palate. RESULTS: Children with complete bilateral clefts presented a higher mean age of eruption of all primary teeth for both arches and both genders, compared to children without clefts. This difference was statistically signifcant for all teeth, except for the maxillary first molar. Mean age of eruption of most teeth was lower for girls compared to boys. The greatest delay was found for the maxillary lateral incisor, which was the eighth tooth of children with clefts of both genders. Analyzing by gender, the maxillary lateral incisor was the eighth tooth to erupt in girls and the last in boys. CONCLUSION: The results suggest an interference of the cleft on the timing and sequence of eruption of primary teeth.
Resumo:
At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees) has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini). The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced). We also report the genome organization (gene content and order), gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.
Resumo:
Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.
Resumo:
Macro- and microarrays are well-established technologies to determine gene functions through repeated measurements of transcript abundance. We constructed a chicken skeletal muscle-associated array based on a muscle-specific EST database, which was used to generate a tissue expression dataset of similar to 4500 chicken genes across 5 adult tissues (skeletal muscle, heart, liver, brain, and skin). Only a small number of ESTs were sufficiently well characterized by BLAST searches to determine their probable cellular functions. Evidence of a particular tissue-characteristic expression can be considered an indication that the transcript is likely to be functionally significant. The skeletal muscle macroarray platform was first used to search for evidence of tissue-specific expression, focusing on the biological function of genes/transcripts, since gene expression profiles generated across tissues were found to be reliable and consistent. Hierarchical clustering analysis revealed consistent clustering among genes assigned to 'developmental growth', such as the ontology genes and germ layers. Accuracy of the expression data was supported by comparing information from known transcripts and tissue from which the transcript was derived with macroarray data. Hybridization assays resulted in consistent tissue expression profile, which will be useful to dissect tissue-regulatory networks and to predict functions of novel genes identified after extensive sequencing of the genomes of model organisms. Screening our skeletal-muscle platform using 5 chicken adult tissues allowed us identifying 43 'tissue-specific' transcripts, and 112 co-expressed uncharacterized transcripts with 62 putative motifs. This platform also represents an important tool for functional investigation of novel genes; to determine expression pattern according to developmental stages; to evaluate differences in muscular growth potential between chicken lines, and to identify tissue-specific genes.
Resumo:
Background: High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results: We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions: This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity >= 2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus.
Resumo:
In Brazil, human T-lymphotropic virus type 2 (HTLV-2) is endemic in Amerindians and epidemic in intravenous drug users (IDUs). The long terminal repeat (LTR) is the most divergent genomic region of HTLV-2, therefore useful to characterize subtypes. Nucleotide sequence and restriction fragment length polymorphism (RFLP) analysis of LTR genomic segments of fourteen HTLV-2 strains isolated from HIV-infected patients of Londrina, Southern Brazil, were carried out. Molecular analysis disclosed that all HTLV-2 strains belonged to 2a subtype, and RFLP detected the presence of the a4, a5, and a6 subgroups according to Switzer's nomenclature. RFLP correlated with nucleotide sequence, and phylogenetic analysis clustered HTLV-2 sequences of IDUs into subgroups a5 and a6. HTLV-2 sequences from individuals of sexual risk factor clustered into the a4 subgroup. These results extend the knowledge of the genetic diversity of HTLV-2 circulating in Brazil and provide insights into HTLV-2 transmission and virus movement in this geographic area.
Resumo:
In this study, 222 genome survey sequences were generated for Trypanosoma rangeli strain P07 isolated from an opossum (Didelphis albiventris) in Minas Gerais State, Brazil. T. rangeli sequences were compared by BLASTX (Basic Local Alignment Search Tool X) analysis with the assembled contigs of Leishmania braziliensis, Leishmania infantum, Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi. Results revealed that 82% (182/222) of the sequences were associated with predicted proteins described, whereas 18% (40/222) of the sequences did not show significant identity with sequences deposited in databases, suggesting that they may represent T. rangeli-specific sequences. Among the 182 predicted sequences, 179 (80.6%) had the highest similarity with T. cruzi, 2 (0.9%) with T. brucei, and 1 (0.5%) with L. braziliensis. Computer analysis permitted the identification of members of various gene families described for trypanosomatids in the genome of T. rangeli, such as trans-sialidases, mucin-associated surface proteins, and major surface proteases (MSP or gp63). This is the first report identifying sequences of the MSP family in T. rangeli. Multiple sequence alignments showed that the predicted MSP of T. rangeli presented the typical characteristics of metalloproteases, such as the presence of the HEXXH motif, which corresponds to a region previously associated with the catalytic site of the enzyme, and various cysteine and proline residues, which are conserved among MSPs of different trypanosomatid species. Reverse transcriptase-polymerase chain reaction analysis revealed the presence of MSP transcripts in epimastigote forms of T. rangeli.
Resumo:
Melanoma is a highly aggressive and therapy resistant tumor for which the identification of specific markers and therapeutic targets is highly desirable. We describe here the development and use of a bioinformatic pipeline tool, made publicly available under the name of EST2TSE, for the in silico detection of candidate genes with tissue-specific expression. Using this tool we mined the human EST (Expressed Sequence Tag) database for sequences derived exclusively from melanoma. We found 29 UniGene clusters of multiple ESTs with the potential to predict novel genes with melanoma-specific expression. Using a diverse panel of human tissues and cell lines, we validated the expression of a subset of three previously uncharacterized genes (clusters Hs.295012, Hs.518391, and Hs.559350) to be highly restricted to melanoma/melanocytes and named them RMEL1, 2 and 3, respectively. Expression analysis in nevi, primary melanomas, and metastatic melanomas revealed RMEL1 as a novel melanocytic lineage-specific gene up-regulated during melanoma development. RMEL2 expression was restricted to melanoma tissues and glioblastoma. RMEL3 showed strong up-regulation in nevi and was lost in metastatic tumors. Interestingly, we found correlations of RMEL2 and RMEL3 expression with improved patient outcome, suggesting tumor and/or metastasis suppressor functions for these genes. The three genes are composed of multiple exons and map to 2q12.2, 1q25.3, and 5q11.2, respectively. They are well conserved throughout primates, but not other genomes, and were predicted as having no coding potential, although primate-conserved and human-specific short ORFs could be found. Hairpin RNA secondary structures were also predicted. Concluding, this work offers new melanoma-specific genes for future validation as prognostic markers or as targets for the development of therapeutic strategies to treat melanoma.