51 resultados para Genomic sequence database
Resumo:
Due to the wide diversity of unknown organisms in the environment, 99% of them cannot be grown in traditional culture medium in laboratories. Therefore, metagenomics projects are proposed to study microbial communities present in the environment, from molecular techniques, especially the sequencing. Thereby, for the coming years it is expected an accumulation of sequences produced by these projects. Thus, the sequences produced by genomics and metagenomics projects present several challenges for the treatment, storing and analysis such as: the search for clones containing genes of interest. This work presents the OCI Metagenomics, which allows defines and manages dynamically the rules of clone selection in metagenomic libraries, thought an algebraic approach based on process algebra. Furthermore, a web interface was developed to allow researchers to easily create and execute their own rules to select clones in genomic sequence database. This software has been tested in metagenomic cosmid library and it was able to select clones containing genes of interest. Copyright 2010 ACM.
Resumo:
We report the cloning and characterization of a long interspersed nucleotide element (LINE) fi-om a cichlid fish, Oreochromis niloticus, and show the distribution of this element, called CiLINE2 for cichlid LINE2, in the chromosomes of this species. The identification of an open reading frame in CiLINE2 with amino acid sequence similarity to reverse transcriptases encoded by LINE-like elements in Caenorhabditis elegans, Platemys spixii, Schistosoma mansoni, Gallus gallus (CRI), Drosophila melanogaster (I factor), and Homo sapiens (LINE2), as well as the structure of the element, suggest it is a member of this family of non-long terminal repeat-containing retrotransposons. Search of a DNA sequence database identified sequences similar to CiLINE2 in four other fish species (Haplotaxodon microlepis, Oreochromis mossambicus, Pseudotropheus zebra, and Fugu rubripes). Southern blot hybridization experiments revealed the presence of sequences similar to CiLINE2 in all Tilapiini species analyzed from the genera Oreochromis, Tilapia, and Sarotherodon, and gave an estimated copy number of about 5500 for the haploid genome of O. niloticus. Fluorescent in situ hybridization showed that CiLINE2 sequences were organized in small clusters dispersed over all chromosomes of O. niloticus, with a higher concentration near chromosome ends. Furthermore the long arm of chromosome 1 was strikingly enriched with this sequence. The distribution of LINE2-related elements might underlie the difference in chromosome banding patterns observed between cold-blooded vertebrates and mammals.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
We report the results of a transcript finishing initiative, undertaken for the purpose of identifying and characterizing novel human transcripts, in which RT-PCR was used to bridge gaps between paired EST Clusters, mapped against the genomic sequence. Each pair of EST Clusters selected for experimental validation was designated a transcript finishing unit (TFU). A total of 489 TFUs were selected for validation, and an overall efficiency of 43.1% was achieved. We generated a total of 59,975 bp of transcribed sequences organized into 432 exons, contributing to the definition of the structure of 211 human transcripts. The structure of several transcripts reported here was confirmed during the course of this project, through the generation of their corresponding full-length cDNA sequences. Nevertheless, for 21% of the validated TFUs, a full-length cDNA sequence is not yet available in public databases, and the structure of 69.2% of these TFUs was not correctly predicted by computer programs. The TF strategy provides a significant contribution to the definition of the complete catalog of human genes and transcripts, because it appears to be particularly useful for identification of low abundance transcripts expressed in a restricted Set of tissues as well as for the delineation of gene boundaries and alternatively spliced isoforms.
Resumo:
A importância do estudo de bactérias acéticas, em especial as do gênero Gluconobacter, está baseada em suas aplicações industriais, pois estas possuem a capacidade de bioconversão de sorbitol a sorbose, viabilizando o processo de produção de vitamina C. O estudo envolveu coletas de amostras em indústrias de refrigerante, flores, frutos e mel, seguidas de purificação, identificação fenotípica e identificação molecular, com a utilização de iniciador definido a partir de consulta ao Nucleotide Sequence Database. Preservaram-se as linhagens identificadas como membros da família Acetobacteriaceae, gênero Gluconobacter. Foi isolado um total de 110 linhagens dos substratos: Pyrostegia venusta (Cipó de São João), mel, Vitis vinifera (uva), Pyrus communis (pêra), Malus sp. (maçã) e de duas amostras de refrigerantes envasados em embalagens de PET de 2 L. Deste total, 57 linhagens foram recuperadas em meio MYP (manitol, extrato de levedura, peptona), 12 em meio YGM (glicose, manitol, extrato de levedura, etanol, ácido acético), 41 em meio de enriquecimento e, posteriormente, em meio GYC (glicose, extrato de levedura e carbonato de cálcio). Obtiveram-se 68 linhagens identificadas como bastonetes Gram negativos. Destas, 31 foram caracterizadas bioquimicamente como pertencentes à família Acetobacteriaceae por serem catalase positivas, oxidase negativas e produtoras de ácido a partir de glicose. A caracterização dessas linhagens foi complementada com os testes bioquímicos: liquefação da gelatina, redução de nitrato, formação de indol e H2S e oxidação de etanol a ácido acético. Métodos moleculares foram aplicados para identificação do gênero Gluconobacter. Finalmente, oito linhagens foram caracterizadas como pertencentes ao gênero Gluconobacter. As linhagens encontram-se depositadas em coleção de cultura do laboratório de Microbiologia do Departamento de Biologia da UNESP, campus de Assis, estocadas em extrato de malte 20 a -196 ºC.
Resumo:
Oxidative stress generating active oxygen species has been proved to be one of the underlying agents causing tissue injury after the exposure of Eucalyptus (Eucalyptus spp.) plants to a wide variety of stress conditions. The objective of this study was to perform data mining to identify favorable genes and alleles associated with the enzyme systems superoxide dismutase, catalase, peroxidases, and glutathione S-transferase that are related to tolerance for environmental stresses and damage caused by pests, diseases, herbicides, and by weeds themselves. This was undertaken by using the eucalyptus expressed-sequence database (https//forests.esalq.usp.br). The alignment results between amino acid and nucleotide sequences indicated that the studied enzymes were adequately represented in the ESTs database of the FORESTs project.
Resumo:
The N-linked glycosylation of secretory and membrane proteins is the most complex posttranslational modification known to occur in eukaryotic cells. It has been shown to play critical roles in modulating protein function. Although this important biological process has been extensively studied in mammals, much less is known about this biosynthetic pathway in plants. The enzymes involved in plant N-glycan biosynthesis and processing are still not well defined and the mechanism of their genetic regulation is almost completely unknown. In this paper we describe our first attempt to understand the N-linked glycosylation mechanism in a plant species by using the data generated by the Sugarcane Expressed Sequence Tag (SUCEST) project. The SUCEST database was mined for sugarcane gene products potentially involved in the N-glycosylation pathway. This approach has led to the identification and functional assignment of 90 expressed sequence tag (EST) clusters sharing significant sequence similarity with the enzymes involved in N-glycan biosynthesis and processing. The ESTs identified were also analyzed to establish their relative abundance.
Resumo:
Genomic sequence comparison across species has enabled the elucidation of important coding and regulatory sequences encoded within DNA. Of particular interest are the noncoding regulatory sequences, which influence gene transcriptional and posttranscriptional processes. A phylogenetic footprinting strategy was employed to identify noncoding conservation patterns of 39 human and bovine orthologous genes. Seventy-three conserved noncoding sequences were identified that shared greater than 70% identity over at least 100 bp. Thirteen of these conserved sequences were also identified in the mouse genome. Evolutionary conservation of noncoding sequences across diverse species may have functional significance, and these conserved sequences may be good candidates for regulatory elements.
Resumo:
The publication of the human genome sequence in 2001 was a major step forward in knowledge necessary to understand the variations between individuals. For farmed species, genomic sequence information will facilitate the selection of animals optimised to live, and be productive, in particular environments. The availability of cattle genome sequence has allowed the breeding industry to take the first steps towards predicting phenotypes from genotypes by estimating a genomic breeding value (gEBV) for bulls using genome-wide DNA markers. The sequencing of the buffalo genome and creation of a panel of DNA markers has created the opportunity to apply molecular selection approaches for this species.The genomes of several buffalo of different breeds were sequenced and aligned with the bovine genome, which facilitated the identification of millions of sequence variants in the buffalo genomes. Based on frequencies of variants within and among buffalo breeds, and their distribution across the genome compared with the bovine genome, 90,000 putative single nucleotide polymorphisms (SNP) were selected to create an Axiom (R) Buffalo Genotyping Array 90K. This SNP Chip was tested in buffalo populations from Italy and Brazil and found to have at least 75% high quality and polymorphic markers in these populations. The 90K SNP chip was then used to investigate the structure of buffalo populations, and to localise the variations having a major effect on milk production.
Resumo:
Background: Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data.Results: Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution.Conclusions: While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Pós-graduação em Agronomia (Proteção de Plantas) - FCA
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)