956 resultados para Genomic sequence database
Resumo:
Las herramientas de análisis de secuencias genómicas permiten a los biólogos identificar y entender regiones fundamentales que tienen implicación en enfermedades genéticas. Actualmente existe una necesidad de dotar al ámbito científico de herramientas de análisis eficientes. Este proyecto lleva a cabo una caracterización y análisis del rendimiento de algoritmos utilizados en la comparación de secuencias genómicas completas, y ejecutadas en arquitecturas MultiCore y ManyCore. A partir del análisis se evalúa la idoneidad de este tipo de arquitecturas para resolver el problema de comparar secuencias genómicas. Finalmente se propone una serie de modificaciones en las implementaciones de estos algoritmos con el objetivo de mejorar el rendimiento.
Resumo:
PURPOSE: The aim of this study was to test whether oligonucleotide-targeted gene repair can correct the point mutation in genomic DNA of PDE6b(rd1) (rd1) mouse retinas in vivo. METHODS: Oligonucleotides (ODNs) of 25 nucleotide length and complementary to genomic sequence subsuming the rd1 point mutation in the gene encoding the beta-subunit of rod photoreceptor cGMP-phosphodiesterase (beta-PDE), were synthesized with a wild type nucleotide base at the rd1 point mutation position. Control ODNs contained the same nucleotide bases as the wild type ODNs but with varying degrees of sequence mismatch. We previously developed a repeatable and relatively non-invasive technique to enhance ODN delivery to photoreceptor nuclei using transpalpebral iontophoresis prior to intravitreal ODN injection. Three such treatments were performed on C3H/henJ (rd1) mouse pups before postnatal day (PN) 9. Treatment outcomes were evaluated at PN28 or PN33, when retinal degeneration was nearly complete in the untreated rd1 mice. The effect of treatment on photoreceptor survival was evaluated by counting the number of nuclei of photoreceptor cells and by assessing rhodopsin immunohistochemistry on flat-mount retinas and sections. Gene repair in the retina was quantified by allele-specific real time PCR and by detection of beta-PDE-immunoreactive photoreceptors. Confirmatory experiments were conducted using independent rd1 colonies in separate laboratories. These experiments had an additional negative control ODN that contained the rd1 mutant nucleotide base at the rd1 point mutation site such that the sole difference between treatment with wild type and control ODN was the single base at the rd1 point mutation site. RESULTS: Iontophoresis enhanced the penetration of intravitreally injected ODNs in all retinal layers. Using this delivery technique, significant survival of photoreceptors was observed in retinas from eyes treated with wild type ODNs but not control ODNs as demonstrated by cell counting and rhodopsin immunoreactivity at PN28. Beta-PDE immunoreactivity was present in retinas from eyes treated with wild type ODN but not from those treated with control ODNs. Gene correction demonstrated by allele-specific real time PCR and by counts of beta-PDE-immunoreactive cells was estimated at 0.2%. Independent confirmatory experiments showed that retinas from eyes treated with wild type ODN contained many more rhodopsin immunoreactive cells compared to retinas treated with control (rd1 sequence) ODN, even when harvested at PN33. CONCLUSIONS: Short ODNs can be delivered with repeatable efficiency to mouse photoreceptor cells in vivo using a combination of intravitreal injection and iontophoresis. Delivery of therapeutic ODNs to rd1 mouse eyes resulted in genomic DNA conversion from mutant to wild type sequence, low but observable beta-PDE immunoreactivity, and preservation of rhodopsin immunopositive cells in the outer nuclear layer, suggesting that ODN-directed gene repair occurred and preserved rod photoreceptor cells. Effects were not seen in eyes treated with buffer or with ODNs having the rd1 mutant sequence, a definitive control for this therapeutic approach. Importantly, critical experiments were confirmed in two laboratories by several different researchers using independent mouse colonies and ODN preparations from separate sources. These findings suggest that targeted gene repair can be achieved in the retina following enhanced ODN delivery.
Resumo:
The current drug options for the treatment of chronic Chagas disease have not been sufficient and high hopes have been placed on the use of genomic data from the human parasite Trypanosoma cruzi to identify new drug targets and develop appropriate treatments for both acute and chronic Chagas disease. However, the lack of a complete assembly of the genomic sequence and the presence of many predicted proteins with unknown or unsure functions has hampered our complete view of the parasite's metabolic pathways. Moreover, pinpointing new drug targets has proven to be more complex than anticipated and has revealed large holes in our understanding of metabolic pathways and their integrated regulation, not only for this parasite, but for many other similar pathogens. Using an in silicocomparative study on pathway annotation and searching for analogous and specific enzymes, we have been able to predict a considerable number of additional enzymatic functions in T. cruzi. Here we focus on the energetic pathways, such as glycolysis, the pentose phosphate shunt, the Krebs cycle and lipid metabolism. We point out many enzymes that are analogous to those of the human host, which could be potential new therapeutic targets.
Resumo:
As in perhaps all eukaryotes, schistosomes use a supplementary information transmitting system, the epigenetic inheritance system, to shape genetic information and to produce different phenotypes. In contrast to other important parasites, the study of epigenetic phenomena in schistosomes is still in its infancy. Nevertheless, we are beginning to grasp what goes on behind the epigenetic scene in this parasite. We have developed techniques of native chromatin immunoprecipitation (N-ChIP) and associated the necessary bioinformatics tools that allow us to run genome-wide comparative chromatin studies on Schistosoma mansoni at different stages of its life cycle, on different strains and on different sexes. We present here an application of such an approach to study the genetic and epigenetic basis for a phenotypic trait, the compatibility of S. mansoni with its invertebrate host Biomphalaria glabrata. We have applied the ChIP procedure to two strains that are either compatible or incompatible with their intermediate host. The precipitated DNA was sequenced and aligned to a reference genome and this information was used to determine regions in which both strands differ in their genomic sequence and/or chromatin structure. This procedure allowed us to identify candidate genes that display either genetic or epigenetic difference between the two strains.
Resumo:
The hepatitis C virus (HCV) NS3 protease has been one of the molecular targets of new therapeutic approaches. Its genomic sequence variability in Brazilian HCV isolates is poorly documented. To obtain more information on the magnitude of its genetic diversity, 114 Brazilian HCV samples were sequenced and analysed together with global reference sequences. Genetic distance (d) analyses revealed that subtype 1b had a higher degree of heterogeneity (d = 0.098) than subtypes 1a (d = 0.060) and 3a (d = 0.062). Brazilian isolates of subtype 1b were distributed in the phylogenetic tree among sequences from other countries, whereas most subtype 1a and 3a sequences clustered into a single branch. Additional characterisation of subtype 1a in clades 1 and 2 revealed that all but two Brazilian subtype 1a sequences formed a distinct and strongly supported (approximate likelihood-ratio test = 93) group of sequences inside clade 1. Moreover, this subcluster inside clade 1 presented an unusual phenotypic characteristic in relation to the presence of resistance mutations for macrocyclic inhibitors. In particular, the mutation Q80K was found in the majority of clade 1 sequences, but not in the Brazilian isolates. These data demonstrate that Brazilian HCV subtypes display a distinct pattern of genetic diversity and reinforce the importance of sequence information in future therapeutic approaches.
Resumo:
The primary mission of UniProt is to support biological research by maintaining a stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces freely accessible to the scientific community. UniProt is produced by the UniProt Consortium which consists of groups from the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB) and the Protein Information Resource (PIR). UniProt is comprised of four major components, each optimized for different uses: the UniProt Archive, the UniProt Knowledgebase, the UniProt Reference Clusters and the UniProt Metagenomic and Environmental Sequence Database. UniProt is updated and distributed every 3 weeks and can be accessed online for searches or download at http://www.uniprot.org.
Resumo:
The primary mission of Universal Protein Resource (UniProt) is to support biological research by maintaining a stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces freely accessible to the scientific community. UniProt is produced by the UniProt Consortium which consists of groups from the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB) and the Protein Information Resource (PIR). UniProt is comprised of four major components, each optimized for different uses: the UniProt Archive, the UniProt Knowledgebase, the UniProt Reference Clusters and the UniProt Metagenomic and Environmental Sequence Database. UniProt is updated and distributed every 4 weeks and can be accessed online for searches or download at http://www.uniprot.org.
Resumo:
One of the first useful products from the human genome will be a set of predicted genes. Besides its intrinsic scientific interest, the accuracy and completeness of this data set is of considerable importance for human health and medicine. Though progress has been made on computational gene identification in terms of both methods and accuracy evaluation measures, most of the sequence sets in which the programs are tested are short genomic sequences, and there is concern that these accuracy measures may not extrapolate well to larger, more challenging data sets. Given the absence of experimentally verified large genomic data sets, we constructed a semiartificial test set comprising a number of short single-gene genomic sequences with randomly generated intergenic regions. This test set, which should still present an easier problem than real human genomic sequence, mimics the approximately 200kb long BACs being sequenced. In our experiments with these longer genomic sequences, the accuracy of GENSCAN, one of the most accurate ab initio gene prediction programs, dropped significantly, although its sensitivity remained high. Conversely, the accuracy of similarity-based programs, such as GENEWISE, PROCRUSTES, and BLASTX was not affected significantly by the presence of random intergenic sequence, but depended on the strength of the similarity to the protein homolog. As expected, the accuracy dropped if the models were built using more distant homologs, and we were able to quantitatively estimate this decline. However, the specificities of these techniques are still rather good even when the similarity is weak, which is a desirable characteristic for driving expensive follow-up experiments. Our experiments suggest that though gene prediction will improve with every new protein that is discovered and through improvements in the current set of tools, we still have a long way to go before we can decipher the precise exonic structure of every gene in the human genome using purely computational methodology.
Resumo:
The complete sequence of the 7.07 Mb genome of the biological control agent Pseudomonas fluorescens Pf-5 is now available, providing a new opportunity to advance knowledge of biological control through genomics. P. fluorescens Pf-5 is a rhizosphere bacterium that suppresses seedling emergence diseases and produces a spectrum of antibiotics toxic to plant-pathogenic fungi and oomycetes. In addition to six known secondary metabolites produced by Pf-5, three novel secondary metabolite biosynthesis gene clusters identified in the genome could also contribute to biological control. The genomic sequence provides numerous clues as to mechanisms used by the bacterium to survive in the spermosphere and rhizosphere. These features include broad catabolic and transport capabilities for utilizing seed and root exudates, an expanded collection of efflux systems for defense against environmental stress and microbial competition, and the presence of 45 outer membrane receptors that should allow for the uptake of iron from a wide array of siderophores produced by soil microorganisms. As expected for a bacterium with a large genome that lives in a rapidly changing environment, Pf-5 has an extensive collection of regulatory genes, only some of which have been characterized for their roles in regulation of secondary metabolite production or biological control. Consistent with its commensal lifestyle, Pf-5 appears to lack a number of virulence and pathogenicity factors found in plant pathogen.
Resumo:
The question of where retroviral DNA becomes integrated in chromosomes is important for understanding (i) the mechanisms of viral growth, (ii) devising new anti-retroviral therapy, (iii) understanding how genomes evolve, and (iv) developing safer methods for gene therapy. With the completion of genome sequences for many organisms, it has become possible to study integration targeting by cloning and sequencing large numbers of host-virus DNA junctions, then mapping the host DNA segments back onto the genomic sequence. This allows statistical analysis of the distribution of integration sites relative to the myriad types of genomic features that are also being mapped onto the sequence scaffold. Here we present methods for recovering and analyzing integration site sequences.
Resumo:
Gene correction at the site of the mutation in the chromosome is the absolute way to really cure a genetic disease. The oligonucleotide (ODN)-mediated gene repair technology uses an ODN perfectly complementary to the genomic sequence except for a mismatch at the base that is mutated. The endogenous repair machinery of the targeted cell then mediates substitution of the desired base in the gene, resulting in a completely normal sequence. Theoretically, it avoids potential gene silencing or random integration associated with common viral gene augmentation approaches and allows an intact regulation of expression of the therapeutic protein. The eye is a particularly attractive target for gene repair because of its unique features (small organ, easily accessible, low diffusion into systemic circulation). Moreover therapeutic effects on visual impairment could be obtained with modest levels of repair. This chapter describes in details the optimized method to target active ODNs to the nuclei of photoreceptors in neonatal mouse using (1) an electric current application at the eye surface (saline transpalpebral iontophoresis), (2) combined with an intravitreous injection of ODNs, as well as the experimental methods for (3) the dissection of adult neural retinas, (4) their immuno-labelling, and (5) flat-mounting for direct observation of photoreceptor survival, a relevant criteria of treatment outcomes for retinal degeneration.
Resumo:
BACKGROUND: To understand cancer-related modifications to transcriptional programs requires detailed knowledge about the activation of signal-transduction pathways and gene expression programs. To investigate the mechanisms of target gene regulation by human estrogen receptor alpha (hERalpha), we combine extensive location and expression datasets with genomic sequence analysis. In particular, we study the influence of patterns of DNA occupancy by hERalpha on expression phenotypes. RESULTS: We find that strong ChIP-chip sites co-localize with strong hERalpha consensus sites and detect nucleotide bias near hERalpha sites. The localization of ChIP-chip sites relative to annotated genes shows that weak sites are enriched near transcription start sites, while stronger sites show no positional bias. Assessing the relationship between binding configurations and expression phenotypes, we find binding sites downstream of the transcription start site (TSS) to be equally good or better predictors of hERalpha-mediated expression as upstream sites. The study of FOX and SP1 cofactor sites near hERalpha ChIP sites shows that induced genes frequently have FOX or SP1 sites. Finally we integrate these multiple datasets to define a high confidence set of primary hERalpha target genes. CONCLUSION: Our results support the model of long-range interactions of hERalpha with the promoter-bound cofactor SP1 residing at the promoter of hERalpha target genes. FOX motifs co-occur with hERalpha motifs along responsive genes. Importantly we show that the spatial arrangement of sites near the start sites and within the full transcript is important in determining response to estrogen signaling.
Resumo:
Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes.
Resumo:
The seven subunit Arp2/3 complex is a highly conserved nucleation factor of actin microfilaments. We have isolated the genomic sequence encoding a putative Arp3a protein of the moss Physcomitrella patens. The disruption of this ARP3A gene by allele replacement has generated loss-of-function mutants displaying a complex developmental phenotype. The loss-of function of ARP3A gene results in shortened, almost cubic chloronemal cells displaying affected tip growth and lacking differentiation to caulonemal cells. In moss arp3a mutants, buds differentiate directly from chloronemata to form stunted leafy shoots having differentiated leaves similar to wild type. Yet, rhizoids never differentiate from stem epidermal cells. To characterize the F-actin organization in the arp3a-mutated cells, we disrupted ARP3A gene in the previously described HGT1 strain expressing conditionally the GFP-talin marker. In vivo observation of the F-actin cytoskeleton during P. patens development demonstrated that loss-of-function of Arp3a is associated with the disappearance of specific F-actin cortical structures associated with the establishment of localized cellular growth domains. Finally, we show that constitutive expression of the P. patens Arp3a and its Arabidopsis thaliana orthologs efficiently complement the mutated phenotype indicating a high degree of evolutionary conservation of the Arp3 function in land plants.
Resumo:
DEAD-box proteins comprise a family of ATP-dependent RNA helicases involved in several aspects of RNA metabolism. Here we report the characterization of the human DEAD-box RNA helicase DDX26. The gene is composed of 14 exons distributed over an extension of 8,123 bp of genomic sequence and encodes a transcript of 1.8 kb that is expressed in all tissues evaluated. The predicted amino acid sequence shows a high similarity to a yeast DEAD-box RNA helicase (Dbp9b) involved in ribosome biogenesis. The new helicase maps to 7p12, a region of frequent chromosome amplifications in glioblastomas involving the epidermal growth factor receptor (EGFR) gene. Nevertheless, co-amplification of DDX26 with EGFR was not detected in nine tumors analyzed.