205 resultados para Bioinformatic
Resumo:
World Congress of Malacology, Ponta Delgada, July 22-28, 2013.
Resumo:
Thesis submitted to the Universidade Nova de Lisboa, Faculdade de Ciências e Tecnologia, for the degree of Doctor of Philosophy in Biochemistry
Resumo:
The isolation of the bartolosides, unprecedented cyanobacterial glycolipids featuring aliphatic chains with chlorine substituents and C-glycosyl moieties, is reported. Their chlorinated dialkylresorcinol (DAR) core presented a major structural-elucidation challenge. To overcome this, we discovered the bartoloside (brt) biosynthetic gene cluster and linked it to the natural products through in vitro characterization of the DAR-forming ketosynthase and aromatase. Bioinformatic analysis also revealed a novel potential halogenase. Knowledge of the bartoloside biosynthesis constrained the DAR core structure by defining key pathway intermediates, ultimately allowing us to determine the full structures of the bartolosides. This work illustrates the power of genomics to enable the use of biosynthetic information for structure elucidation.
Resumo:
Dissertation presented to obtain the Ph.D degree in Biology
Resumo:
A thesis to obtain a Master degree in Structural and Functional Biochemistry
Resumo:
Part of this thesis will be published in the following: Gomes, B.C., Santos, B. 2015. Methods for studying microRNAs expression and their targets in formalin-fixed, paraffin-embedded (FFPE) breast cancer tissues. In Methods in Molecular Biology: Cancer Drug Resistance (Rueff, J. & Rodrigues, A.S. eds), Springer Protocols.
Resumo:
The chemical composition of propolis is affected by environmental factors and harvest season, making it difficult to standardize its extracts for medicinal usage. By detecting a typical chemical profile associated with propolis from a specific production region or season, certain types of propolis may be used to obtain a specific pharmacological activity. In this study, propolis from three agroecological regions (plain, plateau, and highlands) from southern Brazil, collected over the four seasons of 2010, were investigated through a novel NMR-based metabolomics data analysis workflow. Chemometrics and machine learning algorithms (PLS-DA and RF), including methods to estimate variable importance in classification, were used in this study. The machine learning and feature selection methods permitted construction of models for propolis sample classification with high accuracy (>75%, reaching 90% in the best case), better discriminating samples regarding their collection seasons comparatively to the harvest regions. PLS-DA and RF allowed the identification of biomarkers for sample discrimination, expanding the set of discriminating features and adding relevant information for the identification of the class-determining metabolites. The NMR-based metabolomics analytical platform, coupled to bioinformatic tools, allowed characterization and classification of Brazilian propolis samples regarding the metabolite signature of important compounds, i.e., chemical fingerprint, harvest seasons, and production regions.
Resumo:
OBJECTIVES: It is still debated if pre-existing minority drug-resistant HIV-1 variants (MVs) affect the virological outcomes of first-line NNRTI-containing ART. METHODS: This Europe-wide case-control study included ART-naive subjects infected with drug-susceptible HIV-1 as revealed by population sequencing, who achieved virological suppression on first-line ART including one NNRTI. Cases experienced virological failure and controls were subjects from the same cohort whose viraemia remained suppressed at a matched time since initiation of ART. Blinded, centralized 454 pyrosequencing with parallel bioinformatic analysis in two laboratories was used to identify MVs in the 1%-25% frequency range. ORs of virological failure according to MV detection were estimated by logistic regression. RESULTS: Two hundred and sixty samples (76 cases and 184 controls), mostly subtype B (73.5%), were used for the analysis. Identical MVs were detected in the two laboratories. 31.6% of cases and 16.8% of controls harboured pre-existing MVs. Detection of at least one MV versus no MVs was associated with an increased risk of virological failure (OR = 2.75, 95% CI = 1.35-5.60, P = 0.005); similar associations were observed for at least one MV versus no NRTI MVs (OR = 2.27, 95% CI = 0.76-6.77, P = 0.140) and at least one MV versus no NNRTI MVs (OR = 2.41, 95% CI = 1.12-5.18, P = 0.024). A dose-effect relationship between virological failure and mutational load was found. CONCLUSIONS: Pre-existing MVs more than double the risk of virological failure to first-line NNRTI-based ART.
Resumo:
BACKGROUND: Retinitis pigmentosa and other hereditary retinal degenerations (HRD) are rare genetic diseases leading to progressive blindness. Recessive HRD are caused by mutations in more than 100 different genes. Laws of population genetics predict that, on a purely theoretical ground, such a high number of genes should translate into an extremely elevated frequency of unaffected carriers of mutations. In this study we estimate the proportion of these individuals within the general population, via the analyses of data from whole-genome sequencing. METHODOLOGY/PRINCIPAL FINDINGS: We screened complete and high-quality genome sequences from 46 control individuals from various world populations for HRD mutations, using bioinformatic tools developed in-house. All mutations detected in silico were validated by Sanger sequencing. We identified clear-cut, null recessive HRD mutations in 10 out of the 46 unaffected individuals analyzed (∼22%). CONCLUSIONS/SIGNIFICANCE: Based on our data, approximately one in 4-5 individuals from the general population may be a carrier of null mutations that are responsible for HRD. This would be the highest mutation carrier frequency so far measured for a class of Mendelian disorders, especially considering that missenses and other forms of pathogenic changes were not included in our assessment. Among other things, our results indicate that the risk for a consanguineous couple of generating a child with a blinding disease is particularly high, compared to other genetic conditions.
Resumo:
BACKGROUND: The availability of the P. falciparum genome has led to novel ways to identify potential vaccine candidates. A new approach for antigen discovery based on the bioinformatic selection of heptad repeat motifs corresponding to alpha-helical coiled coil structures yielded promising results. To elucidate the question about the relationship between the coiled coil motifs and their sequence conservation, we have assessed the extent of polymorphism in putative alpha-helical coiled coil domains in culture strains, in natural populations and in the single nucleotide polymorphism data available at PlasmoDB. METHODOLOGY/PRINCIPAL FINDINGS: 14 alpha-helical coiled coil domains were selected based on preclinical experimental evaluation. They were tested by PCR amplification and sequencing of different P. falciparum culture strains and field isolates. We found that only 3 out of 14 alpha-helical coiled coils showed point mutations and/or length polymorphisms. Based on promising immunological results 5 of these peptides were selected for further analysis. Direct sequencing of field samples from Papua New Guinea and Tanzania showed that 3 out of these 5 peptides were completely conserved. An in silico analysis of polymorphism was performed for all 166 putative alpha-helical coiled coil domains originally identified in the P. falciparum genome. We found that 82% (137/166) of these peptides were conserved, and for one peptide only the detected SNPs decreased substantially the probability score for alpha-helical coiled coil formation. More SNPs were found in arrays of almost perfect tandem repeats. In summary, the coiled coil structure prediction was rarely modified by SNPs. The analysis revealed a number of peptides with strictly conserved alpha-helical coiled coil motifs. CONCLUSION/SIGNIFICANCE: We conclude that the selection of alpha-helical coiled coil structural motifs is a valuable approach to identify potential vaccine targets showing a high degree of conservation.
Resumo:
Abstract : Copy number variation (CNV) of DNA segments has recently gained considerable interest as a source of genetic variation likely to play a role in phenotypic diversity and evolution. Much effort has been put into the identification and mapping of regions that vary in copy number among seemingly normal individuals, both in humans and in a number of model organisms, using both bioinformatic and hybridization-based methods. Synteny studies suggest the existence of CNV hotspots in mammalian genomes, often in connection with regions of segmental duplication. CNV alleles can be in equilibrium within a population, but can also arise de novo between generations, illustrating the highly dynamic nature of these regions. A small number of studies have assessed the effect of CNV on single loci, however, at the genome-wide scale, the functional impact of CNV remains poorly studied. We have explored the influence of CNV on gene expression, first using the Williams-Beuren syndrome (WBS) associated deletion as a model, and second at the genome-wide scale in inbred mouse strains. We found that the WBS deletion influences the expression levels not only of the hemizygous genes, but also affects the euploid genes mapping nearby. Consistently, on a genome wide scale we observe that CNV genes are expressed at more variable levels than genes that do not vary in copy number. Likewise, CNVs influence the relative expression levels of genes that map to the flank of the genome rearrangements, thus globally influencing tissue transcriptomes. Further studies are warranted to complete cataloguing and fine mapping of CNV regions, as well as to elucidate the different mechanisms by which CNVs influence gene expression. Résumé : La variation en nombre de copies (copy number variation ou CNV) de segments d'ADN suscite un intérêt en tant que variation génétique susceptible de jouer un r81e dans la diversité phénotypique et l'évolution. Les régions variables en nombre de copies parmi des individus apparemment normaux ont été cartographiées et cataloguées au moyen de puces à ADN et d'analyse bioinformatique. L'étude de la synténie entre plusieurs espèces de mammifères laisse supposer l'existence de régions à haut taux de variation, souvent liées à des duplications segmentaires. Les allèles CNV peuvent être en équilibre au sein d'une population ou peuvent apparaître de novo. Ces faits illustrent la nature hautement dynamique de ces régions. Quelques études se sont penchées sur l'effet de la variation en nombre de copies de loci isolés, cependant l'impact de ce phénomène n'a pas été étudié à l'échelle génomique. Nous avons examiné l'influence des CNV sur l'expression des gènes. Dans un premier temps nous avons utilisé la délétion associée au syndrome de Williams-Beuren (WBS), puis, dans un second temps, nous avons poursuivi notre étude à l'échelle du génome, dans des lignées consanguines de souris. Nous avons établi que la délétion WBS influence l'expression non seulement des gènes hémizygotes, mais également celle des gènes euploïdes voisins. A l'échelle génomique, nous observons des phénomènes concordants. En effet, l'expression des gènes variant en nombre de copies est plus variable que celles des gènes ne variant pas. De plus, à l'instar de la délétion WBS, les CNV influencent l'expression des gènes adjacents, exerçant ainsi un impact global sur les profils d'expression dans les tissus. Résumé pour un large public : De nombreuses maladies ont pour cause un défaut génétique. Parmi les types de mutations, on compte la disparition (délétion) d'une partie de notre génome ou sa duplication. Bien que l'on connaisse les anomalies associées à certaines maladies, les mécanismes moléculaires par lesquels ces réarrangements de notre matériel génétique induisent les maladies sont encore méconnus. C'est pourquoi nous nous sommes intéressés à la régulation des gènes dans les régions susceptibles à délétion ou duplication. Dans ce travail, nous avons démontré que les délétions et les duplications influencent la régulation des gènes situés à proximité, et que ces changements interviennent dans plusieurs organes.
Resumo:
The drivers of species diversification and persistence are of great interest to current biogeography, especially in those global biodiversity hotspots' harbouring most of Earth's animal and plant life. Classical multispecies biogeographical work has yielded fascinating insights into broad-scale patterns of diversification, and DNA-based intraspecific phylogeographical studies have started to complement this picture at much finer temporal and spatial scales. The advent of novel next-generation sequencing (NGS) technologies provides the opportunity to greatly scale up the numbers of individuals, populations and species sampled, potentially merging intraspecific and interspecific approaches to biogeographical inference. Here, we outline these prospects and issues by using the example of an undisputed hotspot, the Cape of southern Africa. We outline the current state of knowledge on the biogeography of species diversification within the Cape, review the literature for phylogeographical evidence of its likely drivers and mechanisms, and suggest possible ways forward based on NGS approaches. We demonstrate the potential of these methods and current bioinformatic issues with the help of restriction-site-associated DNA (RAD) sequencing data for three highly divergent species of the Restionaceae, an important plant radiation in the Cape. A thorough understanding of the mechanisms that facilitate species diversification and persistence in spatially structured, species-rich environments will require the adoption of novel genomic and bioinformatic tools in biogeographical studies.
Resumo:
Prèviament havíem identificat la presència d’una forma de l’IKKα de 45kD que s’expressa específicament en el nucli de les cèl.lules de cancer colorectal. A més havíem demostrat que aquesta forma estava fosforilada la qual cosa suggereix que es tracta d’una forma activa d’IKKa. Un dels objectius que ens varem plantejar va ser determinar el mecanisme per el qual es generava aquesta forma de la quinasa. Mitjançant eines bioinformàtiques hem identificat possibles llocs de processament proteolític en la seqüència d’IKKα que podrien ser responsables de generar el fragment de 45kD present en les cèl.lules tumorals. Després, hem demostrat que les proteases identificades in silico són capaces de processar IKKa in vitro, específicament en els llocs predits. S’ha pogut constatar, mitjançant la mutació dels possibles llocs de processament, que només un dels llocs identificats bioinformàticament corresponent a Cathepsin B/L era funcional in vitro, mentre que els altres llocs predits no ho eren. D’igual manera, l’expressió ectòpica de la Cathepsin B o L és capaç de produïr el processament d’IKKα. Pel contrari, la inhibició de l’activitat de la proteasa mitjançant inhibidors específics és capaç de bloquejar el processament d’IKKa en cèl.lules tumorals. Finalment, hem demostrat que els nivells de la Cathepsin B i L, proteases identificades com a responsables de processar IKKa es troben sobre-expressades en la majoria de les mostres humanes de càncer de colon analitzades comparat amb el teixit normal adjacent del mateixos pacients.
Resumo:
The analysis of genetic data for human immunodeficiency virus type 1 (HIV-1) and human T-cell lymphotropic virus type 1 (HTLV-1) is essential to improve treatment and public health strategies as well as to select strains for vaccine programs. However, the analysis of large quantities of genetic data requires collaborative efforts in bioinformatics, computer biology, molecular biology, evolution, and medical science. The objective of this study was to review and improve the molecular epidemiology of HIV-1 and HTLV-1 viruses isolated in Brazil using bioinformatic tools available in the Laboratório Avançado de Sáude Pública (Lasp) bioinformatics unit. The analysis of HIV-1 isolates confirmed a heterogeneous distribution of the viral genotypes circulating in the country. The Brazilian HIV-1 epidemic is characterized by the presence of multiple subtypes (B, F1, C) and B/F1 recombinant virus while, on the other hand, most of the HTLV-1 sequences were classified as Transcontinental subgroup of the Cosmopolitan subtype. Despite the high variation among HIV-1 subtypes, protein glycosylation and phosphorylation domains were conserved in the pol, gag, and env genes of the Brazilian HIV-1 strains suggesting constraints in the HIV-1 evolution process. As expected, the functional protein sites were highly conservative in the HTLV-1 env gene sequences. Furthermore, the presence of these functional sites in HIV-1 and HTLV-1 strains could help in the development of vaccines that pre-empt the viral escape process.
Resumo:
Résumé Etant une importante source d'énergie, les plantes sont constamment attaquées par des pathogènes. Ne pouvant se mouvoir, elles ont développé des systèmes de défense sophistiqués afin de lutter contre ces prédateurs. Parmi ces systèmes, les voies de signalisation mettant en jeu des éliciteurs endog8nes tels que les jasmonates permettent d'induire la production de protéines de défense telles que les protéines dites "liées à la pathogénèse". Les gènes codant pour ces protéines appartiennent à des familles multigéniques. Le premier but de cette thèse est d'évaluer le nombre de ces gènes dans le génome d'Arabidopsis thaliana et d'estimer la part de ce système de défense, dépendant de la voie de signalisation des jasmonates. Nous avons défini un cluster de seulement 1S gènes sur 266, "liés à la pathogénèse", exclusivement régulés par les jasmonates. De multiples membres des familles des lectines de type jacaline et des inhibiteurs de trypsines semblent dépendre du jasmonate. Présente dans tous les systèmes immunitaires des eucaryotes, la famille des défensines est une famille très intéressante. Chez Arabidopsis thaliana, 317 protéines similaires aux défensines ont été définies, cependant seulement 15 défensines (PDF) sont bien annotées. Ces 15 défensines sont séparées en deux groupes dont un semble avoir évolué plus récemment. Le second but de cette thèse est d'étudier ce groupe de défensines à l'aide de la bioinformatique et des techniques de biologie moléculaire (gêne rapporteur, PCR en temps réel). Nous avons montré que ce groupe contenait une défensine acide intéressante, PDF1.5, qui semblait avoir subi une sélection positive. Cette protéine n'avait encore jamais été étudiée. Contrairement à ce que nous pensions, nous avons établi que cette protéine pouvait avoir une activité biologique liée à la défense. Ce travail de thèse a permis de préciser le nombre de gènes "liées à la pathogénèse" induits par la voie des jasmonates et d'apporter des éléments de réponse sur la question de la redondance des gènes de défense. En conclusion, même si de nombreuses familles de gènes intervenant dans la défense sont bien définies chez Arabidopsis, il reste encore de nombreuses études à faire sur chacun de ces membres. Abstract Being an important source of energy, plants are constantly attacked by herbivores and pathogens. As sessile organisms, they have developed sophisticated defense responses to cope with attack. Among these responses, signalling pathways, using endogenous elicitors including jasmonates (JA), allow the plant to induce the production of defense proteins such as pathogenesis-related (PR) proteins. The genes encoding these proteins belong to multigenic families. The first goal of this thesis was to evaluate the number of PR genes in the genome of Arabidopsis thaliana and estimate how much of this plant defense system was dependent on the jasmonate signaling pathway in leaves. Surprisingly a cluster of only 1S genes out of 2ó6 PR genes was exclusively regulated by JA. Multiple members of the jacalin lectin and trypsin inhibitor gene families were shown to be regulated by JA. Present in all eukaryotic immune systems, defensins are an attractive PR family to study. In Arabidopsis thaliana, 317 defensin-related proteins have been found but just 1S defensins (i.e. PDF family) are well annotated. These defensins are split into 2 groups. One of these groups may have appeared and diversified recently. The second goal of this thesis was to study this defensin gene group combining bioinformatic, reporter gene and quantitative PCR techniques. We have shown that this group contains an interesting acidic defensin, PDF1.S, which seems to have undergone positive selection. No information was known on this protein. We have established that this protein may have a biological activity in plant defense. This thesis allowed us to define the number of PR genes induced by the jasmonate pathway and gave initial leads to explain the redundancy of the PR genes in the genome of Arabidopsis. In conclusion, even if many defense gene families are already defined in the Arabidopsis genome, much work remains to be done on individual members.