931 resultados para Bioinformatics


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Dissertation for Ph.D. degree in Biomedical Engineering.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In mammalian circadian clockwork, the CLOCK-BMAL1 complex binds to DNA enhancers of target genes and drives circadian oscillation of transcription. Here we identified 7,978 CLOCK-binding sites in mouse liver by chromatin immunoprecipitation-sequencing (ChIP-Seq), and a newly developed bioinformatics method, motif centrality analysis of ChIP-Seq (MOCCS), revealed a genome-wide distribution of previously unappreciated noncanonical E-boxes targeted by CLOCK. In vitro promoter assays showed that CACGNG, CACGTT, and CATG(T/C)G are functional CLOCK-binding motifs. Furthermore, we extensively revealed rhythmically expressed genes by poly(A)-tailed RNA-Seq and identified 1,629 CLOCK target genes within 11,926 genes expressed in the liver. Our analysis also revealed rhythmically expressed genes that have no apparent CLOCK-binding site, indicating the importance of indirect transcriptional and posttranscriptional regulations. Indirect transcriptional regulation is represented by rhythmic expression of CLOCK-regulated transcription factors, such as Krüppel-like factors (KLFs). Indirect posttranscriptional regulation involves rhythmic microRNAs that were identified by small-RNA-Seq. Collectively, CLOCK-dependent direct transactivation through multiple E-boxes and indirect regulations polyphonically orchestrate dynamic circadian outputs.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Despite the common assumption that orthologs usually share the same function, there have been various reports of divergence between orthologs, even among species as close as mammals. The comparison of mouse and human is of special interest, because mouse is often used as a model organism to understand human biology. We review the literature on evidence for divergence between human and mouse orthologous genes, and discuss it in the context of biomedical research.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Résumé Le transfert du phosphate des racines vers les feuilles s'effectue par la voie du xylème. Il a été précédemment démontré que la protéine AtPHO1 était indispensable au transfert du phosphate dans les vaisseaux du xylème des racines chez la plante modèle Arabidopsis thaliana. Le séquençage et l'annotation du génome d'Arabidopsis ont permis d'identifier dix séquences présentant un niveau de similarité significatif avec le gène AtPHO1 et constituant une nouvelle famille de gène appelé la famille de AtPHO1. Basée sur une étude moléculaire et génétique, cette thèse apporte des éléments de réponse pour déterminer le rôle des membres de ia famille de AtPHO1 chez Arabidopsis, inconnue à ce jour. Dans un premier temps, une analyse bioinformatique des séquences protéiques des membres de la famille de AtPHO1 a révélé la présence dans leur région N-terminale d'un domaine nommé SPX. Ce dernier est conservé parmi de nombreuses protéines impliquées dans l'homéostasie du phosphate chez la levure, renforçant ainsi l'hypothèse que les membres de la famille de AtPHO1 auraient comme AtPHO1 un rôle dans l'équilibre du phosphate dans la plante. En parallèle, la localisation tissulaire de l'expression des gènes AtPHO dans Arabidopsis a été identifiée par l'analyse de plantes transgéniques exprimant le gène rapporteur uidA sous le contrôle des promoteurs respectifs des gènes AtPHO. Un profil d'expression de chaque gène AtPHO au cours du développement de la plante a été obtenu. Une expression prédominante au niveau des tissus vasculaires des racines, des feuilles, des tiges et des fleurs a été observée, suggérant que les gènes AtPHO pourraient avoir des fonctions redondantes au niveau du transfert de phosphate dans le cylindre vasculaire de ces différents organes. Toutefois, plusieurs régions promotrices des gènes AtPHO contrôlent également un profil d'expression GUS non-vasculaire, indiquant un rôle putatif des gènes AtPHO dans l'acquisition ou le recyclage de phosphate dans la plante. Dans un deuxième temps, l'analyse de l'expression des gènes AtPHO durant une carence en phosphate a établi que seule l'expression des gènes AtPHO1, AtPHO1; H1 et AtPHO1; H10 est régulée par cette carence. Une étude approfondie de leur expression en réponse à des traitements affectant l'homéostasie du phosphate dans la plante a ensuite démontré leur régulation par différentes voies de signalisation. Ensuite, une analyse détaillée de la régulation de l'expression du gène AtPHO1; H1O dans des feuilles d'Arabidopsis blessées ou déshydratées a révélé que ce gène constitue le premìer gène marqueur d'une nouvelle voie de signalisation induite par l'OPDA, pas par le JA et dépendante de la protéine COI1. Ces résultats démontrent pour la première fois que l'OPDA et le JA peuvent activer différents gènes via des voies de signalisation dépendantes de COI1. Enfin, cette thèse révèle l'identification d'un nouveau rôle de la protéine AtPHO1 dans la régulation de l'action de l'ABA au cours des processus de fermeture stomatique et de germination des graines chez Arabidopsis. Bien que les fonctions exactes des protéines AtPHO restent à être déterminées, ce travail de thèse suggère leur implication dans la propagation de différents signaux dans la plante via la modulation du potentiel membranaire et/ou l'affectation de la composition en ions des cellules comme le font de nombreux transporteurs ou régulateur du transport d'ions. Summary Phosphate is transferred from the roots to the shoot via the xylem. The requirement for AtPHO1 protein to transfer phosphate to the xylem vessels of the root has been previously demonstrated in Arabidopsis thaliana. The sequencing and the annotation of the Arabidopsis genome had allowed the identification of ten sequences that show a significant level of similarity with the AtPHO1 gene. These 10 genes, of unknown functions, constitute a new gene family called the AtPHO1 gene family. Based on a molecular and genetics study, this thesis reveals some information needed to understand the role of the AtPHO1 family members in the plant Arabidopsis. First, a bioinformatics study revealed that the AtPHO sequences contained, in the N-terminal hydrophilic region, a motif called SPX and conserved among multiple proteins involved in phosphate homeostasis in yeast. This finding reinforces the hypothesis that all AtPHO1 family members have, as AtPHO1, a role in phosphate homeostasis. In parallel, we identified the pattern of expression of AtPHO genes in Arabidopsis via analysis of transgenic plants expressing the uidA reporter gene under the control of respective AtPHO promoter regions. The results exhibit a predominant expression of AtPHO genes in vascular tissues of all organs of the plant, implying that these AtPHO genes could have redundant functions in the transfer of phosphate to the vascular cylinder of various organs. The GUS expression pattern for several AtPHO promoter regions was also detected in non-vascular tissue indicating a broad role of AtPHO genes in the acquisition or in the recycling of phosphate in the plant. In a second step, the analysis of the expression of AtPHO genes during phosphate starvation established that only the expression of the AtPHO1, AtPHO1; H1 and AtPHO1; H10 genes were regulated by Pi starvation. Interestingly, different signalling pathways appeared to regulate these three genes during various treatments affecting Pi homeostasis in the plant. The third chapter presents a detailed analysis of the signalling pathways regulating the expression of the AtPHO1; H10 gene in Arabidopsis leaves during wound and dehydrated stresses. Surprisingly, the expression of AtPHO1; H10 was found to be regulated by OPDA (the precursor of JA) but not by JA itself and via the COI1 protein (the central regulator of the JA signalling pathway). These results demonstrated for the first time that OPDA and JA could activate distinct genes via COI1-dependent pathways. Finally, this thesis presents the identification of a novel role of the AtPHO1 protein in the regulation of ABA action in Arabidopsis guard cells and during seed germination. Although the exact role and function of AtPHO1 still need to be determined, these last findings suggest that AtPHO1 and by extension other AtPHO proteins could mediate the propagation of various signals in the plant by modulating the membrane potential and/or by affecting cellular ion composition, as it is the case for many ion transporters or regulators of ion transport.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The identification and quantification of proteins and lipids is of major importance for the diagnosis, prognosis and understanding of the molecular mechanisms involved in disease development. Owing to its selectivity and sensitivity, mass spectrometry has become a key technique in analytical platforms for proteomic and lipidomic investigations. Using this technique, many strategies have been developed based on unbiased or targeted approaches to highlight or monitor molecules of interest from biomatrices. Although these approaches have largely been employed in cancer research, this type of investigation has been met by a growing interest in the field of cardiovascular disorders, potentially leading to the discovery of novel biomarkers and the development of new therapies. In this paper, we will review the different mass spectrometry-based proteomic and lipidomic strategies applied in cardiovascular diseases, especially atherosclerosis. Particular attention will be given to recent developments and the role of bioinformatics in data treatment. This review will be of broad interest to the medical community by providing a tutorial of how mass spectrometric strategies can support clinical trials.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

MOTIVATION: Microarray results accumulated in public repositories are widely reused in meta-analytical studies and secondary databases. The quality of the data obtained with this technology varies from experiment to experiment, and an efficient method for quality assessment is necessary to ensure their reliability. RESULTS: The lack of a good benchmark has hampered evaluation of existing methods for quality control. In this study, we propose a new independent quality metric that is based on evolutionary conservation of expression profiles. We show, using 11 large organ-specific datasets, that IQRray, a new quality metrics developed by us, exhibits the highest correlation with this reference metric, among 14 metrics tested. IQRray outperforms other methods in identification of poor quality arrays in datasets composed of arrays from many independent experiments. In contrast, the performance of methods designed for detecting outliers in a single experiment like Normalized Unscaled Standard Error and Relative Log Expression was low because of the inability of these methods to detect datasets containing only low-quality arrays and because the scores cannot be directly compared between experiments. AVAILABILITY AND IMPLEMENTATION: The R implementation of IQRray is available at: ftp://lausanne.isb-sib.ch/pub/databases/Bgee/general/IQRray.R. CONTACT: Marta.Rosikiewicz@unil.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Drug delivery is one of the most common clinical routines in hospitals, and is critical to patients' health and recovery. It includes a decision making process in which a medical doctor decides the amount (dose) and frequency (dose interval) on the basis of a set of available patients' feature data and the doctor's clinical experience (a priori adaptation). This process can be computerized in order to make the prescription procedure in a fast, objective, inexpensive, non-invasive and accurate way. This paper proposes a Drug Administration Decision Support System (DADSS) to help clinicians/patients with the initial dose computing. The system is based on a Support Vector Machine (SVM) algorithm for estimation of the potential drug concentration in the blood of a patient, from which a best combination of dose and dose interval is selected at the level of a DSS. The addition of the RANdom SAmple Consensus (RANSAC) technique enhances the prediction accuracy by selecting inliers for SVM modeling. Experiments are performed for the drug imatinib case study which shows more than 40% improvement in the prediction accuracy compared with previous works. An important extension to the patient features' data is also proposed in this paper.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Superinfection with drug resistant HIV strains could potentially contribute to compromised therapy in patients initially infected with drug-sensitive virus and receiving antiretroviral therapy. To investigate the importance of this potential route to drug resistance, we developed a bioinformatics pipeline to detect superinfection from routinely collected genotyping data, and assessed whether superinfection contributed to increased drug resistance in a large European cohort of viremic, drug treated patients. METHODS: We used sequence data from routine genotypic tests spanning the protease and partial reverse transcriptase regions in the Virolab and EuResist databases that collated data from five European countries. Superinfection was indicated when sequences of a patient failed to cluster together in phylogenetic trees constructed with selected sets of control sequences. A subset of the indicated cases was validated by re-sequencing pol and env regions from the original samples. RESULTS: 4425 patients had at least two sequences in the database, with a total of 13816 distinct sequence entries (of which 86% belonged to subtype B). We identified 107 patients with phylogenetic evidence for superinfection. In 14 of these cases, we analyzed newly amplified sequences from the original samples for validation purposes: only 2 cases were verified as superinfections in the repeated analyses, the other 12 cases turned out to involve sample or sequence misidentification. Resistance to drugs used at the time of strain replacement did not change in these two patients. A third case could not be validated by re-sequencing, but was supported as superinfection by an intermediate sequence with high degenerate base pair count within the time frame of strain switching. Drug resistance increased in this single patient. CONCLUSIONS: Routine genotyping data are informative for the detection of HIV superinfection; however, most cases of non-monophyletic clustering in patient phylogenies arise from sample or sequence mix-up rather than from superinfection, which emphasizes the importance of validation. Non-transient superinfection was rare in our mainly treatment experienced cohort, and we found a single case of possible transmitted drug resistance by this route. We therefore conclude that in our large cohort, superinfection with drug resistant HIV did not compromise the efficiency of antiretroviral treatment.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Con la mayor capacidad de los nodos de procesamiento en relación a la potencia de cómputo, cada vez más aplicaciones intensivas de datos como las aplicaciones de la bioinformática, se llevarán a ejecutar en clusters no dedicados. Los clusters no dedicados se caracterizan por su capacidad de combinar la ejecución de aplicaciones de usuarios locales con aplicaciones, científicas o comerciales, ejecutadas en paralelo. Saber qué efecto las aplicaciones con acceso intensivo a dados producen respecto a la mezcla de otro tipo (batch, interativa, SRT, etc) en los entornos no-dedicados permite el desarrollo de políticas de planificación más eficientes. Algunas de las aplicaciones intensivas de E/S se basan en el paradigma MapReduce donde los entornos que las utilizan, como Hadoop, se ocupan de la localidad de los datos, balanceo de carga de forma automática y trabajan con sistemas de archivos distribuidos. El rendimiento de Hadoop se puede mejorar sin aumentar los costos de hardware, al sintonizar varios parámetros de configuración claves para las especificaciones del cluster, para el tamaño de los datos de entrada y para el procesamiento complejo. La sincronización de estos parámetros de sincronización puede ser demasiado compleja para el usuario y/o administrador pero procura garantizar prestaciones más adecuadas. Este trabajo propone la evaluación del impacto de las aplicaciones intensivas de E/S en la planificación de trabajos en clusters no-dedicados bajo los paradigmas MPI y Mapreduce.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Given the rapid increase of species with a sequenced genome, the need to identify orthologous genes between them has emerged as a central bioinformatics task. Many different methods exist for orthology detection, which makes it difficult to decide which one to choose for a particular application. Here, we review the latest developments and issues in the orthology field, and summarize the most recent results reported at the third 'Quest for Orthologs' meeting. We focus on community efforts such as the adoption of reference proteomes, standard file formats and benchmarking. Progress in these areas is good, and they are already beneficial to both orthology consumers and providers. However, a major current issue is that the massive increase in complete proteomes poses computational challenges to many of the ortholog database providers, as most orthology inference algorithms scale at least quadratically with the number of proteomes. The Quest for Orthologs consortium is an open community with a number of working groups that join efforts to enhance various aspects of orthology analysis, such as defining standard formats and datasets, documenting community resources and benchmarking. AVAILABILITY AND IMPLEMENTATION: All such materials are available at http://questfororthologs.org. CONTACT: erik.sonnhammer@scilifelab.se or c.dessimoz@ucl.ac.uk.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Many significant advances in dermatology were published during 2009, focussing on infectious diseases, inflammatory disorders and oncology. Molecular medicine, as a result of the human genome project, also modifies the field of dermatology. Bioinformatics and biotechnology revolutionize the daily clinical practice in dermatology. A change of paradigm occurs notably in infectious diseases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Avui en dia la biologia aporta grans quantitats de dades que només la informàtica pot tractar. Les aplicacions bioinformàtiques són la més important eina d’anàlisi i comparació que tenim per entendre la vida i aconseguir desxifrar aquestes dades. Aquest projecte centra el seu esforç en l’estudi de les aplicacions dedicades a l’alineament de seqüències genètiques, i més concretament a dos algoritmes, basats en programació dinàmica i òptims: el Needleman&Wunsch i el Smith&Waterman. Amb l’objectiu de millorar el rendiment d’aquests algoritmes per a alineaments de seqüències grans, proposem diferents versions d’implementació. Busquem millorar rendiments en temps i espai. Per a aconseguir millorar els resultats aprofitem el paral·lelisme. Els resultats dels anàlisis de les versions els comparem per obtenir les dades necessàries per valorar cost, guany i rendiment.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Amino acids form the building blocks of all proteins. Naturally occurring amino acids are restricted to a few tens of sidechains, even when considering post-translational modifications and rare amino acids such as selenocysteine and pyrrolysine. However, the potential chemical diversity of amino acid sidechains is nearly infinite. Exploiting this diversity by using non-natural sidechains to expand the building blocks of proteins and peptides has recently found widespread applications in biochemistry, protein engineering and drug design. Despite these applications, there is currently no unified online bioinformatics resource for non-natural sidechains. With the SwissSidechain database (http://www.swisssidechain.ch), we offer a central and curated platform about non-natural sidechains for researchers in biochemistry, medicinal chemistry, protein engineering and molecular modeling. SwissSidechain provides biophysical, structural and molecular data for hundreds of commercially available non-natural amino acid sidechains, both in l- and d-configurations. The database can be easily browsed by sidechain names, families or physico-chemical properties. We also provide plugins to seamlessly insert non-natural sidechains into peptides and proteins using molecular visualization software, as well as topologies and parameters compatible with molecular mechanics software.