911 resultados para Molecular Sequence Data
Resumo:
Les trichothécènes de Fusarium appartiennent au groupe des sesquiterpènes qui sont des inhibiteurs la synthèse des protéines des eucaryotes. Les trichothécènes causent d’une part de sérieux problèmes de santé aux humains et aux animaux qui ont consommé des aliments infectés par le champignon et de l’autre part, elles sont des facteurs importants de la virulence chez plantes. Dans cette étude, nous avons isolé et caractérisé seize isolats de Fusarium de la pomme de terre infectée naturellement dans un champs. Les tests de pathogénicité ont été réalisés pour évaluer la virulence des isolats sur la pomme de terre ainsi que leur capacité à produire des trichothécènes. Nous avons choisi F. sambucinum souche T5 comme un modèle pour cette étude parce qu’il était le plus agressif sur la pomme de terre en serre en induisant un flétrissement rapide, un jaunissement suivi de la mort des plantes. Cette souche produit le 4,15-diacétoxyscirpénol (4,15-DAS) lorsqu’elle est cultivée en milieu liquide. Nous avons amplifié et caractérisé cinq gènes de biosynthèse trichothécènes (TRI5, TRI4, TRI3, TRI11, et TRI101) impliqués dans la production du 4,15-DAS. La comparaison des séquences avec les bases de données a montré 98% et 97% d'identité de séquence avec les gènes de la biosynthèse des trichothécènes chez F. sporotrichioides et Gibberella zeae, respectivement. Nous avons confrenté F. sambucinum avec le champignon mycorhizien à arbuscule Glomus irregulare en culture in vitro. Les racines de carotte et F. sambucinum seul, ont été utilisés comme témoins. Nous avons observé que la croissance de F. sambucinum a été significativement réduite avec la présence de G. irregulare par rapport aux témoins. Nous avons remarqué que l'inhibition de la croissance F. sambucinum a été associée avec des changements morphologiques, qui ont été observés lorsque les hyphes de G. irregulare ont atteint le mycélium de F. sambucinum. Ceci suggère que G. irregulare pourrait produire des composés qui inhibent la croissance de F. sambucinum. Nous avons étudié les patrons d’expression des gènes de biosynthèse de trichothécènes de F. sambucinum en présence ou non de G. irregulare, en utilisant le PCR en temps-réel. Nous avons observé que TRI5 et TRI6 étaient sur-exprimés, tandis que TRI4, TRI13 et TRI101 étaient en sous-exprimés en présence de G. irregulare. Des analyses par chromatographie en phase-gazeuse (GC-MS) montrent clairement que la présence de G. irregulare réduit significativement la production des trichothécènes par F. sambucinum. Le dosage du 4,15-DAS a été réduit à 39 μg/ml milieu GYEP par G. irregulare, comparativement à 144 μg/ml milieu GYEP quand F. sambucinum est cultivé sans G. irregulare. Nous avons testé la capacité de G. irregulare à induire la défense des plants de pomme de terre contre l'infection de F. sambucinum. Des essais en chambre de croissance montrent que G. irregulare réduit significativement l’incidence de la maladie causée par F. sambucinum. Nous avons aussi observé que G. irregulare augmente la biomasse des racines, des feuilles et des tubercules. En utilisant le PCR en temps-réel, nous avons étudié les niveaux d’expression des gènes impliqué dans la défense des plants de pommes de terre tels que : chitinase class II (ChtA3), 1,3-β-glucanase (Glub), peroxidase (CEVI16), osmotin-like protéin (OSM-8e) et pathogenèses-related protein (PR-1). Nous avons observé que G. irregulare a induit une sur-expression de tous ces gènes dans les racines après 72 heures de l'infection avec F. sambucinum. Nous avons également trové que la baisse provoquée par F. sambucinum des gènes Glub et CEVI16 dans les feuilles pourrait etre bloquée par le traitement AMF. Ceci montre que l’inoculation avec G. irregulare constitut un bio-inducteur systémique même dans les parties non infectées par F. sambucinum. En conclusion, cette étude apporte de nouvelles connaissances importantes sur les interactions entre les plants et les microbes, d’une part sur les effets directs des champignons mycorhiziens sur l’inhibition de la croissance et la diminution de la production des mycotoxines chez Fusarium et d’autre part, l’atténuation de la sévérité de la maladie dans des plantes par stimulation leur défense. Les données présentées ouvrent de nouvelles perspectives de bio-contrôle contre les pathogènes mycotoxinogènes des plantes.
Resumo:
Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.
Resumo:
The major objective of the thesis is essentially to evolve and apply certain computational procedures to evaluate the structure and properties of some simple polyatomic molecules making use of spectroscopic data available from the literature. It must be said that though there is dwindling interest in recent times in such analyses, there exists tremendous scope and utility for attempting such calculations as the precision and reliability of'experimental techniques in spectroscopy have increased vastly due to enormous sophistication of the instruments used for these measurements. In the present thesis an attempt is made to extract maximum amount of information regarding the geometrical structure and interatmic forces of simple molecules from the experimental data on microwave and infrared spectra of these molecules
Resumo:
This thesis deals with some studies in molecular mechanic using spectroscopic data. It includes an improvement in the parameter technique for the evaluation of exact force fields, the introduction of a new and simple algebraic method for the force field calculation and a study of asymmetric variation of bonding forces along a bond.
Resumo:
Resolving the relationships between Metazoa and other eukaryotic groups as well as between metazoan phyla is central to the understanding of the origin and evolution of animals. The current view is based on limited data sets, either a single gene with many species (e.g., ribosomal RNA) or many genes but with only a few species. Because a reliable phylogenetic inference simultaneously requires numerous genes and numerous species, we assembled a very large data set containing 129 orthologous proteins (similar to30,000 aligned amino acid positions) for 36 eukaryotic species. Included in the alignments are data from the choanoflagellate Monosiga ovata, obtained through the sequencing of about 1,000 cDNAs. We provide conclusive support for choanoflagellates as the closest relative of animals and for fungi as the second closest. The monophyly of Plantae and chromalveolates was recovered but without strong statistical support. Within animals, in contrast to the monophyly of Coelomata observed in several recent large-scale analyses, we recovered a paraphyletic Coelamata, with nematodes and platyhelminths nested within. To include a diverse sample of organisms, data from EST projects were used for several species, resulting in a large amount of missing data in our alignment (about 25%). By using different approaches, we verify that the inferred phylogeny is not sensitive to these missing data. Therefore, this large data set provides a reliable phylogenetic framework for studying eukaryotic and animal evolution and will be easily extendable when large amounts of sequence information become available from a broader taxonomic range.
Resumo:
Analysis of X-ray powder data for the melt-crystallisable aromatic poly(thioether thioether ketone) [-S-Ar-S-Ar-CO-Ar](n), ('PTTK', Ar= 1,4-phenylene), reveals that it adopts a crystal structure very different from that established for its ether-analogue PEEK. Molecular modelling and diffraction-simulation studies of PTTK show that the structure of this polymer is analogous to that of melt-crystallised poly(thioetherketone) [-SAr-CO-Ar](n) in which the carbonyl linkages in symmetry-related chains are aligned anti-parallel to one another. and that these bridging units are crystallographically interchangeable. The final model for the crystal structure of PTTK is thus disordered, in the monoclinic space group 121a (two chains per unit cell), with cell dimensions a = 7.83, b = 6.06, c = 10.35 angstrom, beta = 93.47 degrees. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Polycondensation of 2,6-dihydroxynaphthalene with 4,4'-bis(4"-fluorobenzoyl)biphenyl affords a novel, semicrystalline poly(ether ketone) with a melting point of 406 degreesC and glass transition temperature (onset) of 168 degreesC. Molecular modeling and diffraction-simulation studies of this polymer, coupled with data from the single-crystal structure of an oligomer model, have enabled the crystal and molecular structure of the polymer to be determined from X-ray powder data. This structure-the first for any naphthalene-containing poly(ether ketone)-is fully ordered, in monoclinic space group P2(1)/b, with two chains per unit cell. Rietveld refinement against the experimental powder data gave a final agreement factor (R-wp) of 6.7%.
Resumo:
Specific monomer sequences in aromatic copolyimides are recognized through their -stacking and hydrogen-bonding interactions with a sterically and electronically complementary molecular tweezer. These interactions enable the tweezer molecule to read monomer sequences comprising up to 27 aromatic rings by multiple adjacent binding to neighboring sites on the polymer chain.
Resumo:
The compounds chlorothiazide and hydrochlorothiazide (crystalline form II) have been studied in their fully hydrogenous forms by powder neutron diffraction on the GEM diffractometer. The results of joint Rietveld refinement of the structures against multi-bank neutron and single-bank X-ray powder data are reported and show that accurate and precise structural information can be obtained from polycrystalline molecular organic materials by this route.
Resumo:
This paper describes a prototype grid infrastructure, called the eMinerals minigrid, for molecular simulation scientists. which is based on an integration of shared compute and data resources. We describe the key components, namely the use of Condor pools, Linux/Unix clusters with PBS and IBM's LoadLeveller job handling tools, the use of Globus for security handling, the use of Condor-G tools for wrapping globus job submit commands, Condor's DAGman tool for handling workflow, the Storage Resource Broker for handling data, and the CCLRC dataportal and associated tools for both archiving data with metadata and making data available to other workers.
Resumo:
Nonstructural protein 3 of the severe acute respiratory syndrome (SARS) coronavirus includes a "SARS-unique domain" (SUD) consisting of three globular domains separated by short linker peptide segments. This work reports NMR structure determinations of the C-terminal domain (SUD-C) and a two-domain construct (SUD-MC) containing the middle domain (SUD-M) and the C-terminal domain, and NMR data on the conformational states of the N-terminal domain (SUD-N) and the SUD-NM two-domain construct. Both SUD-N and SUD-NM are monomeric and globular in solution; in SUD-NM, there is high mobility in the two-residue interdomain linking sequence, with no preferred relative orientation of the two domains. SUD-C adopts a frataxin like fold and has structural similarity to DNA-binding domains of DNA-modifying enzymes. The structures of both SUD-M (previously determined) and SUD-C (from the present study) are maintained in SUD-MC, where the two domains are flexibly linked. Gel-shift experiments showed that both SUD-C and SUD-MC bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases, whereby SUD-MC binds to a more restricted set of purine-containing RNA sequences than SUD-M. NMR chemical shift perturbation experiments with observations of (15)N-labeled proteins further resulted in delineation of RNA binding sites (i.e., in SUD-M, a positively charged surface area with a pronounced cavity, and in SUD-C, several residues of an anti-parallel beta-sheet). Overall, the present data provide evidence for molecular mechanisms involving the concerted actions of SUD-M and SUD-C, which result in specific RNA binding that might be unique to the SUD and, thus, to the SARS coronavirus.
Resumo:
Antimicrobial drug resistance is a global challenge for the 21st century with the emergence of resistant bacterial strains worldwide. Transferable resistance to beta-lactam antimicrobial drugs, mediated by production of extended-spectrum beta-lactamases (ESBLs), is of particular concern. In 2004, an ESBL-carrying IncK plasmid (pCT) was isolated from cattle in the United Kingdom. The sequence was a 93,629-bp plasmid encoding a single antimicrobial drug resistance gene, bla(CTX-M-14). From this information, PCRs identifying novel features of pCT were designed and applied to isolates from several countries, showing that the plasmid has disseminated worldwide in bacteria from humans and animals. Complete DNA sequences can be used as a platform to develop rapid epidemiologic tools to identify and trace the spread of plasmids in clinically relevant pathogens, thus facilitating a better understanding of their distribution and ability to transfer between bacteria of humans and animals.