962 resultados para Molecular Structure
Resumo:
The activation of the specific immune response against tumor cells is based on the recognition by the CD8+ Cytotoxic Τ Lymphocytes (CTL), of antigenic peptides (p) presented at the surface of the cell by the class I major histocompatibility complex (MHC). The ability of the so-called T-Cell Receptors (TCR) to discriminate between self and non-self peptides constitutes the most important specific control mechanism against infected cells. The TCR/pMHC interaction has been the subject of much attention in cancer therapy since the design of the adoptive transfer approach, in which Τ lymphocytes presenting an interesting response against tumor cells are extracted from the patient, expanded in vitro, and reinfused after immunodepletion, possibly leading to cancer regression. In the last decade, major progress has been achieved by the introduction of engineered lypmhocytes. In the meantime, the understanding of the molecular aspects of the TCRpMHC interaction has become essential to guide in vitro and in vivo studies. In 1996, the determination of the first structure of a TCRpMHC complex by X-ray crystallography revealed the molecular basis of the interaction. Since then, molecular modeling techniques have taken advantage of crystal structures to study the conformational space of the complex, and understand the specificity of the recognition of the pMHC by the TCR. In the meantime, experimental techniques used to determine the sequences of TCR that bind to a pMHC complex have been used intensively, leading to the collection of large repertoires of TCR sequences that are specific for a given pMHC. There is a growing need for computational approaches capable of predicting the molecular interactions that occur upon TCR/pMHC binding without relying on the time consuming resolution of a crystal structure. This work presents new approaches to analyze the molecular principles that govern the recognition of the pMHC by the TCR and the subsequent activation of the T-cell. We first introduce TCRep 3D, a new method to model and study the structural properties of TCR repertoires, based on homology and ab initio modeling. We discuss the methodology in details, and demonstrate that it outperforms state of the art modeling methods in predicting relevant TCR conformations. Two successful applications of TCRep 3D that supported experimental studies on TCR repertoires are presented. Second, we present a rigid body study of TCRpMHC complexes that gives a fair insight on the TCR approach towards pMHC. We show that the binding mode of the TCR is correctly described by long-distance interactions. Finally, the last section is dedicated to a detailed analysis of an experimental hydrogen exchange study, which suggests that some regions of the constant domain of the TCR are subject to conformational changes upon binding to the pMHC. We propose a hypothesis of the structural signaling of TCR molecules leading to the activation of the T-cell. It is based on the analysis of correlated motions in the TCRpMHC structure. - L'activation de la réponse immunitaire spécifique dirigée contre les cellules tumorales est basée sur la reconnaissance par les Lymphocytes Τ Cytotoxiques (CTL), d'un peptide antigénique (p) présenté à la suface de la cellule par le complexe majeur d'histocompatibilité de classe I (MHC). La capacité des récepteurs des lymphocytes (TCR) à distinguer les peptides endogènes des peptides étrangers constitue le mécanisme de contrôle le plus important dirigé contre les cellules infectées. L'interaction entre le TCR et le pMHC est le sujet de beaucoup d'attention dans la thérapie du cancer, depuis la conception de la méthode de transfer adoptif: les lymphocytes capables d'une réponse importante contre les cellules tumorales sont extraits du patient, amplifiés in vitro, et réintroduits après immunosuppression. Il peut en résulter une régression du cancer. Ces dix dernières années, d'importants progrès ont été réalisés grâce à l'introduction de lymphocytes modifiés par génie génétique. En parallèle, la compréhension du TCRpMHC au niveau moléculaire est donc devenue essentielle pour soutenir les études in vitro et in vivo. En 1996, l'obtention de la première structure du complexe TCRpMHC à l'aide de la cristallographie par rayons X a révélé les bases moléculaires de l'interaction. Depuis lors, les techniques de modélisation moléculaire ont exploité les structures expérimentales pour comprendre la spécificité de la reconnaissance du pMHC par le TCR. Dans le même temps, de nouvelles techniques expérimentales permettant de déterminer la séquence de TCR spécifiques envers un pMHC donné, ont été largement exploitées. Ainsi, d'importants répertoires de TCR sont devenus disponibles, et il est plus que jamais nécessaire de développer des approches informatiques capables de prédire les interactions moléculaires qui ont lieu lors de la liaison du TCR au pMHC, et ce sans dépendre systématiquement de la résolution d'une structure cristalline. Ce mémoire présente une nouvelle approche pour analyser les principes moléculaires régissant la reconnaissance du pMHC par le TCR, et l'activation du lymphocyte qui en résulte. Dans un premier temps, nous présentons TCRep 3D, une nouvelle méthode basée sur les modélisations par homologie et ab initio, pour l'étude de propriétés structurales des répertoires de TCR. Le procédé est discuté en détails et comparé à des approches standard. Nous démontrons ainsi que TCRep 3D est le plus performant pour prédire des conformations pertinentes du TCR. Deux applications à des études expérimentales des répertoires TCR sont ensuite présentées. Dans la seconde partie de ce travail nous présentons une étude de complexes TCRpMHC qui donne un aperçu intéressant du mécanisme d'approche du pMHC par le TCR. Finalement, la dernière section se concentre sur l'analyse détaillée d'une étude expérimentale basée sur les échanges deuterium/hydrogène, dont les résultats révèlent que certaines régions clés du domaine constant du TCR sont sujettes à un changement conformationnel lors de la liaison au pMHC. Nous proposons une hypothèse pour la signalisation structurelle des TCR, menant à l'activation du lymphocyte. Celle-ci est basée sur l'analyse des mouvements corrélés observés dans la structure du TCRpMHC.
Resumo:
Desmosomes are intercellular adhesive complexes that anchor the intermediate filament cytoskeleton to the cell membrane in epithelia and cardiac muscle cells. The desmosomal component desmoplakin plays a key role in tethering various intermediate filament networks through its C-terminal plakin repeat domain. To gain better insight into the cytoskeletal organization of cardiomyocytes, we investigated the association of desmoplakin with desmin by cell transfection, yeast two-hybrid, and/or in vitro binding assays. The results indicate that the association of desmoplakin with desmin depends on sequences within the linker region and C-terminal extremity of desmoplakin, where the B and C subdomains contribute to efficient binding; a potentially phosphorylatable serine residue in the C-terminal extremity of desmoplakin affects its association with desmin; the interaction of desmoplakin with non-filamentous desmin requires sequences contained within the desmin C-terminal rod portion and tail domain in yeast, whereas in in vitro binding studies the desmin tail is dispensable for association; and mutations in either the C-terminus of desmoplakin or the desmin tail linked to inherited cardiomyopathy seem to impair desmoplakindesmin interaction. These studies increase our understanding of desmoplakin-intermediate filament interactions, which are important for maintenance of cytoarchitecture in cardiomyocytes, and give new insights into the molecular basis of desmoplakin- and desmin-related human diseases.
Resumo:
Genetic structure of populations of Pissodes castaneus (De Geer) (Coleoptera, Curculionidae) using amplified fragment length polymorphism. The objective of this study was to determine the genetic structure of populations of Pissodes castaneus from different areas and on different species of Pinus using the PCR-AFLP technique. Twenty samples were analyzed, representing 19 populations from Brazil and one from Florence, Italy, which is the region of origin of P. castaneus. The four combinations of primers generated a total of 367 fragments of DNA, and 100% of polymorphic loci, indicating high degree of molecular polymorphism. The dendrogram did not reveal trends for grouping the populations in relation to origin. The low genetic similarity (0.11 between the most distant groups) and genetic distances of 0.13 and 0.44 for 10 out of the 20 samples may indicate several founding events or multiple introductions of heterogeneous strains into Brazil. The allelic fixation index (Fst) was 0.3851, considered high, and the number of migrants (Nm) was 0.3991, indicating low gene flow among populations. The highest genetic distances were between the population from Irani, SC and Cambará do Sul, RS and Bituruna, PR, indicating an independent founding event or a particular allelic fixation in the former location. The high genetic diversity among populations points out that the populations are genetically heterogeneous with a diverse gene pool in the surveyed areas, what makes them to respond differently to control measures.
Resumo:
Distinct genetic structure in populations of Chrysoperla externa (Hagen) (Neuroptera, Chrysopidae) shown by genetic markers ISSR and COI gene. Green lacewings are generalist predators, and the species Chrysoperla externa presents a great potential for use in biological control of agricultural pests due to its high predation and reproduction capacities, as well as its easy mass rearing in the laboratory. The adaptive success of a species is related to genetic variability, so that population genetic studies are extremely important in order to maximize success of the biological control. Thus, the present study used nuclear (Inter Simple Sequence Repeat - ISSR) and mitochondrial (Cytochrome Oxidase I - COI) molecular markers to estimate the genetic variability of 12 populations in the São Paulo State, Brazil, as well as the genetic relationships between populations. High levels of genetic diversity were observed for both markers, and the highest values of genetic diversity appear associated with municipalities that have the greatest areas of native vegetation. There was high haplotype sharing, and there was no correlation between the markers and the geographic distribution of the populations. The AMOVA indicated absence of genetic structure for the COI gene, suggesting that the sampled areas formed a single population unit. However, the great genetic differentiation among populations showed by ISSR demonstrates that these have been under differentiation after their expansion or may also reflect distinct dispersal behavior between males and females.
Resumo:
Epithelial Na(+) channel (ENaC)/degenerin family members are involved in mechanosensation, blood pressure control, pain sensation, and the expression of fear. Several of these channel types display a form of desensitization that allows the channel to limit Na(+) influx during prolonged stimulation. We used site-directed mutagenesis and chemical modification, functional analysis, and molecular dynamics simulations to investigate the role of the lower palm domain of the acid-sensing ion channel 1, a member of the ENaC/degenerin family. The lower palm domains of this trimeric channel are arranged around a central vestibule, at ∼20 Å above the plasma membrane and are covalently linked to the transmembrane channel parts. We show that the lower palm domains approach one another during desensitization. Residues in the palm co-determine the pH dependence of desensitization, its kinetics, and the stability of the desensitized state. Mutations of palm residues impair desensitization by preventing the closing movement of the palm. Overexpression of desensitization-impaired channel mutants in central neurons allowed--in contrast to overexpression of wild type--a sustained signaling response to rapid pH fluctuations. We identify and describe here the function of an important regulatory domain that most likely has a conserved role in ENaC/degenerin channels.
Resumo:
Crushed seeds of the Moringa oleifera tree have been used traditionally as natural flocculants to clarify drinking water. We previously showed that one of the seed peptides mediates both the sedimentation of suspended particles such as bacterial cells and a direct bactericidal activity, raising the possibility that the two activities might be related. In this study, the conformational modeling of the peptide was coupled to a functional analysis of synthetic derivatives. This indicated that partly overlapping structural determinants mediate the sedimentation and antibacterial activities. Sedimentation requires a positively charged, glutamine-rich portion of the peptide that aggregates bacterial cells. The bactericidal activity was localized to a sequence prone to form a helix-loop-helix structural motif. Amino acid substitution showed that the bactericidal activity requires hydrophobic proline residues within the protruding loop. Vital dye staining indicated that treatment with peptides containing this motif results in bacterial membrane damage. Assembly of multiple copies of this structural motif into a branched peptide enhanced antibacterial activity, since low concentrations effectively kill bacteria such as Pseudomonas aeruginosa and Streptococcus pyogenes without displaying a toxic effect on human red blood cells. This study thus identifies a synthetic peptide with potent antibacterial activity against specific human pathogens. It also suggests partly distinct molecular mechanisms for each activity. Sedimentation may result from coupled flocculation and coagulation effects, while the bactericidal activity would require bacterial membrane destabilization by a hydrophobic loop.
Resumo:
BAFF, APRIL and their receptors play important immunological roles, especially in the B cell arm of the immune system. A number of splice isoforms have been described for both ligands and receptors in this subfamily, some of which are conserved between mouse and human, while others are species-specific. Structural and mutational analyses have revealed key determinants of receptor-ligand specificity. BAFF-R has a strong selectivity for BAFF; BCMA has a higher affinity for APRIL than for BAFF, while TACI binds both ligands equally well. The molecular signaling events downstream of BAFF-R, BCMA and TACI are still incompletely characterized. Survival appears to be mediated by upregulation of Bcl-2 family members through NF-kappaB activation, degradation of the pro-apototic Bim protein, and control of subcellular localization of PCKdelta. Very little is known about other signaling events associated with receptor engagement by BAFF and APRIL that lead for example to B cell activation or to CD40L-independent Ig switch.
Resumo:
(3R)-hydroxyacyl-CoA dehydrogenase is part of multifunctional enzyme type 2 (MFE-2) of peroxisomal fatty acid beta-oxidation. The MFE-2 protein from yeasts contains in the same polypeptide chain two dehydrogenases (A and B), which possess difference in substrate specificity. The crystal structure of Candida tropicalis (3R)-hydroxyacyl-CoA dehydrogenase AB heterodimer, consisting of dehydrogenase A and B, determined at the resolution of 2.2A, shows overall similarity with the prototypic counterpart from rat, but also important differences that explain the substrate specificity differences observed. Docking studies suggest that dehydrogenase A binds the hydrophobic fatty acyl chain of a medium-chain-length ((3R)-OH-C10) substrate as bent into the binding pocket, whereas the short-chain substrates are dislocated by two mechanisms: (i) a short-chain-length 3-hydroxyacyl group ((3R)-OH-C4) does not reach the hydrophobic contacts needed for anchoring the substrate into the active site; and (ii) Leu44 in the loop above the NAD(+) cofactor attracts short-chain-length substrates away from the active site. Dehydrogenase B, which can use a (3R)-OH-C4 substrate, has a more shallow binding pocket and the substrate is correctly placed for catalysis. Based on the current structure, and together with the structure of the 2-enoyl-CoA hydratase 2 unit of yeast MFE-2 it becomes obvious that in yeast and mammalian MFE-2s, despite basically identical functional domains, the assembly of these domains into a mature, dimeric multifunctional enzyme is very different.
Resumo:
Temocapril is a prodrug whose hydrolysis by carboxylesterase 1 (CES1) yields the active ACE inhibitor temocaprilat. This molecular-dynamics (MD) study uses a resolved structure of the human CES1 (hCES1) to investigate some mechanistic details of temocapril hydrolysis. The ionization constants of temocapril (pK1 and pK3) and temocaprilat (pK1, pK2, and pK3) were determined experimentally and computationally using commercial algorithms. The constants so obtained were in good agreement and revealed that temocapril exists mainly in three ionic forms (a cation, a zwitterion, and an anion), whereas temocaprilat exists in four major ionic forms (a cation, a zwitterion, an anion, and a dianion). All these ionic forms were used as ligands in 5-ns MS simulations. While the cationic and zwitterionic forms of temocapril were involved in an ion-pair bond with Glu255 suggestive of an inhibitor behavior, the anionic form remained in a productive interaction with the catalytic center. As for temocaprilat, its cation appeared trapped by Glu255, while its zwitterion and anion made a slow departure from the catalytic site and a partial egress from the protein. Only its dianion was effectively removed from the catalytic site and attracted to the protein surface by Lys residues. A detailed mechanism of product egress emerges from the simulations.
Resumo:
Summary The specific CD8+ T cell immune response against tumors relies on the recognition by the T cell receptor (TCR) on cytotoxic T lymphocytes (CTL) of antigenic peptides bound to the class I major histocompatibility complex (MHC) molecule. Such tumor associated antigenic peptides are the focus of tumor immunotherapy with peptide vaccines. The strategy for obtaining an improved immune response often involves the design of modified tumor associated antigenic peptides. Such modifications aim at creating higher affinity and/or degradation resistant peptides and require precise structures of the peptide-MHC class I complex. In addition, the modified peptide must be cross-recognized by CTLs specific for the parental peptide, i.e. preserve the structure of the epitope. Detailed structural information on the modified peptide in complex with MHC is necessary for such predictions. In this thesis, the main focus is the development of theoretical in silico methods for prediction of both structure and cross-reactivity of peptide-MHC class I complexes. Applications of these methods in the context of immunotherapy are also presented. First, a theoretical method for structure prediction of peptide-MHC class I complexes is developed and validated. The approach is based on a molecular dynamics protocol to sample the conformational space of the peptide in its MHC environment. The sampled conformers are evaluated using conformational free energy calculations. The method, which is evaluated for its ability to reproduce 41 X-ray crystallographic structures of different peptide-MHC class I complexes, shows an overall prediction success of 83%. Importantly, in the clinically highly relevant subset of peptide-HLAA*0201 complexes, the prediction success is 100%. Based on these structure predictions, a theoretical approach for prediction of cross-reactivity is developed and validated. This method involves the generation of quantitative structure-activity relationships using three-dimensional molecular descriptors and a genetic neural network. The generated relationships are highly predictive as proved by high cross-validated correlation coefficients (0.78-0.79). Together, the here developed theoretical methods open the door for efficient rational design of improved peptides to be used in immunotherapy. Résumé La réponse immunitaire spécifique contre des tumeurs dépend de la reconnaissance par les récepteurs des cellules T CD8+ de peptides antigéniques présentés par les complexes majeurs d'histocompatibilité (CMH) de classe I. Ces peptides sont utilisés comme cible dans l'immunothérapie par vaccins peptidiques. Afin d'augmenter la réponse immunitaire, les peptides sont modifiés de façon à améliorer l'affinité et/ou la résistance à la dégradation. Ceci nécessite de connaître la structure tridimensionnelle des complexes peptide-CMH. De plus, les peptides modifiés doivent être reconnus par des cellules T spécifiques du peptide natif. La structure de l'épitope doit donc être préservée et des structures détaillées des complexes peptide-CMH sont nécessaires. Dans cette thèse, le thème central est le développement des méthodes computationnelles de prédiction des structures des complexes peptide-CMH classe I et de la reconnaissance croisée. Des applications de ces méthodes de prédiction à l'immunothérapie sont également présentées. Premièrement, une méthode théorique de prédiction des structures des complexes peptide-CMH classe I est développée et validée. Cette méthode est basée sur un échantillonnage de l'espace conformationnel du peptide dans le contexte du récepteur CMH classe I par dynamique moléculaire. Les conformations sont évaluées par leurs énergies libres conformationnelles. La méthode est validée par sa capacité à reproduire 41 structures des complexes peptide-CMH classe I obtenues par cristallographie aux rayons X. Le succès prédictif général est de 83%. Pour le sous-groupe HLA-A*0201 de complexes de grande importance pour l'immunothérapie, ce succès est de 100%. Deuxièmement, à partir de ces structures prédites in silico, une méthode théorique de prédiction de la reconnaissance croisée est développée et validée. Celle-ci consiste à générer des relations structure-activité quantitatives en utilisant des descripteurs moléculaires tridimensionnels et un réseau de neurones couplé à un algorithme génétique. Les relations générées montrent une capacité de prédiction remarquable avec des valeurs de coefficients de corrélation de validation croisée élevées (0.78-0.79). Les méthodes théoriques développées dans le cadre de cette thèse ouvrent la voie du design de vaccins peptidiques améliorés.
Resumo:
284 million people worldwide suffered from type 2 diabetes mellitus (T2DM) in 2010, which will, in approximately half of them, lead to the development of diabetic peripheral neuropathy (DPN). Although DPN is the most common complication of diabetes mellitus and the leading cause of non-traumatic amputations its pathophysiology is still poorly understood. To get more insight into the molecular mechanism underlying DPN in T2DM, I used a rodent model of T2DM, the db/db mice.¦ln vivo electrophysiological recordings of diabetic animals indicated that in addition to reduced nerve conduction velocity db/db mice also present increased nerve excitability. Further ex vivo evaluation of the electrophysiological properties of db/db nerves clearly established a presence of the peripheral nerve hyperexcitability (PNH) phenotype in diabetic animals. Using pharmacological inhibitors we demonstrated that PNH is mostly mediated by the decreased activity of Kv1 channels. ln agreement with these data 1 observed that the diabetic condition led to a reduced presence of the Kv1.2 subunits in juxtaparanodal regions of db/db peripheral nerves whereas its mANA and protein expression levels were not affected. Lmportantly, I confirmed a loss of juxtaparanodal Kv1.2 subunits in nerve biopsies from type 2 diabetic patients. Together these observations indicate that the type 2 diabetic condition leads to potassium-channel mediated changes of nerve excitability thus identifying them as potential drug targets to treat sorne of the DPN related symptoms.¦Schwann cells ensheath and isolate peripheral axons by the production of myelin, which consists of lipids and proteins in a ratio of 2:1. Peripheral myelin protein 2 (= P2, Pmp2 or FABP8) was originally described as one of the most abundant myelin proteins in the peripheral nervous system. P2, which is a member of the fatty acid binding protein (FABP) family, is a 14.8 kDa cytosolic protein expressed on the cytoplasmic side of compact myelin membranes. As indicated by their name, the principal role of FABPs is thought to be the binding and transport of fatty acids.¦To study its role in myelinating glial cells I have recently generated a complete P2 knockout mouse model (P2-/-). I confirmed the loss of P2 in the sciatic nerve of P2-/- mice at the mRNA and protein level. Electrophysiological analysis of the adult (P56) mutant mice revealed a mild but significant reduction in the motor nerve conduction velocity. lnterestingly, this functional change was not accompanied by any detectable alterations in general myelin structure. However, I have observed significant alterations in the mRNA expression level of other FABPs, predominantly FABP9, in the PNS of P2-/- mice as compared to age-matched P2+/+ mice indicating a role of P2 in the glial myelin lipid metabolism.¦Le diabète de type 2 touche 284 million de personnes dans le monde en 2010 et son évolution conduit dans la moitié des cas à une neuropathie périphérique diabétique. Bien que la neuropathie périphérique soit la complication la plus courante du diabète pouvant conduire jusqu'à l'amputation, sa physiopathologie est aujourd'hui encore mal comprise. Dans le but d'améliorer les connaissances moléculaires expliquant les mécanismes de la neuropathie liée au diabète de type 2, j'ai utilisé un modèle murin du diabète de type 2, les souris db/db.¦ln vivo, les enregistrements éléctrophysiologiques des animaux diabétiques montrent qu'en plus d'une diminution de la vitesse de conduction nerveuse, les souris db/db présentent également une augmentation de l'excitabilité nerveuse. Des mesures menées Ex vivo ont montré l'existence d'un phénotype d'hyperexcitabilité sur les nerfs périphériques isolés d'animaux diabétiques. Grâce à l'utilisation d'inhibiteurs pharmacologiques, nous avons pu démontrer que l'hyperexcitabilité démontrée était due à une réduction d'activité des canaux Kv1. En accord avec ces données, j'ai observé qu'une situation de diabète conduisait à une diminution des canaux Kv1.2 aux régions juxta-paranodales des nerfs périphériques db/db, alors que l'expression du transcrit et de la protéine restait stable. J'ai également confirmé l'absence de canaux Kv1.2 aux juxta-paranoeuds de biopsies de nerfs de patients diabétiques. L'ensemble de ces observations montrent que les nerfs périphériques chez les patients atteints de diabète de type 2 est due à une diminution des canaux potassiques rapides juxtaparanodaux les identifiant ainsi comme des cibles thérapeutiques potentielles.¦Les cellules de Schwann enveloppent et isolent les axones périphériques d'une membrane spécialisée, la myéline, composée de deux fois plus de lipides que de protéines. La protéine P2 (Pmp2 "peripheral myelin protein 2" ou FABP8 "fatty acid binding protein") est l'une des protéines les plus abondantes au système nerveux périphérique. P2 appartient à la famille de protéines FABP liant et transportant les acides gras et est une protéine cytosolique de 14,8 kDa exprimée du côté cytoplasmique de la myéline compacte.¦Afin d'étudier le rôle de P2 dans les cellules de Schwann myélinisantes, j'ai généré une souris knockout (P2-/-). Après avoir validé l'absence de transcrit et de protéine P2 dans les nerfs sciatiques P2-/-, des mesures électrophysiologiques ont montré une réduction modérée mais significative de la vitesse de conduction du nerf moteur périphérique. Il est important de noter que ces changements fonctionnels n'ont pas pu être associés à quelconque changement dans la structure de la myéline. Cependant, j'ai observé dans les nerfs périphériques P2-/-, une altération significative du niveau d'expression d'ARNm d'autres FABPs et en particulier FABP9. Ce dernier résultat démontre l'importance du rôle de la protéine P2 dans le métabolisme lipidique de la myéline.
Resumo:
Molecular shape has long been known to be an important property for the process of molecular recognition. Previous studies postulated the existence of a drug-like shape space that could be used to artificially bias the composition of screening libraries, with the aim to increase the chance of success in Hit Identification. In this work, it was analysed to which extend this assumption holds true. Normalized Principal Moments of Inertia Ratios (NPRs) have been used to describe the molecular shape of small molecules. It was investigated, whether active molecules of diverse targets are located in preferred subspaces of the NPR shape space. Results illustrated a significantly stronger clustering than could be expected by chance, with parts of the space unlikely to be occupied by active compounds. Furthermore, a strong enrichment of elongated, rather flat shapes could be observed, while globular compounds were highly underrepresented. This was confirmed for a wide range of small molecule datasets from different origins. Active compounds exhibited a high overlap in their shape distributions across different targets, making a purely shape based discrimination very difficult. An additional perspective was provided by comparing the shapes of protein binding pockets with those of their respective ligands. Although more globular than their ligands, it was observed that binding sites shapes exhibited a similarly skewed distribution in shape space: spherical shapes were highly underrepresented. This was different for unoccupied binding pockets of smaller size. These were on the contrary identified to possess a more globular shape. The relation between shape complementarity and exhibited bioactivity was analysed; a moderate correlation between bioactivity and parameters including pocket coverage, distance in shape space, and others could be identified, which reflects the importance of shape complementarity. However, this also suggests that other aspects are of relevance for molecular recognition. A subsequent analysis assessed if and how shape and volume information retrieved from pocket or respective reference ligands could be used as a pre-filter in a virtual screening approach. ln Lead Optimization compounds need to get optimized with respect to a variety of pararneters. Here, the availability of past success stories is very valuable, as they can guide medicinal chemists during their analogue synthesis plans. However, although of tremendous interest for the public domain, so far only large corporations had the ability to mine historical knowledge in their proprietary databases. With the aim to provide such information, the SwissBioisostere database was developed and released during this thesis. This database contains information on 21,293,355 performed substructural exchanges, corresponding to 5,586,462 unique replacements that have been measured in 35,039 assays against 1,948 molecular targets representing 30 target classes, and on their impact on bioactivity . A user-friendly interface was developed that provides facile access to these data and is accessible at http//www.swissbioisostere.ch. The ChEMBL database was used as primary data source of bioactivity information. Matched molecular pairs have been identified in the extracted and cleaned data. Success-based scores were developed and integrated into the database to allow re-ranking of proposed replacements by their past outcomes. It was analysed to which degree these scores correlate with chemical similarity of the underlying fragments. An unexpectedly weak relationship was detected and further investigated. Use cases of this database were envisioned, and functionalities implemented accordingly: replacement outcomes are aggregatable at the assay level, and it was shawn that an aggregation at the target or target class level could also be performed, but should be accompanied by a careful case-by-case assessment. It was furthermore observed that replacement success depends on the activity of the starting compound A within a matched molecular pair A-B. With increasing potency the probability to lose bioactivity through any substructural exchange was significantly higher than in low affine binders. A potential existence of a publication bias could be refuted. Furthermore, often performed medicinal chemistry strategies for structure-activity-relationship exploration were analysed using the acquired data. Finally, data originating from pharmaceutical companies were compared with those reported in the literature. It could be seen that industrial medicinal chemistry can access replacement information not available in the public domain. In contrast, a large amount of often-performed replacements within companies could also be identified in literature data. Preferences for particular replacements differed between these two sources. The value of combining different endpoints in an evaluation of molecular replacements was investigated. The performed studies highlighted furthermore that there seem to exist no universal substructural replacement that always retains bioactivity irrespective of the biological environment. A generalization of bioisosteric replacements seems therefore not possible. - La forme tridimensionnelle des molécules a depuis longtemps été reconnue comme une propriété importante pour le processus de reconnaissance moléculaire. Des études antérieures ont postulé que les médicaments occupent préférentiellement un sous-ensemble de l'espace des formes des molécules. Ce sous-ensemble pourrait être utilisé pour biaiser la composition de chimiothèques à cribler, dans le but d'augmenter les chances d'identifier des Hits. L'analyse et la validation de cette assertion fait l'objet de cette première partie. Les Ratios de Moments Principaux d'Inertie Normalisés (RPN) ont été utilisés pour décrire la forme tridimensionnelle de petites molécules de type médicament. Il a été étudié si les molécules actives sur des cibles différentes se co-localisaient dans des sous-espaces privilégiés de l'espace des formes. Les résultats montrent des regroupements de molécules incompatibles avec une répartition aléatoire, avec certaines parties de l'espace peu susceptibles d'être occupées par des composés actifs. Par ailleurs, un fort enrichissement en formes allongées et plutôt plates a pu être observé, tandis que les composés globulaires étaient fortement sous-représentés. Cela a été confirmé pour un large ensemble de compilations de molécules d'origines différentes. Les distributions de forme des molécules actives sur des cibles différentes se recoupent largement, rendant une discrimination fondée uniquement sur la forme très difficile. Une perspective supplémentaire a été ajoutée par la comparaison des formes des ligands avec celles de leurs sites de liaison (poches) dans leurs protéines respectives. Bien que plus globulaires que leurs ligands, il a été observé que les formes des poches présentent une distribution dans l'espace des formes avec le même type d'asymétrie que celle observée pour les ligands: les formes sphériques sont fortement sous représentées. Un résultat différent a été obtenu pour les poches de plus petite taille et cristallisées sans ligand: elles possédaient une forme plus globulaire. La relation entre complémentarité de forme et bioactivité a été également analysée; une corrélation modérée entre bioactivité et des paramètres tels que remplissage de poche, distance dans l'espace des formes, ainsi que d'autres, a pu être identifiée. Ceci reflète l'importance de la complémentarité des formes, mais aussi l'implication d'autres facteurs. Une analyse ultérieure a évalué si et comment la forme et le volume d'une poche ou de ses ligands de référence pouvaient être utilisés comme un pré-filtre dans une approche de criblage virtuel. Durant l'optimisation d'un Lead, de nombreux paramètres doivent être optimisés simultanément. Dans ce contexte, la disponibilité d'exemples d'optimisations réussies est précieuse, car ils peuvent orienter les chimistes médicinaux dans leurs plans de synthèse par analogie. Cependant, bien que d'un extrême intérêt pour les chercheurs dans le domaine public, seules les grandes sociétés pharmaceutiques avaient jusqu'à présent la capacité d'exploiter de telles connaissances au sein de leurs bases de données internes. Dans le but de remédier à cette limitation, la base de données SwissBioisostere a été élaborée et publiée dans le domaine public au cours de cette thèse. Cette base de données contient des informations sur 21 293 355 échanges sous-structuraux observés, correspondant à 5 586 462 remplacements uniques mesurés dans 35 039 tests contre 1948 cibles représentant 30 familles, ainsi que sur leur impact sur la bioactivité. Une interface a été développée pour permettre un accès facile à ces données, accessible à http:/ /www.swissbioisostere.ch. La base de données ChEMBL a été utilisée comme source de données de bioactivité. Une version modifiée de l'algorithme de Hussain et Rea a été implémentée pour identifier les Matched Molecular Pairs (MMP) dans les données préparées au préalable. Des scores de succès ont été développés et intégrés dans la base de données pour permettre un reclassement des remplacements proposés selon leurs résultats précédemment observés. La corrélation entre ces scores et la similarité chimique des fragments correspondants a été étudiée. Une corrélation plus faible qu'attendue a été détectée et analysée. Différents cas d'utilisation de cette base de données ont été envisagés, et les fonctionnalités correspondantes implémentées: l'agrégation des résultats de remplacement est effectuée au niveau de chaque test, et il a été montré qu'elle pourrait également être effectuée au niveau de la cible ou de la classe de cible, sous réserve d'une analyse au cas par cas. Il a en outre été constaté que le succès d'un remplacement dépend de l'activité du composé A au sein d'une paire A-B. Il a été montré que la probabilité de perdre la bioactivité à la suite d'un remplacement moléculaire quelconque est plus importante au sein des molécules les plus actives que chez les molécules de plus faible activité. L'existence potentielle d'un biais lié au processus de publication par articles a pu être réfutée. En outre, les stratégies fréquentes de chimie médicinale pour l'exploration des relations structure-activité ont été analysées à l'aide des données acquises. Enfin, les données provenant des compagnies pharmaceutiques ont été comparées à celles reportées dans la littérature. Il a pu être constaté que les chimistes médicinaux dans l'industrie peuvent accéder à des remplacements qui ne sont pas disponibles dans le domaine public. Par contre, un grand nombre de remplacements fréquemment observés dans les données de l'industrie ont également pu être identifiés dans les données de la littérature. Les préférences pour certains remplacements particuliers diffèrent entre ces deux sources. L'intérêt d'évaluer les remplacements moléculaires simultanément selon plusieurs paramètres (bioactivité et stabilité métabolique par ex.) a aussi été étudié. Les études réalisées ont souligné qu'il semble n'exister aucun remplacement sous-structural universel qui conserve toujours la bioactivité quel que soit le contexte biologique. Une généralisation des remplacements bioisostériques ne semble donc pas possible.
Resumo:
The study of the ecology of soil microbial communities at relevant spatial scales is primordial in the wide Amazon region due to the current land use changes. In this study, the diversity of the Archaea domain (community structure) and ammonia-oxidizing Archaea (richness and community composition) were investigated using molecular biology-based techniques in different land-use systems in western Amazonia, Brazil. Soil samples were collected in two periods with high precipitation (March 2008 and January 2009) from Inceptisols under primary tropical rainforest, secondary forest (5-20 year old), agricultural systems of indigenous people and cattle pasture. Denaturing gradient gel electrophoresis of polymerase chain reaction-amplified DNA (PCR-DGGE) using the 16S rRNA gene as a biomarker showed that archaeal community structures in crops and pasture soils are different from those in primary forest soil, which is more similar to the community structure in secondary forest soil. Sequence analysis of excised DGGE bands indicated the presence of crenarchaeal and euryarchaeal organisms. Based on clone library analysis of the gene coding the subunit of the enzyme ammonia monooxygenase (amoA) of Archaea (306 sequences), the Shannon-Wiener function and Simpson's index showed a greater ammonia-oxidizing archaeal diversity in primary forest soils (H' = 2.1486; D = 0.1366), followed by a lower diversity in soils under pasture (H' = 1.9629; D = 0.1715), crops (H' = 1.4613; D = 0.3309) and secondary forest (H' = 0.8633; D = 0.5405). All cloned inserts were similar to the Crenarchaeota amoA gene clones (identity > 95 %) previously found in soils and sediments and distributed primarily in three major phylogenetic clusters. The findings indicate that agricultural systems of indigenous people and cattle pasture affect the archaeal community structure and diversity of ammonia-oxidizing Archaea in western Amazon soils.
Resumo:
The species of the common shrew (Sorex araneus) group are morphologically very similar but exhibit high levels of karyotypic variation. Here we used genetic variation at 10 microsatellite markers in a data set of 212 individuals mostly sampled in the western Alps and composed of five karyotypic taxa (Sorex coronatus, Sorex antinorii and the S. araneus chromosome races Cordon, Bretolet and Vaud) to investigate the concordance between genetic and karyotypic structure. Bayesian analysis confirmed the taxonomic status of the three sampled species since individuals consistently grouped according to their taxonomical status. However, introgression can still be detected between S. antinorii and the race Cordon of S. araneus. This observation is consistent with the expected low karyotypic complexity of hybrids between these two taxa. Geographically based cryptic substructure was discovered within S. antinorii, a pattern consistent with the different postglaciation recolonization routes of this species. Additionally, we detected two genetic groups within S. araneus notwithstanding the presence of three chromosome races. This pattern can be explained by the probable hybrid status of the Bretolet race but also suggests a relatively low impact of chromosomal differences on genetic structure compared to historical factors. Finally, we propose that the current data set (available at http://www.unil.ch/dee/page7010_en.html#1) could be used as a reference by those wanting to identify Sorex individuals sampled in the western Alps.