965 resultados para Floristic Similarity
Resumo:
Segmenting ultrasound images is a challenging problemwhere standard unsupervised segmentation methods such asthe well-known Chan-Vese method fail. We propose in thispaper an efficient segmentation method for this class ofimages. Our proposed algorithm is based on asemi-supervised approach (user labels) and the use ofimage patches as data features. We also consider thePearson distance between patches, which has been shown tobe robust w.r.t speckle noise present in ultrasoundimages. Our results on phantom and clinical data show avery high similarity agreement with the ground truthprovided by a medical expert.
Resumo:
Introduction: Intoxications with colchicine usually occur by ingestion of meadow saffron leaves (Colchicum autumnale) which are mistakenly collected for alimentary purposes instead of the leaves of crow garlic (Allium ursinum). Colchicine, the main alkaloid of Colchicum autumnale, is present in all parts of the plant. We report a rarer source of mistake, i.e. between the flowers of Colchicum autumnale and Crocus sativus. The similarity in this case is limited to the appearance of the flowers, but Colchicum autumnale, which is also flowering in autumn, lacks the crimson stigma from which the saffron spice is derived from Crocus sativus. Case report: A 47-year-old woman collected the stamens of a flower resembling Crocus sativus for use as saffron. Her knowledge about Crocus sativus was limited to having seen this plant previously at a museum of saffron (Mund, Switzerland). She prepared a meal with rice using three pinches of ''saffron'' for ten tablespoons of rice. She and her 8-year-old child, both ate the usual amount of rice (6 and 2 tablespoons, respectively). The 2 brothers (4- and 9-years-old) only ate 3 teaspoons of rice each. A slightly bitter taste and the absence of a yellow colouration were peculiar. Three to four hours after the meal, the mother developed nausea and contacted the Swiss Toxicological Information Centre, suspecting a plant misidentification. All family members were referred to the regional university hospital for administration of oral activated charcoal. No other symptoms were reported, notably no symptoms in the 8-year-old boy and his brothers. Colchicine serum concentration (blood sample obtained 15 hours after ingestion) measured by HPLC-mass spectrometry was 0.36 mg/L for the mother, and 0.13 mg/L for the 8-year-old child, respectively (therapeutic levels: 0.30-2.5 mg/L). Conclusion: This report demonstrates that a significant amount of colchicine may be absorbed even after ingestion of very small quantities of Colchicum autumnale, which in this case was confused with Crocus sativus. Serum colchicine concentrations in the sub-/therapeutic range can be quantified by HPLC-mass spectrometry, which allows a very sensitive and specific detection of this alkaloid in blood and urine.
Resumo:
A new species of the spider genus Loxosceles, L. mrazig sp. n., found in Tunisia is described and illustrated. The male bulb shows a high degree of morphological similarity to that of L. gaucho from Brazil, but the pro- portions of the palpal segments and the general colouration of the body reveal significant differences between the two species. A distance analysis of the sequences of the mitochondrial gene cox1 reveals that the specimen from Tunisia shows high genetic distance from L. gaucho (more than 20%). The American species L. gaucho and L. laeta form a sister group to the Mediterranean representatives (L. rufescens and the Tunisian specimen). Taxonomy, Araneae, Loxosceles, new species, Tunisia.
Resumo:
Persistent infection induces an adaptive immune response that is mediated by T and B lymphocytes. Upon triggering with an antigen, these cells become activated and turn into fast expanding cells able to efficiently defend the host. Lymphocyte activation is controlled by a complex composed of CARMA1, BCL10 and MALT1 which regulates the NF-KB signaling pathway upon antigen triggering. Abnormally high expression or activity of either one of these three proteins can favor the development of lymphomas, while genetic defects in the pathway are associated with immunodeficiency. MALT1 was identified as a paracaspase sharing homology with other cysteine proteases, namely caspases and metacaspases. In order to be active, caspases need to dimerize. Based on their sequence similarity with MALT1, we hypothesized that dimerization might also be a mechanism of activation employed by MALT1. To address this assumption, we performed a bioinformatics modelling based on the crystal structures of several caspases. Our model suggested that the MALT1 caspase-like domain can indeed form dimers. This finding was later confirmed by several published crystal structures of MALT1. In the dimer interface of our model, we noticed the presence of charged amino acids that could potentially form salt bridges and thereby hold both monomers together. Mutation of one of these residues, E549, into alanine completely blocked the catalytic activity of MALT1. Additionally, we provided evidence for a role of E549 in promoting the MALTl-dependent growth of cells derived from diffuse large B cell lymphoma (DLBCL) of the aggressive B cell-like type (ABC). To our initial surprise, the E549A mutation showed only a partial defect in dimerization, indicating that additional residues are essential to form a stable dimer. The MALT1 crystal structures revealed a key function for E549 in stabilizing the catalytic site of the protease via its interaction with an arginine which is located next to the catalytic active cysteine. In an additional study, we discovered that MALT1 monoubiquitination is required for the catalytic activity of the protease. Interestingly, we found that the MALT1 dimer interface mutant E549A could not be monoubiquitinated. Based on these findings, we suggest that correct formation of the dimer interface is a prerequisite for monoubiquitination. In a second project, we discovered a novel target of the protease MALT1, the ribonuclease Regnase¬la It was described that the RNase activity of Regnase-1 negatively regulates immune responses. We could show that in ABC DLBCL cell lines, Regnase-1 is not only cleaved by MALT1 but also phosphorylated, at least in part, by the inhibitor of KB kinase (IKK). Both regulations appear to restrain the RNase function of Regnase-1 and thereby allow the production of pro-survival proteins. In conclusion, our studies further highlight and explain the importance of the catalytic activity of MALT1 for the activation of lymphocytes and provide additional knowledge for the development of specific drugs targeting the catalytic activity of MALT1 for immunomodulation and treatment of lymphomas. SUMMARY IN FRENCH PhD Thesis Katrin Cabalzar 2 SUMMARY IN FRENCH Une infection persistante induit une réponse immunitaire adaptative par l'intermédiaire des lymphocytes T et B. Quand elles reconnaissent l'antigène, ces cellules sont activées et se multiplient très rapidement pour défendre efficacement l'hôte. L'activation des lymphocytes est transmise par un complexe composé de trois protéines, CARMA1, BCL10 et MALT1, qui régule la voie de signalisation NF-KB lorsque l'antigène est reconnu. L'expression ou l'activité anormalement élevée de l'une de ces trois protéines peut favoriser le développement de lymphomes, tandis que des défauts génétiques de cette voie de signalisation sont associés à l'immunodéficience. MALT1 a été identifiée comme étant une paracaspase qui partage des séquences homologues avec d'autres protéases à cystéine, comme les caspases et les métacaspases. Pour être actives, les caspases ont besoin de dimériser. Etant donné leur similarité de séquence avec MALT1, nous avons supposé que la dimérisation pouvait aussi être un mécanisme d'activation utilisé par MALT1. Pour vérifier cette hypothèse, nous avons conçu un modèle bioinformatique à partir des structures cristallographiques de plusieurs caspases. Et notre modèle a suggéré que le domaine catalytique de MALT1 était effectivement capable de former des dimères. Cette découverte a été confirmée plus tard par des publications qui montrent des structures cristallographiques dimériques de MALT1. Dans l'interface du dimère de notre modèle, nous avons remarqué la présence d'acides aminés chargés qui pouvaient former des liaisons ioniques et ainsi réunir les deux monomères. La mutation de l'un de ces résidus, E549, pour une alanine, a complètement inhibé l'activité catalytique de MALT1. De plus, nous avons mis en évidence un rôle d'E549 dans la croissance dépendante de MALT1, des cellules dérivées de lymphomes B diffus à grandes cellules (DLBCL) de sous-type cellules B actives (ABC). Dans un premier temps nous avons été surpris de constater que cette mutation révélait seulement un défaut partiel de dimérisation, ce qui indique que des acides aminés supplémentaires sont indispensables pour former un dimère stable. Les structures cristallographiques de MALT1 ont révélé un rôle primordial d'E549 dans la stabilisation du site catalytique de la protéase via son interaction avec une arginine qui se trouve à côté de la cystéine du site actif. Dans une autre étude, nous avons découvert que la monoubiquitination de MALT1 est requise pour l'activité catalytique de la protéase. A remarquer que nous avons trouvé que le mutant E549A de l'interface dimère de MALT1 n'a pas pu être monoubiquitiné. Sur la base de ces résultats, nous suggérons que la formation correcte de l'interface du dimère est une condition préalable pour la monoubiquitination. Dans un second projet, nous avons découvert une nouvelle cible de la protéase MALT1, la ribonucléase Regnase-1. Il a été décrit que l'activité RNase de Regnase-1 régulait négativement les réponses immunitaires. Nous avons pu montrer que dans les lignées cellulaires ABC DLBCL, la Regnase-1 n'était pas seulement clivée par MALT1 mais également phosphorylée, au moins en partie, par la kinase de l'inhibiteur de KB (IKK). Les deux régulations semblent supprimer la fonction RNase de Regnase-1 et permettre ainsi la stabilisation de certains ARN messagers et la production de protéines favorisant la survie. En conclusion, nos études mettent en évidence le rôle-clé de la dimérisation de MALT1 et expliquent l'importance de l'activité catalytique de MALT1 pour l'activation des lymphocytes. Ainsi, nos résultats apportent des connaissances supplémentaires pour le développement de médicaments spécifiques ciblant l'activité catalytique de MALT1, qui pourraient être utiles pour modifier les réponses immunitaires et traiter des lymphomes.
Resumo:
This article designs what it calls a Credit-Risk Balance Sheet (the risk being that of default by customers), a tool which, in principle, can contribute to revealing, controlling and managing the bad debt risk arising from a company¿s commercial credit, whose amount can represent a significant proportion of both its current and total assets.To construct it, we start from the duality observed in any credit transaction of this nature, whose basic identity can be summed up as Credit = Risk. ¿Credit¿ is granted by a company to its customer, and can be ranked by quality (we suggest the credit scoring system) and ¿risk¿ can either be assumed (interiorised) by the company itself or transferred to third parties (exteriorised).What provides the approach that leads to us being able to talk with confidence of a real Credit-Risk Balance Sheet with its methodological robustness is that the dual vision of the credit transaction is not, as we demonstrate, merely a classificatory duality (a double risk-credit classification of reality) but rather a true causal relationship, that is, a risk-credit causal duality.Once said Credit-Risk Balance Sheet (which bears a certain structural similarity with the classic net asset balance sheet) has been built, and its methodological coherence demonstrated, its properties ¿static and dynamic¿ are studied.Analysis of the temporal evolution of the Credit-Risk Balance Sheet and of its applications will be the object of subsequent works.
Resumo:
Although research on influenza lasted for more than 100 years, it is still one of the most prominent diseases causing half a million human deaths every year. With the recent observation of new highly pathogenic H5N1 and H7N7 strains, and the appearance of the influenza pandemic caused by the H1N1 swine-like lineage, a collaborative effort to share observations on the evolution of this virus in both animals and humans has been established. The OpenFlu database (OpenFluDB) is a part of this collaborative effort. It contains genomic and protein sequences, as well as epidemiological data from more than 27,000 isolates. The isolate annotations include virus type, host, geographical location and experimentally tested antiviral resistance. Putative enhanced pathogenicity as well as human adaptation propensity are computed from protein sequences. Each virus isolate can be associated with the laboratories that collected, sequenced and submitted it. Several analysis tools including multiple sequence alignment, phylogenetic analysis and sequence similarity maps enable rapid and efficient mining. The contents of OpenFluDB are supplied by direct user submission, as well as by a daily automatic procedure importing data from public repositories. Additionally, a simple mechanism facilitates the export of OpenFluDB records to GenBank. This resource has been successfully used to rapidly and widely distribute the sequences collected during the recent human swine flu outbreak and also as an exchange platform during the vaccine selection procedure. Database URL: http://openflu.vital-it.ch.
Resumo:
ABSTRACT: BACKGROUND: The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. RESULTS: Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. CONCLUSIONS: Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Resumo:
A novel two-component system, CbrA-CbrB, was discovered in Pseudomonas aeruginosa; cbrA and cbrB mutants of strain PAO were found to be unable to use several amino acids (such as arginine, histidine and proline), polyamines and agmatine as sole carbon and nitrogen sources. These mutants were also unable to use, or used poorly, many other carbon sources, including mannitol, glucose, pyruvate and citrate. A 7 kb EcoRI fragment carrying the cbrA and cbrB genes was cloned and sequenced. The cbrA and cbrB genes encode a sensor/histidine kinase (Mr 108 379, 983 residues) and a cognate response regulator (Mr 52 254, 478 residues) respectively. The amino-terminal half (490 residues) of CbrA appears to be a sensor membrane domain, as predicted by 12 possible transmembrane helices, whereas the carboxy-terminal part shares homology with the histidine kinases of the NtrB family. The CbrB response regulator shows similarity to the NtrC family members. Complementation and primer extension experiments indicated that cbrA and cbrB are transcribed from separate promoters. In cbrA or cbrB mutants, as well as in the allelic argR9901 and argR9902 mutants, the aot-argR operon was not induced by arginine, indicating an essential role for this two-component system in the expression of the ArgR-dependent catabolic pathways, including the aruCFGDB operon specifying the major aerobic arginine catabolic pathway. The histidine catabolic enzyme histidase was not expressed in cbrAB mutants, even in the presence of histidine. In contrast, proline dehydrogenase, responsible for proline utilization (Pru), was expressed in a cbrB mutant at a level comparable with that of the wild-type strain. When succinate or other C4-dicarboxylates were added to proline medium at 1 mM, the cbrB mutant was restored to a Pru+ phenotype. Such a succinate-dependent Pru+ property was almost abolished by 20 mM ammonia. In conclusion, the CbrA-CbrB system controls the expression of several catabolic pathways and, perhaps together with the NtrB-NtrC system, appears to ensure the intracellular carbon: nitrogen balance in P. aeruginosa.
Resumo:
Aim To improve our understanding of how biological communities assemble, we investigated changes in bumblebee communities in space along an elevation gradient. We assessed how much deterministic abiotic and biotic factors shape community assembly. We focused on proboscis length (influencing the species' dietary regime) and phylogenetic relatedness to investigate if competition and environmental filtering occur in more and less productive climates, respectively. Location Western Swiss Alps. Methods We recorded bumblebee species in 149 plots along a 1800-m wide elevation gradient. We contrasted two major clades of bumblebees, a short-tongued and a long-tongued clade. We calculated the phylogenetic and proboscis-length diversity of the bumblebee communities and compared these observed data with a random distribution to detect clustering likely to be caused by environmental filtering or overdispersion likely to be caused by competition. We compared the prevalence of clustered and overdispersed communities along the gradients of plant species richness (biotic) and temperature (abiotic). Results Under colder conditions, where plant species richness is lower and floral resources are scarcer, the clade with shorter proboscides prevails over the clade with longer proboscides, and communities are functionally and phylogenetic clustered. Under warmer conditions, we found phylogenetic but not functional overdispersion in communities. Main conclusions We show for the first time a strong correlation between phylogenetic relatedness, proboscis length and species distribution along temperature and plant richness gradients shaping bumblebee communities. The low temperatures and low levels of plant species richness limit the dispersal of the species from the long-tongued clade, which have more specialized diets, into high-elevation areas. Competition under warmer conditions may produce communities composed of less closely related species that share distinct ecological preferences. Our empirical results corroborate theoretical expectation as well as experiments on the prevalence of deterministic processes in the most severe and most productive parts of environmental gradients.
Resumo:
Aim. To predict the fate of alpine interactions involving specialized species, using a monophagous beetle and its host-plant as a case study. Location. The Alps. Methods. We investigated genetic structuring of the herbivorous beetle Oreina gloriosa and its specific host-plant Peucedanum ostruthium. We used genome fingerprinting (in the insect and the plant) and sequence data (in the insect) to compare the distribution of the main gene pools in the two associated species and to estimate divergence time in the insect, a proxy for the temporal origin of the interaction. We quantified the similarity in spatial genetic structures by performing a Procrustes analysis, a tool from the shape theory. Finally, we simulated recolonization of an empty space analogous to the deglaciated Alps just after ice retreat by two lineages from two species showing unbalanced dependence, to examine how timing of the recolonization process, as well as dispersal capacities of associated species, could explain the observed pattern. Results. Contrasting with expectations based on their asymmetrical dependence, patterns in the beetle and plant were congruent at a large scale. Exceptions occurred at a regional scale in areas of admixture, matching known suture zones in Alpine plants. Simulations using a lattice-based model suggested these empirical patterns arose during or soon after recolonization, long after the estimated origin of the interaction c. 0.5 million years ago. Main conclusions. Species-specific interactions are scarce in alpine habitats because glacial cycles have limited opportunities for coevolution. Their fate, however, remains uncertain under climate change. Here we show that whereas most dispersal routes are paralleled at large scale, regional incongruence implies that the destinies of the species might differ under changing climate. This may be a consequence of the host-dependence of the beetle that locally limits the establishment of dispersing insects.
Resumo:
This article designs what it calls a Credit-Risk Balance Sheet (the risk being that of default by customers), a tool which, in principle, can contribute to revealing, controlling and managing the bad debt risk arising from a company¿s commercial credit, whose amount can represent a significant proportion of both its current and total assets.To construct it, we start from the duality observed in any credit transaction of this nature, whose basic identity can be summed up as Credit = Risk. ¿Credit¿ is granted by a company to its customer, and can be ranked by quality (we suggest the credit scoring system) and ¿risk¿ can either be assumed (interiorised) by the company itself or transferred to third parties (exteriorised).What provides the approach that leads to us being able to talk with confidence of a real Credit-Risk Balance Sheet with its methodological robustness is that the dual vision of the credit transaction is not, as we demonstrate, merely a classificatory duality (a double risk-credit classification of reality) but rather a true causal relationship, that is, a risk-credit causal duality.Once said Credit-Risk Balance Sheet (which bears a certain structural similarity with the classic net asset balance sheet) has been built, and its methodological coherence demonstrated, its properties ¿static and dynamic¿ are studied.Analysis of the temporal evolution of the Credit-Risk Balance Sheet and of its applications will be the object of subsequent works.
Resumo:
The ecdysone-responsive DNA sequence of the Drosophila hsp27 gene promoter contains four direct and inverted repeats reminiscent of those that compose the vertebrate palindromic estrogen response element (ERE) and the thyroid hormone/retinoic acid response element (TRE/RRE). Interestingly, a 3 bp substitution in the wild-type Hsp27 ecdysone response element (EcdRE) increases both its similarity with the vertebrate ERE and TRE/RRE and its capacity to confer ecdysone responsiveness to a heterologous promoter. Remarkably, increasing the spacing between the inverted repeats of this strong EcdRE by two nucleotides converts it into an ERE. Inversely, decreasing the spacing between the two inverted repeats of the vertebrate consensus palindromic ERE, from three to one nucleotide, converts it into a functional EcdRE. Thus, the only difference between an invertebrate EcdRE and a vertebrate palindromic ERE or TRE/RRE is in the spacing between the conserved inverted repeated motifs forming these palindromic HREs. The finding that the sequence motif 5'-GGTCA-3' present in the vertebrate ERE and TRE/RRE is also a functionally important characteristic of an invertebrate HRE, suggests that a common ancestor regulatory DNA sequence gave rise to all HREs known so far. We discuss the possibility that this progenitor motif is the GGTCA sequence.
Molecular analysis of the bacterial diversity in a specialized consortium for diesel oil degradation
Resumo:
Diesel oil is a compound derived from petroleum, consisting primarily of hydrocarbons. Poor conditions in transportation and storage of this product can contribute significantly to accidental spills causing serious ecological problems in soil and water and affecting the diversity of the microbial environment. The cloning and sequencing of the 16S rRNA gene is one of the molecular techniques that allows estimation and comparison of the microbial diversity in different environmental samples. The aim of this work was to estimate the diversity of microorganisms from the Bacteria domain in a consortium specialized in diesel oil degradation through partial sequencing of the 16S rRNA gene. After the extraction of DNA metagenomics, the material was amplified by PCR reaction using specific oligonucleotide primers for the 16S rRNA gene. The PCR products were cloned into a pGEM-T-Easy vector (Promega), and Escherichia coli was used as the host cell for recombinant DNAs. The partial clone sequencing was obtained using universal oligonucleotide primers from the vector. The genetic library obtained generated 431 clones. All the sequenced clones presented similarity to phylum Proteobacteria, with Gammaproteobacteria the most present group (49.8 % of the clones), followed by Alphaproteobacteira (44.8 %) and Betaproteobacteria (5.4 %). The Pseudomonas genus was the most abundant in the metagenomic library, followed by the Parvibaculum and the Sphingobium genus, respectively. After partial sequencing of the 16S rRNA, the diversity of the bacterial consortium was estimated using DOTUR software. When comparing these sequences to the database from the National Center for Biotechnology Information (NCBI), a strong correlation was found between the data generated by the software used and the data deposited in NCBI.
Resumo:
Core-samples from wells and an outcrop located on the Voronesh Anticline in the southeastern part of the Russian Platform contain Late Cretaceous radiolaria. 83 species are described and illustrated (SEM and transmitted light images) from Santonian-early Campanian deposits, and two assemblages are distinguished. The older assemblage with Alievium gallowayi, Archaeospongoprunum bipartitum, Archaeospongoprunum. cf. A. salumi as well as other less age-diagnostic taxa, is interpreted as Santonian correlative with the Euchitonia santonica-Alievium gallowayi Assemblage Zone of the Moscow Basin (Vishnevskaya 1993). The younger assemblage, of Santonian - early Campanian age, contains Patulibracchium cf. P. davisi, Crucella irwini, Cryptamphorella sphaerica, Praeconocaryomma californiensis, Dictyomitra lamellicostata among other species and is correlative with the Orbiculiforma quadrata-Lithostrobus rostovtsevi Assemblage Zone of the Moscow Basin. In terms of inter-regional faunal comparisons, both of the Voronesh Anticline radiolarian assemblages demonstrate relatively close affinities to coeval rocks from the Volga River region, but less similarity to the assemblages from the Moscow Basin. Only a few of the common endemic species of Siberian assemblages occur in our samples. On an inter-regional level, the radiolarian assemblages described herein have similarities with assemblages reported from Japan and California. Index-species characteristic for the Santonian-Campanian radiolarian biozonations of the Atlantic and Pacific Oceans are not found in our collection. However, the presence of many cosmopolitan species known from the European Platform, Japan and California suggests a marine connection between the Voronesh Anticline region, the western Atlantic and eastern Tethys during Santonian-Early Campanian time.
Resumo:
Molecular shape has long been known to be an important property for the process of molecular recognition. Previous studies postulated the existence of a drug-like shape space that could be used to artificially bias the composition of screening libraries, with the aim to increase the chance of success in Hit Identification. In this work, it was analysed to which extend this assumption holds true. Normalized Principal Moments of Inertia Ratios (NPRs) have been used to describe the molecular shape of small molecules. It was investigated, whether active molecules of diverse targets are located in preferred subspaces of the NPR shape space. Results illustrated a significantly stronger clustering than could be expected by chance, with parts of the space unlikely to be occupied by active compounds. Furthermore, a strong enrichment of elongated, rather flat shapes could be observed, while globular compounds were highly underrepresented. This was confirmed for a wide range of small molecule datasets from different origins. Active compounds exhibited a high overlap in their shape distributions across different targets, making a purely shape based discrimination very difficult. An additional perspective was provided by comparing the shapes of protein binding pockets with those of their respective ligands. Although more globular than their ligands, it was observed that binding sites shapes exhibited a similarly skewed distribution in shape space: spherical shapes were highly underrepresented. This was different for unoccupied binding pockets of smaller size. These were on the contrary identified to possess a more globular shape. The relation between shape complementarity and exhibited bioactivity was analysed; a moderate correlation between bioactivity and parameters including pocket coverage, distance in shape space, and others could be identified, which reflects the importance of shape complementarity. However, this also suggests that other aspects are of relevance for molecular recognition. A subsequent analysis assessed if and how shape and volume information retrieved from pocket or respective reference ligands could be used as a pre-filter in a virtual screening approach. ln Lead Optimization compounds need to get optimized with respect to a variety of pararneters. Here, the availability of past success stories is very valuable, as they can guide medicinal chemists during their analogue synthesis plans. However, although of tremendous interest for the public domain, so far only large corporations had the ability to mine historical knowledge in their proprietary databases. With the aim to provide such information, the SwissBioisostere database was developed and released during this thesis. This database contains information on 21,293,355 performed substructural exchanges, corresponding to 5,586,462 unique replacements that have been measured in 35,039 assays against 1,948 molecular targets representing 30 target classes, and on their impact on bioactivity . A user-friendly interface was developed that provides facile access to these data and is accessible at http//www.swissbioisostere.ch. The ChEMBL database was used as primary data source of bioactivity information. Matched molecular pairs have been identified in the extracted and cleaned data. Success-based scores were developed and integrated into the database to allow re-ranking of proposed replacements by their past outcomes. It was analysed to which degree these scores correlate with chemical similarity of the underlying fragments. An unexpectedly weak relationship was detected and further investigated. Use cases of this database were envisioned, and functionalities implemented accordingly: replacement outcomes are aggregatable at the assay level, and it was shawn that an aggregation at the target or target class level could also be performed, but should be accompanied by a careful case-by-case assessment. It was furthermore observed that replacement success depends on the activity of the starting compound A within a matched molecular pair A-B. With increasing potency the probability to lose bioactivity through any substructural exchange was significantly higher than in low affine binders. A potential existence of a publication bias could be refuted. Furthermore, often performed medicinal chemistry strategies for structure-activity-relationship exploration were analysed using the acquired data. Finally, data originating from pharmaceutical companies were compared with those reported in the literature. It could be seen that industrial medicinal chemistry can access replacement information not available in the public domain. In contrast, a large amount of often-performed replacements within companies could also be identified in literature data. Preferences for particular replacements differed between these two sources. The value of combining different endpoints in an evaluation of molecular replacements was investigated. The performed studies highlighted furthermore that there seem to exist no universal substructural replacement that always retains bioactivity irrespective of the biological environment. A generalization of bioisosteric replacements seems therefore not possible. - La forme tridimensionnelle des molécules a depuis longtemps été reconnue comme une propriété importante pour le processus de reconnaissance moléculaire. Des études antérieures ont postulé que les médicaments occupent préférentiellement un sous-ensemble de l'espace des formes des molécules. Ce sous-ensemble pourrait être utilisé pour biaiser la composition de chimiothèques à cribler, dans le but d'augmenter les chances d'identifier des Hits. L'analyse et la validation de cette assertion fait l'objet de cette première partie. Les Ratios de Moments Principaux d'Inertie Normalisés (RPN) ont été utilisés pour décrire la forme tridimensionnelle de petites molécules de type médicament. Il a été étudié si les molécules actives sur des cibles différentes se co-localisaient dans des sous-espaces privilégiés de l'espace des formes. Les résultats montrent des regroupements de molécules incompatibles avec une répartition aléatoire, avec certaines parties de l'espace peu susceptibles d'être occupées par des composés actifs. Par ailleurs, un fort enrichissement en formes allongées et plutôt plates a pu être observé, tandis que les composés globulaires étaient fortement sous-représentés. Cela a été confirmé pour un large ensemble de compilations de molécules d'origines différentes. Les distributions de forme des molécules actives sur des cibles différentes se recoupent largement, rendant une discrimination fondée uniquement sur la forme très difficile. Une perspective supplémentaire a été ajoutée par la comparaison des formes des ligands avec celles de leurs sites de liaison (poches) dans leurs protéines respectives. Bien que plus globulaires que leurs ligands, il a été observé que les formes des poches présentent une distribution dans l'espace des formes avec le même type d'asymétrie que celle observée pour les ligands: les formes sphériques sont fortement sous représentées. Un résultat différent a été obtenu pour les poches de plus petite taille et cristallisées sans ligand: elles possédaient une forme plus globulaire. La relation entre complémentarité de forme et bioactivité a été également analysée; une corrélation modérée entre bioactivité et des paramètres tels que remplissage de poche, distance dans l'espace des formes, ainsi que d'autres, a pu être identifiée. Ceci reflète l'importance de la complémentarité des formes, mais aussi l'implication d'autres facteurs. Une analyse ultérieure a évalué si et comment la forme et le volume d'une poche ou de ses ligands de référence pouvaient être utilisés comme un pré-filtre dans une approche de criblage virtuel. Durant l'optimisation d'un Lead, de nombreux paramètres doivent être optimisés simultanément. Dans ce contexte, la disponibilité d'exemples d'optimisations réussies est précieuse, car ils peuvent orienter les chimistes médicinaux dans leurs plans de synthèse par analogie. Cependant, bien que d'un extrême intérêt pour les chercheurs dans le domaine public, seules les grandes sociétés pharmaceutiques avaient jusqu'à présent la capacité d'exploiter de telles connaissances au sein de leurs bases de données internes. Dans le but de remédier à cette limitation, la base de données SwissBioisostere a été élaborée et publiée dans le domaine public au cours de cette thèse. Cette base de données contient des informations sur 21 293 355 échanges sous-structuraux observés, correspondant à 5 586 462 remplacements uniques mesurés dans 35 039 tests contre 1948 cibles représentant 30 familles, ainsi que sur leur impact sur la bioactivité. Une interface a été développée pour permettre un accès facile à ces données, accessible à http:/ /www.swissbioisostere.ch. La base de données ChEMBL a été utilisée comme source de données de bioactivité. Une version modifiée de l'algorithme de Hussain et Rea a été implémentée pour identifier les Matched Molecular Pairs (MMP) dans les données préparées au préalable. Des scores de succès ont été développés et intégrés dans la base de données pour permettre un reclassement des remplacements proposés selon leurs résultats précédemment observés. La corrélation entre ces scores et la similarité chimique des fragments correspondants a été étudiée. Une corrélation plus faible qu'attendue a été détectée et analysée. Différents cas d'utilisation de cette base de données ont été envisagés, et les fonctionnalités correspondantes implémentées: l'agrégation des résultats de remplacement est effectuée au niveau de chaque test, et il a été montré qu'elle pourrait également être effectuée au niveau de la cible ou de la classe de cible, sous réserve d'une analyse au cas par cas. Il a en outre été constaté que le succès d'un remplacement dépend de l'activité du composé A au sein d'une paire A-B. Il a été montré que la probabilité de perdre la bioactivité à la suite d'un remplacement moléculaire quelconque est plus importante au sein des molécules les plus actives que chez les molécules de plus faible activité. L'existence potentielle d'un biais lié au processus de publication par articles a pu être réfutée. En outre, les stratégies fréquentes de chimie médicinale pour l'exploration des relations structure-activité ont été analysées à l'aide des données acquises. Enfin, les données provenant des compagnies pharmaceutiques ont été comparées à celles reportées dans la littérature. Il a pu être constaté que les chimistes médicinaux dans l'industrie peuvent accéder à des remplacements qui ne sont pas disponibles dans le domaine public. Par contre, un grand nombre de remplacements fréquemment observés dans les données de l'industrie ont également pu être identifiés dans les données de la littérature. Les préférences pour certains remplacements particuliers diffèrent entre ces deux sources. L'intérêt d'évaluer les remplacements moléculaires simultanément selon plusieurs paramètres (bioactivité et stabilité métabolique par ex.) a aussi été étudié. Les études réalisées ont souligné qu'il semble n'exister aucun remplacement sous-structural universel qui conserve toujours la bioactivité quel que soit le contexte biologique. Une généralisation des remplacements bioisostériques ne semble donc pas possible.