35 resultados para Macroarray, Microarray
em AMS Tesi di Dottorato - Alm@DL - Università di Bologna
Resumo:
The main aim of this Ph.D. dissertation is the study of clustering dependent data by means of copula functions with particular emphasis on microarray data. Copula functions are a popular multivariate modeling tool in each field where the multivariate dependence is of great interest and their use in clustering has not been still investigated. The first part of this work contains the review of the literature of clustering methods, copula functions and microarray experiments. The attention focuses on the K–means (Hartigan, 1975; Hartigan and Wong, 1979), the hierarchical (Everitt, 1974) and the model–based (Fraley and Raftery, 1998, 1999, 2000, 2007) clustering techniques because their performance is compared. Then, the probabilistic interpretation of the Sklar’s theorem (Sklar’s, 1959), the estimation methods for copulas like the Inference for Margins (Joe and Xu, 1996) and the Archimedean and Elliptical copula families are presented. In the end, applications of clustering methods and copulas to the genetic and microarray experiments are highlighted. The second part contains the original contribution proposed. A simulation study is performed in order to evaluate the performance of the K–means and the hierarchical bottom–up clustering methods in identifying clusters according to the dependence structure of the data generating process. Different simulations are performed by varying different conditions (e.g., the kind of margins (distinct, overlapping and nested) and the value of the dependence parameter ) and the results are evaluated by means of different measures of performance. In light of the simulation results and of the limits of the two investigated clustering methods, a new clustering algorithm based on copula functions (‘CoClust’ in brief) is proposed. The basic idea, the iterative procedure of the CoClust and the description of the written R functions with their output are given. The CoClust algorithm is tested on simulated data (by varying the number of clusters, the copula models, the dependence parameter value and the degree of overlap of margins) and is compared with the performance of model–based clustering by using different measures of performance, like the percentage of well–identified number of clusters and the not rejection percentage of H0 on . It is shown that the CoClust algorithm allows to overcome all observed limits of the other investigated clustering techniques and is able to identify clusters according to the dependence structure of the data independently of the degree of overlap of margins and the strength of the dependence. The CoClust uses a criterion based on the maximized log–likelihood function of the copula and can virtually account for any possible dependence relationship between observations. Many peculiar characteristics are shown for the CoClust, e.g. its capability of identifying the true number of clusters and the fact that it does not require a starting classification. Finally, the CoClust algorithm is applied to the real microarray data of Hedenfalk et al. (2001) both to the gene expressions observed in three different cancer samples and to the columns (tumor samples) of the whole data matrix.
Resumo:
In the past decade, the advent of efficient genome sequencing tools and high-throughput experimental biotechnology has lead to enormous progress in the life science. Among the most important innovations is the microarray tecnology. It allows to quantify the expression for thousands of genes simultaneously by measurin the hybridization from a tissue of interest to probes on a small glass or plastic slide. The characteristics of these data include a fair amount of random noise, a predictor dimension in the thousand, and a sample noise in the dozens. One of the most exciting areas to which microarray technology has been applied is the challenge of deciphering complex disease such as cancer. In these studies, samples are taken from two or more groups of individuals with heterogeneous phenotypes, pathologies, or clinical outcomes. these samples are hybridized to microarrays in an effort to find a small number of genes which are strongly correlated with the group of individuals. Eventhough today methods to analyse the data are welle developed and close to reach a standard organization (through the effort of preposed International project like Microarray Gene Expression Data -MGED- Society [1]) it is not unfrequant to stumble in a clinician's question that do not have a compelling statistical method that could permit to answer it.The contribution of this dissertation in deciphering disease regards the development of new approaches aiming at handle open problems posed by clinicians in handle specific experimental designs. In Chapter 1 starting from a biological necessary introduction, we revise the microarray tecnologies and all the important steps that involve an experiment from the production of the array, to the quality controls ending with preprocessing steps that will be used into the data analysis in the rest of the dissertation. While in Chapter 2 a critical review of standard analysis methods are provided stressing most of problems that In Chapter 3 is introduced a method to adress the issue of unbalanced design of miacroarray experiments. In microarray experiments, experimental design is a crucial starting-point for obtaining reasonable results. In a two-class problem, an equal or similar number of samples it should be collected between the two classes. However in some cases, e.g. rare pathologies, the approach to be taken is less evident. We propose to address this issue by applying a modified version of SAM [2]. MultiSAM consists in a reiterated application of a SAM analysis, comparing the less populated class (LPC) with 1,000 random samplings of the same size from the more populated class (MPC) A list of the differentially expressed genes is generated for each SAM application. After 1,000 reiterations, each single probe given a "score" ranging from 0 to 1,000 based on its recurrence in the 1,000 lists as differentially expressed. The performance of MultiSAM was compared to the performance of SAM and LIMMA [3] over two simulated data sets via beta and exponential distribution. The results of all three algorithms over low- noise data sets seems acceptable However, on a real unbalanced two-channel data set reagardin Chronic Lymphocitic Leukemia, LIMMA finds no significant probe, SAM finds 23 significantly changed probes but cannot separate the two classes, while MultiSAM finds 122 probes with score >300 and separates the data into two clusters by hierarchical clustering. We also report extra-assay validation in terms of differentially expressed genes Although standard algorithms perform well over low-noise simulated data sets, multi-SAM seems to be the only one able to reveal subtle differences in gene expression profiles on real unbalanced data. In Chapter 4 a method to adress similarities evaluation in a three-class prblem by means of Relevance Vector Machine [4] is described. In fact, looking at microarray data in a prognostic and diagnostic clinical framework, not only differences could have a crucial role. In some cases similarities can give useful and, sometimes even more, important information. The goal, given three classes, could be to establish, with a certain level of confidence, if the third one is similar to the first or the second one. In this work we show that Relevance Vector Machine (RVM) [2] could be a possible solutions to the limitation of standard supervised classification. In fact, RVM offers many advantages compared, for example, with his well-known precursor (Support Vector Machine - SVM [3]). Among these advantages, the estimate of posterior probability of class membership represents a key feature to address the similarity issue. This is a highly important, but often overlooked, option of any practical pattern recognition system. We focused on Tumor-Grade-three-class problem, so we have 67 samples of grade I (G1), 54 samples of grade 3 (G3) and 100 samples of grade 2 (G2). The goal is to find a model able to separate G1 from G3, then evaluate the third class G2 as test-set to obtain the probability for samples of G2 to be member of class G1 or class G3. The analysis showed that breast cancer samples of grade II have a molecular profile more similar to breast cancer samples of grade I. Looking at the literature this result have been guessed, but no measure of significance was gived before.
Resumo:
A systematic characterization of the composition and structure of the bacterial cell-surface proteome and its complexes can provide an invaluable tool for its comprehensive understanding. The knowledge of protein complexes composition and structure could offer new, more effective targets for a more specific and consequently effective immune response against a complex instead of a single protein. Large-scale protein-protein interaction screens are the first step towards the identification of complexes and their attribution to specific pathways. Currently, several methods exist for identifying protein interactions and protein microarrays provide the most appealing alternative to existing techniques for a high throughput screening of protein-protein interactions in vitro under reasonably straightforward conditions. In this study approximately 100 proteins of Group A Streptococcus (GAS) predicted to be secreted or surface exposed by genomic and proteomic approaches were purified in a His-tagged form and used to generate protein microarrays on nitrocellulose-coated slides. To identify protein-protein interactions each purified protein was then labeled with biotin, hybridized to the microarray and interactions were detected with Cy3-labelled streptavidin. Only reciprocal interactions, i. e. binding of the same two interactors irrespective of which of the two partners is in solid-phase or in solution, were taken as bona fide protein-protein interactions. Using this approach, we have identified 20 interactors of one of the potent toxins secreted by GAS and known as superantigens. Several of these interactors belong to the molecular chaperone or protein folding catalyst families and presumably are involved in the secretion and folding of the superantigen. In addition, a very interesting interaction was found between the superantigen and the substrate binding subunit of a well characterized ABC transporter. This finding opens a new perspective on the current understanding of how superantigens are modified by the bacterial cell in order to become major players in causing disease.
Resumo:
This study provides a comprehensive genetic overview on the endangered Italian wolf population. In particular, it focuses on two research lines. On one hand, we focalised on melanism in wolf in order to isolate a mutation related with black coat colour in canids. With several reported black individuals (an exception at European level), the Italian wolf population constituted a challenging research field posing many unanswered questions. As found in North American wolf, we reported that melanism in the Italian population is caused by a different melanocortin pathway component, the K locus, in which a beta-defensin protein acts as an alternative ligand for the Mc1r. This research project was conducted in collaboration with Prof. Gregory Barsh, Department of Genetics and Paediatrics, Stanford University. On the other hand, we performed analysis on a high number of SNPs thanks to a customized Canine microarray useful to integrate or substitute the STR markers for genotyping individuals and detecting wolf-dog hybrids. Thanks to DNA microchip technology, we obtained an impressive amount of genetic data which provides a solid base for future functional genomic studies. This study was undertaken in collaboration with Prof. Robert K. Wayne, Department of Ecology and Evolutionary Biology, University of California, Los Angeles (UCLA).
Resumo:
Adhesion, immune evasion and invasion are key determinants during bacterial pathogenesis. Pathogenic bacteria possess a wide variety of surface exposed and secreted proteins which allow them to adhere to tissues, escape the immune system and spread throughout the human body. Therefore, extensive contacts between the human and the bacterial extracellular proteomes take place at the host-pathogen interface at the protein level. Recent researches emphasized the importance of a global and deeper understanding of the molecular mechanisms which underlie bacterial immune evasion and pathogenesis. Through the use of a large-scale, unbiased, protein microarray-based approach and of wide libraries of human and bacterial purified proteins, novel host-pathogen interactions were identified. This approach was first applied to Staphylococcus aureus, cause of a wide variety of diseases ranging from skin infections to endocarditis and sepsis. The screening led to the identification of several novel interactions between the human and the S. aureus extracellular proteomes. The interaction between the S. aureus immune evasion protein FLIPr (formyl-peptide receptor like-1 inhibitory protein) and the human complement component C1q, key players of the offense-defense fighting, was characterized using label-free techniques and functional assays. The same approach was also applied to Neisseria meningitidis, major cause of bacterial meningitis and fulminant sepsis worldwide. The screening led to the identification of several potential human receptors for the neisserial adhesin A (NadA), an important adhesion protein and key determinant of meningococcal interactions with the human host at various stages. The interaction between NadA and human LOX-1 (low-density oxidized lipoprotein receptor) was confirmed using label-free technologies and cell binding experiments in vitro. Taken together, these two examples provided concrete insights into S. aureus and N. meningitidis pathogenesis, and identified protein microarray coupled with appropriate validation methodologies as a powerful large scale tool for host-pathogen interactions studies.
Resumo:
Cardiac morphogenesis is a complex process governed by evolutionarily conserved transcription factors and signaling molecules. The Drosophila cardiac tube is linear, made of 52 pairs of cardiomyocytes (CMs), which express specific transcription factor genes that have human homologues implicated in Congenital Heart Diseases (CHDs) (NKX2-5, GATA4 and TBX5). The Drosophila cardiac tube is linear and composed of a rostral portion named aorta and a caudal one called heart, distinguished by morphological and functional differences controlled by Hox genes, key regulators of axial patterning. Overexpression and inactivation of the Hox gene abdominal-A (abd-A), which is expressed exclusively in the heart, revealed that abd-A controls heart identity. The aim of our work is to isolate the heart-specific cisregulatory sequences of abd-A direct target genes, the realizator genes granting heart identity. In each segment of the heart, four pairs of cardiomyocytes (CMs) express tinman (tin), homologous to NKX2-5, and acquire strong contractile and automatic rhythmic activities. By tyramide amplified FISH, we found that seven genes, encoding ion channels, pumps or transporters, are specifically expressed in the Tin-CMs of the heart. We initially used online available tools to identify their heart-specific cisregutatory modules by looking for Conserved Non-coding Sequences containing clusters of binding sites for various cardiac transcription factors, including Hox proteins. Based on these data we generated several reporter gene constructs and transgenic embryos, but none of them showed reporter gene expression in the heart. In order to identify additional abd-A target genes, we performed microarray experiments comparing the transcriptomes of aorta versus heart and identified 144 genes overexpressed in the heart. In order to find the heart-specific cis-regulatory regions of these target genes we developed a new bioinformatic approach where prediction is based on pattern matching and ordered statistics. We first retrieved Conserved Noncoding Sequences from the alignment between the D.melanogaster and D.pseudobscura genomes. We scored for combinations of conserved occurrences of ABD-A, ABD-B, TIN, PNR, dMEF2, MADS box, T-box and E-box sites and we ranked these results based on two independent strategies. On one hand we ranked the putative cis-regulatory sequences according to best scored ABD-A biding sites, on the other hand we scored according to conservation of binding sites. We integrated and ranked again the two lists obtained independently to produce a final rank. We generated nGFP reporter construct flies for in vivo validation. We identified three 1kblong heart-specific enhancers. By in vivo and in vitro experiments we are determining whether they are direct abd-A targets, demonstrating the role of a Hox gene in the realization of heart identity. The identified abd-A direct target genes may be targets also of the NKX2-5, GATA4 and/or TBX5 homologues tin, pannier and Doc genes, respectively. The identification of sequences coregulated by a Hox protein and the homologues of transcription factors causing CHDs, will provide a mean to test whether these factors function as Hox cofactors granting cardiac specificity to Hox proteins, increasing our knowledge on the molecular mechanisms underlying CHDs. Finally, it may be investigated whether these Hox targets are involved in CHDs.
Resumo:
In a global and increasingly competitive fresh produce market, more attention is being given to fruit quality traits and consumer satisfaction. Kiwifruit occupies a niche position in the worldwide market, when compared to apples, oranges or bananas. It is a fruit with extraordinarily good nutritional traits, and its benefits to human health have been widely described. Until recently, international trade in kiwifruit was restricted to a single cultivar, but different types of kiwifruit are now becoming available in the market. Effective programmes of kiwifruit improvement start by considering the requirements of consumers, and recent surveys indicate that sweeter fruit with better flavour are generally preferred. There is a strong correlation between at-harvest dry matter and starch content, and soluble solid concentration and flavour when fruit are eating ripe. This suggests that carbon accumulation strongly influences the development of kiwifruit taste. The overall aim of the present study was to determine what factors affect carbon accumulation during Actinidia deliciosa berry development. One way of doing this is by comparing kiwifruit genotypes that differ greatly in their ability to accumulate dry matter in their fruit. Starch is the major component of dry matter content. It was hypothesized that genotypes were different in sink strength. Sink strength, by definition, is the effect of sink size and sink activity. Chapter 1 reviews fruit growth, kiwifruit growth and development and carbon metabolism. Chapter 2 describes the materials and methods used. Chapter 3, 4, 5 and 6 describes different types of experimental work. Chapter 7 contains the final discussions and the conclusions Three Actinidia deliciosa breeding populations were analysed in detail to confirm that observed differences in dry matter content were genetically determined. Fruit of the different genotypes differed in dry matter content mainly because of differences in starch concentrations and dry weight accumulation rates, irrespective of fruit size. More detailed experiments were therefore carried out on genotypes which varied most in fruit starch concentrations to determine why sink strengths were so different. The kiwifruit berry comprises three tissues which differ in dry matter content. It was initially hypothesised that observed differences in starch content could be due to a larger proportion of one or other of these tissues, for example, of the central core which is highest in dry matter content. The study results showed that this was not the case. Sink size, intended as cell number or cell size, was then investigated. The outer pericarp makes up about 60% of berry weight in ‘Hayward’ kiwifruit. The outer pericarp contains two types of parenchyma cells: large cells with low starch concentration, and small cells with high starch concentration. Large cell, small cell and total cell densities in the outer pericarp were shown to be not correlated with either dry matter content or fruit size but further investigation of volume proportion among cell types seemed justified. It was then shown that genotypes with fruit having higher dry matter contents also had a higher proportion of small cells. However, the higher proportion of small cell volume could only explain half of the observed differences in starch content. So, sink activity, intended as sucrose to starch metabolism, was investigated. In transiently starch storing sinks, such as tomato fruit and potato tubers, a pivotal role in carbon metabolism has been attributed to sucrose cleaving enzymes (mainly sucrose synthase and cell wall invertase) and to ADP-glucose pyrophosphorylase (the committed step in starch synthesis). Studies on tomato and potato genotypes differing in starch content or in final fruit soluble solid concentrations have demonstrated a strong link with either sucrose synthase or ADP-glucose pyrophosphorylase, at both enzyme activity and gene expression levels, depending on the case. Little is known about sucrose cleaving enzyme and ADP-glucose pyrophosphorylase isoforms. The HortResearch Actinidia EST database was then screened to identify sequences putatively encoding for sucrose synthase, invertase and ADP-glucose pyrophosphorylase isoforms and specific primers were designed. Sucrose synthase, invertase and ADP-glucose pyrophosphorylase isoform transcript levels were anlayzed throughout fruit development of a selection of four genotypes (two high dry matter and two low dry matter). High dry matter genotypes showed higher amounts of sucrose synthase transcripts (SUS1, SUS2 or both) and higher ADP-glucose pyrophosphorylase (AGPL4, large subunit 4) gene expression, mainly early in fruit development. SUS1- like gene expression has been linked with starch biosynthesis in several crop (tomato, potato and maize). An enhancement of its transcript level early in fruit development of high dry matter genotypes means that more activated glucose (UDP-glucose) is available for starch synthesis. This can be then correlated to the higher starch observed since soon after the onset of net starch accumulation. The higher expression level of AGPL4 observed in high dry matter genotypes suggests an involvement of this subunit in drive carbon flux into starch. Changes in both enzymes (SUSY and AGPse) are then responsible of higher starch concentrations. Low dry matter genotypes showed generally higher vacuolar invertase gene expression (and also enzyme activity), early in fruit development. This alternative cleavage strategy can possibly contribute to energy loss, in that invertases’ products are not adenylated, and further reactions and transport are needed to convert carbon into starch. Although these elements match well with observed differences in starch contents, other factors could be involved in carbon metabolism control. From the microarray experiment, in fact, several kinases and transcription factors have been found to be differentially expressed. Sink strength is known to be modified by application of regulators. In ‘Hayward’ kiwifruit, the synthetic cytokinin CPPU (N-(2-Chloro-4-Pyridyl)-N-Phenylurea) promotes a dramatic increase in fruit size, whereas dry matter content decreases. The behaviour of CPPU-treated ‘Hayward’ kiwifruit was similar to that of fruit from low dry matter genotypes: dry matter and starch concentrations were lower. However, the CPPU effect was strongly source limited, whereas in genotype variation it was not. Moreover, CPPU-treated fruit gene expression (at sucrose cleavage and AGPase levels) was similar to that in high dry matter genotypes. It was therefore concluded that CPPU promotes both sink size and sink activity, but at different “speeds” and this ends in the observed decrease in dry matter content and starch concentration. The lower “speed” in sink activity is probably due to a differential partitioning of activated glucose between starch storage and cell wall synthesis to sustain cell expansion. Starch is the main carbohydrate accumulated in growing Actinidia deliciosa fruit. Results obtained in the present study suggest that sucrose synthase and AGPase enzymes contribute to sucrose to starch conversion, and differences in their gene expression levels, mainly early in fruit development, strongly affect the rate at which starch is therefore accumulated. This results are interesting in that starch and Actinidia deliciosa fruit quality are tightly connected.
Resumo:
Theory of aging postulates that aging is a remodeling process where the body of survivors progressively adapts to internal and external damaging agents they are exposed to during several decades. Thus , stress response and adaptation mechanisms play a fundamental role in the aging process where the capability of adaptating effects, certainly, also is related the lifespan of each individual. A key gene linking aging to stress response is indeed p21, an induction of cyclin-dependent kinase inhibitor which triggers cell growth arrest associated with senescence and damage response and notably is involved in the up-regulation of multiple genes that have been associated with senescence or implicated in age-related . This PhD thesis project that has been performed in collaboration with the Roninson Lab at Ordway Research Institute in Albany, NY had two main aims: -the testing the hypothesis that p21 polymorphisms are involved in longevity -Evaluating age-associated differences in gene expression and transcriptional response to p21 and DNA damage In the first project, trough PCR-sequencing and Sequenom strategies, we we found out that there are about 30 polymorphic variants in the p21 gene. In addition, we found an haplotpype located in -5kb region of the p21 promoter whose frequency is ~ 2 fold higher in centenarians than in the general population (Large-scale analysis of haplotype frequencies is currently in progress). Functional studies I carried out on the promoter highilighted that the ―centenarian‖ haplotype doesn’t affect the basal p21 promoter activity or its response to p53. However, there are many other possible physiological conditions in which the centenarian allele of the p21 promoter may potentially show a different response (IL6, IFN,progesterone, vitamin E, Vitamin D etc). In the second part, project #2, trough Microarrays we seeked to evaluate the differences in gene expression between centenarians, elderly, young in dermal fibroblast cultures and their response to p21 and DNA damage. Microarray analysis of gene expression in dermal fibroblast cultures of individuals of different ages yielded a tentative "centenarian signature". A subset of genes that were up- or downregulated in centenarians showed the same response to ectopic expression of p21, yielding a putative "p21-centenarian" signature. Trough RQ-PCR (as well Microarrays studies whose analysis is in progress) we tested the DNA damage response of the p21-centenarian signature genes showing a correlation stress/aging in additional sets of young and old samples treated with p21-inducing drug doxorubicin thus finding for a subset of of them , a response to stress age-related.
Resumo:
Brown rot caused by Monilinia laxa and Monilinia fructigena is considered one of the most important diseases affecting Prunus species. Although some losses can result from the rotten fruits in the orchard, most of the damage is caused to fruits during the post-harvest phase. Several studies reported that brown rot incidence during fruit development highly varies; it was found that at a period corresponding to the the pit hardening stage, fruit susceptibility drastically decreases, to be quickly restored afterwards. However the molecular basis of this phenomenon is still not well understood. Furthermore, no difference in the rot incidence was found between wound and un-wound fruits, suggesting that resistance associated more to a specifc biochemical response of the fruit, rather than to a higher mechanical resistance. So far, the interaction Monilinia-peach was analyzed through chemical approaches. In this study, a bio-molecular approach was undertaken in order to reveal alteration in gene expression associated to the variation of susceptibility. In this thesis three different methods for gene expression analysis were used to analyze the alterations in gene expression occurring in peach fruits during the pit hardening stage, in a period encompassing the temporary change in Monilinia susceptibility: real time PCR, microarray and cDNA AFLP techniques. In 2005, peach fruits (cv.K2) were weekly harvested during a 19-week long-period, starting from the fourth week after full bloom, until full maturity. At each sampling time, three replicates of 5 fruits each were dipped in the M.laxa conidial suspension or in distilled water, as negative control. The fruits were maintained at room temperature for 3 hours; afterwards, they were peeled with a scalpel; the peel was immediately frozen in liquid nitrogen and transferred to -80 °C until use. The degree of susceptibility of peach fruit to the pathogen was determined on 3 replicates of 20 fruits each, as percentage of infected fruits, after one week at 20 °C. Real time PCR analysis was performed to study the variation in expression of those genes encoding for the enzymes of the phenylpropanoid pathway (phenylalanine ammonia lyase (PAL), chalcone synthase (CHS), cinnamate 4-hydroxylase (C4H), leucoanthocyanidine reductase (LAR), hydroxycinnamoyl CoA quinate hydroxycinnamoyl transferase (HQT) and of the jasmonate pathway, such as lipoxygenase (LOX), both involved in the production of important defense compounds. Alteration in gene expression was monitored on fruit samples of a period encompassing the pit hardening stage and the corresponding temporary resistance to M.laxa infections, weekly, from the 6thto the 12th week after full bloom (AFB) inoculated with M. laxa or mock-inoculated. The data suggest a critical change in the expression level of the phenylpropanoid pathway from the 7th to the 8th week AFB; such change could be directly physiologically associated to the peach growth and it could indirectly determine the decrease of susceptibility of peach fruit to Monilinia rot during the subsequent weeks. To investigate on the transcriptome variation underneath the temporary loss of susceptibility of peach fruits to Monilinia rot, the microarray and the cDNA AFLP techniques were used. The samples harvested on the 8th week AFB (named S, for susceptible ones) and on the 12th week AFB (named R, for resistant ones) were compared, both inoculated or mock-inoculated. The microarray experiments were carried out at the University of Padua (Dept. of Environmental Agronomy and Crop Science), using the μPEACH1.0 microarray together with the suited protocols. The analysis showed that 30 genes (corresponding to the 0.6% of the total sequences (4806) contained in the μPeach1.0 microarray) were found up-regulated and 31 ( 0.6%) down regulated in RH vs. SH fruits. On the other hand, 20 genes (0.4%) were shown to be up-regulated and 13 (0.3%) down-regulated in the RI vs. SI fruit. No genes were found differentially expressed in the mock-inoculated resistant fruits (RH) vs. the inoculated resistant ones (RI). Among the up-regulated genes an ATP sulfurylase, an heat shock protein 70, the major allergen Pru P1, an harpin inducing protein and S-adenosylmethionine decarboxylase were found, conversely among the down-regulated ones, cinnamyl alcohol dehydrogenase, an histidine- containing phosphotransfer protein and the ferritin were found. The microarray experimental results and the data indirectly derived, were tested by Real Time PCR analysis. cDNA AFLP analysis was also performed on the same samples. 339 transcript derived fragments considered significant for Monilinia resistance, were selected, sequenced and classified. Genes potentially involved in cell rescue and defence were well represented (8%); several genes (12.1%) involved in the protein folding, post-transductional modification and genes (9.2%) involved in cellular transport were also found. A further 10.3% of genes were classified as involved in the metabolism of aminoacid, carbohydrate and fatty acid. On the other hand, genes involved in the protein synthesis (5.7%) and in signal transduction and communication (5.7%) were found. Among the most interesting genes found differentially expressed between susceptible and resistant fruits, genes encoding for pathogenesis related (PR) proteins were found. To investigate on the association of Monilinia resistance and PR biological function, the major allergen Pru P1 (GenBank accession AM493970) and its isoform (here named Pru P2), were expressed in heterologous system and in vitro assayed for their anti-microbial activity. The ribonuclease activity of the recombinant Pru P1 and Pru P2 proteins was assayed against peach total RNA. As the other PR10 proteins, they showed a ribonucleolytic activity, that could be important to contrast pathogen penetration. Moreover Pru P1 and Pru P2 recombinant proteins were checked for direct antimicrobial activity. No inhibitory effect of Pru P1 or Pru P2 was detected against the selected fungi.
Resumo:
The present PhD project was focused on the development of new tools and methods for luminescence-based techniques. In particular, the ultimate goal was to present substantial improvements to the currently available technologies for both research and diagnostic in the fields of biology, proteomics and genomics. Different aspects and problems were investigated, requiring different strategies and approaches. The whole work was thus divided into separate chapters, each based on the study of one specific aspect of luminescence: Chemiluminescence, Fluorescence and Electrochemiluminescence. CHAPTER 1, Chemiluminescence The work on luminol-enhancer solution lead to a new luminol solution formulation with 1 order of magnitude lower detection limit for HRP. This technology was patented with Cyanagen brand and is now sold worldwide for Western Blot and ELISA applications. CHAPTER 2, Fluorescescence The work on dyed-doped silica nanoparticles is marking a new milestone in the development of nanotechnologies for biological applications. While the project is still in progress, preliminary studies on model structures are leading to very promising results. The improved brightness of these nano-sized objects, their simple synthesis and handling, their low toxicity will soon turn them, we strongly believe, into a new generation of fluorescent labels for many applications. CHAPTER 3, Electrochemiluminescence The work on electrochemiluminescence produced interesting results that can potentially turn into great improvements from an analytical point of view. Ru(bpy)3 derivatives were employed both for on-chip microarray (Chapter 3.1) and for microscopic imaging applications (Chapter 3.2). The development of these new techniques is still under investigation, but the obtained results confirm the possibility to achieve the final goal. Furthermore the development of new ECL-active species (Chapter 3.3, 3.4, 3.5) and their use in these applications can significantly improve overall performances, thus helping to spread ECL as powerful analytical tool for routinary techniques. To conclude, the results obtained are of strong value to largely increase the sensitivity of luminescence techniques, thus fulfilling the expectation we had at the beginning of this research work.
Resumo:
In this work we aim to propose a new approach for preliminary epidemiological studies on Standardized Mortality Ratios (SMR) collected in many spatial regions. A preliminary study on SMRs aims to formulate hypotheses to be investigated via individual epidemiological studies that avoid bias carried on by aggregated analyses. Starting from collecting disease counts and calculating expected disease counts by means of reference population disease rates, in each area an SMR is derived as the MLE under the Poisson assumption on each observation. Such estimators have high standard errors in small areas, i.e. where the expected count is low either because of the low population underlying the area or the rarity of the disease under study. Disease mapping models and other techniques for screening disease rates among the map aiming to detect anomalies and possible high-risk areas have been proposed in literature according to the classic and the Bayesian paradigm. Our proposal is approaching this issue by a decision-oriented method, which focus on multiple testing control, without however leaving the preliminary study perspective that an analysis on SMR indicators is asked to. We implement the control of the FDR, a quantity largely used to address multiple comparisons problems in the eld of microarray data analysis but which is not usually employed in disease mapping. Controlling the FDR means providing an estimate of the FDR for a set of rejected null hypotheses. The small areas issue arises diculties in applying traditional methods for FDR estimation, that are usually based only on the p-values knowledge (Benjamini and Hochberg, 1995; Storey, 2003). Tests evaluated by a traditional p-value provide weak power in small areas, where the expected number of disease cases is small. Moreover tests cannot be assumed as independent when spatial correlation between SMRs is expected, neither they are identical distributed when population underlying the map is heterogeneous. The Bayesian paradigm oers a way to overcome the inappropriateness of p-values based methods. Another peculiarity of the present work is to propose a hierarchical full Bayesian model for FDR estimation in testing many null hypothesis of absence of risk.We will use concepts of Bayesian models for disease mapping, referring in particular to the Besag York and Mollié model (1991) often used in practice for its exible prior assumption on the risks distribution across regions. The borrowing of strength between prior and likelihood typical of a hierarchical Bayesian model takes the advantage of evaluating a singular test (i.e. a test in a singular area) by means of all observations in the map under study, rather than just by means of the singular observation. This allows to improve the power test in small areas and addressing more appropriately the spatial correlation issue that suggests that relative risks are closer in spatially contiguous regions. The proposed model aims to estimate the FDR by means of the MCMC estimated posterior probabilities b i's of the null hypothesis (absence of risk) for each area. An estimate of the expected FDR conditional on data (\FDR) can be calculated in any set of b i's relative to areas declared at high-risk (where thenull hypothesis is rejected) by averaging the b i's themselves. The\FDR can be used to provide an easy decision rule for selecting high-risk areas, i.e. selecting as many as possible areas such that the\FDR is non-lower than a prexed value; we call them\FDR based decision (or selection) rules. The sensitivity and specicity of such rule depend on the accuracy of the FDR estimate, the over-estimation of FDR causing a loss of power and the under-estimation of FDR producing a loss of specicity. Moreover, our model has the interesting feature of still being able to provide an estimate of relative risk values as in the Besag York and Mollié model (1991). A simulation study to evaluate the model performance in FDR estimation accuracy, sensitivity and specificity of the decision rule, and goodness of estimation of relative risks, was set up. We chose a real map from which we generated several spatial scenarios whose counts of disease vary according to the spatial correlation degree, the size areas, the number of areas where the null hypothesis is true and the risk level in the latter areas. In summarizing simulation results we will always consider the FDR estimation in sets constituted by all b i's selected lower than a threshold t. We will show graphs of the\FDR and the true FDR (known by simulation) plotted against a threshold t to assess the FDR estimation. Varying the threshold we can learn which FDR values can be accurately estimated by the practitioner willing to apply the model (by the closeness between\FDR and true FDR). By plotting the calculated sensitivity and specicity (both known by simulation) vs the\FDR we can check the sensitivity and specicity of the corresponding\FDR based decision rules. For investigating the over-smoothing level of relative risk estimates we will compare box-plots of such estimates in high-risk areas (known by simulation), obtained by both our model and the classic Besag York Mollié model. All the summary tools are worked out for all simulated scenarios (in total 54 scenarios). Results show that FDR is well estimated (in the worst case we get an overestimation, hence a conservative FDR control) in small areas, low risk levels and spatially correlated risks scenarios, that are our primary aims. In such scenarios we have good estimates of the FDR for all values less or equal than 0.10. The sensitivity of\FDR based decision rules is generally low but specicity is high. In such scenario the use of\FDR = 0:05 or\FDR = 0:10 based selection rule can be suggested. In cases where the number of true alternative hypotheses (number of true high-risk areas) is small, also FDR = 0:15 values are well estimated, and \FDR = 0:15 based decision rules gains power maintaining an high specicity. On the other hand, in non-small areas and non-small risk level scenarios the FDR is under-estimated unless for very small values of it (much lower than 0.05); this resulting in a loss of specicity of a\FDR = 0:05 based decision rule. In such scenario\FDR = 0:05 or, even worse,\FDR = 0:1 based decision rules cannot be suggested because the true FDR is actually much higher. As regards the relative risk estimation, our model achieves almost the same results of the classic Besag York Molliè model. For this reason, our model is interesting for its ability to perform both the estimation of relative risk values and the FDR control, except for non-small areas and large risk level scenarios. A case of study is nally presented to show how the method can be used in epidemiology.