950 resultados para Microarray-based genomic hybridization
Resumo:
Dissertação para obtenção do Grau de Doutor em Engenharia Química e Bioquímica
Resumo:
The MAP-i Doctoral Program of the Universities of Minho, Aveiro and Porto
Resumo:
DNA microarrays are one of the most used technologies for gene expression measurement. However, there are several distinct microarray platforms, from different manufacturers, each with its own measurement protocol, resulting in data that can hardly be compared or directly integrated. Data integration from multiple sources aims to improve the assertiveness of statistical tests, reducing the data dimensionality problem. The integration of heterogeneous DNA microarray platforms comprehends a set of tasks that range from the re-annotation of the features used on gene expression, to data normalization and batch effect elimination. In this work, a complete methodology for gene expression data integration and application is proposed, which comprehends a transcript-based re-annotation process and several methods for batch effect attenuation. The integrated data will be used to select the best feature set and learning algorithm for a brain tumor classification case study. The integration will consider data from heterogeneous Agilent and Affymetrix platforms, collected from public gene expression databases, such as The Cancer Genome Atlas and Gene Expression Omnibus.
Resumo:
Bacteriophage-host interaction studies in biofilm structures are still challenging due to the technical limitations of traditional methods. The aim of this study was to provide a direct fluorescence in situ hybridization (FISH) method based on locked nucleic acid (LNA) probes, which targets the phage replication phase, allowing the study of population dynamics during infection. Bacteriophages specific for two biofilm-forming bacteria, Pseudomonas aeruginosa and Acinetobacter, were selected. Four LNA probes were designed and optimized for phage-specific detection and for bacterial counterstaining. To validate the method, LNA-FISH counts were compared with the traditional plaque forming unit (PFU) technique. To visualize the progression of phage infection within a biofilm, colony-biofilms were formed and infected with bacteriophages. A good correlation (r=0.707) was observed between LNA-FISH and PFU techniques. In biofilm structures, LNA-FISH provided a good discrimination of the infected cells and also allowed the assessment of the spatial distribution of infected and non-infected populations.
Resumo:
The limited ability of common variants to account for the genetic contribution to complex disease has prompted searches for rare variants of large effect, to partly explain the 'missing heritability'. Analyses of genome-wide genotyping data have identified genomic structural variants (GSVs) as a source of such rare causal variants. Recent studies have reported multiple GSV loci associated with risk of obesity. We attempted to replicate these associations by similar analysis of two familial-obesity case-control cohorts and a population cohort, and detected GSVs at 11 out of 18 loci, at frequencies similar to those previously reported. Based on their reported frequencies and effect sizes (OR≥25), we had sufficient statistical power to detect the large majority (80%) of genuine associations at these loci. However, only one obesity association was replicated. Deletion of a 220 kb region on chromosome 16p11.2 has a carrier population frequency of 2×10(-4) (95% confidence interval [9.6×10(-5)-3.1×10(-4)]); accounts overall for 0.5% [0.19%-0.82%] of severe childhood obesity cases (P = 3.8×10(-10); odds ratio = 25.0 [9.9-60.6]); and results in a mean body mass index (BMI) increase of 5.8 kg.m(-2) [1.8-10.3] in adults from the general population. We also attempted replication using BMI as a quantitative trait in our population cohort; associations with BMI at or near nominal significance were detected at two further loci near KIF2B and within FOXP2, but these did not survive correction for multiple testing. These findings emphasise several issues of importance when conducting rare GSV association, including the need for careful cohort selection and replication strategy, accurate GSV identification, and appropriate correction for multiple testing and/or control of false discovery rate. Moreover, they highlight the potential difficulty in replicating rare CNV associations across different populations. Nevertheless, we show that such studies are potentially valuable for the identification of variants making an appreciable contribution to complex disease.
Resumo:
Projecte de recerca elaborat a partir d’una estada al Department for Feed and Food Hygiene del National Veterinary Institute, Noruega, entre novembre i desembre del 2006. Els grans de cereal poden estar contaminats amb diferents espècies de Fusarium capaces de produir metabolits secundaris altament tòxics com trichotecenes, fumonisines o moniliformines. La correcta identificació d’aquestes espècies és de gran importància per l’assegurament del risc en l’àmbit de la salut humana i animal. La identificació de Fusarium en base a la seva morfologia requereix coneixements taxonòmics i temps; la majoria dels mètodes moleculars permeten la identificació d’una única espècie diana. Per contra, la tecnologia de microarray ofereix l’anàlisi paral•lel d’un alt nombre de DNA dianes. En aquest treball, s’ha desenvolupat un array per a la identificació de les principals espècies de Fusarium toxigèniques del Nord i Sud d’Europa. S’ha ampliat un array ja existent, per a la detecció de les espècies de Fusarium productores de trichothecene i moniliformina (predominants al Nord d’Europa), amb l’addició de 18 sondes de DNA que permeten identificar les espècies toxigèniques més abundants al Sud d’Europa, les qual produeixen majoritàriament fumonisines. Les sondes de captura han estat dissenyades en base al factor d’elongació translació- 1 alpha (TEF-1alpha). L’anàlisi de les mostres es realitza mitjançant una única PCR que permet amplificar part del TEF-1alpha seguida de la hibridació al xip de Fusarium. Els resultats es visualitzen mitjançant un mètode de detecció colorimètric. El xip de Fusarium desenvolupat pot esdevenir una eina útil i de gran interès per a l’anàlisi de cereals presents en la cadena alimentària.
Resumo:
CD8 T cells play a key role in mediating protective immunity against selected pathogens after vaccination. Understanding the mechanism of this protection is dependent upon definition of the heterogeneity and complexity of cellular immune responses generated by different vaccines. Here, we identify previously unrecognized subsets of CD8 T cells based upon analysis of gene-expression patterns within single cells and show that they are differentially induced by different vaccines. Three prime-boost vector combinations encoding HIV Env stimulated antigen-specific CD8 T-cell populations of similar magnitude, phenotype, and functionality. Remarkably, however, analysis of single-cell gene-expression profiles enabled discrimination of a majority of central memory (CM) and effector memory (EM) CD8 T cells elicited by the three vaccines. Subsets of T cells could be defined based on their expression of Eomes, Cxcr3, and Ccr7, or Klrk1, Klrg1, and Ccr5 in CM and EM cells, respectively. Of CM cells elicited by DNA prime-recombinant adenoviral (rAd) boost vectors, 67% were Eomes(-) Ccr7(+) Cxcr3(-), in contrast to only 7% and 2% stimulated by rAd5-rAd5 or rAd-LCMV, respectively. Of EM cells elicited by DNA-rAd, 74% were Klrk1(-) Klrg1(-)Ccr5(-) compared with only 26% and 20% for rAd5-rAd5 or rAd5-LCMV. Definition by single-cell gene profiling of specific CM and EM CD8 T-cell subsets that are differentially induced by different gene-based vaccines will facilitate the design and evaluation of vaccines, as well as enable our understanding of mechanisms of protective immunity.
Resumo:
Introduction: As part of the MicroArray Quality Control (MAQC)-II project, this analysis examines how the choice of univariate feature-selection methods and classification algorithms may influence the performance of genomic predictors under varying degrees of prediction difficulty represented by three clinically relevant endpoints. Methods: We used gene-expression data from 230 breast cancers (grouped into training and independent validation sets), and we examined 40 predictors (five univariate feature-selection methods combined with eight different classifiers) for each of the three endpoints. Their classification performance was estimated on the training set by using two different resampling methods and compared with the accuracy observed in the independent validation set. Results: A ranking of the three classification problems was obtained, and the performance of 120 models was estimated and assessed on an independent validation set. The bootstrapping estimates were closer to the validation performance than were the cross-validation estimates. The required sample size for each endpoint was estimated, and both gene-level and pathway-level analyses were performed on the obtained models. Conclusions: We showed that genomic predictor accuracy is determined largely by an interplay between sample size and classification difficulty. Variations on univariate feature-selection methods and choice of classification algorithm have only a modest impact on predictor performance, and several statistically equally good predictors can be developed for any given classification problem.
Resumo:
Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics.
Resumo:
Microsatellite instability (MSI) occurs in 10-20% of colorectal tumours and is associated with good prognosis. Here we describe the development and validation of a genomic signature that identifies colorectal cancer patients with MSI caused by DNA mismatch repair deficiency with high accuracy. Microsatellite status for 276 stage II and III colorectal tumours has been determined. Full-genome expression data was used to identify genes that correlate with MSI status. A subset of these samples (n = 73) had sequencing data for 615 genes available. An MSI gene signature of 64 genes was developed and validated in two independent validation sets: the first consisting of frozen samples from 132 stage II patients; and the second consisting of FFPE samples from the PETACC-3 trial (n = 625). The 64-gene MSI signature identified MSI patients in the first validation set with a sensitivity of 90.3% and an overall accuracy of 84.8%, with an AUC of 0.942 (95% CI, 0.888-0.975). In the second validation, the signature also showed excellent performance, with a sensitivity 94.3% and an overall accuracy of 90.6%, with an AUC of 0.965 (95% CI, 0.943-0.988). Besides correct identification of MSI patients, the gene signature identified a group of MSI-like patients that were MSS by standard assessment but MSI by signature assessment. The MSI-signature could be linked to a deficient MMR phenotype, as both MSI and MSI-like patients showed a high mutation frequency (8.2% and 6.4% of 615 genes assayed, respectively) as compared to patients classified as MSS (1.6% mutation frequency). The MSI signature showed prognostic power in stage II patients (n = 215) with a hazard ratio of 0.252 (p = 0.0145). Patients with an MSI-like phenotype had also an improved survival when compared to MSS patients. The MSI signature was translated to a diagnostic microarray and technically and clinically validated in FFPE and frozen samples.
Resumo:
MOTIVATION: Microarray results accumulated in public repositories are widely reused in meta-analytical studies and secondary databases. The quality of the data obtained with this technology varies from experiment to experiment, and an efficient method for quality assessment is necessary to ensure their reliability. RESULTS: The lack of a good benchmark has hampered evaluation of existing methods for quality control. In this study, we propose a new independent quality metric that is based on evolutionary conservation of expression profiles. We show, using 11 large organ-specific datasets, that IQRray, a new quality metrics developed by us, exhibits the highest correlation with this reference metric, among 14 metrics tested. IQRray outperforms other methods in identification of poor quality arrays in datasets composed of arrays from many independent experiments. In contrast, the performance of methods designed for detecting outliers in a single experiment like Normalized Unscaled Standard Error and Relative Log Expression was low because of the inability of these methods to detect datasets containing only low-quality arrays and because the scores cannot be directly compared between experiments. AVAILABILITY AND IMPLEMENTATION: The R implementation of IQRray is available at: ftp://lausanne.isb-sib.ch/pub/databases/Bgee/general/IQRray.R. CONTACT: Marta.Rosikiewicz@unil.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Resumo:
PURPOSE: The aim of this study was to determine whether tumor location proximal or distal to the splenic flexure is associated with distinct molecular patterns and can predict clinical outcome in a homogeneous group of patients with Dukes B (T3-T4, N0, M0) colorectal cancer. It has been hypothesized that proximal and distal colorectal cancer may arise through different pathogenetic mechanisms. Although p53 and Ki-ras gene mutations occur frequently in distal tumors, another form of genomic instability associated with defective DNA mismatch repair has been predominantly identified in the proximal colon. To date, however, the clinical usefulness of these molecular characteristics remains unproven. METHODS: A total of 126 patients with a lymph node-negative sporadic colon or rectum adenocarcinoma were prospectively assessed with the endpoint of death by cancer. No patient received either radiotherapy or chemotherapy. p53 protein was studied by immunohistochemistry using DO-7 monoclonal antibody, and p53 and Ki-ras gene mutations were detected by single strand conformation polymorphism assay. RESULTS: During a mean follow-up of 67 months, the overall five-year survival was 70 percent. Nuclear p53 staining was found in 57 tumors (47 percent), and was more frequent in distal than in proximal tumors (55 vs. 21 percent; chi-squared test, P < 0.001). For the whole group, p53 protein expression correlated with poor survival in univariate and multivariate analysis (log-rank test, P = 0.01; hazard ratio = 2.16; 95 percent confidence interval = 1.12-4.11, P = 0.02). Distal colon tumors and rectal tumors exhibited similar molecular patterns and showed no difference in clinical outcome. In comparison with distal colorectal cancer, proximal tumors were found to be statistically significantly different on the following factors: mucinous content (P = 0.008), degree of histologic differentiation (P = 0.012), p53 protein expression, and gene mutation (P = 0.001 and 0.01 respectively). Finally, patients with proximal tumors had a marginally better survival than those with distal colon or rectal cancers (log-rank test, P = 0.045). CONCLUSION: In this series of Dukes B colorectal cancers, p53 protein expression was an independent factor for survival, which also correlated with tumor location. Eighty-six percent of p53-positive tumors were located in the distal colon and rectum. Distal colon and rectum tumors had similar molecular and clinical characteristics. In contrast, proximal neoplasms seem to represent a distinct entity, with specific histopathologic characteristics, molecular patterns, and clinical outcome. Location of the neoplasm in reference to the splenic flexure should be considered before group stratification in future trials of adjuvant chemotherapy in patients with Dukes B tumors.
Resumo:
The minimum chromosome number of Glomus intraradices was assessed through cloning and sequencing of the highly divergent telomere-associated sequences (TAS) and by pulsed field gel electrophoresis (PFGE). The telomere of G. intraradices, as in other filamentous fungi, consists of TTAGGG repeats, this was confirmed using Bal31 nuclease time course reactions. Telomere length was estimated to be roughly 0.9 kb by Southern blots on genomic DNA and a telomere probe. We have identified six classes of cloned chromosomal termini based on the TAS. An unusually high genetic variation was observed within two of the six TAS classes. To further assess the total number of chromosome termini, we used telomere fingerprinting. Surprisingly, all hybridization patterns showed smears, which demonstrate that TAS are remarkably variable in the G. intraradices genome. These analyses predict the presence of at least three chromosomes in G. intraradices while PFGE showed a pattern of four bands ranging from 1.2 to 1.5 Mb. Taken together, our results indicate that there are at least four chromosomes in G. intraradices but there are probably more. The information on TAS and telomeres in the G. intradicies will be essential for making a physical map of the G. intraradices genome and could provide molecular markers for future studies of genetic variation among nuclei in these multigenomic fungi.
Resumo:
To better undesrtand the distribution of Culex pipiens and Cx. quinquefasciatus in Argentina, samples were collected from six localities situated in a North-South line from Castelli (Chaco Province) to Puerto Madryn (Chubut Province). Identification was based on the morphology of male genitalia. Only Cx. quinquefasciatus was found in Castelli and Esperanza, while in Rosario, 95.3% belonged to this species and 4.7% represented hybrid forms. Southern samples included only Cx. pipiens. With the purpose of verfying if Cx. pipiens and Cx. quinquefasciatus hybridize, different crosses between the two species were perfomed. All crosses produced viable egg rafts. Hatching ranged from 70 to 100%, except in one cross, female Cx. pipiens x male Cx. quinquefasciatus, where a high incompatibility was observed (11.1%hatch). The F1 hybrids obtained all crosses were fertile. The finding of hybrid forms in nature can be interpreted as evidence for subspecific status of Cx. pipiens and Cx. quinquefasciatus in Argentina.
Resumo:
The epidemiologic typing of bacterial pathogens can be applied to answer a number of different questions: in case of outbreak, what is the extent and mode of transmission of epidemic clone(s )? In case of long-term surveillance, what is the prevalence over time and the geographic spread of epidemic and endemic clones in the population? A number of molecular typing methods can be used to classify bacteria based on genomic diversity into groups of closely-related isolates (presumed to arise from a common ancestor in the same chain of transmission) and divergent, epidemiologically-unrelated isolates (arising from independent sources of infection). Ribotyping, IS-RFLP fingerprinting, macrorestriction analysis of chromosomal DNA and PCR-fingerprinting using arbitrary sequence or repeat element primers are useful methods for outbreak investigations and regional surveillance. Library typing systems based on multilocus sequence-based analysis and strain-specific probe hybridization schemes are in development for the international surveillance of major pathogens like Mycobacterium tuberculosis. Accurate epidemiological interpretation of data obtained with molecular typing systems still requires additional research on the evolution rate of polymorphic loci in bacterial pathogens.