939 resultados para Multiple-trait analysis
Resumo:
PhD thesis in Biomedical Engineering
Resumo:
Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as Nuclear Magnetic Resonance, Gas or Liquid Chromatography, Mass Spectrometry, Infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines.
Resumo:
The limited ability of common variants to account for the genetic contribution to complex disease has prompted searches for rare variants of large effect, to partly explain the 'missing heritability'. Analyses of genome-wide genotyping data have identified genomic structural variants (GSVs) as a source of such rare causal variants. Recent studies have reported multiple GSV loci associated with risk of obesity. We attempted to replicate these associations by similar analysis of two familial-obesity case-control cohorts and a population cohort, and detected GSVs at 11 out of 18 loci, at frequencies similar to those previously reported. Based on their reported frequencies and effect sizes (OR≥25), we had sufficient statistical power to detect the large majority (80%) of genuine associations at these loci. However, only one obesity association was replicated. Deletion of a 220 kb region on chromosome 16p11.2 has a carrier population frequency of 2×10(-4) (95% confidence interval [9.6×10(-5)-3.1×10(-4)]); accounts overall for 0.5% [0.19%-0.82%] of severe childhood obesity cases (P = 3.8×10(-10); odds ratio = 25.0 [9.9-60.6]); and results in a mean body mass index (BMI) increase of 5.8 kg.m(-2) [1.8-10.3] in adults from the general population. We also attempted replication using BMI as a quantitative trait in our population cohort; associations with BMI at or near nominal significance were detected at two further loci near KIF2B and within FOXP2, but these did not survive correction for multiple testing. These findings emphasise several issues of importance when conducting rare GSV association, including the need for careful cohort selection and replication strategy, accurate GSV identification, and appropriate correction for multiple testing and/or control of false discovery rate. Moreover, they highlight the potential difficulty in replicating rare CNV associations across different populations. Nevertheless, we show that such studies are potentially valuable for the identification of variants making an appreciable contribution to complex disease.
Resumo:
Un reto al ejecutar las aplicaciones en un cluster es lograr mejorar las prestaciones utilizando los recursos de manera eficiente, y este reto es mayor al utilizar un ambiente distribuido. Teniendo en cuenta este reto, se proponen un conjunto de reglas para realizar el cómputo en cada uno de los nodos, basado en el análisis de cómputo y comunicaciones de las aplicaciones, se analiza un esquema de mapping de celdas y un método para planificar el orden de ejecución, tomando en consideración la ejecución por prioridad, donde las celdas de fronteras tienen una mayor prioridad con respecto a las celdas internas. En la experimentación se muestra el solapamiento del computo interno con las comunicaciones de las celdas fronteras, obteniendo resultados donde el Speedup aumenta y los niveles de eficiencia se mantienen por encima de un 85%, finalmente se obtiene ganancias de los tiempos de ejecución, concluyendo que si se puede diseñar un esquemas de solapamiento que permita que la ejecución de las aplicaciones SPMD en un cluster se hagan de forma eficiente.
Resumo:
Résumé Le transfert du phosphate des racines vers les feuilles s'effectue par la voie du xylème. Il a été précédemment démontré que la protéine AtPHO1 était indispensable au transfert du phosphate dans les vaisseaux du xylème des racines chez la plante modèle Arabidopsis thaliana. Le séquençage et l'annotation du génome d'Arabidopsis ont permis d'identifier dix séquences présentant un niveau de similarité significatif avec le gène AtPHO1 et constituant une nouvelle famille de gène appelé la famille de AtPHO1. Basée sur une étude moléculaire et génétique, cette thèse apporte des éléments de réponse pour déterminer le rôle des membres de ia famille de AtPHO1 chez Arabidopsis, inconnue à ce jour. Dans un premier temps, une analyse bioinformatique des séquences protéiques des membres de la famille de AtPHO1 a révélé la présence dans leur région N-terminale d'un domaine nommé SPX. Ce dernier est conservé parmi de nombreuses protéines impliquées dans l'homéostasie du phosphate chez la levure, renforçant ainsi l'hypothèse que les membres de la famille de AtPHO1 auraient comme AtPHO1 un rôle dans l'équilibre du phosphate dans la plante. En parallèle, la localisation tissulaire de l'expression des gènes AtPHO dans Arabidopsis a été identifiée par l'analyse de plantes transgéniques exprimant le gène rapporteur uidA sous le contrôle des promoteurs respectifs des gènes AtPHO. Un profil d'expression de chaque gène AtPHO au cours du développement de la plante a été obtenu. Une expression prédominante au niveau des tissus vasculaires des racines, des feuilles, des tiges et des fleurs a été observée, suggérant que les gènes AtPHO pourraient avoir des fonctions redondantes au niveau du transfert de phosphate dans le cylindre vasculaire de ces différents organes. Toutefois, plusieurs régions promotrices des gènes AtPHO contrôlent également un profil d'expression GUS non-vasculaire, indiquant un rôle putatif des gènes AtPHO dans l'acquisition ou le recyclage de phosphate dans la plante. Dans un deuxième temps, l'analyse de l'expression des gènes AtPHO durant une carence en phosphate a établi que seule l'expression des gènes AtPHO1, AtPHO1; H1 et AtPHO1; H10 est régulée par cette carence. Une étude approfondie de leur expression en réponse à des traitements affectant l'homéostasie du phosphate dans la plante a ensuite démontré leur régulation par différentes voies de signalisation. Ensuite, une analyse détaillée de la régulation de l'expression du gène AtPHO1; H1O dans des feuilles d'Arabidopsis blessées ou déshydratées a révélé que ce gène constitue le premìer gène marqueur d'une nouvelle voie de signalisation induite par l'OPDA, pas par le JA et dépendante de la protéine COI1. Ces résultats démontrent pour la première fois que l'OPDA et le JA peuvent activer différents gènes via des voies de signalisation dépendantes de COI1. Enfin, cette thèse révèle l'identification d'un nouveau rôle de la protéine AtPHO1 dans la régulation de l'action de l'ABA au cours des processus de fermeture stomatique et de germination des graines chez Arabidopsis. Bien que les fonctions exactes des protéines AtPHO restent à être déterminées, ce travail de thèse suggère leur implication dans la propagation de différents signaux dans la plante via la modulation du potentiel membranaire et/ou l'affectation de la composition en ions des cellules comme le font de nombreux transporteurs ou régulateur du transport d'ions. Summary Phosphate is transferred from the roots to the shoot via the xylem. The requirement for AtPHO1 protein to transfer phosphate to the xylem vessels of the root has been previously demonstrated in Arabidopsis thaliana. The sequencing and the annotation of the Arabidopsis genome had allowed the identification of ten sequences that show a significant level of similarity with the AtPHO1 gene. These 10 genes, of unknown functions, constitute a new gene family called the AtPHO1 gene family. Based on a molecular and genetics study, this thesis reveals some information needed to understand the role of the AtPHO1 family members in the plant Arabidopsis. First, a bioinformatics study revealed that the AtPHO sequences contained, in the N-terminal hydrophilic region, a motif called SPX and conserved among multiple proteins involved in phosphate homeostasis in yeast. This finding reinforces the hypothesis that all AtPHO1 family members have, as AtPHO1, a role in phosphate homeostasis. In parallel, we identified the pattern of expression of AtPHO genes in Arabidopsis via analysis of transgenic plants expressing the uidA reporter gene under the control of respective AtPHO promoter regions. The results exhibit a predominant expression of AtPHO genes in vascular tissues of all organs of the plant, implying that these AtPHO genes could have redundant functions in the transfer of phosphate to the vascular cylinder of various organs. The GUS expression pattern for several AtPHO promoter regions was also detected in non-vascular tissue indicating a broad role of AtPHO genes in the acquisition or in the recycling of phosphate in the plant. In a second step, the analysis of the expression of AtPHO genes during phosphate starvation established that only the expression of the AtPHO1, AtPHO1; H1 and AtPHO1; H10 genes were regulated by Pi starvation. Interestingly, different signalling pathways appeared to regulate these three genes during various treatments affecting Pi homeostasis in the plant. The third chapter presents a detailed analysis of the signalling pathways regulating the expression of the AtPHO1; H10 gene in Arabidopsis leaves during wound and dehydrated stresses. Surprisingly, the expression of AtPHO1; H10 was found to be regulated by OPDA (the precursor of JA) but not by JA itself and via the COI1 protein (the central regulator of the JA signalling pathway). These results demonstrated for the first time that OPDA and JA could activate distinct genes via COI1-dependent pathways. Finally, this thesis presents the identification of a novel role of the AtPHO1 protein in the regulation of ABA action in Arabidopsis guard cells and during seed germination. Although the exact role and function of AtPHO1 still need to be determined, these last findings suggest that AtPHO1 and by extension other AtPHO proteins could mediate the propagation of various signals in the plant by modulating the membrane potential and/or by affecting cellular ion composition, as it is the case for many ion transporters or regulators of ion transport.
Resumo:
Background: Plasmodium falciparum(P. falciparum) merozoite surfaceprotein 2 (MSP-2) is one of bloodstage proteins that are associated withprotection from malaria. MSP-2 consistsof a highly polymorphic centralrepeat region flanked by a dimorphicregion that defines the two allelicfamilies, 3D7 and FC27; N- and Cterminalregions are conserved domains.Long synthetic peptides (LSP)representing the two allelic familiesof MSP-2 and constant regions arerecognized by sera from donors livingin endemic areas; and specific antibodies(Abs) are associated with protectionand active in antibody dependentcellular inhibition (ADCI) in vitro.However, the fine specificity ofAb response to the two allelic familiesof MSP-2 is unknown. Methods: Peptidesrepresenting dimorphic regionof 3D7 and FC27 families and theirC-terminal (common fragment to thetwo families) termed 3D7-D (88 aa),FC27-D (48 aa) and C (40 aa) respectivelywere synthesized. Overlapping20 mer peptides covering dimorphicand constant regions of two familieswere also synthesized for epitopemapping. Human sera were obtainedfrom donors living in malaria endemicareas. SpecificDand CregionsAbs were purified from single or poolhuman sera. Sera from mice were obtainedafter immunization with thetwo families LSP mixture in three differentadjuvants: alhydrogel (Alum),Glucopyranosyl Lipid Adjuvant-Stableoil-in-water Emulsion (GLA-SE)and Virosome. For ADCI, P. falciparum(strain 3D7) parasite wasmaintained in culture at 0.5% parasitemiaand 4% hematocrit in air tightbox at love oxygen (2%) and 37 ºC.Results: We identified several epitopesfrom the dimorphic and constantregions of both families of MSP-2, inmice and humans (adults and children).In human, most recognizedepitopes were the same in differentendemic regions for each domain ofthe two families of MSP-2. In mice,the differential recognition of epitopewas depending on the strain of mouseand interestingly on the adjuvantused. GLA-SE and alum as adjuvantswere more often associated with therecognition of multiple epitopes thanvirosomes. Epitope-specific Abs recognizednative merozoites of P.falciparum and were active in ADCIto block development of parasite.Conclusion: The delineation of a limitednumber of epitopes could be exploitedto develop MSP-2 vaccinesactive on both allelic families ofMSP-2.
Resumo:
PURPOSE: Several studies observed a female advantage in the prognosis of cutaneous melanoma, for which behavioral factors or an underlying biologic mechanism might be responsible. Using complete and reliable follow-up data from four phase III trials of the European Organisation for Research and Treatment of Cancer (EORTC) Melanoma Group, we explored the female advantage across multiple end points and in relation to other important prognostic indicators. PATIENTS AND METHODS: Patients diagnosed with localized melanoma were included in EORTC adjuvant treatment trials 18832, 18871, 18952, and 18961 and randomly assigned during the period of 1984 to 2005. Cox proportional hazard models were used to calculate hazard ratios (HRs) and 95% CIs for women compared with men, adjusted for age, Breslow thickness, body site, ulceration, performed lymph node dissection, and treatment. RESULTS: A total of 2,672 patients with stage I/II melanoma were included. Women had a highly consistent and independent advantage in overall survival (adjusted HR, 0.70; 95% CI, 0.59 to 0.83), disease-specific survival (adjusted HR, 0.74; 95% CI, 0.62 to 0.88), time to lymph node metastasis (adjusted HR, 0.70; 95% CI, 0.51 to 0.96), and time to distant metastasis (adjusted HR, 0.69; 95% CI, 0.59 to 0.81). Subgroup analysis showed that the female advantage was consistent across all prognostic subgroups (with the possible exception of head and neck melanomas) and in pre- and postmenopausal age groups. CONCLUSION: Women have a consistent and independent relative advantage in all aspects of the progression of localized melanoma of approximately 30%, most likely caused by an underlying biologic sex difference.
Resumo:
PURPOSE: Statins have beneficial effects in patients after myocardial infarction and at least part of the benefit results from mobilization of marrow endothelial progenitors to repopulate damaged myocardial tissues. This study examines if statins may have the same effect in mobilizing marrow progenitors to be harvested and subsequently used in high-dose chemotherapy with progenitor cell rescue in multiple myeloma. METHODS: From 2006 to 2012, 86 patients with multiple myeloma were mobilized with the use of G-CSF and were retrospectively analyzed. Patients with other malignancies or mobilized with the use of chemotherapy or with plerixafor were excluded. RESULTS: The median age of the patients was 60 years. 72 patients had received one line of chemotherapy and 14 patients two or more lines of chemotherapy. Twenty patients were taking statins at the time of the harvest while 66 patients were not. In the group of patients taking statins the success rate of first leukapheresis (obtaining the target number of 4 × 10(6) CD34+ cells/kg) was 85 % while in the group not taking statins this rate was 63.6 %. Despite the comparatively small number of patients this difference approached statistical significance (χ (2) = 0.07). CONCLUSION: This retrospective analysis of 86 patients shows for the first time a possible benefit of statins for peripheral blood progenitor cells mobilization in patients with multiple myeloma. Larger studies would be required to clarify the issue. If their effectiveness is confirmed, statins could be a safe and cheaper addition to chemotherapy and plerixafor for peripheral hematopoietic stem cell mobilization.
Resumo:
Context: Understanding the process through which adolescents and young adults are trying legal and illegal substances is a crucial point for the development of tailored prevention and treatment programs. However, patterns of substance first use can be very complex when multiple substances are considered, requiring reduction into a few meaningful number of categories. Data: We used data from a survey on adolescent and young adult health conducted in 2002 in Switzerland. Answers from 2212 subjects aged 19 and 20 were included. The first consumption ever of 10 substances (tobacco, cannabis, medicine to get high, sniff (volatile substances, and inhalants), ecstasy, GHB, LSD, cocaine, methadone, and heroin) was considered for a grand total of 516 different patterns. Methods: In a first step, automatic clustering was used to decrease the number of patterns to 50. Then, two groups of substance use experts, three social field workers, and three toxicologists and health professionals, were asked to reduce them into a maximum of 10 meaningful categories. Results: Classifications obtained through our methodology are of practical interest by revealing associations invisible to purely automatic algorithms. The article includes a detailed analysis of both final classifications, and a discussion on the advantages and limitations of our approach.
Resumo:
Fatty acid degradation in most organisms occurs primarily via the beta-oxidation cycle. In mammals, beta-oxidation occurs in both mitochondria and peroxisomes, whereas plants and most fungi harbor the beta-oxidation cycle only in the peroxisomes. Although several of the enzymes participating in this pathway in both organelles are similar, some distinct physiological roles have been uncovered. Recent advances in the structural elucidation of numerous mammalian and yeast enzymes involved in beta-oxidation have shed light on the basis of the substrate specificity for several of them. Of particular interest is the structural organization and function of the type 1 and 2 multifunctional enzyme (MFE-1 and MFE-2), two enzymes evolutionarily distant yet catalyzing the same overall enzymatic reactions but via opposite stereochemistry. New data on the physiological roles of the various enzymes participating in beta-oxidation have been gathered through the analysis of knockout mutants in plants, yeast and animals, as well as by the use of polyhydroxyalkanoate synthesis from beta-oxidation intermediates as a tool to study carbon flux through the pathway. In plants, both forward and reverse genetics performed on the model plant Arabidopsis thaliana have revealed novel roles for beta-oxidation in the germination process that is independent of the generation of carbohydrates for growth, as well as in embryo and flower development, and the generation of the phytohormone indole-3-acetic acid and the signal molecule jasmonic acid.
Resumo:
Besides CYP2B6, other polymorphic enzymes contribute to efavirenz (EFV) interindividual variability. This study was aimed at quantifying the impact of multiple alleles on EFV disposition. Plasma samples from 169 human immunodeficiency virus (HIV) patients characterized for CYP2B6, CYP2A6, and CYP3A4/5 allelic diversity were used to build up a population pharmacokinetic model using NONMEM (non-linear mixed effects modeling), the aim being to seek a general approach combining genetic and demographic covariates. Average clearance (CL) was 11.3 l/h with a 65% interindividual variability that was explained largely by CYP2B6 genetic variation (31%). CYP2A6 and CYP3A4 had a prominent influence on CL, mostly when CYP2B6 was impaired. Pharmacogenetics fully accounted for ethnicity, leaving body weight as the only significant demographic factor influencing CL. Square roots of the numbers of functional alleles best described the influence of each gene, without interaction. Functional genetic variations in both principal and accessory metabolic pathways demonstrate a joint impact on EFV disposition. Therefore, dosage adjustment in accordance with the type of polymorphism (CYP2B6, CYP2A6, or CYP3A4) is required in order to maintain EFV within the therapeutic target levels.
Resumo:
This paper proposes a new class of stratification indices that measure interdistributional inequality between multiple groups. The class is based on a conceptualisation of stratification as a process that results in a hierarchical ordering of groups and therefore seeks to capture not only the extent to which groups form well-defined strata in the income distribution but also the scale of the resultant differences in income standards between them, where these two factors play the same role as identification and alienation respectively in the measurement of polarisation. The properties of the class as a whole are investigated as well as those of selected members of it: zeroth and first power indices may be interpreted as measuring the overall incidence and depth of stratification respectively, while higher power indices members are directly sensitive to the severity of stratification between groups. An illustrative application provides an empirical analysis of global income stratification by regions in 1993.
Resumo:
AIM: The first pathogenetic step in multiple myeloma is the emergence of a limited number of clonal plasma cells, clinically known as monoclonal gammopathy of undetermined significance (MGUS). Patients with MGUS do not have symptoms or end-organ damage but they do have a 1% annual risk of progression to multiple myeloma or related malignant disorders. With progression of MGUS to multiple myeloma, complex genetic events occur in the neoplastic plasma cell. Karyotyping and fluorescence in-situ hybridization (FISH) were shown to be of prognostic value in patients with multiple myeloma. Tc-sestamibi imaging reflects myeloma disease activity in bone marrow with very high sensitivity and specificity predicting disease evolution. This study was undertaken to evaluate the role of Tc-sestamibi imaging and cytogenetic analysis in prognosis prediction of MGUS and multiple myeloma. METHODS: We enrolled 30 consecutive patients with a confirmed diagnosis of multiple myeloma or MGUS. Bone marrow biopsy and biochemical staging according to the International Staging System (ISS) were performed in all cases. Karyotype analysis and FISH were performed in 11 of 12 patients with MGUS and in 17 of 18 patients with multiple myeloma having adequate metaphases. RESULTS: The karyotype was abnormal in four of 11 MGUS and in six of 17 multiple myeloma. Abnormalities of chromosome 13 were present in one case of MGUS and in six cases of multiple myeloma whereas the involvement of immunoglobulin was observed in one case of multiple myeloma. An abnormal FISH panel was found in four MGUS and nine multiple myeloma patients. All patients with MGUS showed a normal MIBI scan (score 0). Among patients with multiple myeloma only three, all with ISS stage I, showed a normal scan while a positive scan was obtained in others (score range, 1-7). The MIBI uptake was strongly related to the bone marrow plasma cell infiltration and to cytogenetic abnormalities. Particularly, a MIBI uptake score above 5 identified patients with poor prognosis encompassing all stage III multiple myeloma and three of seven stage II multiple myeloma. On the other hand all stage I and II patients having a MIBI score less than 5 showed a good prognosis. CONCLUSION: Both cytogenetic analysis and a MIBI scan add no relevant prognostic information to the ISS in patients with stage I and III multiple myeloma. The MIBI scan was of prognostic value in stage II multiple myeloma patients. Additionally, MIBI imaging may be useful to guide bone marrow biopsy in order to obtain adequate samples for cytogenetic analysis.
Resumo:
Although melanin is the most common pigment in animal integuments, the adaptive function of variation in melanin-based coloration remains poorly understood. The individual fitness returns associated with melanin pigments can be variable across species as these pigments can have physical and biological protective properties and genes involved in melanogenesis may vary in the intensity of pleiotropic effects. Moreover, dark and pale coloration can also enhance camouflage in alternative habitats and melanin-based coloration can be involved in social interactions. We investigated whether darker or paler individuals achieve a higher fitness in birds, a taxon wherein associations between melanin-based coloration and fitness parameters have been studied in a large number of species. A meta-analysis showed that the degree of melanin-based coloration was not significantly associated with laying date, clutch size, brood size, and survival across 26 species. Similar results were found when restricting the analyses to non-sexually dimorphic birds, colour polymorphic and monomorphic species, in passerines and non-passerines and in species for which inter-individual variation in melanism is due to colour intensity. However, eumelanic coloration was positively associated with clutch and brood size in sexually dimorphic species and those that vary in the size of black patches, respectively. Given that greater extent of melanin-based coloration was positively associated with reproductive parameters and survival in some species but negatively in other species, we conclude that in birds the sign and magnitude of selection exerted on melanin-based coloration is species- or trait-specific.
Resumo:
SUMMARY : Eukaryotic DNA interacts with the nuclear proteins using non-covalent ionic interactions. Proteins can recognize specific nucleotide sequences based on the sterical interactions with the DNA and these specific protein-DNA interactions are the basis for many nuclear processes, e.g. gene transcription, chromosomal replication, and recombination. New technology termed ChIP-Seq has been recently developed for the analysis of protein-DNA interactions on a whole genome scale and it is based on immunoprecipitation of chromatin and high-throughput DNA sequencing procedure. ChIP-Seq is a novel technique with a great potential to replace older techniques for mapping of protein-DNA interactions. In this thesis, we bring some new insights into the ChIP-Seq data analysis. First, we point out to some common and so far unknown artifacts of the method. Sequence tag distribution in the genome does not follow uniform distribution and we have found extreme hot-spots of tag accumulation over specific loci in the human and mouse genomes. These artifactual sequence tags accumulations will create false peaks in every ChIP-Seq dataset and we propose different filtering methods to reduce the number of false positives. Next, we propose random sampling as a powerful analytical tool in the ChIP-Seq data analysis that could be used to infer biological knowledge from the massive ChIP-Seq datasets. We created unbiased random sampling algorithm and we used this methodology to reveal some of the important biological properties of Nuclear Factor I DNA binding proteins. Finally, by analyzing the ChIP-Seq data in detail, we revealed that Nuclear Factor I transcription factors mainly act as activators of transcription, and that they are associated with specific chromatin modifications that are markers of open chromatin. We speculate that NFI factors only interact with the DNA wrapped around the nucleosome. We also found multiple loci that indicate possible chromatin barrier activity of NFI proteins, which could suggest the use of NFI binding sequences as chromatin insulators in biotechnology applications. RESUME : L'ADN des eucaryotes interagit avec les protéines nucléaires par des interactions noncovalentes ioniques. Les protéines peuvent reconnaître les séquences nucléotidiques spécifiques basées sur l'interaction stérique avec l'ADN, et des interactions spécifiques contrôlent de nombreux processus nucléaire, p.ex. transcription du gène, la réplication chromosomique, et la recombinaison. Une nouvelle technologie appelée ChIP-Seq a été récemment développée pour l'analyse des interactions protéine-ADN à l'échelle du génome entier et cette approche est basée sur l'immuno-précipitation de la chromatine et sur la procédure de séquençage de l'ADN à haut débit. La nouvelle approche ChIP-Seq a donc un fort potentiel pour remplacer les anciennes techniques de cartographie des interactions protéine-ADN. Dans cette thèse, nous apportons de nouvelles perspectives dans l'analyse des données ChIP-Seq. Tout d'abord, nous avons identifié des artefacts très communs associés à cette méthode qui étaient jusqu'à présent insoupçonnés. La distribution des séquences dans le génome ne suit pas une distribution uniforme et nous avons constaté des positions extrêmes d'accumulation de séquence à des régions spécifiques, des génomes humains et de la souris. Ces accumulations des séquences artéfactuelles créera de faux pics dans toutes les données ChIP-Seq, et nous proposons différentes méthodes de filtrage pour réduire le nombre de faux positifs. Ensuite, nous proposons un nouvel échantillonnage aléatoire comme un outil puissant d'analyse des données ChIP-Seq, ce qui pourraient augmenter l'acquisition de connaissances biologiques à partir des données ChIP-Seq. Nous avons créé un algorithme d'échantillonnage aléatoire et nous avons utilisé cette méthode pour révéler certaines des propriétés biologiques importantes de protéines liant à l'ADN nommés Facteur Nucléaire I (NFI). Enfin, en analysant en détail les données de ChIP-Seq pour la famille de facteurs de transcription nommés Facteur Nucléaire I, nous avons révélé que ces protéines agissent principalement comme des activateurs de transcription, et qu'elles sont associées à des modifications de la chromatine spécifiques qui sont des marqueurs de la chromatine ouverte. Nous pensons que lés facteurs NFI interagir uniquement avec l'ADN enroulé autour du nucléosome. Nous avons également constaté plusieurs régions génomiques qui indiquent une éventuelle activité de barrière chromatinienne des protéines NFI, ce qui pourrait suggérer l'utilisation de séquences de liaison NFI comme séquences isolatrices dans des applications de la biotechnologie.