916 resultados para Hierarchical
Resumo:
Software engineering is criticized as not being engineering or 'well-developed' science at all. Software engineers seem not to know exactly how long their projects will last, what they will cost, and will the software work properly after release. Measurements have to be taken in software projects to improve this situation. It is of limited use to only collect metrics afterwards. The values of the relevant metrics have to be predicted, too. The predictions (i.e. estimates) form the basis for proper project management. One of the most painful problems in software projects is effort estimation. It has a clear and central effect on other project attributes like cost and schedule, and to product attributes like size and quality. Effort estimation can be used for several purposes. In this thesis only the effort estimation in software projects for project management purposes is discussed. There is a short introduction to the measurement issues, and some metrics relevantin estimation context are presented. Effort estimation methods are covered quite broadly. The main new contribution in this thesis is the new estimation model that has been created. It takes use of the basic concepts of Function Point Analysis, but avoids the problems and pitfalls found in the method. It is relativelyeasy to use and learn. Effort estimation accuracy has significantly improved after taking this model into use. A major innovation related to the new estimationmodel is the identified need for hierarchical software size measurement. The author of this thesis has developed a three level solution for the estimation model. All currently used size metrics are static in nature, but this new proposed metric is dynamic. It takes use of the increased understanding of the nature of the work as specification and design work proceeds. It thus 'grows up' along with software projects. The effort estimation model development is not possible without gathering and analyzing history data. However, there are many problems with data in software engineering. A major roadblock is the amount and quality of data available. This thesis shows some useful techniques that have been successful in gathering and analyzing the data needed. An estimation process is needed to ensure that methods are used in a proper way, estimates are stored, reported and analyzed properly, and they are used for project management activities. A higher mechanism called measurement framework is also introduced shortly. The purpose of the framework is to define and maintain a measurement or estimationprocess. Without a proper framework, the estimation capability of an organization declines. It requires effort even to maintain an achieved level of estimationaccuracy. Estimation results in several successive releases are analyzed. It isclearly seen that the new estimation model works and the estimation improvementactions have been successful. The calibration of the hierarchical model is a critical activity. An example is shown to shed more light on the calibration and the model itself. There are also remarks about the sensitivity of the model. Finally, an example of usage is shown.
Resumo:
The present study applies a micro-level perspective on how within-individual differences in motivational and social-cognitive factors affect the weekly fluctuations of engagement in proactive career behaviors among a group of 67 German university students. Career self-efficacy beliefs, perceived career barriers, experienced social career support, positive and negative emotions, and career engagement were assessed weekly for 13 consecutive weeks. Hierarchical linear regression analyses showed that above-average levels of career engagement within individuals were predicted by higher than average perceived social support and positive emotions during a given week. Conversely, within-individual differences in self-efficacy, barriers, and negative emotions had no effect. The results suggest that career interventions should provide boosts in social support and positive emotions.
Resumo:
La présente thèse s'intitule "Développent et Application des Méthodologies Computationnelles pour la Modélisation Qualitative". Elle comprend tous les différents projets que j'ai entrepris en tant que doctorante. Plutôt qu'une mise en oeuvre systématique d'un cadre défini a priori, cette thèse devrait être considérée comme une exploration des méthodes qui peuvent nous aider à déduire le plan de processus regulatoires et de signalisation. Cette exploration a été mue par des questions biologiques concrètes, plutôt que par des investigations théoriques. Bien que tous les projets aient inclus des systèmes divergents (réseaux régulateurs de gènes du cycle cellulaire, réseaux de signalisation de cellules pulmonaires) ainsi que des organismes (levure à fission, levure bourgeonnante, rat, humain), nos objectifs étaient complémentaires et cohérents. Le projet principal de la thèse est la modélisation du réseau de l'initiation de septation (SIN) du S.pombe. La cytokinèse dans la levure à fission est contrôlée par le SIN, un réseau signalant de protéines kinases qui utilise le corps à pôle-fuseau comme échafaudage. Afin de décrire le comportement qualitatif du système et prédire des comportements mutants inconnus, nous avons décidé d'adopter l'approche de la modélisation booléenne. Dans cette thèse, nous présentons la construction d'un modèle booléen étendu du SIN, comprenant la plupart des composantes et des régulateurs du SIN en tant que noeuds individuels et testable expérimentalement. Ce modèle utilise des niveaux d'activité du CDK comme noeuds de contrôle pour la simulation d'évènements du SIN à différents stades du cycle cellulaire. Ce modèle a été optimisé en utilisant des expériences d'un seul "knock-out" avec des effets phénotypiques connus comme set d'entraînement. Il a permis de prédire correctement un set d'évaluation de "knock-out" doubles. De plus, le modèle a fait des prédictions in silico qui ont été validées in vivo, permettant d'obtenir de nouvelles idées de la régulation et l'organisation hiérarchique du SIN. Un autre projet concernant le cycle cellulaire qui fait partie de cette thèse a été la construction d'un modèle qualitatif et minimal de la réciprocité des cyclines dans la S.cerevisiae. Les protéines Clb dans la levure bourgeonnante présentent une activation et une dégradation caractéristique et séquentielle durant le cycle cellulaire, qu'on appelle communément les vagues des Clbs. Cet évènement est coordonné avec la courbe d'activation inverse du Sic1, qui a un rôle inhibitoire dans le système. Pour l'identification des modèles qualitatifs minimaux qui peuvent expliquer ce phénomène, nous avons sélectionné des expériences bien définies et construit tous les modèles minimaux possibles qui, une fois simulés, reproduisent les résultats attendus. Les modèles ont été filtrés en utilisant des simulations ODE qualitatives et standardisées; seules celles qui reproduisaient le phénotype des vagues ont été gardées. L'ensemble des modèles minimaux peut être utilisé pour suggérer des relations regulatoires entre les molécules participant qui peuvent ensuite être testées expérimentalement. Enfin, durant mon doctorat, j'ai participé au SBV Improver Challenge. Le but était de déduire des réseaux spécifiques à des espèces (humain et rat) en utilisant des données de phosphoprotéines, d'expressions des gènes et des cytokines, ainsi qu'un réseau de référence, qui était mis à disposition comme donnée préalable. Notre solution pour ce concours a pris la troisième place. L'approche utilisée est expliquée en détail dans le dernier chapitre de la thèse. -- The present dissertation is entitled "Development and Application of Computational Methodologies in Qualitative Modeling". It encompasses the diverse projects that were undertaken during my time as a PhD student. Instead of a systematic implementation of a framework defined a priori, this thesis should be considered as an exploration of the methods that can help us infer the blueprint of regulatory and signaling processes. This exploration was driven by concrete biological questions, rather than theoretical investigation. Even though the projects involved divergent systems (gene regulatory networks of cell cycle, signaling networks in lung cells), as well as organisms (fission yeast, budding yeast, rat, human), our goals were complementary and coherent. The main project of the thesis is the modeling of the Septation Initiation Network (SIN) in S.pombe. Cytokinesis in fission yeast is controlled by the SIN, a protein kinase signaling network that uses the spindle pole body as scaffold. In order to describe the qualitative behavior of the system and predict unknown mutant behaviors we decided to adopt a Boolean modeling approach. In this thesis, we report the construction of an extended, Boolean model of the SIN, comprising most SIN components and regulators as individual, experimentally testable nodes. The model uses CDK activity levels as control nodes for the simulation of SIN related events in different stages of the cell cycle. The model was optimized using single knock-out experiments of known phenotypic effect as a training set, and was able to correctly predict a double knock-out test set. Moreover, the model has made in silico predictions that have been validated in vivo, providing new insights into the regulation and hierarchical organization of the SIN. Another cell cycle related project that is part of this thesis was to create a qualitative, minimal model of cyclin interplay in S.cerevisiae. CLB proteins in budding yeast present a characteristic, sequential activation and decay during the cell cycle, commonly referred to as Clb waves. This event is coordinated with the inverse activation curve of Sic1, which has an inhibitory role in the system. To generate minimal qualitative models that can explain this phenomenon, we selected well-defined experiments and constructed all possible minimal models that, when simulated, reproduce the expected results. The models were filtered using standardized qualitative ODE simulations; only the ones reproducing the wave-like phenotype were kept. The set of minimal models can be used to suggest regulatory relations among the participating molecules, which will subsequently be tested experimentally. Finally, during my PhD I participated in the SBV Improver Challenge. The goal was to infer species-specific (human and rat) networks, using phosphoprotein, gene expression and cytokine data and a reference network provided as prior knowledge. Our solution to the challenge was selected as in the final chapter of the thesis.
Resumo:
BACKGROUND: Pneumonia is the biggest cause of deaths in young children in developing countries, but early diagnosis and intervention can effectively reduce mortality. We aimed to assess the diagnostic value of clinical signs and symptoms to identify radiological pneumonia in children younger than 5 years and to review the accuracy of WHO criteria for diagnosis of clinical pneumonia. METHODS: We searched Medline (PubMed), Embase (Ovid), the Cochrane Database of Systematic Reviews, and reference lists of relevant studies, without date restrictions, to identify articles assessing clinical predictors of radiological pneumonia in children. Selection was based on: design (diagnostic accuracy studies), target disease (pneumonia), participants (children aged <5 years), setting (ambulatory or hospital care), index test (clinical features), and reference standard (chest radiography). Quality assessment was based on the 2011 Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) criteria. For each index test, we calculated sensitivity and specificity and, when the tests were assessed in four or more studies, calculated pooled estimates with use of bivariate model and hierarchical summary receiver operation characteristics plots for meta-analysis. FINDINGS: We included 18 articles in our analysis. WHO-approved signs age-related fast breathing (six studies; pooled sensitivity 0·62, 95% CI 0·26-0·89; specificity 0·59, 0·29-0·84) and lower chest wall indrawing (four studies; 0·48, 0·16-0·82; 0·72, 0·47-0·89) showed poor diagnostic performance in the meta-analysis. Features with the highest pooled positive likelihood ratios were respiratory rate higher than 50 breaths per min (1·90, 1·45-2·48), grunting (1·78, 1·10-2·88), chest indrawing (1·76, 0·86-3·58), and nasal flaring (1·75, 1·20-2·56). Features with the lowest pooled negative likelihood ratio were cough (0·30, 0·09-0·96), history of fever (0·53, 0·41-0·69), and respiratory rate higher than 40 breaths per min (0·43, 0·23-0·83). INTERPRETATION: Not one clinical feature was sufficient to diagnose pneumonia definitively. Combination of clinical features in a decision tree might improve diagnostic performance, but the addition of new point-of-care tests for diagnosis of bacterial pneumonia would help to attain an acceptable level of accuracy. FUNDING: Swiss National Science Foundation.
Resumo:
A new issue, once again a bouquet of attractive papers. First of all the paper by Droit-Dupré et al. (10.1007/s00428-015-1724-9). The group studied colonic adenocarcinomas, not otherwise specified, by immunohistochemistry for the expression of markers of intestinal epithelial cell differentiation. Hierarchical clustering analysis identified a major cluster of two thirds of the case series, expressing cytokeratin 20, CDX2 and MUC2 and invariably mismatch repair competent, which they called crypt-like. In stage III colon cancer, the crypt-like cluster had a better prognosis. The paper is a relatively simple example of what is happening in cancer classification beyond morphology: multiparameter differentiation and (epi)genomic markers defining new subtypes of cancer with potential clinical significance in clinical decision making.
Resumo:
The current challenge in a context of major environmental changes is to anticipate the responses of species to future landscape and climate scenarios. In the Mediterranean basin, climate change is one the most powerful driving forces of fire dynamics, with fire frequency and impact having markedly increased in recent years. Species distribution modelling plays a fundamental role in this challenge, but better integration of available ecological knowledge is needed to adequately guide conservation efforts. Here, we quantified changes in habitat suitability of an early-succession bird in Catalonia, the Dartford Warbler (Sylvia undata) ― globally evaluated as Near Threatened in the IUCN Red List. We assessed potential changes in species distributions between 2000 and 2050 under different fire management and climate change scenarios and described landscape dynamics using a spatially-explicit fire-succession model that simulates fire impacts in the landscape and post-fire regeneration (MEDFIRE model). Dartford Warbler occurrence data were acquired at two different spatial scales from: 1) the Atlas of European Breeding Birds (EBCC) and 2) Catalan Breeding Bird Atlas (CBBA). Habitat suitability was modelled using five widely-used modelling techniques in an ensemble forecasting framework. Our results indicated considerable habitat suitability losses (ranging between 47% and 57% in baseline scenarios), which were modulated to a large extent by fire regime changes derived from fire management policies and climate changes. Such result highlighted the need for taking the spatial interaction between climate changes, fire-mediated landscape dynamics and fire management policies into account for coherently anticipating habitat suitability changes of early succession bird species. We conclude that fire management programs need to be integrated into conservation plans to effectively preserve sparsely forested and early succession habitats and their associated species in the face of global environmental change.
Resumo:
The main aim of this study was to replicate and extend previous results on subtypes of adolescents with substance use disorders (SUD), according to their Minnesota Multiphasic Personality Inventory for adolescents (MMPI-A) profiles. Sixty patients with SUD and psychiatric comorbidity (41.7% male, mean age = 15.9 years old) completed the MMPI-A, the Teen Addiction Severity Index (T-ASI), the Child Behaviour Checklist (CBCL), and were interviewed in order to determine DSMIV diagnoses and level of substance use. Mean MMPI-A personality profile showed moderate peaks in Psychopathic Deviate, Depression and Hysteria scales. Hierarchical cluster analysis revealed four profiles (acting-out, 35% of the sample; disorganized-conflictive, 15%; normative-impulsive, 15%; and deceptive-concealed, 35%). External correlates were found between cluster 1, CBCL externalizing symptoms at a clinical level and conduct disorders, and between cluster 2 and mixed CBCL internalized/externalized symptoms at a clinical level. Discriminant analysis showed that Depression, Psychopathic Deviate and Psychasthenia MMPI-A scales correctly classified 90% of the patients into the clusters obtained.
Resumo:
In traffic accidents involving motorcycles, paint traces can be transferred from the rider's helmet or smeared onto its surface. These traces are usually in the form of chips or smears and are frequently collected for comparison purposes. This research investigates the physical and chemical characteristics of the coatings found on motorcycles helmets. An evaluation of the similarities between helmet and automotive coating systems was also performed.Twenty-seven helmet coatings from 15 different brands and 22 models were considered. One sample per helmet was collected and observed using optical microscopy. FTIR spectroscopy was then used and seven replicate measurements per layer were carried out to study the variability of each coating system (intravariability). Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA) were also performed on the infrared spectra of the clearcoats and basecoats of the data set. The most common systems were composed of two or three layers, consistently involving a clearcoat and basecoat. The coating systems of helmets with composite shells systematically contained a minimum of three layers. FTIR spectroscopy results showed that acrylic urethane and alkyd urethane were the most frequent binders used for clearcoats and basecoats. A high proportion of the coatings were differentiated (more than 95%) based on microscopic examinations. The chemical and physical characteristics of the coatings allowed the differentiation of all but one pair of helmets of the same brand, model and color. Chemometrics (PCA and HCA) corroborated classification based on visual comparisons of the spectra and allowed the study of the whole data set at once (i.e., all spectra of the same layer). Thus, the intravariability of each helmet and its proximity to the others (intervariability) could be more readily assessed. It was also possible to determine the most discriminative chemical variables based on the study of the PCA loadings. Chemometrics could therefore be used as a complementary decision-making tool when many spectra and replicates have to be taken into account. Similarities between automotive and helmet coating systems were highlighted, in particular with regard to automotive coating systems on plastic substrates (microscopy and FTIR). However, the primer layer of helmet coatings was shown to differ from the automotive primer. If the paint trace contains this layer, the risk of misclassification (i.e., helmet versus vehicle) is reduced. Nevertheless, a paint examiner should pay close attention to these similarities when analyzing paint traces, especially regarding smears or paint chips presenting an incomplete layer system.
Resumo:
Advances in flow cytometry and other single-cell technologies have enabled high-dimensional, high-throughput measurements of individual cells as well as the interrogation of cell population heterogeneity. However, in many instances, computational tools to analyze the wealth of data generated by these technologies are lacking. Here, we present a computational framework for unbiased combinatorial polyfunctionality analysis of antigen-specific T-cell subsets (COMPASS). COMPASS uses a Bayesian hierarchical framework to model all observed cell subsets and select those most likely to have antigen-specific responses. Cell-subset responses are quantified by posterior probabilities, and human subject-level responses are quantified by two summary statistics that describe the quality of an individual's polyfunctional response and can be correlated directly with clinical outcome. Using three clinical data sets of cytokine production, we demonstrate how COMPASS improves characterization of antigen-specific T cells and reveals cellular 'correlates of protection/immunity' in the RV144 HIV vaccine efficacy trial that are missed by other methods. COMPASS is available as open-source software.
Resumo:
Probabilistic inversion methods based on Markov chain Monte Carlo (MCMC) simulation are well suited to quantify parameter and model uncertainty of nonlinear inverse problems. Yet, application of such methods to CPU-intensive forward models can be a daunting task, particularly if the parameter space is high dimensional. Here, we present a 2-D pixel-based MCMC inversion of plane-wave electromagnetic (EM) data. Using synthetic data, we investigate how model parameter uncertainty depends on model structure constraints using different norms of the likelihood function and the model constraints, and study the added benefits of joint inversion of EM and electrical resistivity tomography (ERT) data. Our results demonstrate that model structure constraints are necessary to stabilize the MCMC inversion results of a highly discretized model. These constraints decrease model parameter uncertainty and facilitate model interpretation. A drawback is that these constraints may lead to posterior distributions that do not fully include the true underlying model, because some of its features exhibit a low sensitivity to the EM data, and hence are difficult to resolve. This problem can be partly mitigated if the plane-wave EM data is augmented with ERT observations. The hierarchical Bayesian inverse formulation introduced and used herein is able to successfully recover the probabilistic properties of the measurement data errors and a model regularization weight. Application of the proposed inversion methodology to field data from an aquifer demonstrates that the posterior mean model realization is very similar to that derived from a deterministic inversion with similar model constraints.
Resumo:
Tutkimuksen tavoitteena oli etsiä kohdeorganisaation taustalla olevia tekijöitä, jotka joko edesauttavat tai estävät nykyisen johtamisjärjestelmän soveltamista, tiedon käyttöä ja hyödyntämistä organisaation työpisteissä. Kohdeorganisaatio on Varenso Oy, Tekniset tuotantopalvelut. Teoriaosiossa käsitellään tietojohtamiseen liittyvää käsitteistöä sekä tiedon luomiseen, johtamiseen ja hyödyntämiseen liittyviä tekijöitä. Johtamista lähestytään myös perustehtävän, strategian ja muutosvalmiuden, valta- ja organisaatiorakenteiden sekä informaatio- ohjauksen näkökulmasta. Lopuksi käsitellään suorituskykyä, tavoitteiden asettamista, mittaamista funktionaalisissa- ja prosessijohdetuissa organisaatioissa. Empiirisessä osiossa tehtiin kyselytutkimus. Tulokset analysoitiin monimuuttujamenetelmiä soveltaen ja löydettiin faktorit, joiden avulla on mahdollista vaikuttaa kohdeorganisaation toimintaan. Kyselytutkimuksen avulla tulkittiin organisaation tämän hetkistä suorituskykyä ja asemaa suhteessa tavoitteisiin. Tuloksena syntyi myös toimenpideehdotuksia.
Resumo:
Background: Nursing terminologies are designed to support nursing practice but, as with any other clinical tool, they should be evaluated. Cross-mapping is a formal method for examining the validity of the existing controlled vocabularies. Objectives: The study aims to assess the inclusiveness and expressiveness of the nursing diagnosis axis of a newly implemented interface terminology by cross-mapping with the NANDA-I taxonomy. Design/Methods: The study applied a descriptive design, using a cross-sectional, bidirectional mapping strategy. The sample included 728 concepts from both vocabularies. Concept cross-mapping was carried out to identify one-to-one, negative, and hierarchical connections. The analysis was conducted using descriptive statistics. Results: Agreement of the raters" mapping achieved 97%. More than 60% of the nursing diagnosis concepts in the NANDA-I taxonomy were mapped to concepts in the diagnosis axis of the new interface terminology; 71.1% were reversely mapped. Conclusions: Main results for outcome measures suggest that the diagnosis axis of this interface terminology meets the validity criterion of cross-mapping when mapped from and to the NANDA-I taxonomy.
Resumo:
Long-term assessment of the effects of psychotherapy for personality disorders (PDs) in a natural environment is an important task. Such research contributes to enlarge the practice-based evidence, embedded in broad collaborations between clinicians and researchers in psychotherapy for PDs. The present pilot study used rigorous assessment procedures and incorporated feedback loops of outcome information to the therapists in demonstrating the effects of psychotherapy for PD in a natural setting. The number of Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), criteria for any PD was the primary outcome (along with psychological distress, depression, impulsiveness, and quality of life as secondary measures), assessed at intake, 6, 12, 18, and 24 months of psychotherapy for N = 13 patients with PD. Data were analyzed using hierarchical linear modeling. Results demonstrated a large pre-post effect (d = 2.22) for the observer-rated measure (primary outcome), and small to medium effects for the secondary outcomes; these results were corroborated by a steady decrease of symptoms over all five time points, which was significant for several outcomes. These results add a piece to the literature by demonstrating the effects of long-term psychotherapy for PDs in increasingly diverse contexts and suggest that practice-oriented research can be carried out in a collaborative and systematic manner.
Resumo:
Tutkielman tarkoituksena oli yleisesti ymmärtää sisäistä yhteistyökyvykkyyttä ja sen roolia yrityksen menestystekijänä. Empiirisesti sisäistä yhteistyökyvykkyyttä tarkasteltiin case-organisaation, Metso Paper Jyväskylän, avulla. Tavoitteena oli selvittää, mistä sisäinen yhteistyökyvykkyys koostuu sekä siihen vaikuttavia kriittisiä tekijöitä. Myös käytännön keinoja ja sovelluksia etsittiin edistämään sisäistä yhteistyötä. Tutkielmassa käytettiin kvalitatiivista tutkimusotetta. Tiedon hankinnan strategiana oli tapaustutkimus ja tutkimusaineisto kerättiin teemahaastattelujen avulla. Saatu aineisto analysoitiin teemoittelun avulla. Teemoja etsittiin tekstistä teoriasta johdettujen oletusten ja empiriasta löydettyjen tekijöiden perusteella. Tutkimuksen tuloksena oli, että yhteistyökyvykkyys koostuu sekä yksilöön että organisaatioon liittyvistä tekijöistä. Yhteistyökyvykkyyden kriittisiä ja sitä estäviä tekijöitä olivat toiminnan painottuminen ulkoisiin suhteisiin, muodollinen ja monitasoinen organisaatiorakenne, organisaatiorakenteiden nopea muutostahti, toimintamallin yksisuuntaisuus, organisaatio- ja johtamiskulttuuri, joka ei tue yhteistyötä, ylimmän johdon ja operatiivisten toimintojen välinen kuilu, tiedon painottuminen tietojärjestelmiin, dialogin puute, kommunikoinnin yksisuuntaisuus ja erilaisista työkulttuureista aiheutuvat kommunikaatioesteet, yhteistyösuhteiden puute ja transaktiosuhteiden painottuminen sekä ryhmien sisäisen yhtenäisyyden vahvuus. Näiden tekijöiden esiin tuominen tuo uusia näkökulmia organisaation sisäisen yhteistyökyvykkyyden ymmärtämiseen ja kehittämiseen.
Resumo:
Background: Differences in the distribution of genotypes between individuals of the same ethnicity are an important confounder factor commonly undervalued in typical association studies conducted in radiogenomics. Objective: To evaluate the genotypic distribution of SNPs in a wide set of Spanish prostate cancer patients for determine the homogeneity of the population and to disclose potential bias. Design, Setting, and Participants: A total of 601 prostate cancer patients from Andalusia, Basque Country, Canary and Catalonia were genotyped for 10 SNPs located in 6 different genes associated to DNA repair: XRCC1 (rs25487, rs25489, rs1799782), ERCC2 (rs13181), ERCC1 (rs11615), LIG4 (rs1805388, rs1805386), ATM (rs17503908, rs1800057) and P53 (rs1042522). The SNP genotyping was made in a Biotrove OpenArrayH NT Cycler. Outcome Measurements and Statistical Analysis: Comparisons of genotypic and allelic frequencies among populations, as well as haplotype analyses were determined using the web-based environment SNPator. Principal component analysis was made using the SnpMatrix and XSnpMatrix classes and methods implemented as an R package. Non-supervised hierarchical cluster of SNP was made using MultiExperiment Viewer. Results and Limitations: We observed that genotype distribution of 4 out 10 SNPs was statistically different among the studied populations, showing the greatest differences between Andalusia and Catalonia. These observations were confirmed in cluster analysis, principal component analysis and in the differential distribution of haplotypes among the populations. Because tumor characteristics have not been taken into account, it is possible that some polymorphisms may influence tumor characteristics in the same way that it may pose a risk factor for other disease characteristics. Conclusion: Differences in distribution of genotypes within different populations of the same ethnicity could be an important confounding factor responsible for the lack of validation of SNPs associated with radiation-induced toxicity, especially when extensive meta-analysis with subjects from different countries are carried out.