953 resultados para Discrete Variable Representation
Resumo:
The present study discusses retention criteria for principal components analysis (PCA) applied to Likert scale items typical in psychological questionnaires. The main aim is to recommend applied researchers to restrain from relying only on the eigenvalue-than-one criterion; alternative procedures are suggested for adjusting for sampling error. An additional objective is to add evidence on the consequences of applying this rule when PCA is used with discrete variables. The experimental conditions were studied by means of Monte Carlo sampling including several sample sizes, different number of variables and answer alternatives, and four non-normal distributions. The results suggest that even when all the items and thus the underlying dimensions are independent, eigenvalues greater than one are frequent and they can explain up to 80% of the variance in data, meeting the empirical criterion. The consequences of using Kaiser"s rule are illustrated with a clinical psychology example. The size of the eigenvalues resulted to be a function of the sample size and the number of variables, which is also the case for parallel analysis as previous research shows. To enhance the application of alternative criteria, an R package was developed for deciding the number of principal components to retain by means of confidence intervals constructed about the eigenvalues corresponding to lack of relationship between discrete variables.
Resumo:
We describe an improved multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) scheme for genotyping Staphylococcus aureus. We compare its performance to those of multilocus sequence typing (MLST) and spa typing in a survey of 309 strains. This collection includes 87 epidemic methicillin-resistant S. aureus (MRSA) strains of the Harmony collection, 75 clinical strains representing the major MLST clonal complexes (CCs) (50 methicillin-sensitive S. aureus [MSSA] and 25 MRSA), 135 nasal carriage strains (133 MSSA and 2 MRSA), and 13 published S. aureus genome sequences. The results show excellent concordance between the techniques' results and demonstrate that the discriminatory power of MLVA is higher than those of both MLST and spa typing. Two hundred forty-two genotypes are discriminated with 14 VNTR loci (diversity index, 0.9965; 95% confidence interval, 0.9947 to 0.9984). Using a cutoff value of 45%, 21 clusters are observed, corresponding to the CCs previously defined by MLST. The variability of the different tandem repeats allows epidemiological studies, as well as follow-up of the evolution of CCs and the identification of potential ancestors. The 14 loci can conveniently be analyzed in two steps, based upon a first-line simplified assay comprising a subset of 10 loci (panel 1) and a second subset of 4 loci (panel 2) that provides higher resolution when needed. In conclusion, the MLVA scheme proposed here, in combination with available on-line genotyping databases (including http://mlva.u-psud.fr/), multiplexing, and automatic sizing, can provide a basis for almost-real-time large-scale population monitoring of S. aureus.
National minorities and their representation in social surveys : which practices make a difference ?
Resumo:
Finding an adequate paraphrase representation formalism is a challenging issue in Natural Language Processing. In this paper, we analyse the performance of Tree Edit Distance as a paraphrase representation baseline. Our experiments using Edit Distance Textual Entailment Suite show that, as Tree Edit Distance consists of a purely syntactic approach, paraphrase alternations not based on structural reorganizations do not find an adequate representation. They also show that there is much scope for better modelling of the way trees are aligned.
Resumo:
The choice network revenue management (RM) model incorporates customer purchase behavioras customers purchasing products with certain probabilities that are a function of the offeredassortment of products, and is the appropriate model for airline and hotel network revenuemanagement, dynamic sales of bundles, and dynamic assortment optimization. The underlyingstochastic dynamic program is intractable and even its certainty-equivalence approximation, inthe form of a linear program called Choice Deterministic Linear Program (CDLP) is difficultto solve in most cases. The separation problem for CDLP is NP-complete for MNL with justtwo segments when their consideration sets overlap; the affine approximation of the dynamicprogram is NP-complete for even a single-segment MNL. This is in contrast to the independentclass(perfect-segmentation) case where even the piecewise-linear approximation has been shownto be tractable. In this paper we investigate the piecewise-linear approximation for network RMunder a general discrete-choice model of demand. We show that the gap between the CDLP andthe piecewise-linear bounds is within a factor of at most 2. We then show that the piecewiselinearapproximation is polynomially-time solvable for a fixed consideration set size, bringing itinto the realm of tractability for small consideration sets; small consideration sets are a reasonablemodeling tradeoff in many practical applications. Our solution relies on showing that forany discrete-choice model the separation problem for the linear program of the piecewise-linearapproximation can be solved exactly by a Lagrangian relaxation. We give modeling extensionsand show by numerical experiments the improvements from using piecewise-linear approximationfunctions.
Resumo:
In humans, NK receptors are expressed by natural killer cells and some T cells, the latter of which are preferentially alphabetaTCR+ CD8+ cytolytic T lymphocytes (CTL). In this study we analyzed the expression of nine NK receptors (p58.1, p58.2, p70, p140, ILT2, NKRP1A, ZIN176, CD94 and CD94/NKG2A) in PBL from both healthy donors and melanoma patients. The percentages of NK receptor-positive T cells (NKT cells) varied strongly, and this variation was more important between individual patients than between individual healthy donors. In all the individuals, the NKT cells were preferentially CD28-, and a significant correlation was found between the percentage of CD28- T cells and the percentage of NK receptor+ T cells. Based on these data and the known activated phenotype of CD28- T cells, we propose that the CD28- CD8+ T cell pool represents or contains the currently active CTL population, and that the frequent expression of NK receptors reflects regulatory mechanisms modulating the extent of CTL effector function. Preliminary results indicate that some tumor antigen-specific T cells may indeed be CD28- and express NK receptors in vivo.
Resumo:
L’objecte del present treball és la realització d’una aplicació que permeti portar a terme el control estadístic multivariable en línia d’una planta SBR.Aquesta eina ha de permetre realitzar un anàlisi estadístic multivariable complet del lot en procés, de l’últim lot finalitzat i de la resta de lots processats a la planta.L’aplicació s’ha de realitzar en l’entorn LabVIEW. L’elecció d’aquest programa vecondicionada per l’actualització del mòdul de monitorització de la planta que s’estàdesenvolupant en aquest mateix entorn
Resumo:
Individual learning (e.g., trial-and-error) and social learning (e.g., imitation) are alternative ways of acquiring and expressing the appropriate phenotype in an environment. The optimal choice between using individual learning and/or social learning may be dictated by the life-stage or age of an organism. Of special interest is a learning schedule in which social learning precedes individual learning, because such a schedule is apparently a necessary condition for cumulative culture. Assuming two obligatory learning stages per discrete generation, we obtain the evolutionarily stable learning schedules for the three situations where the environment is constant, fluctuates between generations, or fluctuates within generations. During each learning stage, we assume that an organism may target the optimal phenotype in the current environment by individual learning, and/or the mature phenotype of the previous generation by oblique social learning. In the absence of exogenous costs to learning, the evolutionarily stable learning schedules are predicted to be either pure social learning followed by pure individual learning ("bang-bang" control) or pure individual learning at both stages ("flat" control). Moreover, we find for each situation that the evolutionarily stable learning schedule is also the one that optimizes the learned phenotype at equilibrium.
Resumo:
Abstract: Asthma prevalence in children and adolescents in Spain is 10-17%. It is the most common chronic illness during childhood. Prevalence has been increasing over the last 40 years and there is considerable evidence that, among other factors, continued exposure to cigarette smoke results in asthma in children. No statistical or simulation model exist to forecast the evolution of childhood asthma in Europe. Such a model needs to incorporate the main risk factors that can be managed by medical authorities, such as tobacco (OR = 1.44), to establish how they affect the present generation of children. A simulation model using conditional probability and discrete event simulation for childhood asthma was developed and validated by simulating realistic scenario. The parameters used for the model (input data) were those found in the bibliography, especially those related to the incidence of smoking in Spain. We also used data from a panel of experts from the Hospital del Mar (Barcelona) related to actual evolution and asthma phenotypes. The results obtained from the simulation established a threshold of a 15-20% smoking population for a reduction in the prevalence of asthma. This is still far from the current level in Spain, where 24% of people smoke. We conclude that more effort must be made to combat smoking and other childhood asthma risk factors, in order to significantly reduce the number of cases. Once completed, this simulation methodology can realistically be used to forecast the evolution of childhood asthma as a function of variation in different risk factors.