978 resultados para Categorical data
Resumo:
Problématique : Bien que le tabac et l’alcool soient les facteurs causaux principaux des cancers épidermoïdes de l’oropharynx, le virus du papillome humain (VPH) serait responsable de l’augmentation récente de l’incidence de ces cancers, particulièrement chez les patients jeunes et/ou non-fumeurs. La prévalence du VPH à haut risque, essentiellement de type 16, est passée de 20% à plus de 60% au cours des vingt dernières années. Certaines études indiquent que les cancers VPH-positifs ont un meilleur pronostic que les VPH- négatifs, mais des données prospectives à cet égard sont rares dans la littérature, surtout pour les études de phase III avec stratification basée sur les risques. Hypothèses et objectifs : Il est présumé que la présence du VPH est un facteur de bon pronostic. L’étude vise à documenter la prévalence du VPH dans les cancers de l’oropharynx, et à établir son impact sur le pronostic, chez des patients traités avec un schéma thérapeutique incluant la chimio-radiothérapie. Méthodologie : Les tumeurs proviennent de cas traités au CHUM pour des cancers épidermoïdes de la sphère ORL à un stade localement avancé (III, IVA et IVB). Elles sont conservées dans une banque tumorale, et les données cliniques sur l’efficacité du traitement et les effets secondaires, recueillies prospectivement. La présence du VPH est établie par biologie moléculaire déterminant la présence du génome VPH et son génotype. Résultats: 255 spécimens ont été soumis au test de génotypage Linear Array HPV. Après amplification par PCR, de l’ADN viral a été détecté dans 175 (68.6%) échantillons tumoraux ; le VPH de type 16 était impliqué dans 133 cas (52.25 %). Conclusion: Une proportion grandissante de cancers ORL est liée au VPH. Notre étude confirme que la présence du VPH est fortement associée à une amélioration du pronostic chez les patients atteints de cancers ORL traités par chimio-radiothérapie, et devrait être un facteur de stratification dans les essais cliniques comprenant des cas de cancers ORL.
Resumo:
Decision trees are very powerful tools for classification in data mining tasks that involves different types of attributes. When coming to handling numeric data sets, usually they are converted first to categorical types and then classified using information gain concepts. Information gain is a very popular and useful concept which tells you, whether any benefit occurs after splitting with a given attribute as far as information content is concerned. But this process is computationally intensive for large data sets. Also popular decision tree algorithms like ID3 cannot handle numeric data sets. This paper proposes statistical variance as an alternative to information gain as well as statistical mean to split attributes in completely numerical data sets. The new algorithm has been proved to be competent with respect to its information gain counterpart C4.5 and competent with many existing decision tree algorithms against the standard UCI benchmarking datasets using the ANOVA test in statistics. The specific advantages of this proposed new algorithm are that it avoids the computational overhead of information gain computation for large data sets with many attributes, as well as it avoids the conversion to categorical data from huge numeric data sets which also is a time consuming task. So as a summary, huge numeric datasets can be directly submitted to this algorithm without any attribute mappings or information gain computations. It also blends the two closely related fields statistics and data mining
Resumo:
By using suitable parameters, we present a uni¯ed aproach for describing four methods for representing categorical data in a contingency table. These methods include: correspondence analysis (CA), the alternative approach using Hellinger distance (HD), the log-ratio (LR) alternative, which is appropriate for compositional data, and the so-called non-symmetrical correspondence analysis (NSCA). We then make an appropriate comparison among these four methods and some illustrative examples are given. Some approaches based on cumulative frequencies are also linked and studied using matrices. Key words: Correspondence analysis, Hellinger distance, Non-symmetrical correspondence analysis, log-ratio analysis, Taguchi inertia
Resumo:
This paper addresses the application of a PCA analysis on categorical data prior to diagnose a patients data set using a Case-Based Reasoning (CBR) system. The particularity is that the standard PCA techniques are designed to deal with numerical attributes, but our medical data set contains many categorical data and alternative methods as RS-PCA are required. Thus, we propose to hybridize RS-PCA (Regular Simplex PCA) and a simple CBR. Results show how the hybrid system produces similar results when diagnosing a medical data set, that the ones obtained when using the original attributes. These results are quite promising since they allow to diagnose with less computation effort and memory storage
Resumo:
Los solventes orgánicos son sustancias químicas que por sus propiedades físico-químicas son fácilmente inhalados o absorbidos por la piel, pueden causar daños de diversa índole en la salud. En Colombia existen normas que contemplan las medidas de protección, sin embargo persiste la informalidad en el sector de pintores de autos, por lo cual los trabajadores expuestos, a largo plazo pueden ver afectada su salud. En este estudio se analizó la relación entre individuos expuestos laboralmente a los solventes orgánicos versus no expuestos con respecto a la longitud de sus telómeros y formación de fragilidades. Se emplearon muestras de sangre extraídas por venopunción, recolectada en dos tubos: uno con Heparina, destinado al cultivo de linfocitos, para obtener cromosomas metafásicos y evaluar en ellos la presencia de fragilidades; el otro tubo con EDTA, fue empleado para la extracción de ADN y se utilizó para obtener los valores de longitud telomérica mediante la técnica de PCR cuantitativa. Los análisis estadísticos se realizaron aplicando la prueba de rangos de Wilcoxon, en el caso de la presencia de fragilidades se analizó la razón No.Fragilidades/No.Metafases, aplicando el método de Wilcoxon se encontró que existe diferencia estadísticamente significativa entre expuestos y no expuestos (p = 0,036), en donde los expuestos presentan mayor frecuencia de fragilidades. Por otra parte el valor relativo de longitud telomérica del grupo de expuestos fue mayor que el observado en el grupo de no expuestos, esta diferencia fue estadísticamente significativa (Wilcoxon, p = 0.002).
Resumo:
Objective To undertake a process evaluation of pharmacists' recommendations arising in the context of a complex IT-enabled pharmacist-delivered randomised controlled trial (PINCER trial) to reduce the risk of hazardous medicines management in general practices. Methods PINCER pharmacists manually recorded patients’ demographics, details of interventions recommended, actions undertaken by practice staff and time taken to manage individual cases of hazardous medicines management. Data were coded and double entered into SPSS v15, and then summarised using percentages for categorical data (with 95% CI) and, as appropriate, means (SD) or medians (IQR) for continuous data. Key findings Pharmacists spent a median of 20 minutes (IQR 10, 30) reviewing medical records, recommending interventions and completing actions in each case of hazardous medicines management. Pharmacists judged 72% (95%CI 70, 74) (1463/2026) of cases of hazardous medicines management to be clinically relevant. Pharmacists recommended 2105 interventions in 74% (95%CI 73, 76) (1516/2038) of cases and 1685 actions were taken in 61% (95%CI 59, 63) (1246/2038) of cases; 66% (95%CI 64, 68) (1383/2105) of interventions recommended by pharmacists were completed and 5% (95%CI 4, 6) (104/2105) of recommendations were accepted by general practitioners (GPs), but not completed at the end of the pharmacists’ placement; the remaining recommendations were rejected or considered not relevant by GPs. Conclusions The outcome measures were used to target pharmacist activity in general practice towards patients at risk from hazardous medicines management. Recommendations from trained PINCER pharmacists were found to be broadly acceptable to GPs and led to ameliorative action in the majority of cases. It seems likely that the approach used by the PINCER pharmacists could be employed by other practice pharmacists following appropriate training.
Resumo:
This article presents important properties of standard discrete distributions and its conjugate densities. The Bernoulli and Poisson processes are described as generators of such discrete models. A characterization of distributions by mixtures is also introduced. This article adopts a novel singular notation and representation. Singular representations are unusual in statistical texts. Nevertheless, the singular notation makes it simpler to extend and generalize theoretical results and greatly facilitates numerical and computational implementation.
Análise genética de escores de avaliação visual de bovinos com modelos bayesianos de limiar e linear
Resumo:
O objetivo deste trabalho foi comparar as estimativas de parâmetros genéticos obtidas em análises bayesianas uni-característica e bi-característica, em modelo animal linear e de limiar, considerando-se as características categóricas morfológicas de bovinos da raça Nelore. Os dados de musculosidade, estrutura física e conformação foram obtidos entre 2000 e 2005, em 3.864 animais de 13 fazendas participantes do Programa Nelore Brasil. Foram realizadas análises bayesianas uni e bi-características, em modelos de limiar e linear. de modo geral, os modelos de limiar e linear foram eficientes na estimação dos parâmetros genéticos para escores visuais em análises bayesianas uni-características. Nas análises bi-características, observou-se que: com utilização de dados contínuos e categóricos, o modelo de limiar proporcionou estimativas de correlação genética de maior magnitude do que aquelas do modelo linear; e com o uso de dados categóricos, as estimativas de herdabilidade foram semelhantes. A vantagem do modelo linear foi o menor tempo gasto no processamento das análises. Na avaliação genética de animais para escores visuais, o uso do modelo de limiar ou linear não influenciou a classificação dos animais, quanto aos valores genéticos preditos, o que indica que ambos os modelos podem ser utilizados em programas de melhoramento genético.
Resumo:
OBJETIVO: Analisar o efeito de 12 semanas de intervenção envolvendo prática de atividade física, orientações alimentar e psicológica sobre fatores de risco para o desenvolvimento da síndrome metabólica em crianças e adolescentes obesos. MÉTODOS: Estudo longitudinal com 23 crianças e adolescentes obesos, com idade entre seis e 16 anos (12,0±3,2 anos). Foram mensurados: gordura corporal total e de tronco, glicemia, colesterol total e triglicérides, pressão arterial sistólica e diastólica. Os jovens foram submetidos a três sessões semanais de 60 minutos de exercício físico (atividades esportivas recreativas, ginástica, circuitos e caminhadas), durante 12 semanas. O teste do qui-quadrado foi usado para comparar dados categóricos daqueles que apresentaram valores acima das recomendações para cada fator de risco. O teste t para dados pareados foi aplicado para comparar os dois momentos do estudo. RESULTADOS: em indivíduos com alterações metabólicas no início do estudo, observou-se, após a intervenção, a diminuição de 11,6% na glicemia (105 para 93mg/dL; p=0,046) e de 24,9% no triglicérides (217 para 163mg/dL; p=0,013); porém, não houve diferenças na pressão arterial e no colesterol total. CONCLUSÕES: O programa de exercício físico aplicado nas crianças e adolescentes foi eficiente para melhorar os valores de glicemia e triglicérides.
Resumo:
Introduction: Hypoestrogenism is the main characteristic of female aging. It promotes significant changes in body composition, both in fat mass as in lean body mass, leading to a decrease in muscle strength and physical performance. Objective: The aim of this study was to test whether menopausal status and hormone levels are associated with muscular strength and physical performance in middle-aged women. Methods: In a cross-sectional study it was collected sociodemographic data, gynecological history, anthropometric and biochemical measures in women aged 40 to 65 years in Parnamirim-RN. The menopause status (pre, peri and post menopause) was determined by menstrual history. All women underwent three dimensions of physical performance assessment: handgrip dynamometry, gait speed and chair stands test - Short Physical Performance Battery (SPPB). Categorical data were presented as absolute and relative frequencies. Quantitative data were showed as mean and standard deviation and the normality of distribution was verified with Kolmogorov-Smirnov (KS) test. Biochemical measures of estradiol and follicle-stimulating hormone (FSH) were transformed to log10. ANOVA with Tukey post-test for comparison of variables between the groups pre, peri and post-menopausal was performed and then multiple linear regression analyzes. Results: Two hundred and seventy eight women aged 50.2 (±5.58) years composed this study, being 50 women in premenopausal status (18%), 122 in perimenopausal (43.9%), and 106 postmenopausal stage (38.1%). The groups were different in age (p=0.001), marital relationship duration (p <0.001), number of pregnancies (p=0.001) and parity (p=0.001). Differences in biochemical measures were observed among the groups: estradiol (p<0.001), FSH (p<0.001), total cholesterol (p=0.001). There were no differences in gait velocity between menopausal status. Values in mean of grip strength decreased by postmenopausal women to perimenopausal and premenopausal ones (24.5 ± 5.1, 25.6 ± 5.4, 26.9 ± 4.9 for post-stage, pre and peri menopausas, respectively, p = 0.02) and the performance of chair stands test was better in premenopausal women compared with that in peri and postmenopausal status (p = 0.02). In multiple linear regression for muscle strength, the variables that remained were: age, estradiol and somatic symptoms measured by Menopause Rating Scale-MRS (R2=0.15). While for the xiv chair-stands test the predictors were number of births and FSH values (R2=0.04). Conclusion: There is a relationship between the stages of menopause and muscle performance in measures of grip strength and sit-up test and these are influenced by the fall of estrogens levels. Data suggest that the decrease in muscle strength and physical performance already appear in the transition to menopause stage, pointing to the need for more research in this area and appropriate preventive interventions
Resumo:
The aim of this study was to investigate the social representation of technological education teachers at the Federal Technological Education Network. The survey was conducted from 2007 to 2010, and the respondents were 275 teachers, 135 of the Federal Center for Technological Education (CEFET in portuguese) in the state of Amazonas, in Manaus unit headquarters; 140 of the CEFET in the state of Rio Grande do Norte, a unit based in Natal. We adopt the concept of technological education as the top level of professional education, that is to say, the undergraduate programs of short duration called technological courses. The Federal Technological Education Network gathers hundreds of related institutions, coordinated and supervised by the Office of Vocational and Technological Education of the Ministry of Education. Although many of these institutions offer courses in technology education, no research addressing this subject from the perspective of Social Representations Theory (SRT) was found in the literature. We seek to unravel the social representation of technological education of the teachers by adopting the procedural approach of SRT. This is a qualitative approach, focusing on significant aspects of the representative activity and the formation mechanisms of the representation. Therefore, we search the socio-genesis of the representation in the articulations between discourses, social institutions and practices. We initiated the research through applying critical reading and an analytical perspective on the historical and regulatory documents of technological education in Brazil, from the early twentieth century to the present day. We adopt the Procedure for Multiple Classifications (PMC) from the Free Words Association Technique (FWAT) to access the elements of representational content. For the analysis of the data obtained with FWAT and selection of major words / phrases pertinent to the semantic field of education technology, we used Hamlet II software. For the data analysis of PMC and Free Classification (FC) we used the SPSS ® (Statistical Package for the Social Sciences) version 17.0 and used the method of multidimensional scaling - Multidimensional scaling - (MDS). The output from the central MDS takes the form of a set of scatterplots - "perceptual maps" - of which the points are the elements of the representational content. For the FC data analysis we used the Scalogram Multidimensional Analysis (SMA) - which makes use of the original data in its raw form and allows categorical data to be interpreted in the map as measures of (di)similarity. In order to help with the understanding of the settings of the perceptual maps of FC, we used the Content Analysis of the discourse fragments of the teachers interviewed. The results confirm our initial hypothesis regarding the presence of a single plot among the socio-cognitive study subjects, which is the basis for a social representation of technological education in line with the historic assumption of the dichotomy between mental and manual labor. In spite of the three merging representational elements of the representational content, the perceptual maps compiled from the MSA statistics corroborates the dichotomy, with the exception of the map relating to the subgroup of teachers belonging to the humanities
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Pós-graduação em Ciências da Motricidade - IBRC
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Pós-graduação em Ginecologia, Obstetrícia e Mastologia - FMB