869 resultados para height partition clustering
Resumo:
The aim of this study was to describe the distribution of waist circumference (WC) and WC to height (WCTH) values among Kaingang indigenous adolescents in order to estimate the prevalence of high WCTH values and evaluate the correlation between WC and WCTH and body mass index (BMI)-for-age. A total of 1,803 indigenous adolescents were evaluated using a school-based cross-sectional study. WCTH values > 0.5 were considered high. Higher mean WC and WCTH values were observed for girls in all age categories. WCTH values > 0.5 were observed in 25.68% of the overall sample of adolescents. Mean WC and WCTH values were significantly higher for adolescents with BMI/age z-scores > 2 than for those with normal z-scores. The correlation coefficients of WC and WCTH for BMI/age were r = 0.68 and 0.76, respectively, for boys, and r = 0.79 and 0.80, respectively, for girls. This study highlights elevated mean WC and WCTH values and high prevalence of abdominal obesity among Kaingang indigenous adolescents.
Resumo:
In developed countries, children with intrauterine growth restriction (IUGR) or born preterm (PT) tend to achieve catch-up growth. There is little information about height catch-up in developing countries and about height catch-down in both developed and developing countries. We studied the effect of IUGR and PT birth on height catch-up and catch-down growth of children from two cohorts of liveborn singletons. Data from 1,463 children was collected at birth and at school age in Ribeirao Preto (RP), a more developed city, and in Sao Luis (SL), a less developed city. A change in z-score between schoolchild height z-score and birth length z-score >= 0.67 was considered catch-up; a change in z-score <=-0.67 indicated catch-down growth. The explanatory variables were: appropriate weight for gestational age/PT birth in four categories: term children without IUGR (normal), IUGR only (term with IUGR), PT only ( preterm without IUGR) and preterm with IUGR; infant's sex; maternal parity, age, schooling and marital status; occupation of family head; family income and neonatal ponderal index (PI). The risk ratio for catch-up and catch-down was estimated by multinomial logistic regression for each city. In RP, preterms without IUGR (RR = 4.13) and thin children (PI<10th percentile, RR = 14.39) had a higher risk of catch-down; catch-up was higher among terms with IUGR (RR = 5.53), preterms with IUGR (RR = 5.36) and children born to primiparous mothers (RR = 1.83). In SL, catch-down was higher among preterms without IUGR (RR = 5.19), girls (RR = 1.52) and children from low-income families ( RR = 2.74); the lowest risk of catch-down (RR = 0.27) and the highest risk of catch-up (RR = 3.77) were observed among terms with IUGR. In both cities, terms with IUGR presented height catch-up growth whereas preterms with IUGR only had height catch-up growth in the more affluent setting. Preterms without IUGR presented height catch-down growth, suggesting that a better socioeconomic situation facilitates height catch-up and prevents height catch-down growth.
Resumo:
Background and Aim: The identification of gastric carcinomas (GC) has traditionally been based on histomorphology. Recently, DNA microarrays have successfully been used to identify tumors through clustering of the expression profiles. Random forest clustering is widely used for tissue microarrays and other immunohistochemical data, because it handles highly-skewed tumor marker expressions well, and weighs the contribution of each marker according to its relatedness with other tumor markers. In the present study, we e identified biologically- and clinically-meaningful groups of GC by hierarchical clustering analysis of immunohistochemical protein expression. Methods: We selected 28 proteins (p16, p27, p21, cyclin D1, cyclin A, cyclin B1, pRb, p53, c-met, c-erbB-2, vascular endothelial growth factor, transforming growth factor [TGF]-beta I, TGF-beta II, MutS homolog-2, bcl-2, bax, bak, bcl-x, adenomatous polyposis coli, clathrin, E-cadherin, beta-catenin, mucin (MUC) 1, MUC2, MUC5AC, MUC6, matrix metalloproteinase [ MMP]-2, and MMP-9) to be investigated by immunohistochemistry in 482 GC. The analyses of the data were done using a random forest-clustering method. Results: Proteins related to cell cycle, growth factor, cell motility, cell adhesion, apoptosis, and matrix remodeling were highly expressed in GC. We identified protein expressions associated with poor survival in diffuse-type GC. Conclusions: Based on the expression analysis of 28 proteins, we identified two groups of GC that could not be explained by any clinicopathological variables, and a subgroup of long-surviving diffuse-type GC patients with a distinct molecular profile. These results provide not only a new molecular basis for understanding the biological properties of GC, but also better prediction of survival than the classic pathological grouping.
Resumo:
This paper addresses the m-machine no-wait flow shop problem where the set-up time of a job is separated from its processing time. The performance measure considered is the total flowtime. A new hybrid metaheuristic Genetic Algorithm-Cluster Search is proposed to solve the scheduling problem. The performance of the proposed method is evaluated and the results are compared with the best method reported in the literature. Experimental tests show superiority of the new method for the test problems set, regarding the solution quality. (c) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Abstract Background Transcript enumeration methods such as SAGE, MPSS, and sequencing-by-synthesis EST "digital northern", are important high-throughput techniques for digital gene expression measurement. As other counting or voting processes, these measurements constitute compositional data exhibiting properties particular to the simplex space where the summation of the components is constrained. These properties are not present on regular Euclidean spaces, on which hybridization-based microarray data is often modeled. Therefore, pattern recognition methods commonly used for microarray data analysis may be non-informative for the data generated by transcript enumeration techniques since they ignore certain fundamental properties of this space. Results Here we present a software tool, Simcluster, designed to perform clustering analysis for data on the simplex space. We present Simcluster as a stand-alone command-line C package and as a user-friendly on-line tool. Both versions are available at: http://xerad.systemsbiology.net/simcluster. Conclusion Simcluster is designed in accordance with a well-established mathematical framework for compositional data analysis, which provides principled procedures for dealing with the simplex space, and is thus applicable in a number of contexts, including enumeration-based gene expression data.
Resumo:
Background: Childhood obesity is a public health problem worldwide. Visceral obesity, particularly associated with cardio-metabolic risk, has been assessed by body mass index (BMI) and waist circumference, but both methods use sex-and age-specific percentile tables and are influenced by sexual maturity. Waist-to-height ratio (WHtR) is easier to obtain, does not involve tables and can be used to diagnose visceral obesity, even in normal-weight individuals. This study aims to compare the WHtR to the 2007 World Health Organization (WHO) reference for BMI in screening for the presence of cardio-metabolic and inflammatory risk factors in 6–10-year-old children. Methods: A cross-sectional study was undertaken with 175 subjects selected from the Reference Center for the Treatment of Children and Adolescents in Campos, Rio de Janeiro, Brazil. The subjects were classified according to the 2007 WHO standard as normal-weight (BMI z score > −1 and < 1) or overweight/obese (BMI z score ≥ 1). Systolic blood pressure (SBP), diastolic blood pressure (DBP), fasting glycemia, low-density lipoprotein (LDL), high-density lipoprotein (HDL), triglyceride (TG), Homeostatic Model Assessment – Insulin Resistance (HOMA-IR), leukocyte count and ultrasensitive C-reactive protein (CRP) were also analyzed. Results: There were significant correlations between WHtR and BMI z score (r = 0.88, p < 0.0001), SBP (r = 0.51, p < 0.0001), DBP (r = 0.49, p < 0.0001), LDL (r = 0.25, p < 0.0008, HDL (r = −0.28, p < 0.0002), TG (r = 0.26, p < 0.0006), HOMA-IR (r = 0.83, p < 0.0001) and CRP (r = 0.51, p < 0.0001). WHtR and BMI areas under the curve were similar for all the cardio-metabolic parameters. A WHtR cut-off value of > 0.47 was sensitive for screening insulin resistance and any one of the cardio-metabolic parameters. Conclusions: The WHtR was as sensitive as the 2007 WHO BMI in screening for metabolic risk factors in 6-10-year-old children. The public health message “keep your waist to less than half your height” can be effective in reducing cardio-metabolic risk because most of these risk factors are already present at a cut point of WHtR ≥ 0.5. However, as this is the first study to correlate the WHtR with inflammatory markers, we recommend further exploration of the use of WHtR in this age group and other population-based samples.
Resumo:
Background: A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results: In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions: This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them.
Resumo:
The watermelon is traditionally cultivated horizontally on the ground. The cultivars of small fruits (1 to 3 kg), which reach better market prices, are also being grown in a greenhouse, where the plants are trained upward on vertical supports, with branches pruning and fruits thinning. These practices make possible an increase of the plant density, fruit quality and yield compared to the traditional growth system. The aim of this experiment was to evaluate the influence of three training heights (1.7, 2.2 and 2.7 m) and two planting densities (3.17 and 4.76 plants m-2) over the productive and qualitative characteristics of mini watermelon "Smile" cultivated in greenhouse. The pruning was done at 43, 55 and 66 days after transplanting (DAT), when the plant height reached 1.7, 2.2 and 2.7 m, respectively. The dry mass of branches, petioles, leaves and total were affected by the training height, where the highest values were obtained by the plants pruned at 2.2 and 2.7 m. Leaf area, specific leaf area and leaf area index were not affected by the height of the plants. The training height of 2.7 m raised the total yield, however, marketable yield, average fruit mass and all the quality characteristics did not differ significantly from those obtained by the training height of 2.2 m. Regarding to plant density, the best option was 4.76 plants m-2, due to the increasing of marketable yield in 37.4% without reducing the average weight of fruits.
Resumo:
The aim of this study was to describe the distribution of waist circumference (WC) and WC to height (WCTH) values among Kaingáng indigenous adolescents in order to estimate the prevalence of high WCTH values and evaluate the correlation between WC and WCTH and body mass index (BMI)-for-age. A total of 1,803 indigenous adolescents were evaluated using a school-based cross-sectional study. WCTH values > 0.5 were considered high. Higher mean WC and WCTH values were observed for girls in all age categories. WCTH values > 0.5 were observed in 25.68% of the overall sample of adolescents. Mean WC and WCTH values were significantly higher for adolescents with BMI/age z-scores > 2 than for those with normal z-scores. The correlation coefficients of WC and WCTH for BMI/age were r = 0.68 and 0.76, respectively, for boys, and r = 0.79 and 0.80, respectively, for girls. This study highlights elevated mean WC and WCTH values and high prevalence of abdominal obesity among Kaingáng indigenous adolescents.
Resumo:
The present work proposes a method based on CLV (Clustering around Latent Variables) for identifying groups of consumers in L-shape data. This kind of datastructure is very common in consumer studies where a panel of consumers is asked to assess the global liking of a certain number of products and then, preference scores are arranged in a two-way table Y. External information on both products (physicalchemical description or sensory attributes) and consumers (socio-demographic background, purchase behaviours or consumption habits) may be available in a row descriptor matrix X and in a column descriptor matrix Z respectively. The aim of this method is to automatically provide a consumer segmentation where all the three matrices play an active role in the classification, getting homogeneous groups from all points of view: preference, products and consumer characteristics. The proposed clustering method is illustrated on data from preference studies on food products: juices based on berry fruits and traditional cheeses from Trentino. The hedonic ratings given by the consumer panel on the products under study were explained with respect to the product chemical compounds, sensory evaluation and consumer socio-demographic information, purchase behaviour and consumption habits.
Resumo:
This thesis tackles the problem of the automated detection of the atmospheric boundary layer (BL) height, h, from aerosol lidar/ceilometer observations. A new method, the Bayesian Selective Method (BSM), is presented. It implements a Bayesian statistical inference procedure which combines in an statistically optimal way different sources of information. Firstly atmospheric stratification boundaries are located from discontinuities in the ceilometer back-scattered signal. The BSM then identifies the discontinuity edge that has the highest probability to effectively mark the BL height. Information from the contemporaneus physical boundary layer model simulations and a climatological dataset of BL height evolution are combined in the assimilation framework to assist this choice. The BSM algorithm has been tested for four months of continuous ceilometer measurements collected during the BASE:ALFA project and is shown to realistically diagnose the BL depth evolution in many different weather conditions. Then the BASE:ALFA dataset is used to investigate the boundary layer structure in stable conditions. Functions from the Obukhov similarity theory are used as regression curves to fit observed velocity and temperature profiles in the lower half of the stable boundary layer. Surface fluxes of heat and momentum are best-fitting parameters in this exercise and are compared with what measured by a sonic anemometer. The comparison shows remarkable discrepancies, more evident in cases for which the bulk Richardson number turns out to be quite large. This analysis supports earlier results, that surface turbulent fluxes are not the appropriate scaling parameters for profiles of mean quantities in very stable conditions. One of the practical consequences is that boundary layer height diagnostic formulations which mainly rely on surface fluxes are in disagreement to what obtained by inspecting co-located radiosounding profiles.
Resumo:
Il task del data mining si pone come obiettivo l'estrazione automatica di schemi significativi da grandi quantità di dati. Un esempio di schemi che possono essere cercati sono raggruppamenti significativi dei dati, si parla in questo caso di clustering. Gli algoritmi di clustering tradizionali mostrano grossi limiti in caso di dataset ad alta dimensionalità, composti cioè da oggetti descritti da un numero consistente di attributi. Di fronte a queste tipologie di dataset è necessario quindi adottare una diversa metodologia di analisi: il subspace clustering. Il subspace clustering consiste nella visita del reticolo di tutti i possibili sottospazi alla ricerca di gruppi signicativi (cluster). Una ricerca di questo tipo è un'operazione particolarmente costosa dal punto di vista computazionale. Diverse ottimizzazioni sono state proposte al fine di rendere gli algoritmi di subspace clustering più efficienti. In questo lavoro di tesi si è affrontato il problema da un punto di vista diverso: l'utilizzo della parallelizzazione al fine di ridurre il costo computazionale di un algoritmo di subspace clustering.