17 resultados para NIRS. Plum. Multivariate calibration. Variables selection
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
A set of predictor variables is said to be intrinsically multivariate predictive (IMP) for a target variable if all properly contained subsets of the predictor set are poor predictors of the. target but the full set predicts the target with great accuracy. In a previous article, the main properties of IMP Boolean variables have been analytically described, including the introduction of the IMP score, a metric based on the coefficient of determination (CoD) as a measure of predictiveness with respect to the target variable. It was shown that the IMP score depends on four main properties: logic of connection, predictive power, covariance between predictors and marginal predictor probabilities (biases). This paper extends that work to a broader context, in an attempt to characterize properties of discrete Bayesian networks that contribute to the presence of variables (network nodes) with high IMP scores. We have found that there is a relationship between the IMP score of a node and its territory size, i.e., its position along a pathway with one source: nodes far from the source display larger IMP scores than those closer to the source, and longer pathways display larger maximum IMP scores. This appears to be a consequence of the fact that nodes with small territory have larger probability of having highly covariate predictors, which leads to smaller IMP scores. In addition, a larger number of XOR and NXOR predictive logic relationships has positive influence over the maximum IMP score found in the pathway. This work presents analytical results based on a simple structure network and an analysis involving random networks constructed by computational simulations. Finally, results from a real Bayesian network application are provided. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
Current methods for quality control of sugar cane are performed in extracted juice using several methodologies, often requiring appreciable time and chemicals (eventually toxic), making the methods not green and expensive. The present study proposes the use of X-ray spectrometry together with chemometric methods as an innovative and alternative technique for determining sugar cane quality parameters, specifically sucrose concentration, POL, and fiber content. Measurements in stem, leaf, and juice were performed, and those applied directly in stem provided the best results. Prediction models for sugar cane stem determinations with a single 60 s irradiation using portable X-ray fluorescence equipment allows estimating the % sucrose, % fiber, and POL simultaneously. Average relative deviations in the prediction step of around 8% are acceptable if considering that field measurements were done. These results may indicate the best period to cut a particular crop as well as for evaluating the quality of sugar cane for the sugar and alcohol industries.
Resumo:
Petroleum contamination impact on macrobenthic communities in the northeast portion of Todos os Santos Bay was assessed combining in multivariate analyses, chemical parameters such as aliphatic and polycyclic aromatic hydrocarbon indices and concentration ratios with benthic ecological parameters. Sediment samples were taken in August 2000 with a 0.05 m(2) van Veen grab at 28 sampling locations. The predominance of n-alkanes with more than 24 carbons, together with CPI values close to one, and the fact that most of the stations showed UCM/resolved aliphatic hydrocarbons ratios (UCM:R) higher than two, indicated a high degree of anthropogenic contribution, the presence of terrestrial plant detritus, petroleum products and evidence of chronic oil pollution. The indices used to determine the origin of PAH indicated the occurrence of a petrogenic contribution. A pyrolytic contribution constituted mainly by fossil fuel combustion derived PAH was also observed. The results of the stepwise multiple regression analysis performed with chemical data and benthic ecological descriptors demonstrated that not only total PAH concentrations but also specific concentration ratios or indices such as >= C24:< C24, An/178 and Fl/Fl + Py, are determining the structure of benthic communities within the study area. According to the BIO-ENV results petroleum related variables seemed to have a main influence on macrofauna community structure. The PCA ordination performed with the chemical data resulted in the formation of three groups of stations. The decrease in macrofauna density, number of species and diversity from groups III to I seemed to be related to the occurrence of high aliphatic hydrocarbon and PAH concentrations associated with fine sediments. Our results showed that macrobenthic communities in the northeast portion of Todos os Santos Bay are subjected to the impact of chronic oil pollution as was reflected by the reduction in the number of species and diversity. These results emphasise the importance to combine in multivariate approaches not only total hydrocarbon concentrations but also indices, isomer pair ratios and specific compound concentrations with biological data to improve the assessment of anthropogenic impact on marine ecosystems. (c) 2008 Elsevier Ltd. All rights reserved.
Resumo:
Data visualization techniques are powerful in the handling and analysis of multivariate systems. One such technique known as parallel coordinates was used to support the diagnosis of an event, detected by a neural network-based monitoring system, in a boiler at a Brazilian Kraft pulp mill. Its attractiveness is the possibility of the visualization of several variables simultaneously. The diagnostic procedure was carried out step-by-step going through exploratory, explanatory, confirmatory, and communicative goals. This tool allowed the visualization of the boiler dynamics in an easier way, compared to commonly used univariate trend plots. In addition it facilitated analysis of other aspects, namely relationships among process variables, distinct modes of operation and discrepant data. The whole analysis revealed firstly that the period involving the detected event was associated with a transition between two distinct normal modes of operation, and secondly the presence of unusual changes in process variables at this time.
Resumo:
Concentrations of 39 organic compounds were determined in three fractions (head, heart and tail) obtained from the pot still distillation of fermented sugarcane juice. The results were evaluated using analysis of variance (ANOVA), Tukey's test, principal component analysis (PCA), hierarchical cluster analysis (HCA) and linear discriminant analysis (LDA). According to PCA and HCA, the experimental data lead to the formation of three clusters. The head fractions give rise to a more defined group. The heart and tail fractions showed some overlap consistent with its acid composition. The predictive ability of calibration and validation of the model generated by LDA for the three fractions classification were 90.5 and 100%, respectively. This model recognized as the heart twelve of the thirteen commercial cachacas (92.3%) with good sensory characteristics, thus showing potential for guiding the process of cuts.
Resumo:
The objective of this study was to compare the BLUP selection method with different selection strategies in F-2:4 and assess the efficiency of this method on the early choice of the best common bean (Phaseolus vulgaris) lines. Fifty-one F-2:4 progenies were produced from a cross between the CVIII8511 x RP-26 lines. A randomized block design was used with 20 replications and one-plant field plots. Character data on plant architecture and grain yield were obtained and then the sum of the standardized variables was estimated for simultaneous selection of both traits. Analysis was carried out by mixed models (BLUP) and the least squares method to compare different selection strategies, like mass selection, stratified mass selection and between and within progeny selection. The progenies selected by BLUP were assessed in advanced generations, always selecting the greatest and smallest sum of the standardized variables. Analyses by the least squares method and BLUP procedure ranked the progenies in the same way. The coincidence of the individuals identified by BLUP and between and within progeny selection was high and of the greatest magnitude when BLUP was compared with mass selection. Although BLUP is the best estimator of genotypic value, its efficiency in the response to long term selection is not different from any of the other methods, because it is also unable to predict the future effect of the progenies x environments interaction. It was inferred that selection success will always depend on the most accurate possible progeny assessment and using alternatives to reduce the progenies x environments interaction effect.
Resumo:
Hematopoietic cell transplantation (HCT) is an emerging therapy for patients with severe autoimmune diseases (AID). We report data on 368 patients with AID who underwent HCT in 64 North and South American transplantation centers reported to the Center for International Blood and Marrow Transplant Research between 1996 and 2009. Most of the HCTs involved autologous grafts (n = 339); allogeneic HCT (n = 29) was done mostly in children. The most common indications for HCT were multiple sclerosis, systemic sclerosis, and systemic lupus erythematosus. The median age at transplantation was 38 years for autologous HCT and 25 years for allogeneic HCT. The corresponding times from diagnosis to HCT were 35 months and 24 months. Three-year overall survival after autologous HCT was 86% (95% confidence interval [CI], 81%-91%). Median follow-up of survivors was 31 months (range, 1-144 months). The most common causes of death were AID progression, infections, and organ failure. On multivariate analysis, the risk of death was higher in patients at centers that performed fewer than 5 autologous HCTs (relative risk, 3.5; 95% CI, 1.1-11.1; P = .03) and those that performed 5 to 15 autologous HCTs for AID during the study period (relative risk, 4.2; 95% CI, 1.5-11.7; P = .006) compared with patients at centers that performed more than 15 autologous HCTs for AID during the study period. AID is an emerging indication for HCT in the region. Collaboration of hematologists and other disease specialists with an outcomes database is important to promote optimal patient selection, analysis of the impact of prognostic variables and long-term outcomes, and development of clinical trials. Biol Blood Marrow Transplant 18: 1471-1478 (2012) (C) 2012 Published by Elsevier Inc. on behalf of American Society for Blood and Marrow Transplantation
Resumo:
Managers know more about the performance of the organization than investors, which makes the disclosure of information a possible strategy for competitive differentiation, minimizing adverse selection. This paper's main goal is to analyze whether or not an entity's level of diclosure may affect the risk perception of individuals and the process of evaluating their shares. The survey was carried out in an experimental study with 456 subjects. In a stock market simulation, we investigated the pricing of the stocks of two companies with different levels of information disclosure at four separate stages. The results showed that, when other variables are constant, the level of disclosure of an entity can affect the expectations of individuals and the process of evaluating their shares. A higher level of disclosure by an entity affected the value of its share and the other company's.
Resumo:
Background: Adjuvant chemoradiotherapy is part of a multimodality treatment approach in order to improve survival outcomes after surgery for gastric cancer. The aims of this study are to describe the results of gastrectomy and adjuvant chemoradiotherapy in patients treated in a single institution, and to identify prognostic factors that could determine which individuals would benefit from this treatment. Methods: This retrospective study included patients with pathologically confirmed gastric adenocarcinoma who underwent surgical treatment with curative intent in a single cancer center in Brazil, between 1998 and 2008. Among 327 patients treated in this period, 142 were selected. Exclusion criteria were distant metastatic disease (M1), T1N0 tumors, different multimodality treatments and tumors of the gastric stump. Another 10 individuals were lost to follow-up and there were 3 postoperative deaths. The role of several clinical and pathological variables as prognostic factors was determined. Results: D2-lymphadenectomy was performed in 90.8% of the patients, who had 5-year overall and disease-free survival of 58.9% and 55.7%. The interaction of N-category and N-ratio, extended resection and perineural invasion were independent prognostic factors for overall and disease-free survival. Adjuvant chemoradiotherapy was not associated with a significant improvement in survival. Patients with node-positive disease had improved survival with adjuvant chemoradiotherapy, especially when we grouped patients with N1 and N2 tumors and a higher N-ratio. These individuals had worse disease-free (30.3% vs. 48.9%) and overall survival (30.9% vs. 71.4%). Conclusion: N-category and N-ratio interaction, perineural invasion and extended resections were prognostic factors for survival in gastric cancer patients treated with D2-lymphadenectomy, but adjuvant chemoradiotherapy was not. There may be some benefit with this treatment in patients with node-positive disease and higher N-ratio.
Resumo:
This study performed an exploratory analysis of the anthropometrical and morphological muscle variables related to the one-repetition maximum (1RM) performance. In addition, the capacity of these variables to predict the force production was analyzed. 50 active males were submitted to the experimental procedures: vastus lateralis muscle biopsy, quadriceps magnetic resonance imaging, body mass assessment and 1RM test in the leg-press exercise. K-means cluster analysis was performed after obtaining the body mass, sum of the left and right quadriceps muscle cross-sectional area (Sigma CSA), percentage of the type II fibers and the 1RM performance. The number of clusters was defined a priori and then were labeled as high strength performance (HSP1RM) group and low strength performance (LSP1RM) group. Stepwise multiple regressions were performed by means of body mass, Sigma CSA, percentage of the type II fibers and clusters as predictors' variables and 1RM performance as response variable. The clusters mean +/- SD were: 292.8 +/- 52.1 kg, 84.7 +/- 17.9 kg, 19249.7 +/- 1645.5 mm(2) and 50.8 +/- 7.2% for the HSP1RM and 254.0 +/- 51.1 kg, 69.2 +/- 8.1 kg, 15483.1 +/- 1 104.8 mm(2) and 51.7 +/- 6.2 %, for the LSP1RM in the 1RM, body mass, Sigma CSA and muscle fiber type II percentage, respectively. The most important variable in the clusters division was the Sigma CSA. In addition, the Sigma CSA and muscle fiber type II percentage explained the variance in the 1RM performance (Adj R-2 = 0.35, p = 0.0001) for all participants and for the LSP1RM (Adj R-2 = 0.25, p = 0.002). For the HSP1RM, only the Sigma CSA was entered in the model and showed the highest capacity to explain the variance in the 1RM performance (Adj R-2 = 0.38, p = 0.01). As a conclusion, the muscle CSA was the most relevant variable to predict force production in individuals with no strength training background.
Resumo:
We investigated dietary intake patterns (DIP) in adolescents (14-18 year-olds) and the association with demographic and socioeconomic characteristics and lifestyle variables. This school-based survey was carried out among high school students from the city of Maringa in the state of Parana (PR), Brazil (2007). The sample included 991 students (54.5% girls) from high schools. DIPs were investigated by the frequency of weekly consumption of each food group: vegetables, fruit, rice, beans, fried food, sweet food, milk, soda, meat, eggs, alcoholic drinks. Independent variables were: demographic and socioeconomic characteristics and lifestyle variables. DIPS were identified using principal component analysis with orthogonal rotation (varimax). Three components were extracted. Component 1 (fried foods, sweets and soft drinks) was positively associated with not having breakfast for girls and dinner for boys. Moreover, component 2 (consumption of fruit and vegetables) was positively associated with having breakfast at home for boys and number of meals for girls. Component 3 (beans, eggs and meat) was positively associated with having lunch, employment and sedentary behavior level for girls. However, it was negatively associated with having lunch and dinner for boys. Adolescents who have healthier eating patterns also had other healthier behaviors regardless of gender. However, factors associated with dietary patterns differ between boys and girls. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Context. The ESO public survey VISTA variables in the Via Lactea (VVV) started in 2010. VVV targets 562 sq. deg in the Galactic bulge and an adjacent plane region and is expected to run for about five years. Aims. We describe the progress of the survey observations in the first observing season, the observing strategy, and quality of the data obtained. Methods. The observations are carried out on the 4-m VISTA telescope in the ZYJHK(s) filters. In addition to the multi-band imaging the variability monitoring campaign in the K-s filter has started. Data reduction is carried out using the pipeline at the Cambridge Astronomical Survey Unit. The photometric and astrometric calibration is performed via the numerous 2MASS sources observed in each pointing. Results. The first data release contains the aperture photometry and astrometric catalogues for 348 individual pointings in the ZYJHK(s) filters taken in the 2010 observing season. The typical image quality is similar to 0 ''.9-1 ''.0. The stringent photometric and image quality requirements of the survey are satisfied in 100% of the JHK(s) images in the disk area and 90% of the JHK(s) images in the bulge area. The completeness in the Z and Y images is 84% in the disk, and 40% in the bulge. The first season catalogues contain 1.28 x 10(8) stellar sources in the bulge and 1.68 x 10(8) in the disk area detected in at least one of the photometric bands. The combined, multi-band catalogues contain more than 1.63 x 10(8) stellar sources. About 10% of these are double detections because of overlapping adjacent pointings. These overlapping multiple detections are used to characterise the quality of the data. The images in the JHK(s) bands extend typically similar to 4 mag deeper than 2MASS. The magnitude limit and photometric quality depend strongly on crowding in the inner Galactic regions. The astrometry for K-s = 15-18 mag has rms similar to 35-175 mas. Conclusions. The VVV Survey data products offer a unique dataset to map the stellar populations in the Galactic bulge and the adjacent plane and provide an exciting new tool for the study of the structure, content, and star-formation history of our Galaxy, as well as for investigations of the newly discovered star clusters, star-forming regions in the disk, high proper motion stars, asteroids, planetary nebulae, and other interesting objects.
Resumo:
Objectives: to identify factors associated with maternal intrapartum transfer from a freestanding birth centre to hospital. Design: case-control study with retrospective data collection. Participants and settings: cases included all 111 women transferred from a freestanding birth centre in Sao Paulo to the referral hospital, from March 2002 to December 2009. The controls were 456 women who gave birth in the birth centre during the same period who were not transferred, randomly selected with four controls for each case. Methods: data were obtained from maternal records. Factors associated with maternal intrapartum transfers were initially analysed using a chi(2) test of association. Variables with p < 0.20 were then included in multivariate analyses. A multiple logistic regression model was built using stepwise forward selection; variables which reached statistical significance at p < 0.05 were considered to be independently associated with maternal transfer. Findings: during the study data collection period, 111(4%) of 2,736 women admitted to the centre were transferred intrapartum. Variables identified as independently associated factors for intrapartum transfer included nulliparity (OR 5.1, 95% CI 2.7-9.8), maternal age >= 35 years (OR 5.4, 95% CI 2.1-13.4), not having a partner (OR 2.8, 95% CI 1.5-5.3), cervical dilation <= 3 cm on admission to the birth centre (OR 1.9, 95% CI 1.1-3.2) and between 5 and 12 antenatal appointments at the birth centre (OR 3.8, 95% CI 1.9-7.5). In contrast, a low correlation between fundal height and pregnancy gestation (OR 0.3, 95% CI 0.2-0.6) appeared to be protective against transfer. Conclusions and implications for practice: identifying factors associated with maternal intrapartum transfer could support decision making by women considering options for place of birth, and support the content of appropriate information about criteria for admission to a birth centre. Findings add to the evidence base to support identification of women in early labour who may experience later complications and could support timely implementation of appropriate interventions associated with reducing transfer rates. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Quality of fresh-cut carambola (Averrhoa carambola L) is related to many chemical and biochemical variables especially those involved with softening and browning, both influenced by storage temperature. To study these effects, a multivariate analysis was used to evaluate slices packaged in vacuum-sealed polyolefin bags, and stored at 2.5 degrees C, 5 degrees C and 10 degrees C, for up to 16 d. The quality of slices at each temperature was correlated with the duration of storage, O(2) and CO(2) concentration in the package, physical chemical constituents, and activity of enzymes involved in softening (PG) and browning (PPO) metabolism. Three quality groups were identified by hierarchical cluster analysis, and the classification of the components within each of these groups was obtained from a principal component analysis (PCA). The characterization of samples by PCA clearly distinguished acceptable and non-acceptable slices. According to PCA, acceptable slices presented higher ascorbic acid content, greater hue angles ((o)h) and final lightness (L-5) in the first principal component (PC1). On the other hand, non-acceptable slices presented higher total pectin content. PPO activity in the PC1. Non-acceptable slices also presented higher soluble pectin content, increased pectin solubilisation and higher CO(2) concentration in the second principal component (PC2) whereas acceptable slices showed lower total sugar content. The hierarchical cluster and PCA analyses were useful for discriminating the quality of slices stored at different temperatures. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Although a large amount of data have been published in past years on the taxonomic status of the Anastrepha fraterculus (Wiedemann) species complex, there is still a need to know how many species this complex comprises, the distribution of each one, and their distinguishing features. In this study, we assessed the morphometric variability of 32 populations from the A. fraterculus complex, located in major biogeographical areas from the Neotropics. Multivariate techniques for analysis were applied to the measurements of 21 variables referring to the mesonotum, aculeus, and wing. For the first time, our results identified the presence of seven distinct morphotypes within this species complex. According to the biogeographical areas, populations occurring in the Mesoamerican dominion (Mexico, Guatemala, and Panama) were clustered within a single natural entity labeled as the "Mexican" morphotype; whereas in the northwestern South American dominion, samples fell into three distinct groups: the "Venezuelan" morphotype with a single population from the Caribbean lowlands of Venezuela, the "Andean" morphotype from the highlands of Venezuela and Colombia, and the third group or "Peruvian" morphotype comprised the samples from the Pacific coastal lowlands of Ecuador and Peru. Three additional groups were identified from the Chacoan and Paranaense sub-regions: the morphotype "Brazilian-1" was recognized as including the Argentinean samples with most pertaining to Brazil, and widely distributed in these biogeographical areas; the morphotype "Brazilian-2" was recognized as including two samples from the state of Sao Paulo (Ilha-Bela and Sao Sebastiao); whereas the morphotype "Brazilian-3" included a single population from Botucatu (state of Sao Paulo). Based on data published by previous authors showing genetic and karyotypic differentiation, as well as reproductive isolation, we have concluded that such morphotypes indeed represent natural groups and distinct taxonomic entities.