957 resultados para Multivariate analysis of variance
Resumo:
Methods from statistical physics, such as those involving complex networks, have been increasingly used in the quantitative analysis of linguistic phenomena. In this paper, we represented pieces of text with different levels of simplification in co-occurrence networks and found that topological regularity correlated negatively with textual complexity. Furthermore, in less complex texts the distance between concepts, represented as nodes, tended to decrease. The complex networks metrics were treated with multivariate pattern recognition techniques, which allowed us to distinguish between original texts and their simplified versions. For each original text, two simplified versions were generated manually with increasing number of simplification operations. As expected, distinction was easier for the strongly simplified versions, where the most relevant metrics were node strength, shortest paths and diversity. Also, the discrimination of complex texts was improved with higher hierarchical network metrics, thus pointing to the usefulness of considering wider contexts around the concepts. Though the accuracy rate in the distinction was not as high as in methods using deep linguistic knowledge, the complex network approach is still useful for a rapid screening of texts whenever assessing complexity is essential to guarantee accessibility to readers with limited reading ability. Copyright (c) EPLA, 2012
Resumo:
The objectives of the present study were to determine if variance components of calving intervals varied with age at calving and if considering calving intervals as a longitudinal trait would be a useful approach for fertility analysis of Zebu dairy herds. With these purposes, calving records from females born from 1940 to 2006 in a Guzerat dairy subpopulation in Brazil were analyzed. The fixed effects of contemporary groups, formed by year and farm at birth or at calving, and the regressions of age at calving, equivalent inbreeding coefficient and day of the year on the studied traits were considered in the statistical models. In one approach, calving intervals (Cl) were analyzed as a single trait, by fitting a statistical model on which both animal and permanent environment effects were adjusted for the effect of age at calving by random regression. In a second approach, a four-trait analysis was conducted, including age at first calving (AFC) and three different female categories for the calving intervals: first calving females; young females (less than 80 months old, but not first calving); or mature females (80 months old or more). Finally, a two-trait analysis was performed, also including AFC and Cl, but calving intervals were regarded as a single trait in a repeatability model. Additionally, the ranking of sires was compared among approaches. Calving intervals decreased with age until females were about 80 months old, remaining nearly constant after that age. A quasi-linear increase of 11.5 days on the calving intervals was observed for each 10% increase in the female's equivalent inbreeding coefficient. The heritability of AFC was 0.37. For Cl. the genetic-phenotypic variance ratios ranged from 0.064 to 0.141, depending on the approach and on ages at calving. Differences among genetic variance components for calving intervals were observed along the animal's lifetime. Those differences confirmed the longitudinal aspect of that trait, indicating the importance of such consideration when accessing fertility of Zebu dairy females, especially in situations where the available information relies on their calving intervals. Spearman rank correlations among approaches ranged from 0.90 to 0.95, and changes observed in the ranking of sires suggested that the genetic progress of the population could be affected by the approach chosen for the analysis of calving intervals. (C) 2012 Elsevier ay. All rights reserved.
Resumo:
Melipona scutellaris Latreille has great economic and ecological importance, especially because it is a pollinator of native plant species. Despite the importance of this species, there is little information about the conservation status of their populations. The objective of this study was to assess the diversity in populations of M. scutellaris coming from a Semideciduous Forest Fragment and an Atlantic Forest Fragment in the Northeast Brazil, through geometric morphometric analysis of wings in worker bees. In each area, worker bees were collected from 10 colonies, 10 workers per colony. To assess the diversity on the right wings of worker bees, 15 landmarks were plotted and the measures were used in analysis of variance and multivariate analysis, principal component analysis, discriminant analysis and clustering analysis. There were significant differences in the shape of the wing venation patterns between colonies of two sites (Wilk's lambda = 0.000006; p < 0.000001), which is probably due to the geographical distance between places of origin which impedes the gene flow between them. It indicates that inter and intrapopulation morphometric variability exists (p < 0.000001) in M. scutellaris coming from two different biomes, revealing the existence of diversity in these populations, which is necessary for the conservation of this bee species.
Resumo:
In the present study, the daily relative growth rates (DRGR, in percent per day) of the red macroalga Gracilaria domingensis in synthetic seawater was investigated for the combined influence of five factors, i.e., light (L), temperature (T), nitrate (N), phosphate (P), and molybdate (M), using a statistical design method. The ranges of the experimental cultivation conditions were T, 18-26A degrees C; L, 74-162 mu mol photons m(-2) s(-1); N, 40-80 mu mol L-1; P, 8-16 mu mol L-1; and M, 1-5 nmol L-1. The optimal conditions, which resulted in a maximum growth rate of a parts per thousand yen6.4% d(-1) from 7 to 10 days of cultivation, were determined by analysis of variance (ANOVA) multivariate factorial analysis (with a 2(5) full factorial design) to be L, 74 mu mol photons m(-2) s(-1); T, 26A degrees C; N, 80 mu mol L-1; P, 8 mu mol L-1; and M, 1 nmol L-1. In additional, these growth rate values are close to the growth rate values in natural medium (von Stosch medium), i.e., 6.5-7.0% d(-1). The results analyzed by the ANOVA indicate that the factors N and T are highly significant linear terms, X (L), (alpha = 0.05). On the other hand, the only significant quadratic term (X (Q)) was that for L. Statistically significant interactions between two different factors were found between T vs. L and N vs. T. Finally, a two-way (linear/quadratic interaction) model provided a quite reasonable correlation between the experimental and predicted DRGR values (R (adjusted) (2) = 0.9540).
Resumo:
Our objective was to assess extrinsic influences upon childbirth. In a cohort of 1,826 days containing 17,417 childbirths among them 13,252 spontaneous labor admissions, we studied the influence of environment upon the high incidence of labor (defined by 75th percentile or higher), analyzed by logistic regression. The predictors of high labor admission included increases in outdoor temperature (odds ratio: 1.742, P = 0.045, 95%CI: 1.011 to 3.001), and decreases in atmospheric pressure (odds ratio: 1.269, P = 0.029, 95%CI: 1.055 to 1.483). In contrast, increases in tidal range were associated with a lower probability of high admission (odds ratio: 0.762, P = 0.030, 95%CI: 0.515 to 0.999). Lunar phase was not a predictor of high labor admission (P = 0.339). Using multivariate analysis, increases in temperature and decreases in atmospheric pressure predicted high labor admission, and increases of tidal range, as a measurement of the lunar gravitational force, predicted a lower probability of high admission.
Resumo:
Introduction: Vitamin D is responsible for the regulation of certain genes at the transcription level, via interaction with the vitamin D receptor, and influences host immune responses and aspects of bone development, growth, and homeostasis. Our aim was to investigate the association of TaqI vitamin D receptor gene polymorphism with external apical root resorption during orthodontic treatment. Methods: Our subjects were 377 patients with Class II Division 1 malocclusion, divided into 3 groups: (1) 160 with external apical root resorption <= 1.43 mm, (2) 179 with external apical root resorption >1.43 mm), and (3) 38 untreated subjects. External apical root resorption of the maxillary incisors was evaluated on periapical radiographs taken before and after 6 months of treatment. After DNA collection and purification, vitamin D receptor TaqI polymorphism analysis was performed by polymerase chain reaction-restriction fragment length polymorphism. Univariate and multivariate analyses were performed to verify the association of clinical and genetic variables with external apical root resorption (P <0.05). Results: There was a higher proportion of external apical root resorption in orthodontically treated patients compared with the untreated subjects. In patients orthodontically treated, age higher than 14 years old, initial size of the maxillary incisor root superior to 30 mm, and premolar extraction were associated with increased external apical root resorption. Genotypes containing the C allele were weakly associated with protection against external apical root resorption (CC + CT x TT [odds ratio, 0.29; 95% confidence interval, 0.07-1.23; P = 0.091]) when treated orthodontic patients were compared to untreated individuals. Conclusions: Clinical factors and vitamin D receptor TaqI polymorphism were associated with external apical root resorption in orthodontic patients. (Am J Orthod Dentofacial Orthop 2012; 142: 339-47)
Resumo:
Introduction. Patients with terminal heart failure have increased more than the available organs leading to a high mortality rate on the waiting list. Use of Marginal and expanded criteria donors has increased due to the heart shortage. Objective. We analyzed all heart transplantations (HTx) in Sao Paulo state over 8 years for donor profile and recipient risk factors. Method. This multi-institutional review collected HTx data from all institutions in the state of Sao Paulo, Brazil. From 2002 to 2008 (6 years), only 512 (28.8%) of 1777 available heart donors were accepted for transplantation. All medical records were analyzed retrospectively; none of the used donors was excluded, even those considered to be nonstandard. Results. The hospital mortality rate was 27.9% (n = 143) and the average follow-up time was 29.4 +/- 28.4 months. The survival rate was 55.5% (n = 285) at 6 years after HTx. Univariate analysis showed the following factors to impact survival: age (P = .0004), arterial hypertension (P = .4620), norepinephrine (P = .0450), cardiac arrest (P = .8500), diabetes mellitus (P = .5120), infection (P = .1470), CKMB (creatine kinase MB) (P = .8694), creatinine (P = .7225), and Na+ (P = .3273). On multivariate analysis, only age showed significance; logistic regression showed a significant cut-off at 40 years: organs from donors older than 40 years showed a lower late survival rates (P = .0032). Conclusions. Donor age older than 40 years represents an important risk factor for survival after HTx. Neither donor gender nor norepinephrine use negatively affected early survival.
Resumo:
Oil content and grain yield in maize are negatively correlated, and so far the development of high-oil high-yielding hybrids has not been accomplished. Then a fully understand of the inheritance of the kernel oil content is necessary to implement a breeding program to improve both traits simultaneously. Conventional and molecular marker analyses of the design III were carried out from a reference population developed from two tropical inbred lines divergent for kernel oil content. The results showed that additive variance was quite larger than the dominance variance, and the heritability coefficient was very high. Sixteen QTL were mapped, they were not evenly distributed along the chromosomes, and accounted for 30.91% of the genetic variance. The average level of dominance computed from both conventional and QTL analysis was partial dominance. The overall results indicated that the additive effects were more important than the dominance effects, the latter were not unidirectional and then heterosis could not be exploited in crosses. Most of the favorable alleles of the QTL were in the high-oil parental inbred, which could be transferred to other inbreds via marker-assisted backcross selection. Our results coupled with reported information indicated that the development of high-oil hybrids with acceptable yields could be accomplished by using marker-assisted selection involving oil content, grain yield and its components. Finally, to exploit the xenia effect to increase even more the oil content, these hybrids should be used in the Top Cross((TM)) procedure.
Resumo:
Multivariate analyses of UV-Vis spectral data from cachaca wood extracts provide a simple and robust model to classify aged Brazilian cachacas according to the wood species used in the maturation barrels. The model is based on inspection of 93 extracts of oak and different Brazilian wood species by a non-aged cachaca used as an extraction solvent. Application of PCA (Principal Components Analysis) and HCA (Hierarchical Cluster Analysis) leads to identification of 6 clusters of cachaca wood extracts (amburana, amendoim, balsamo, castanheira, jatoba, and oak). LDA (Linear Discriminant Analysis) affords classification of 10 different wood species used in the cachaca extracts (amburana, amendoim, balsamo, cabreuva-parda, canela-sassafras, castanheira, jatoba, jequitiba-rosa, louro-canela, and oak) with an accuracy ranging from 80% (amendoim and castanheira) to 100% (balsamo and jequitiba-rosa). The methodology provides a low-cost alternative to methods based on liquid chromatography and mass spectrometry to classify cachacas aged in barrels that are composed of different wood species.
Resumo:
The objective of this paper is to model variations in test-day milk yields of first lactations of Holstein cows by RR using B-spline functions and Bayesian inference in order to fit adequate and parsimonious models for the estimation of genetic parameters. They used 152,145 test day milk yield records from 7317 first lactations of Holstein cows. The model established in this study was additive, permanent environmental and residual random effects. In addition, contemporary group and linear and quadratic effects of the age of cow at calving were included as fixed effects. Authors modeled the average lactation curve of the population with a fourth-order orthogonal Legendre polynomial. They concluded that a cubic B-spline with seven random regression coefficients for both the additive genetic and permanent environment effects was to be the best according to residual mean square and residual variance estimates. Moreover they urged a lower order model (quadratic B-spline with seven random regression coefficients for both random effects) could be adopted because it yielded practically the same genetic parameter estimates with parsimony. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Aims: To evaluate the associations of excision repair cross complementing-group 1 (ERCC1) (DNA repair protein) (G19007A) polymorphism, methylation and immunohistochemical expression with epidemiological and clinicopathological factors and with overall survival in head and neck squamous cell carcinoma (HNSCC) patients. Methods and results: The study group comprised 84 patients with HNSCC who underwent surgery and adjuvant radiotherapy without chemotherapy. Bivariate and multivariate analyses were used. The allele A genotype variant was observed in 79.8% of the samples, GG in 20.2%, GA in 28.6% and AA in 51.2%. Individuals aged more than 45 years had a higher prevalence of the allelic A variant and a high (83.3%) immunohistochemical expression of ERCC1 protein [odds ratio (OR) = 4.86, 95% confidence interval (CI): 1.2-19.7, P = 0.027], which was also high in patients with advanced stage (OR= 5.04, 95% CI: 1.07-23.7, P = 0.041). Methylated status was found in 51.2% of the samples, and was higher in patients who did not present distant metastasis (OR = 6.67, 95% CI: 1.40-33.33, P = 0.019) and in patients with advanced stage (OR = 5.04, 95% CI: 1.07-23.7, P = 0.041). At 2 and 5 years, overall survival was 55% and 36%, respectively (median = 30 months). Conclusion: Our findings may reflect a high rate of DNA repair due to frequent tissue injury during the lifetime of these individuals, and also more advanced disease presentation in this population with worse prognosis.
Resumo:
Abstract Background The generalized odds ratio (GOR) was recently suggested as a genetic model-free measure for association studies. However, its properties were not extensively investigated. We used Monte Carlo simulations to investigate type-I error rates, power and bias in both effect size and between-study variance estimates of meta-analyses using the GOR as a summary effect, and compared these results to those obtained by usual approaches of model specification. We further applied the GOR in a real meta-analysis of three genome-wide association studies in Alzheimer's disease. Findings For bi-allelic polymorphisms, the GOR performs virtually identical to a standard multiplicative model of analysis (e.g. per-allele odds ratio) for variants acting multiplicatively, but augments slightly the power to detect variants with a dominant mode of action, while reducing the probability to detect recessive variants. Although there were differences among the GOR and usual approaches in terms of bias and type-I error rates, both simulation- and real data-based results provided little indication that these differences will be substantial in practice for meta-analyses involving bi-allelic polymorphisms. However, the use of the GOR may be slightly more powerful for the synthesis of data from tri-allelic variants, particularly when susceptibility alleles are less common in the populations (≤10%). This gain in power may depend on knowledge of the direction of the effects. Conclusions For the synthesis of data from bi-allelic variants, the GOR may be regarded as a multiplicative-like model of analysis. The use of the GOR may be slightly more powerful in the tri-allelic case, particularly when susceptibility alleles are less common in the populations.
Resumo:
Analysts, politicians and international players from all over the world look at China as one of the most powerful countries on the international scenario, and as a country whose economic development can significantly impact on the economies of the rest of the world. However many aspects of this country have still to be investigated. First the still fundamental role played by Chinese rural areas for the general development of the country from a political, economic and social point of view. In particular, the way in which the rural areas have influenced the social stability of the whole country has been widely discussed due to their strict relationship with the urban areas where most people from the countryside emigrate searching for a job and a better life. In recent years many studies have mostly focused on the urbanization phenomenon with little interest in the living conditions in rural areas and in the deep changes which have occurred in some, mainly agricultural provinces. An analysis of the level of infrastructure is one of the main aspects which highlights the principal differences in terms of living conditions between rural and urban areas. In this thesis, I first carried out the analysis through the multivariate statistics approach (Principal Component Analysis and Cluster Analysis) in order to define the new map of rural areas based on the analysis of living conditions. In the second part I elaborated an index (Living Conditions Index) through the Fuzzy Expert/Inference System. Finally I compared this index (LCI) to the results obtained from the cluster analysis drawing geographic maps. The data source is the second national agricultural census of China carried out in 2006. In particular, I analysed the data refer to villages but aggregated at province level.
Resumo:
The Large Hadron Collider, located at the CERN laboratories in Geneva, is the largest particle accelerator in the world. One of the main research fields at LHC is the study of the Higgs boson, the latest particle discovered at the ATLAS and CMS experiments. Due to the small production cross section for the Higgs boson, only a substantial statistics can offer the chance to study this particle properties. In order to perform these searches it is desirable to avoid the contamination of the signal signature by the number and variety of the background processes produced in pp collisions at LHC. Much account assumes the study of multivariate methods which, compared to the standard cut-based analysis, can enhance the signal selection of a Higgs boson produced in association with a top quark pair through a dileptonic final state (ttH channel). The statistics collected up to 2012 is not sufficient to supply a significant number of ttH events; however, the methods applied in this thesis will provide a powerful tool for the increasing statistics that will be collected during the next LHC data taking.
Resumo:
Purpose: To report an angiographic investigation of midterm atherosclerotic disease progression in below-the-knee (BTK) arteries of claudicants. Methods: Angiograms were performed in 58 consecutive claudicants (35 men; mean age 68.3±8.7 years) with endovascular treatment of femoropopliteal arteries in 58 limbs after a mean follow-up of 3.6±1.2 years. Angiograms were reviewed in consensus by 2 experienced readers blinded to clinical data. Progression of atherosclerosis in 4 BTK arterial segments (tibioperoneal trunk, anterior and posterior tibial arteries, and peroneal artery) was assessed according to the Bollinger score. The composite per calf Bollinger score represented the average of the 4 BTK arterial segment scores. The association of the Bollinger score with cardiovascular risk factors and gender was scrutinized. Results: A statistically significant increase in atherosclerotic burden was observed for the mean composite per calf Bollinger score (5.7±8.3 increase, 95% CI 3.5 to 7.9, p<0.0001), as well as for each single arterial segment analyzed. In multivariate linear regression analysis, diabetes mellitus was associated with a more pronounced progression of atherosclerotic burden in crural arteries (β: 5.6, p=0.035, 95% CI 0.398 to 10.806). Conclusion: Progression of infrapopliteal atherosclerotic lesions is common in claudicants during midterm follow-up. Presence of diabetes mellitus was confirmed as a major risk factor for more pronounced atherosclerotic BTK disease progression.