12 resultados para Multivariate statistical methods
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
The topology of real-world complex networks, such as in transportation and communication, is always changing with time. Such changes can arise not only as a natural consequence of their growth, but also due to major modi. cations in their intrinsic organization. For instance, the network of transportation routes between cities and towns ( hence locations) of a given country undergo a major change with the progressive implementation of commercial air transportation. While the locations could be originally interconnected through highways ( paths, giving rise to geographical networks), transportation between those sites progressively shifted or was complemented by air transportation, with scale free characteristics. In the present work we introduce the path-star transformation ( in its uniform and preferential versions) as a means to model such network transformations where paths give rise to stars of connectivity. It is also shown, through optimal multivariate statistical methods (i.e. canonical projections and maximum likelihood classification) that while the US highways network adheres closely to a geographical network model, its path-star transformation yields a network whose topological properties closely resembles those of the respective airport transportation network.
Resumo:
Deviations from the average can provide valuable insights about the organization of natural systems. The present article extends this important principle to the systematic identification and analysis of singular motifs in complex networks. Six measurements quantifying different and complementary features of the connectivity around each node of a network were calculated, and multivariate statistical methods applied to identify singular nodes. The potential of the presented concepts and methodology was illustrated with respect to different types of complex real-world networks, namely the US air transportation network, the protein-protein interactions of the yeast Saccharomyces cerevisiae and the Roget thesaurus networks. The obtained singular motifs possessed unique functional roles in the networks. Three classic theoretical network models were also investigated, with the Barabasi-Albert model resulting in singular motifs corresponding to hubs, confirming the potential of the approach. Interestingly, the number of different types of singular node motifs as well as the number of their instances were found to be considerably higher in the real-world networks than in any of the benchmark networks. Copyright (C) EPLA, 2009
Resumo:
The Natural History of Human Papillomavirus (HPV) Infection in Men: The HIM Study is a prospective multi-center cohort study that, among other factors, analyzes participants` diet. A parallel cross-sectional study was designed to evaluate the validity and reproducibility of the quantitative food frequency questionnaire (QFFQ) used in the Brazilian center from the HIM Study. For this, a convenience subsample of 98 men aged 18 to 70 years from the HIM Study in Brazil answered three 54-item QFFQ and three 24-hour recall interviews, with 6-month intervals between them (data collection January to September 2007). A Bland-Altman analysis indicated that the difference between instruments was dependent on the magnitude of the intake for energy and most nutrients included in the validity analysis, with the exception of carbohydrates, fiber, polyunsaturated fat, vitamin C, and vitamin E. The correlation between the QFFQ and the 24-hour recall for the deattenuated and energy-adjusted data ranged from 0.05 (total fat) to 0.57 (calcium). For the energy and nutrients consumption included in the validity analysis, 33.5% of participants on average were correctly classified into quartiles, and the average value of 0.26 for weighted kappa shows a reasonable agreement. The intraclass correlation coefficients for all nutrients were greater than 0.40 in the reproducibility analysis. The QFFQ demonstrated good reproducibility and acceptable validity. The results support the use of this instrument in the HIM Study. J Am Diet Assoc. 2011;111:1045-1051.
Resumo:
In order to improve our understanding of climate change, the aim of this research project was to study the climatology and the time trends of drizzle and fog events in the Sao Paulo Metropolitan Area, and the possible connections of this variability with the sea surface temperature (SST) of the Atlantic and Pacific Oceans. The climatology of both phenomena presents differences and similarities. Fog shows a marked maximum frequency in winter and a minimum frequency in summer, while the seasonal differences of drizzle occurrence are less pronounced, there is a maximum in spring, whereas the other seasons present smaller and similar numbers of events. Both phenomena present a negative trend from 1933 to 2005 which is stronger for fog events. A multivariate statistical analysis indicates that the South Atlantic SST could increase warm temperature advection to the continent. This could be one of the responsible factors for the negative tendency in the number of both fog and drizzle events.
Resumo:
The correlation between the breaks in the metallicity distribution and the corotation radius of spiral galaxies has been already advocated in the past and is predicted by a chemodynamical model of our Galaxy that effectively introduces the role of spiral arms in the star formation rate. In this work, we present photometric and spectroscopic observations made with the Gemini Telescope for three of the best candidates of spiral galaxies to have the corotation inside the optical disc: IC 0167, NGC 1042 and NGC 6907. We observed the most intense and well-distributed H ii regions of these galaxies, deriving reliable galactocentric distances and oxygen abundances by applying different statistical methods. From these results, we confirm the presence of variations in the gradients of metallicity of these galaxies that are possibly correlated with the corotation resonance.
Resumo:
This paper deals with the morphological features of the tracheary elements of the vegetative organs in four Portulaca species (Portulaca hirsutissima Camb., P. halimoides L., P. wedermannii Poelln. and P. mucronata Link.) occurring in Southeast and Northeast Brazil. The vessel elements are small (< 25 mu m) and with simple perforation plate. The pattern of wall thickening varied from bordered pitting (in roots) to scalariform and helicoidal (stem and leaves). Statistical methods show variation in vessel-element diameter in different vegetative organs; wider elements were observed in roots. Tracheids occurring in leaves of P. hirsutissima and P. wedermannii, have morphological features that are similar to terminal tracheids or tracheoid idiolasts frequently associated with xerophytes. The paedomorphic features (juvenlism) observed here may be related, in part, to aspects of water transport and storage as described in Cactaceae.
Resumo:
We investigated the evolution of anuran locomotor performance and its morphological correlates as a function of habitat use and lifestyles. We reanalysed a subset of the data reported by Zug (Smithson. Contrib. Zool. 1978; 276: 1-31) employing phylogenetically explicit statistical methods (n = 56 species), and assembled morphological data on the ratio between hind-limb length and snout-vent length (SVL) from the literature and museum specimens for a large subgroup of the species from the original paper (n = 43 species). Analyses using independent contrasts revealed that classifying anurans into terrestrial, semi-aquatic, and arboreal categories cannot distinguish between the effects of phylogeny and ecological diversification in anuran locomotor performance. However, a more refined classification subdividing terrestrial species into `fossorials` and `non-fossorials`, and arboreal species into `open canopy`, `low canopy` and `high canopy`, suggests that part of the variation in locomotor performance and in hind-limb morphology can be attributed to ecological diversification. In particular, fossorial species had significantly lower jumping performances and shorter hind limbs than other species after controlling for SVL, illustrating how the trade-off between burrowing efficiency and jumping performance has resulted in morphological specialization in this group.
Resumo:
Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions (MWEs) have received special attention from the NLP community, as the methods and techniques developed for the treatment of simplex words are not necessarily suitable for them. This is certainly the case for the automatic acquisition of MWEs from corpora. A lot of effort has been directed to the task of automatically identifying them, with considerable success. In this paper, we propose an approach for the identification of MWEs in a multilingual context, as a by-product of a word alignment process, that not only deals with the identification of possible MWE candidates, but also associates some multiword expressions with semantics. The results obtained indicate the feasibility and low costs in terms of tools and resources demanded by this approach, which could, for example, facilitate and speed up lexicographic work.
A bivariate regression model for matched paired survival data: local influence and residual analysis
Resumo:
The use of bivariate distributions plays a fundamental role in survival and reliability studies. In this paper, we consider a location scale model for bivariate survival times based on the proposal of a copula to model the dependence of bivariate survival data. For the proposed model, we consider inferential procedures based on maximum likelihood. Gains in efficiency from bivariate models are also examined in the censored data setting. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the bivariate regression model for matched paired survival data. Sensitivity analysis methods such as local and total influence are presented and derived under three perturbation schemes. The martingale marginal and the deviance marginal residual measures are used to check the adequacy of the model. Furthermore, we propose a new measure which we call modified deviance component residual. The methodology in the paper is illustrated on a lifetime data set for kidney patients.
Resumo:
A 172 cm-long sediment core was collected from a small pristine lake situated within a centripetal drainage basin in a tropical karst environment (Ribeira River valley, southeastern Brazil) in order to investigate the paleoenvironmental record provided by the lacustrine geochemistry. Sediments derived from erosion of the surrounding cambisoils contain quartz, kaolinite, mica, chlorite and goethite. Accelerator mass spectroscopy (AMS) (14)C dating provided the geochronological framework. Three major sedimentary units were identified based on the structure and color of the sediments: Unit III from 170 to 140 cm (1030 +/- 60-730 +/- 60 yr BP), Unit II from 140 to 90 cm (730 +/- 60-360 +/- 60 yr BP) and Unit I from 90 to 0 cm (360 +/- 60-0 yr BP). Results of major and trace element concentrations were analysed through multivariate statistical techniques. Factor analysis provided three factors accounting for 72.4% of the total variance. F1 and F2 have high positive loadings from K, Ba, Cs, Rb, Sr, Sc, Th, light rare earth element (LREE), Fe, Cr, Ti, Zr, Hf and Ta, and high negative loadings from Mg, Co, Cu, Zn, Br and loss on ignition (LOI). F3, with positive loadings from V and non-metals As and Sb, accounts for a low percentage (9.7%) of the total variance, being therefore of little interpretative use. The profile distribution of F1 scores reveals negative values in Units I and III, and positive values in Unit II, meaning that K, Ba, Cs, Rb, Sr, Sc, Th, LREE, Fe, Cr, Ti, Zr, Hf and Ta are relatively more concentrated in Unit II, and Mg, Co, Cu, Zn and Br are relatively more abundant in Units I and III. The observed fluctuations in the geochemical composition of the sediments are consistent with slight variations of the erosion intensity in the catchment area as a possible response to variations of climatic conditions during the last millennium. (c) 2009 Elsevier GmbH. All rights reserved.
Resumo:
The aim of this study was to evaluate the presence of nutrients and toxic elements in coffees cultivated during the process of conversion, on organic agriculture, in southwest Bahia, Brazil. Levels of the nutrients and toxic elements were determined in samples of soils and coffee tissues from two transitional organic farms by atomic absorption spectrometry (FAAS). The metals in soil samples were extracted by Mehlich1 and USEPA-3050 procedures. Coffee samples from both farms presented relatively high levels of Cd, Zn and Cu (0.75,45.4 and 14.9 mu g g(-1). respectively), but were still below the limits specified by the Brazilian Food Legislation. The application of statistical methods showed that this finding can be attributed to the addition of high amounts of organic matter during the flowering tree period which can act on the bioavailability of metal ions in soils. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
An abnormality in neurodevelopment is one of the most robust etiologic hypotheses in schizophrenia (SZ). There is also strong evidence that genetic factors may influence abnormal neurodevelopment in the disease. The present study evaluated in SZ patients, whose brain structural data had been obtained with magnetic resonance imaging (MRI), the possible association between structural brain measures, and 32 DNA polymorphisms,located in 30 genes related to neurogenesis and brain development. DNA was extracted from peripheral blood cells of 25 patients with schizophrenia, genotyping was performed using diverse procedures, and putative associations were evaluated by standard statistical methods (using the software Statistical Package for Social Sciences - SPSS) with a modified Bonferroni adjustment. For reelin (RELN), a protease that guides neurons in the developing brain and underlies neurotransmission and synaptic plasticity in adults, an association was found for a non-synonymous polymorphism (Va1997Leu) with left and right ventricular enlargement. A putative association was also found between protocadherin 12 (PCDH12), a cell adhesion molecule involved in axonal guidance and synaptic specificity, and cortical folding (asymmetry coefficient of gyrification index). Although our results are preliminary, due to the small number of individuals analyzed, such an approach could reveal new candidate genes implicated in anomalous neurodevelopment in schizophrenia. (c) 2007 Elsevier Ireland Ltd. All rights reserved.