957 resultados para Multivariate statistical methods


Relevância:

80.00% 80.00%

Publicador:

Resumo:

1. Species distribution modelling is used increasingly in both applied and theoretical research to predict how species are distributed and to understand attributes of species' environmental requirements. In species distribution modelling, various statistical methods are used that combine species occurrence data with environmental spatial data layers to predict the suitability of any site for that species. While the number of data sharing initiatives involving species' occurrences in the scientific community has increased dramatically over the past few years, various data quality and methodological concerns related to using these data for species distribution modelling have not been addressed adequately. 2. We evaluated how uncertainty in georeferences and associated locational error in occurrences influence species distribution modelling using two treatments: (1) a control treatment where models were calibrated with original, accurate data and (2) an error treatment where data were first degraded spatially to simulate locational error. To incorporate error into the coordinates, we moved each coordinate with a random number drawn from the normal distribution with a mean of zero and a standard deviation of 5 km. We evaluated the influence of error on the performance of 10 commonly used distributional modelling techniques applied to 40 species in four distinct geographical regions. 3. Locational error in occurrences reduced model performance in three of these regions; relatively accurate predictions of species distributions were possible for most species, even with degraded occurrences. Two species distribution modelling techniques, boosted regression trees and maximum entropy, were the best performing models in the face of locational errors. The results obtained with boosted regression trees were only slightly degraded by errors in location, and the results obtained with the maximum entropy approach were not affected by such errors. 4. Synthesis and applications. To use the vast array of occurrence data that exists currently for research and management relating to the geographical ranges of species, modellers need to know the influence of locational error on model quality and whether some modelling techniques are particularly robust to error. We show that certain modelling techniques are particularly robust to a moderate level of locational error and that useful predictions of species distributions can be made even when occurrence data include some error.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

OBJECTIVES: The aim of the study was to assess whether prospective follow-up data within the Swiss HIV Cohort Study can be used to predict patients who stop smoking; or among smokers who stop, those who start smoking again. METHODS: We built prediction models first using clinical reasoning ('clinical models') and then by selecting from numerous candidate predictors using advanced statistical methods ('statistical models'). Our clinical models were based on literature that suggests that motivation drives smoking cessation, while dependence drives relapse in those attempting to stop. Our statistical models were based on automatic variable selection using additive logistic regression with component-wise gradient boosting. RESULTS: Of 4833 smokers, 26% stopped smoking, at least temporarily; because among those who stopped, 48% started smoking again. The predictive performance of our clinical and statistical models was modest. A basic clinical model for cessation, with patients classified into three motivational groups, was nearly as discriminatory as a constrained statistical model with just the most important predictors (the ratio of nonsmoking visits to total visits, alcohol or drug dependence, psychiatric comorbidities, recent hospitalization and age). A basic clinical model for relapse, based on the maximum number of cigarettes per day prior to stopping, was not as discriminatory as a constrained statistical model with just the ratio of nonsmoking visits to total visits. CONCLUSIONS: Predicting smoking cessation and relapse is difficult, so that simple models are nearly as discriminatory as complex ones. Patients with a history of attempting to stop and those known to have stopped recently are the best candidates for an intervention.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Durante los cuatro años de disfrute de la beca (2006 – 2009) se ha consolidado una base de datos de medidas osteológicas del esqueleto apendicular de numerosas especies del O. Carnivora. Concretamente, se han medido 364 individuos de 126 especies. Los ejemplares pertenecían a las colecciones del Phyletisches Museum (Jena, Alemania), el Museum für Naturkunde (Berlín, Alemania), el Museu de Ciències Naturals de la Ciutadella (Barcelona, España), el Múseum National d'Histoire Naturelle (París, Francia), y el Museo Nacional de Ciencias Naturales (Madrid, España). Asimismo, con estos datos se han estado preparando tres artículos sobre la morfología de ciertos elementos del esqueleto apendicular en carnívoros, dos de los cuales se encuentran actualmente en estado de revisión para su publicación científica. Dos de ellos, "Scapula, habitat and locomotion in Carnivora" y "Size and shape in the carnivore scapula", relacionan la morfología escapular con factores como el tamaño del animal, el tipo de locomoción que presenta y el hábitat en el que se encuentra; el primero mediante metodología multivariante (análisis funcional) y el segundo bajo las nuevas técnicas de morfometría geométrica. El tercer artículo, "Scaling and mechanics in the carnivore calcaneus: A comparison of natural and artificial selection", evalúa el efecto de diferentes tipos de selección, natural frente a artificial, sobre la morfología del calcáneo y su influencia en la biomecánica de este hueso. Finalmente, también se ha desarrollado un estudio experimental sobre la búsqueda de estabilidad durante la locomoción arbórea, cuyos resultados han dado lugar al artículo "The search for stability on narrow supports: An experimental study in cats and dogs", que también se halla bajo revisión actualmente.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

To compare the epidemiological profile and socioeconomic factors associated to the infection by Schistosoma mansoni in a rural and an urban endemic area a cross-sectional study was performed in Água Branca de Minas (rural area) and Bela Fama (urban area), both situated in the State of Minas Gerais, Brazil. Two hundred and eighty eight individuals were surveyed in the rural area and 787 in the urban area. Water contact and socioeconomic questionnaires were used to identify risk factors for the infection. The prevalences of 38.8% and 9.7% and the geometric mean of eggs per gram of faeces of 117.8 and 62.3 were found in the rural and urban areas, respectively. By multivariate statistical analysis age groups over nine years old and previous specific treatment were associated with the infection in rural area. In urban area age over nine years old, low quality housing, weekly fishing and swimming were associated after adjustment by logistic regression

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper examines the results of spatial (microgeographical) water contact/schistosomiasis studies in two African (Egyptian and Kenyan) and one Brazilian communities. All three studies used traditional cartographic and statistical methods but one of them emploeyd also GIS (geographical information systems) tools. The advantage of GIS and their potential role in schistosomiasis control are briefly described. The three cases revealed considerable variation in the spatial distribution of water contact, transmission parameters and infection levels at the household and individual levels. All studies showed considerable variation in the prevalence and intensity of infection between households. They also show a variable influence of distance on water contact behavior associated with type of activity, age, sex, socioeconomic level, perception of water quality, season and availability of water in the home. Water contact behavior and schistosomiasis were evaluated in the Brazilian village of Nova União within the context of water sharing between household and age/sex groups. Recommendations are made for further spatial studies on the transmission and control of schistosomiasis.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Report for the scientific sojourn at the University of Reading, United Kingdom, from January until May 2008. The main objectives have been firstly to infer population structure and parameters in demographic models using a total of 13 microsatellite loci for genotyping approximately 30 individuals per population in 10 Palinurus elephas populations both from Mediterranean and Atlantic waters. Secondly, developing statistical methods to identify discrepant loci, possibly under selection and implement those methods using the R software environment. It is important to consider that the calculation of the probability distribution of the demographic and mutational parameters for a full genetic data set is numerically difficult for complex demographic history (Stephens 2003). The Approximate Bayesian Computation (ABC), based on summary statistics to infer posterior distributions of variable parameters without explicit likelihood calculations, can surmount this difficulty. This would allow to gather information on different demographic prior values (i.e. effective population sizes, migration rate, microsatellite mutation rate, mutational processes) and assay the sensitivity of inferences to demographic priors by assuming different priors.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The response of Arabidopsis to stress caused by mechanical wounding was chosen as a model to compare the performances of high resolution quadrupole-time-of-flight (Q-TOF) and single stage Orbitrap (Exactive Plus) mass spectrometers in untargeted metabolomics. Both instruments were coupled to ultra-high pressure liquid chromatography (UHPLC) systems set under identical conditions. The experiment was divided in two steps: the first analyses involved sixteen unwounded plants, half of which were spiked with pure standards that are not present in Arabidopsis. The second analyses compared the metabolomes of mechanically wounded plants to unwounded plants. Data from both systems were extracted using the same feature detection software and submitted to unsupervised and supervised multivariate analysis methods. Both mass spectrometers were compared in terms of number and identity of detected features, capacity to discriminate between samples, repeatability and sensitivity. Although analytical variability was lower for the UHPLC-Q-TOF, generally the results for the two detectors were quite similar, both of them proving to be highly efficient at detecting even subtle differences between plant groups. Overall, sensitivity was found to be comparable, although the Exactive Plus Orbitrap provided slightly lower detection limits for specific compounds. Finally, to evaluate the potential of the two mass spectrometers for the identification of unknown markers, mass and spectral accuracies were calculated on selected identified compounds. While both instruments showed excellent mass accuracy (<2.5ppm for all measured compounds), better spectral accuracy was recorded on the Q-TOF. Taken together, our results demonstrate that comparable performances can be obtained at acquisition frequencies compatible with UHPLC on Q-TOF and Exactive Plus MS, which may thus be equivalently used for plant metabolomics.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

La aplicación Log2XML tiene como objeto principal la transformación de archivos log en formato texto con separador de campos a un formato XML estandarizado. Para permitir que la aplicación pueda trabajar con logs de diferentes sistemas o aplicaciones, dispone de un sistema de plantillas (indicación de orden de campos y carácter separador) que permite definir la estructura mínima para poder extraer la información de cualquier tipo de log que se base en separadores de campo. Por último, la aplicación permite el procesamiento de la información extraída para la generación de informes y estadísticas.Por otro lado, en el proyecto se profundiza en la tecnología Grails.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Analyzing the relationship between the baseline value and subsequent change of a continuous variable is a frequent matter of inquiry in cohort studies. These analyses are surprisingly complex, particularly if only two waves of data are available. It is unclear for non-biostatisticians where the complexity of this analysis lies and which statistical method is adequate.With the help of simulated longitudinal data of body mass index in children,we review statistical methods for the analysis of the association between the baseline value and subsequent change, assuming linear growth with time. Key issues in such analyses are mathematical coupling, measurement error, variability of change between individuals, and regression to the mean. Ideally, it is better to rely on multiple repeated measurements at different times and a linear random effects model is a standard approach if more than two waves of data are available. If only two waves of data are available, our simulations show that Blomqvist's method - which consists in adjusting for measurement error variance the estimated regression coefficient of observed change on baseline value - provides accurate estimates. The adequacy of the methods to assess the relationship between the baseline value and subsequent change depends on the number of data waves, the availability of information on measurement error, and the variability of change between individuals.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This study is a comparison AU Press with three other traditional (non-open access) Canadian university presses. The analysis is based on actual physical book sales on Amazon.com and Amazon.ca. Statistical methods include the sampling of the sales ranking of randomly selected books from each press. Results suggest that there is no significant difference in the ranking of printed books sold by AU Press in comparison with traditional university presses. However, AU Press, can demonstrate a significantly larger readership for its books as evidenced by thousands of downloads of the open electronic versions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This presentation aims to make understandable the use and application context of two Webometrics techniques, the logs analysis and Google Analytics, which currently coexist in the Virtual Library of the UOC. In this sense, first of all it is provided a comprehensive introduction to webometrics and then it is analysed the case of the UOC's Virtual Library focusing on the assimilation of these techniques and the considerations underlying their use, and covering in a holistic way the process of gathering, processing and data exploitation. Finally there are also provided guidelines for the interpretation of the metric variables obtained.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This study examines how structural determinants influence intermediary factors of child health inequities and how they operate through the communities where children live. In particular, we explore individual, family and community level characteristics associated with a composite indicator that quantitatively measures intermediary determinants of early childhood health in Colombia. We use data from the 2010 Colombian Demographic and Health Survey (DHS). Adopting the conceptual framework of the Commission on Social Determinants of Health (CSDH), three dimensions related to child health are represented in the index: behavioural factors, psychosocial factors and health system. In order to generate the weight of the variables and take into account the discrete nature of the data, principal component analysis (PCA) using polychoric correlations are employed in the index construction. Weighted multilevel models are used to examine community effects. The results show that the effect of household’s SES is attenuated when community characteristics are included, indicating the importance that the level of community development may have in mediating individual and family characteristics. The findings indicate that there is a significant variance in intermediary determinants of child health between-community, especially for those determinants linked to the health system, even after controlling for individual, family and community characteristics. These results likely reflect that whilst the community context can exert a greater influence on intermediary factors linked directly to health, in the case of psychosocial factors and the parent’s behaviours, the family context can be more important. This underlines the importance of distinguishing between community and family intervention programmes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In an earlier investigation (Burger et al., 2000) five sediment cores near the RodriguesTriple Junction in the Indian Ocean were studied applying classical statistical methods(fuzzy c-means clustering, linear mixing model, principal component analysis) for theextraction of endmembers and evaluating the spatial and temporal variation ofgeochemical signals. Three main factors of sedimentation were expected by the marinegeologists: a volcano-genetic, a hydro-hydrothermal and an ultra-basic factor. Thedisplay of fuzzy membership values and/or factor scores versus depth providedconsistent results for two factors only; the ultra-basic component could not beidentified. The reason for this may be that only traditional statistical methods wereapplied, i.e. the untransformed components were used and the cosine-theta coefficient assimilarity measure.During the last decade considerable progress in compositional data analysis was madeand many case studies were published using new tools for exploratory analysis of thesedata. Therefore it makes sense to check if the application of suitable data transformations,reduction of the D-part simplex to two or three factors and visualinterpretation of the factor scores would lead to a revision of earlier results and toanswers to open questions . In this paper we follow the lines of a paper of R. Tolosana-Delgado et al. (2005) starting with a problem-oriented interpretation of the biplotscattergram, extracting compositional factors, ilr-transformation of the components andvisualization of the factor scores in a spatial context: The compositional factors will beplotted versus depth (time) of the core samples in order to facilitate the identification ofthe expected sources of the sedimentary process.Kew words: compositional data analysis, biplot, deep sea sediments

Relevância:

80.00% 80.00%

Publicador:

Resumo:

”compositions” is a new R-package for the analysis of compositional and positive data.It contains four classes corresponding to the four different types of compositional andpositive geometry (including the Aitchison geometry). It provides means for computation,plotting and high-level multivariate statistical analysis in all four geometries.These geometries are treated in an fully analogous way, based on the principle of workingin coordinates, and the object-oriented programming paradigm of R. In this way,called functions automatically select the most appropriate type of analysis as a functionof the geometry. The graphical capabilities include ternary diagrams and tetrahedrons,various compositional plots (boxplots, barplots, piecharts) and extensive graphical toolsfor principal components. Afterwards, ortion and proportion lines, straight lines andellipses in all geometries can be added to plots. The package is accompanied by ahands-on-introduction, documentation for every function, demos of the graphical capabilitiesand plenty of usage examples. It allows direct and parallel computation inall four vector spaces and provides the beginner with a copy-and-paste style of dataanalysis, while letting advanced users keep the functionality and customizability theydemand of R, as well as all necessary tools to add own analysis routines. A completeexample is included in the appendix

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In standard multivariate statistical analysis common hypotheses of interest concern changes in mean vectors and subvectors. In compositional data analysis it is now well established that compositional change is most readily described in terms of the simplicial operation of perturbation and that subcompositions replace the marginal concept of subvectors. To motivate the statistical developments of this paper we present two challenging compositional problems from food production processes.Against this background the relevance of perturbations and subcompositions can beclearly seen. Moreover we can identify a number of hypotheses of interest involvingthe specification of particular perturbations or differences between perturbations and also hypotheses of subcompositional stability. We identify the two problems as being the counterpart of the analysis of paired comparison or split plot experiments and of separate sample comparative experiments in the jargon of standard multivariate analysis. We then develop appropriate estimation and testing procedures for a complete lattice of relevant compositional hypotheses