772 resultados para Random Sample Size
Resumo:
Accurate knowledge of species’ habitat associations is important for conservation planning and policy. Assessing habitat associations is a vital precursor to selecting appropriate indicator species for prioritising sites for conservation or assessing trends in habitat quality. However, much existing knowledge is based on qualitative expert opinion or local scale studies, and may not remain accurate across different spatial scales or geographic locations. Data from biological recording schemes have the potential to provide objective measures of habitat association, with the ability to account for spatial variation. We used data on 50 British butterfly species as a test case to investigate the correspondence of data-derived measures of habitat association with expert opinion, from two different butterfly recording schemes. One scheme collected large quantities of occurrence data (c. 3 million records) and the other, lower quantities of standardised monitoring data (c. 1400 sites). We used general linear mixed effects models to derive scores of association with broad-leaf woodland for both datasets and compared them with scores canvassed from experts. Scores derived from occurrence and abundance data both showed strongly positive correlations with expert opinion. However, only for occurrence data did these fell within the range of correlations between experts. Data-derived scores showed regional spatial variation in the strength of butterfly associations with broad-leaf woodland, with a significant latitudinal trend in 26% of species. Sub-sampling of the data suggested a mean sample size of 5000 occurrence records per species to gain an accurate estimation of habitat association, although habitat specialists are likely to be readily detected using several hundred records. Occurrence data from recording schemes can thus provide easily obtained, objective, quantitative measures of habitat association.
Resumo:
Sweetness is generally a desirable taste, however consumers can be grouped into sweet likers and dislikers according to optimally preferred sucrose concentrations. Understanding the levels of sweetness in products that are acceptable and unacceptable to both consumer groups is important to product development and for influencing dietary habits. The concentrations at which sucrose decreases liking (the rejection threshold; RjT) in liquid and semi-solid matrices were investigated in this study. Thirty six consumers rated their liking of 5 sucrose aqueous solutions; this identified 36% sweet likers (SL) whose liking ratings increased with increasing sucrose and 64% sweet dislikers (SD) whose liking ratings decreased above 6% (w/v) sucrose. We hypothesized that SL and SD would have different RjT for sucrose in products. This was tested by preparing 8 levels of sucrose in orange juice and orange jelly and presenting each against the lowest level in forced choice preference tests. In orange juice, as sucrose increased from 33g/L to 75g/L the proportion of people preferring the sweeter sample increased in both groups. However, at higher sucrose levels, the proportion of consumers preferring the sweet sample decreased. For SD, a RjT was reached at 380 g/L, whereas a significant RjT for SL was not reached. RjT in jelly were not reached as the sweetness in orange jelly was significantly lower than for orange juice (p<0.001). Despite statistically significant differences in rated sweetness between SL and SD (p=0.019), the extent of difference between the two groups was minor. The results implied that sweet liker status was not substantially related to differences in sweetness perception. Self-reported dietary intake of carbohydrate, sugars and sucrose were not significantly affected by sweet liker status. However the failure to find an effect may be due to the small sample size and future studies within a larger, more representative population sample are justifiable from the results of this study.
Resumo:
Child oral health-related quality of life (COHRQoL) has been increasingly assessed; however, few studies appraised the influence of socioeconomic status on COHRQoL in developing countries. This study assessed the relationship of COHRQoL with socioeconomic backgrounds and clinical factors. This study followed a cross-sectional design, with a multistage random sample of 792 schoolchildren aged 12 years, representative of Santa Maria, a southern city in Brazil. Participants completed the Brazilian version of the Child Perceptions Questionnaire (CPQ(11-14)), their parents or guardians answered questions on socioeconomic status, and a dental examination provided information on the prevalence of caries, dental trauma and occlusion. The assessment of association used hierarchically adjusted Poisson regression models. Higher impacts on COHRQoL were observed for children presenting with untreated dental caries (RR 1.20; 95% CI 1.07-1.35) and maxillary overjet (RR 1.19; 95% CI 1.02-1.40). Socioeconomic factors also associated with COHRQoL; poorer scores were reported by children whose mothers have not completed primary education (RR 1.30; 95% CI 1.17-1.44) and those with lower household income (RR 1.13; 95% CI 1.02-1.26). Poor socioeconomic standings and poor dental status have a negative impact on COHRQoL; reducing health inequalities may demand dental programmes and policies targeting deprived population.
Resumo:
Objective: Self-rating provides a simple direct way of capturing perceptions of health. The objective of this study was to estimate the prevalence and associated factors of poor self-rated oral health among elders. Methods: National data from a cross-sectional population-based study with a multistage random sample of 4786 Brazilian older adults (aged 65-74) in 250 towns were analysed. Data collection included oral examinations (WHO 1997) and struct-ured interviews at elderly households. The outcome was measured by a single five-point-response-scale question dichotomized into `poor` (fair/poor/very poor) and `good` (good/very good) self-rated oral health. Data analyses used Poisson regression models stratified by sex. Results: The prevalence of poor self-rated oral health was 46.6% (95% CI: 45.2-48%) in the whole sample, 50.3% (48-52.5) in men and 44.2% (42.4-46) in women. Higher prevalence ratios (PR) were found in elders reporting unfavourable dental appearance (PR = 2.31; 95% CI: 2.02-2.65), poor chewing ability (PR = 1.64; CI: 1.48-1.8) and dental pain (PR = 1.44; CI: 1.04-1.23) in adjusted analysis. Poor self-perception was also associated with being men, black, unfavourable socioeconomic circumstances, unfavourable clinical oral health and with not using or needing a dental prosthesis. Conclusion: Assessment and understanding of self-rated oral health should take into account social factors, subjective and clinical oral symptoms.
Resumo:
The Hyades stream has long been thought to be a dispersed vestige of the Hyades cluster. However, recent analyses of the parallax distribution, of the mass function, and of the action-space distribution of stream stars have shown it to be rather composed of orbits trapped at a resonance of a density disturbance. This resonant scenario should leave a clearly different signature in the element abundances of stream stars than the dispersed cluster scenario, since the Hyades cluster is chemically homogeneous. Here, we study the metallicity as well as the element abundances of Li, Na, Mg, Fe, Zr, Ba, La, Ce, Nd and Eu for a random sample of stars belonging to the Hyades stream, and compare them with those of stars from the Hyades cluster. From this analysis: (i) we independently confirm that the Hyades stream cannot be solely composed of stars originating in the Hyades cluster; (ii) we show that some stars (namely 2/21) from the Hyades stream nevertheless have abundances compatible with an origin in the cluster; (iii) we emphasize that the use of Li as a chemical tag of the cluster origin of main-sequence stars is very efficient in the range 5500 K <= T(eff) <= 6200 K, since the Li sequence in the Hyades cluster is very tight, while at the same time spanning a large abundance range; (iv) we show that, while this evaporated population has a metallicity excess of similar to 0.2 dex with respect to the local thin-disc population, identical to that of the Hyades cluster, the remainder of the Hyades stream population has still a metallicity excess of similar to 0.06-0.15 dex, consistent with an origin in the inner Galaxy and (v) we show that the Hyades stream can be interpreted as an inner 4:1 resonance of the spiral pattern: this then also reproduces an orbital family compatible with the Sirius stream, and places the origin of the Hyades stream up to 1 kpc inwards from the solar radius, which might explain the observed metallicity excess of the stream population.
Resumo:
Peponapis bees are considered specialized pollinators of Cucurbita flowers, a genus that presents several species of economic value (squashes and pumpkins). Both genera originated in the Americas, and their diversity dispersion center is in Mexico. Ten species of Peponapis and ten species of Cucurbita (only non-domesticated species) were analyzed considering the similarity of their ecological niche characteristics with respect to climatic conditions of their occurrence areas (abiotic variables) and interactions between species (biotic variables). The similarity of climatic conditions (temperature and precipitation) was estimated through cluster analyses. The areas of potential occurrence of the most similar species were obtained through ecological niche modeling and summed with geographic information system tools. Three main clusters were obtained: one with species that shared potential occurrence areas mainly in deserts (P. pruinosa, P. timberlakei, C. digitata, C. palmata, C. foetidissima), another in moist forests (P. limitaris, P. atrata, C. lundelliana, C. o. martinezii) and a third mainly in dry forests (C. a. sororia, C. radicans, C. pedatifolia, P. azteca, P. smithi, P. crassidentata, P. utahensis). Some species with similar ecological niche presented potential shared areas that are also similar to their geographical distribution, like those occurring predominantly on deserts. However, some clustered species presented larger geographical areas, such as P. pruinosa and C. foetidissima suggesting other drivers than climatic conditions to shape their distributions. The domestication of Cucurbita and also the natural history of both genera were considered also as important factors. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
This is a reply to Ortega-Baes` et al. (2010) survey of 25 Argentinean species of cacti evaluated for vivipary. We argue that the sample size and geographic area of the species investigated is insufficient to totally exclude the putative commonness of this condition in the Cactaceae. We indicate possible reasons why they did not find viviparous fruits in their survey. Failure to detect vivipary in cacti of NW Argentina may be correlated with limited taxonomic sampling and geographic region in addition to intrinsic and extrinsic plant factors, including different stages of fruit and seed development and genetic, ecological, and edaphic aspects, which, individually or in concert, control precocious germination. We uphold that viviparity is putatively frequent in this family and list 16 new cases for a total of 53 viviparous cacti, which make up ca. 4% incidence of viviparism in the Cactaceae, a substantially higher percentage than most angiosperm families exhibiting this condition. The Cactaceae ranks fourth in frequency of viviparity after the aquatic families of mangroves and seagrasses. We suggest the re-evaluation of cactus vivipary, primarily as a reproductive adaptation to changing environments and physiological stress with a secondary role as a reproductive strategy with limited offspring dispersal/survival and fitness advantages. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
There is an increasing interest in the application of Evolutionary Algorithms (EAs) to induce classification rules. This hybrid approach can benefit areas where classical methods for rule induction have not been very successful. One example is the induction of classification rules in imbalanced domains. Imbalanced data occur when one or more classes heavily outnumber other classes. Frequently, classical machine learning (ML) classifiers are not able to learn in the presence of imbalanced data sets, inducing classification models that always predict the most numerous classes. In this work, we propose a novel hybrid approach to deal with this problem. We create several balanced data sets with all minority class cases and a random sample of majority class cases. These balanced data sets are fed to classical ML systems that produce rule sets. The rule sets are combined creating a pool of rules and an EA is used to build a classifier from this pool of rules. This hybrid approach has some advantages over undersampling, since it reduces the amount of discarded information, and some advantages over oversampling, since it avoids overfitting. The proposed approach was experimentally analysed and the experimental results show an improvement in the classification performance measured as the area under the receiver operating characteristics (ROC) curve.
Resumo:
In this paper, we proposed a new two-parameter lifetime distribution with increasing failure rate, the complementary exponential geometric distribution, which is complementary to the exponential geometric model proposed by Adamidis and Loukas (1998). The new distribution arises on a latent complementary risks scenario, in which the lifetime associated with a particular risk is not observable; rather, we observe only the maximum lifetime value among all risks. The properties of the proposed distribution are discussed, including a formal proof of its probability density function and explicit algebraic formulas for its reliability and failure rate functions, moments, including the mean and variance, variation coefficient, and modal value. The parameter estimation is based on the usual maximum likelihood approach. We report the results of a misspecification simulation study performed in order to assess the extent of misspecification errors when testing the exponential geometric distribution against our complementary one in the presence of different sample size and censoring percentage. The methodology is illustrated on four real datasets; we also make a comparison between both modeling approaches. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
This paper presents the use of a multiprocessor architecture for the performance improvement of tomographic image reconstruction. Image reconstruction in computed tomography (CT) is an intensive task for single-processor systems. We investigate the filtered image reconstruction suitability based on DSPs organized for parallel processing and its comparison with the Message Passing Interface (MPI) library. The experimental results show that the speedups observed for both platforms were increased in the same direction of the image resolution. In addition, the execution time to communication time ratios (Rt/Rc) as a function of the sample size have shown a narrow variation for the DSP platform in comparison with the MPI platform, which indicates its better performance for parallel image reconstruction.
Resumo:
The class of symmetric linear regression models has the normal linear regression model as a special case and includes several models that assume that the errors follow a symmetric distribution with longer-than-normal tails. An important member of this class is the t linear regression model, which is commonly used as an alternative to the usual normal regression model when the data contain extreme or outlying observations. In this article, we develop second-order asymptotic theory for score tests in this class of models. We obtain Bartlett-corrected score statistics for testing hypotheses on the regression and the dispersion parameters. The corrected statistics have chi-squared distributions with errors of order O(n(-3/2)), n being the sample size. The corrections represent an improvement over the corresponding original Rao`s score statistics, which are chi-squared distributed up to errors of order O(n(-1)). Simulation results show that the corrected score tests perform much better than their uncorrected counterparts in samples of small or moderate size.
Resumo:
P>In the context of either Bayesian or classical sensitivity analyses of over-parametrized models for incomplete categorical data, it is well known that prior-dependence on posterior inferences of nonidentifiable parameters or that too parsimonious over-parametrized models may lead to erroneous conclusions. Nevertheless, some authors either pay no attention to which parameters are nonidentifiable or do not appropriately account for possible prior-dependence. We review the literature on this topic and consider simple examples to emphasize that in both inferential frameworks, the subjective components can influence results in nontrivial ways, irrespectively of the sample size. Specifically, we show that prior distributions commonly regarded as slightly informative or noninformative may actually be too informative for nonidentifiable parameters, and that the choice of over-parametrized models may drastically impact the results, suggesting that a careful examination of their effects should be considered before drawing conclusions.Resume Que ce soit dans un cadre Bayesien ou classique, il est bien connu que la surparametrisation, dans les modeles pour donnees categorielles incompletes, peut conduire a des conclusions erronees. Cependant, certains auteurs persistent a negliger les problemes lies a la presence de parametres non identifies. Nous passons en revue la litterature dans ce domaine, et considerons quelques exemples surparametres simples dans lesquels les elements subjectifs influencent de facon non negligeable les resultats, independamment de la taille des echantillons. Plus precisement, nous montrons comment des a priori consideres comme peu ou non-informatifs peuvent se reveler extremement informatifs en ce qui concerne les parametres non identifies, et que le recours a des modeles surparametres peut avoir sur les conclusions finales un impact considerable. Ceci suggere un examen tres attentif de l`impact potentiel des a priori.
Resumo:
This paper considers an extension to the skew-normal model through the inclusion of an additional parameter which can lead to both uni- and bi-modal distributions. The paper presents various basic properties of this family of distributions and provides a stochastic representation which is useful for obtaining theoretical properties and to simulate from the distribution. Moreover, the singularity of the Fisher information matrix is investigated and maximum likelihood estimation for a random sample with no covariates is considered. The main motivation is thus to avoid using mixtures in fitting bimodal data as these are well known to be complicated to deal with, particularly because of identifiability problems. Data-based illustrations show that such model can be useful. Copyright (C) 2009 John Wiley & Sons, Ltd.
Resumo:
The Birnbaum-Saunders regression model is commonly used in reliability studies. We address the issue of performing inference in this class of models when the number of observations is small. Our simulation results suggest that the likelihood ratio test tends to be liberal when the sample size is small. We obtain a correction factor which reduces the size distortion of the test. Also, we consider a parametric bootstrap scheme to obtain improved critical values and improved p-values for the likelihood ratio test. The numerical results show that the modified tests are more reliable in finite samples than the usual likelihood ratio test. We also present an empirical application. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this article, we give an asymptotic formula of order n(-1/2), where n is the sample size, for the skewness of the distributions of the maximum likelihood estimates of the parameters in exponencial family nonlinear models. We generalize the result by Cordeiro and Cordeiro ( 2001). The formula is given in matrix notation and is very suitable for computer implementation and to obtain closed form expressions for a great variety of models. Some special cases and two applications are discussed.