958 resultados para multivariate binary data
Resumo:
For species that form multi-generational and territorial family groups, resource-rich areas are predicted to support family dynasties in which one genetic lineage continuously occupies an area and may even expand to occupy surrounding areas. Data from a long-term study of Tasmanian native hens (Gallinula mortierii) support this prediction. The reproductive success and dispersal patterns of 18 hen lineages were monitored for seven breeding seasons and over several generations. The founder group with the highest average territory quality produced the highest total number of fledged young and the highest number of fledged linear descendants, accounting for 24% of the combined reproductive output of these 18 lineages. In the space of 6 years, this single genetic lineage expanded from one territory to occupy 12 of the 47 territories present in the population. This rate of expansion was over four times the population average for the same period. A multivariate analysis revealed that the success of a genetic lineage depended only on the number of high-quality territories surrounding the founder group. These results further demonstrate the resource-dependent nature of reproductive success in this species, and also highlight the potential importance of family dynasties in other cooperative species with complex social dynamics and dispersal patterns.
Resumo:
Objective: We examined the relationship between self-reported calcium (Cal intake and bone mineral content (BMC) in children and adolescents. We hypothesized that an expression of Ca adjusted for energy intake (El), i.e., Ca density, would be a better predictor of BMC than unadjusted Ca because of underreporting of EI. Methods: Data were obtained on dietary intakes (repeated 24-hour recalls) and BMC (by DEXA) in a cross-section of 227 children aged 8 to 17 years. Bivariate and multivariate analyses were used to examine die relationship between Ca, Ca density, and the dependent variables total body BMC and lumbar spine BMC. Covariates included were height, weight, bone area, maturity age, activity score and El. Results: Reported El compared to estimated basal metabolic rate suggested underreporting of El. Total body and lumbar spine BMC were significantly associated with El, but not Ca or Ca density, in bivariate analyses. After controlling for size and maturity, multiple linear regression analysis revealed unadjusted Ca to be a predictor of BMC in males in the total body (p = 0.08) and lumbar spine (p = 0.01). Unadjusted Ca was not a predictor of BMC at either site in females. Ca density was not a better predictor of BMC at either site in males or females. Conclusions: The relationship observed in male adolescents in this study between Ca intake and BMC is similar to that seen in clinical trials. Ca density did not enable us to see a relationship between Ca intake and BMC in females, which may reflect systematic reporting errors or that diet is not a limiting factor in this group of healthy adolescents.
Resumo:
The 16S rRNA gene (16S rDNA) is currently the most widely used gene for estimating the evolutionary history of prokaryotes, To date, there are more than 30 000 16S rDNA sequences available from the core databases, GenBank, EMBL and DDBJ, This great number may cause a dilemma when composing datasets for phylogenetic analysis, since the choice and number of reference organisms are known to affect the resulting tree topology. A group of sequences appearing monophyletic in one dataset may not be so in another. This can be especially problematic when establishing the relationships of distantly related sequences at the division (phylum) level. In this study, a multiple-outgroup approach to resolving division-level phylogenetic relationships is suggested using 16S rDNA data. The approach is illustrated by two case studies concerning the monophyly of two recently proposed bacterial divisions, OP9 and OP10.
Resumo:
Recent research in Australian sociology and political science has debated the extent to which postmaterialist values and economic self-interest shape voting in federal elections. Some researchers have argued that postmaterialist values have partly displaced materialist concerns with physical security and economic well-being in Australian public life. This displacement, coupled with the adoption by major political parties of postmaterialist 'quality of life' issues such as the environment, has meant that voting in Australia has come to be more dependent on postmaterialist values than on perceptions of economic interest. Other research, however, has found no relationship between postmaterialist values and voting behaviour, while economic evaluations remain a strong determinant of voting behaviour. Part of the disagreement reflects methodological differences in the research. But different methodological problems compromise each of the previous studies. In this paper we use data from the 1990, 1993, 1996 and 1998 Australian Election Studies to investigate postmaterialist and economic voting in the Commonwealth House of Representatives and the Senate. Using various statistical methods, we first explore bivariate relationships between key variables and then use multivariate models of postmaterialist and economic voting to adjudicate between the contending positions.
Resumo:
Qualitative data analysis (QDA) is often a time-consuming and laborious process usually involving the management of large quantities of textual data. Recently developed computer programs offer great advances in the efficiency of the processes of QDA. In this paper we report on an innovative use of a combination of extant computer software technologies to further enhance and simplify QDA. Used in appropriate circumstances, we believe that this innovation greatly enhances the speed with which theoretical and descriptive ideas can be abstracted from rich, complex, and chaotic qualitative data. © 2001 Human Sciences Press, Inc.
Resumo:
Much progress has been made on inferring population history from molecular data. However, complex demographic scenarios have been considered rarely or have proved intractable. The serial introduction of the South-Central American cane Load Bufo marinas in various Caribbean and Pacific islands involves four major phases: a possible genetic admixture during the first introduction, a bottleneck associated with founding, a transitory, population boom, and finally, a demographic stabilization. A large amount of historical and demographic information is available for those introductions and can be combined profitably with molecular data. We used a Bayesian approach to combine this information With microsatellite (10 loci) and enzyme (22 loci) data and used a rejection algorithm to simultaneously estimate the demographic parameters describing the four major phases of the introduction history,. The general historical trends supported by microsatellites and enzymes were similar. However, there was a stronger support for a larger bottleneck at introductions for microsatellites than enzymes and for a more balanced genetic admixture for enzymes than for microsatellites. Verb, little information was obtained from either marker about the transitory population boom observed after each introduction. Possible explanations for differences in resolution of demographic events and discrepancies between results obtained with microsatellites and enzymes were explored. Limits Of Our model and method for the analysis of nonequilibrium populations were discussed.
Resumo:
Objectives: This study examines human scalp electroencephalographic (EEG) data for evidence of non-linear interdependence between posterior channels. The spectral and phase properties of those epochs of EEG exhibiting non-linear interdependence are studied. Methods: Scalp EEG data was collected from 40 healthy subjects. A technique for the detection of non-linear interdependence was applied to 2.048 s segments of posterior bipolar electrode data. Amplitude-adjusted phase-randomized surrogate data was used to statistically determine which EEG epochs exhibited non-linear interdependence. Results: Statistically significant evidence of non-linear interactions were evident in 2.9% (eyes open) to 4.8% (eyes closed) of the epochs. In the eyes-open recordings, these epochs exhibited a peak in the spectral and cross-spectral density functions at about 10 Hz. Two types of EEG epochs are evident in the eyes-closed recordings; one type exhibits a peak in the spectral density and cross-spectrum at 8 Hz. The other type has increased spectral and cross-spectral power across faster frequencies. Epochs identified as exhibiting non-linear interdependence display a tendency towards phase interdependencies across and between a broad range of frequencies. Conclusions: Non-linear interdependence is detectable in a small number of multichannel EEG epochs, and makes a contribution to the alpha rhythm. Non-linear interdependence produces spatially distributed activity that exhibits phase synchronization between oscillations present at different frequencies. The possible physiological significance of these findings are discussed with reference to the dynamical properties of neural systems and the role of synchronous activity in the neocortex. (C) 2002 Elsevier Science Ireland Ltd. All rights reserved.
Resumo:
In many occupational safety interventions, the objective is to reduce the injury incidence as well as the mean claims cost once injury has occurred. The claims cost data within a period typically contain a large proportion of zero observations (no claim). The distribution thus comprises a point mass at 0 mixed with a non-degenerate parametric component. Essentially, the likelihood function can be factorized into two orthogonal components. These two components relate respectively to the effect of covariates on the incidence of claims and the magnitude of claims, given that claims are made. Furthermore, the longitudinal nature of the intervention inherently imposes some correlation among the observations. This paper introduces a zero-augmented gamma random effects model for analysing longitudinal data with many zeros. Adopting the generalized linear mixed model (GLMM) approach reduces the original problem to the fitting of two independent GLMMs. The method is applied to evaluate the effectiveness of a workplace risk assessment teams program, trialled within the cleaning services of a Western Australian public hospital.
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
The incidence of melanoma increases markedly in the second decade of life but almost nothing is known of the causes of melanoma in this age group. We report on the first population-based case-control study of risk factors for melanoma in adolescents (15-19 years). Data were collected through personal interviews with cases, controls and parents. A single examiner conducted full-body nevus counts and blood samples were collected from cases for analysis of the CDKN2A melanoma predisposition gene. A total of 201 (80%) of the 250 adolescents with melanoma diagnosed between 1987 and 1994 and registered with the Queensland Cancer Registry and 205 (79%) of 258 age-, gender- and location-matched controls who were contacted agreed to participate. The strongest risk factor associated with melanoma in adolescents in a multivariate model was the presence of more than 100 nevi 2 mm or more in diameter (odds ratio [OR] = 46.5, 95% confidence interval [Cl] = 11.4-190.8). Other risk factors were red hair (OR = 5.4, 95%Cl = 1.0-28.4); blue eyes (OR = 4.5, 95%Cl = 1.5- 13.6); inability to tan after prolonged sun exposure (OR = 4.7, 95%Cl = 0.9-24.6); heavy facial freckling (OR = 3.2, 95% Cl = 0.9-12.3); and family history of melanoma (OR = 4.0, 95%Cl = 0.8-18.9). Only 2 of 147 cases tested had germline variants or mutations in CDKN2A. There was no association with sunscreen use overall, however, never/rare use of sunscreen at home under the age of 5 years was associated with increased risk (OR = 2.2, 95%Cl = 0.7-7.1). There was no difference between cases and controls in cumulative sun exposure in this high-exposure environment. Factors indicating genetic susceptibility to melanoma, in particular, the propensity to develop nevi and freckles, red hair, blue eyes, inability to tan and a family history of the disease are the primary determinants of melanoma among adolescents in this high solar radiation environment. Lack of association with reported sun exposure is consistent with the high genetic susceptibility in this group. (C) 2002 Wiley-Liss, Inc.