923 resultados para LARGE-SAMPLE


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Intermediate phenotypes are often measured as a proxy for asthma. It is largely unclear to what extent the same set of environmental or genetic factors regulate these traits. Objective: Estimate the environmental and genetic correlations between self-reported and clinical asthma traits. Methods: A total of 3073 subjects from 802 families were ascertained through a twin proband. Traits measured included self-reported asthma, airway histamine responsiveness (AHR), skin prick response to common allergens including house dust mite (Dermatophagoides pteronyssinus [D. pter]), baseline lung function, total serum immunoglobulin E (IgE) and eosinophilia. Bivariate and multivariate analyses of eight traits were performed with adjustment for ascertainment and significant covariates. Results: Overall 2716 participants completed an asthma questionnaire and 2087 were clinically tested, including 1289 self-reported asthmatics (92% previously diagnosed by a doctor). Asthma, AHR, markers of allergic sensitization and eosinophilia had significant environmental correlations with each other (range: 0.23-0.89). Baseline forced expiratory volume in 1 s (FEV1) showed low environmental correlations with most traits. Fewer genetic correlations were significantly different from zero. Phenotypes with greatest genetic similarity were asthma and atopy (0.46), IgE and eosinophilia (0.44), AHR and D. pter (0.43) and AHR and airway obstruction (-0.43). Traits with greatest genetic dissimilarity were FEV1 and atopy (0.05), airway obstruction and IgE (0.07) and FEV1 and D. pter (0.11). Conclusion: These results suggest that the same set of environmental factors regulates the variation of many asthma traits. In addition, although most traits are regulated to great extent by specific genetic factors, there is still some degree of genetic overlap that could be exploited by multivariate linkage approaches.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The assessment of adolescent drinking behavior is a complex task, complicated by variability in drinking patterns, the transitory and developmental nature of the behavior and the reliance (for large scale studies) on self-report questionnaires. The Adolescent Alcohol Involvement Scale (Mayer & Filstead, 1979) is a 14-item screening tool designed to help to identify alcohol misusers or more problematic drinkers. The present study utilized a large sample (n = 4066) adolescents from Northern Ireland. Results of Confirmatory Factor Analyses and reliability estimates revealed that the 14-items share sufficient common variance that scores can be considered to be reliable and that the 14 items can be scored to provide a composite alcohol use score.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Adolescent drinking is a significant issue yet valid psychometric tools designed for this group are scarce. The Drinking Refusal Self-Efficacy Questionnaire—Revised Adolescent Version (DRSEQ-RA) is designed to assess an individual's belief in their ability to resist drinking alcohol. The original DRSEQ-R consists of three factors reflecting social pressure refusal self-efficacy, opportunistic refusal self-efficacy and emotional relief refusal self-efficacy. A large sample of 2020 adolescents aged between 12 and 19 years completed the DRSEQ and measures of alcohol consumption in small groups. Using confirmatory factor analysis, the three factor structure was confirmed. All three factors were negatively correlated with both frequency and volume of alcohol consumption. Drinkers reported lower drinking refusal self-efficacy than non-drinkers. Taken together, these results suggest that the adolescent version of the Drinking Refusal Self-Efficacy Questionnaire (DRSEQ-RA) is a reliable and valid measure of drinking refusal self-efficacy.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

This study reports on the utilisation of the Manchester Driver Behaviour Questionnaire (DBQ) to examine the self-reported driving behaviours of a large sample of Australian fleet drivers (N = 3414). Surveys were completed by employees before they commenced a one day safety workshop intervention. Factor analysis techniques identified a three factor solution similar to previous research, which was comprised of: (a) errors, (b) highway-code violations and (c) aggressive driving violations. Two items traditionally related with highway-code violations were found to be associated with aggressive driving behaviours among the current sample. Multivariate analyses revealed that exposure to the road, errors and self-reported offences predicted crashes at work in the last 12 months, while gender, highway violations and crashes predicted offences incurred while at work. Importantly, those who received more fines at work were at an increased risk of crashing the work vehicle. However, overall, the DBQ demonstrated limited efficacy at predicting these two outcomes. This paper outlines the major findings of the study in regards to identifying and predicting aberrant driving behaviours and also highlights implications regarding the future utilisation of the DBQ within fleet settings.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Instead of regarding a particular type of gambling activity (for example, electronic gambling machines, table games) as an isolated factor for problem gambling, recent research suggests that gambling involvement (for example, as measured by the number of different types of gambling activities played) should also be considered. Using a large sample of the Victorian adult population, this study found that the strength of association between problem gambling and the type of gambling reduced after adjusting for gambling involvement. This finding supports recent research that gambling involvement is an important factor in assessing the risk of problem gambling. The study also provides insights into the measurements of gambling involvement and provides alternative statistical modelling to analyse problem gambling.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Mixed-species flocks of foraging birds have been documented from terrestrial habitats all over the world and are thought to form for either improved feeding efficiency or better protection from predators. Two kinds of flock participants are recognized: those that join other species ('followers') and are therefore likely to be the recipients of the benefits of flock participation and those that are joined ('leaders'). Through comparative analyses, using a large sample of flocks from around the world, we show that (1) 'followers' tend to be smaller, more insectivorous, and feed in higher strata than matched species that participate in flocks to a lesser extent and (2) 'leaders' tend to be cooperative breeders more often than matched species that are not known to lead flocks. Furthermore, meta-analyses of published results from across the world showed that bird species in terrestrial mixed-species flocks increase foraging rates and reduce vigilance compared to when they are solitary or in conspecific groups. Moreover, the increase in foraging rates is seen only with flock followers and not flock leaders. These findings suggest a role for predation in the evolution of mixed-species flocking. Species that are vulnerable to predation follow species whose vigilance they can exploit. By doing so, they are able to reduce their own vigilance and forage at higher rates. (C) 2009 The Association for the Study of Animal Behaviour. Published by Elsevier Ltd. All rights reserved.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Statistical tests in vector autoregressive (VAR) models are typically based on large-sample approximations, involving the use of asymptotic distributions or bootstrap techniques. After documenting that such methods can be very misleading even with fairly large samples, especially when the number of lags or the number of equations is not small, we propose a general simulation-based technique that allows one to control completely the level of tests in parametric VAR models. In particular, we show that maximized Monte Carlo tests [Dufour (2002)] can provide provably exact tests for such models, whether they are stationary or integrated. Applications to order selection and causality testing are considered as special cases. The technique developed is applied to quarterly and monthly VAR models of the U.S. economy, comprising income, money, interest rates and prices, over the period 1965-1996.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

This paper presents a simple Bayesian approach to sample size determination in clinical trials. It is required that the trial should be large enough to ensure that the data collected will provide convincing evidence either that an experimental treatment is better than a control or that it fails to improve upon control by some clinically relevant difference. The method resembles standard frequentist formulations of the problem, and indeed in certain circumstances involving 'non-informative' prior information it leads to identical answers. In particular, unlike many Bayesian approaches to sample size determination, use is made of an alternative hypothesis that an experimental treatment is better than a control treatment by some specified magnitude. The approach is introduced in the context of testing whether a single stream of binary observations are consistent with a given success rate p(0). Next the case of comparing two independent streams of normally distributed responses is considered, first under the assumption that their common variance is known and then for unknown variance. Finally, the more general situation in which a large sample is to be collected and analysed according to the asymptotic properties of the score statistic is explored. Copyright (C) 2007 John Wiley & Sons, Ltd.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Aim: Tracking the dispersal patterns and habitat use of migratory species is necessary to delineate optimal areas for protection, with large sample sizes being more representative of the population. Here, we examine the dispersal patterns of a key Mediterranean loggerhead turtle (Caretta caretta) breeding population to identify priority foraging sites for protection. Location: Zakynthos Island, Greece and the wider Mediterranean. Method: We examined the dispersal patterns and foraging sites of 75 adult loggerheads (n = 38 males and 37 females) tracked from the breeding area of Zakynthos Island (Greece) from 2004 to 2011. We then combined our data with published sea turtle literature to identify key foraging sites for protection. Results: While both males and females exhibited similar dispersal patterns, about 25% males remained < 100 km of Zakynthos, whereas all females (except one) migrated > 200 km. Integration of our data with the wider literature isolated 10 core sites in proximity to existing protected areas, which could potentially protect 64% of the Zakynthos population, while five sites support individuals from at least 10 other loggerhead breeding populations. Main conclusions: Due to the widespread availability of neritic foraging grounds across the Mediterranean, sea turtles from Zakynthos exhibit disparate dispersal patterns. However, protecting only a few objectively defined important sites can encompass a large proportion of the foraging areas used and hence have considerable conservation benefit.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

This study aimed to investigate the phenomenology of obsessive compulsive disorder (OCD), addressing specific questions about the nature of obsessions and compulsions, and to contribute to the World Health Organization's (WHO) revision of OCD diagnostic guidelines. Data from 1001 patients from the Brazilian Research Consortium on Obsessive Compulsive Spectrum Disorders were used. Patients were evaluated by trained clinicians using validated instruments, including the Dimensional Yale Brown Obsessive Compulsive Scale, the University of Sao Paulo Sensory Phenomena Scale, and the Brown Assessment of Beliefs Scale. The aims were to compare the types of sensory phenomena (SP, subjective experiences that precede or accompany compulsions) in OCD patients with and without tic disorders and to determine the frequency of mental compulsions, the co-occurrence of obsessions and compulsions, and the range of insight. SP were common in the whole sample, but patients with tic disorders were more likely to have physical sensations and urges only. Mental compulsions occurred in the majority of OCD patients. It was extremely rare for OCD patients to have obsessions without compulsions. A wide range of insight into OCD beliefs was observed, with a small subset presenting no insight. The data generated from this large sample will help practicing clinicians appreciate the full range of OCD symptoms and confirm prior studies in smaller samples the degree to which insight varies. These findings also support specific revisions to the WHO's diagnostic guidelines for OCD, such as describing sensory phenomena, mental compulsions and level of insight, so that the world-wide recognition of this disabling disorder is increased. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using several variations of DupER Augmentation (based on different imputation methodologies, deletion rates, and duplication rates), analyzed in BILOG-MG 3, and results are compared to those obtained from analyzing the raw data. Additional manipulated variables include test length and sample size. Estimates are compared using seven different evaluative criteria. Results are mixed and inconclusive. DupER augmented data tend to result in larger root mean squared errors (RMSEs) and lower correlations between estimates and parameters for both item and ability parameters. However, some DupER variations produce estimates that are much less biased than those obtained from the raw data alone. For one DupER variation, it was found that DupER produced better results for low-ability simulees and worse results for those with high abilities. Findings, limitations, and recommendations for future studies are discussed. Specific recommendations for future studies include the application of Duper Augmentation (1) to empirical data, (2) with additional IRT models, and (3) the analysis of the efficacy of the procedure for different item and ability parameter distributions.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Simulation-based assessment is a popular and frequently necessary approach to evaluation of statistical procedures. Sometimes overlooked is the ability to take advantage of underlying mathematical relations and we focus on this aspect. We show how to take advantage of large-sample theory when conducting a simulation using the analysis of genomic data as a motivating example. The approach uses convergence results to provide an approximation to smaller-sample results, results that are available only by simulation. We consider evaluating and comparing a variety of ranking-based methods for identifying the most highly associated SNPs in a genome-wide association study, derive integral equation representations of the pre-posterior distribution of percentiles produced by three ranking methods, and provide examples comparing performance. These results are of interest in their own right and set the framework for a more extensive set of comparisons.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Fitting statistical models is computationally challenging when the sample size or the dimension of the dataset is huge. An attractive approach for down-scaling the problem size is to first partition the dataset into subsets and then fit using distributed algorithms. The dataset can be partitioned either horizontally (in the sample space) or vertically (in the feature space), and the challenge arise in defining an algorithm with low communication, theoretical guarantees and excellent practical performance in general settings. For sample space partitioning, I propose a MEdian Selection Subset AGgregation Estimator ({\em message}) algorithm for solving these issues. The algorithm applies feature selection in parallel for each subset using regularized regression or Bayesian variable selection method, calculates the `median' feature inclusion index, estimates coefficients for the selected features in parallel for each subset, and then averages these estimates. The algorithm is simple, involves very minimal communication, scales efficiently in sample size, and has theoretical guarantees. I provide extensive experiments to show excellent performance in feature selection, estimation, prediction, and computation time relative to usual competitors.

While sample space partitioning is useful in handling datasets with large sample size, feature space partitioning is more effective when the data dimension is high. Existing methods for partitioning features, however, are either vulnerable to high correlations or inefficient in reducing the model dimension. In the thesis, I propose a new embarrassingly parallel framework named {\em DECO} for distributed variable selection and parameter estimation. In {\em DECO}, variables are first partitioned and allocated to m distributed workers. The decorrelated subset data within each worker are then fitted via any algorithm designed for high-dimensional problems. We show that by incorporating the decorrelation step, DECO can achieve consistent variable selection and parameter estimation on each subset with (almost) no assumptions. In addition, the convergence rate is nearly minimax optimal for both sparse and weakly sparse models and does NOT depend on the partition number m. Extensive numerical experiments are provided to illustrate the performance of the new framework.

For datasets with both large sample sizes and high dimensionality, I propose a new "divided-and-conquer" framework {\em DEME} (DECO-message) by leveraging both the {\em DECO} and the {\em message} algorithm. The new framework first partitions the dataset in the sample space into row cubes using {\em message} and then partition the feature space of the cubes using {\em DECO}. This procedure is equivalent to partitioning the original data matrix into multiple small blocks, each with a feasible size that can be stored and fitted in a computer in parallel. The results are then synthezied via the {\em DECO} and {\em message} algorithm in a reverse order to produce the final output. The whole framework is extremely scalable.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Aims. The large and small-scale (pc) structure of the Galactic interstellar medium can be investigated by utilising spectra of early-type stellar probes of known distances in the same region of the sky. This paper determines the variation in line strength of Ca ii at 3933.661 Å as a function of probe separation for a large sample of stars, including a number of sightlines in the Magellanic Clouds. 

Methods. FLAMES-GIRAFFE data taken with the Very Large Telescope towards early-type stars in 3 Galactic and 4 Magellanic open clusters in Ca ii are used to obtain the velocity, equivalent width, column density, and line width of interstellar Galactic calcium for a total of 657 stars, of which 443 are Magellanic Cloud sightlines. In each cluster there are between 43 and 111 stars observed. Additionally, FEROS and UVES Ca ii K and Na i D spectra of 21 Galactic and 154 Magellanic early-type stars are presented and combined with data from the literature to study the calcium column density - parallax relationship. 

Results. For the four Magellanic clusters studied with FLAMES, the strength of the Galactic interstellar Ca ii K equivalent width on transverse scales from ∼0.05-9 pc is found to vary by factors of ∼1.8-3.0, corresponding to column density variations of ∼0.3-0.5 dex in the optically-thin approximation. Using FLAMES, FEROS, and UVES archive spectra, the minimum and maximum reduced equivalent widths for Milky Way gas are found to lie in the range ∼35-125 mÅ and ∼30-160 mÅ for Ca ii K and Na i D, respectively. The range is consistent with a previously published simple model of the interstellar medium consisting of spherical cloudlets of filling factor ∼0.3, although other geometries are not ruled out. Finally, the derived functional form for parallax (π) and Ca ii column density (NCaII) is found to be π(mas) = 1 / (2.39 × 10-13 × NCaII (cm-2) + 0.11). Our derived parallax is ∼25 per cent lower than predicted by Megier et al. (2009, A&A, 507, 833) at a distance of ∼100 pc and ∼15 percent lower at a distance of ∼200 pc, reflecting inhomogeneity in the Ca ii distribution in the different sightlines studied.