849 resultados para large sample distributions
Resumo:
Satellite measurement validations, climate models, atmospheric radiative transfer models and cloud models, all depend on accurate measurements of cloud particle size distributions, number densities, spatial distributions, and other parameters relevant to cloud microphysical processes. And many airborne instruments designed to measure size distributions and concentrations of cloud particles have large uncertainties in measuring number densities and size distributions of small ice crystals. HOLODEC (Holographic Detector for Clouds) is a new instrument that does not have many of these uncertainties and makes possible measurements that other probes have never made. The advantages of HOLODEC are inherent to the holographic method. In this dissertation, I describe HOLODEC, its in-situ measurements of cloud particles, and the results of its test flights. I present a hologram reconstruction algorithm that has a sample spacing that does not vary with reconstruction distance. This reconstruction algorithm accurately reconstructs the field to all distances inside a typical holographic measurement volume as proven by comparison with analytical solutions to the Huygens-Fresnel diffraction integral. It is fast to compute, and has diffraction limited resolution. Further, described herein is an algorithm that can find the position along the optical axis of small particles as well as large complex-shaped particles. I explain an implementation of these algorithms that is an efficient, robust, automated program that allows us to process holograms on a computer cluster in a reasonable time. I show size distributions and number densities of cloud particles, and show that they are within the uncertainty of independent measurements made with another measurement method. The feasibility of another cloud particle instrument that has advantages over new standard instruments is proven. These advantages include a unique ability to detect shattered particles using three-dimensional positions, and a sample volume size that does not vary with particle size or airspeed. It also is able to yield two-dimensional particle profiles using the same measurements.
Resumo:
This dissertation has three separate parts: the first part deals with the general pedigree association testing incorporating continuous covariates; the second part deals with the association tests under population stratification using the conditional likelihood tests; the third part deals with the genome-wide association studies based on the real rheumatoid arthritis (RA) disease data sets from Genetic Analysis Workshop 16 (GAW16) problem 1. Many statistical tests are developed to test the linkage and association using either case-control status or phenotype covariates for family data structure, separately. Those univariate analyses might not use all the information coming from the family members in practical studies. On the other hand, the human complex disease do not have a clear inheritance pattern, there might exist the gene interactions or act independently. In part I, the new proposed approach MPDT is focused on how to use both the case control information as well as the phenotype covariates. This approach can be applied to detect multiple marker effects. Based on the two existing popular statistics in family studies for case-control and quantitative traits respectively, the new approach could be used in the simple family structure data set as well as general pedigree structure. The combined statistics are calculated using the two statistics; A permutation procedure is applied for assessing the p-value with adjustment from the Bonferroni for the multiple markers. We use simulation studies to evaluate the type I error rates and the powers of the proposed approach. Our results show that the combined test using both case-control information and phenotype covariates not only has the correct type I error rates but also is more powerful than the other existing methods. For multiple marker interactions, our proposed method is also very powerful. Selective genotyping is an economical strategy in detecting and mapping quantitative trait loci in the genetic dissection of complex disease. When the samples arise from different ethnic groups or an admixture population, all the existing selective genotyping methods may result in spurious association due to different ancestry distributions. The problem can be more serious when the sample size is large, a general requirement to obtain sufficient power to detect modest genetic effects for most complex traits. In part II, I describe a useful strategy in selective genotyping while population stratification is present. Our procedure used a principal component based approach to eliminate any effect of population stratification. The paper evaluates the performance of our procedure using both simulated data from an early study data sets and also the HapMap data sets in a variety of population admixture models generated from empirical data. There are one binary trait and two continuous traits in the rheumatoid arthritis dataset of Problem 1 in the Genetic Analysis Workshop 16 (GAW16): RA status, AntiCCP and IgM. To allow multiple traits, we suggest a set of SNP-level F statistics by the concept of multiple-correlation to measure the genetic association between multiple trait values and SNP-specific genotypic scores and obtain their null distributions. Hereby, we perform 6 genome-wide association analyses using the novel one- and two-stage approaches which are based on single, double and triple traits. Incorporating all these 6 analyses, we successfully validate the SNPs which have been identified to be responsible for rheumatoid arthritis in the literature and detect more disease susceptibility SNPs for follow-up studies in the future. Except for chromosome 13 and 18, each of the others is found to harbour susceptible genetic regions for rheumatoid arthritis or related diseases, i.e., lupus erythematosus. This topic is discussed in part III.
Resumo:
This study aimed to identify the microbial contamination of water from dental chair units (DCUs) using the prevalence of Pseudomonas aeruginosa, Legionella species and heterotrophic bacteria as a marker of pollution in water in the area of St. Gallen, Switzerland. Water (250 ml) from 76 DCUs was collected twice (early on a morning before using all the instruments and after using the DCUs for at least two hours) either from the high-speed handpiece tube, the 3 in 1 syringe or the micromotor for water quality testing. An increased bacterial count (>300 CFU/ml) was found in 46 (61%) samples taken before use of the DCU, but only in 29 (38%) samples taken two hours after use. Pseudomonas aeruginosa was found in both water samples in 6/76 (8%) of the DCUs. Legionella were found in both samples in 15 (20%) of the DCUs tested. Legionella anisa was identified in seven samples and Legionella pneumophila was found in eight. DCUs which were less than five years old were contaminated less often than older units (25% und 77%, p<0.001). This difference remained significant (0=0.0004) when adjusted for manufacturer and sampling location in a multivariable logistic regression. A large proportion of the DCUs tested did not comply with the Swiss drinking water standards nor with the recommendations of the American Centers for Disease Control and Prevention (CDC).
Resumo:
BACKGROUND The coping resources questionnaire for back pain (FBR) uses 12 items to measure the perceived helpfulness of different coping resources (CRs, social emotional support, practical help, knowledge, movement and relaxation, leisure and pleasure, spirituality and cognitive strategies). The aim of the study was to evaluate the instrument in a clinical patient sample assessed in a primary care setting. SAMPLE AND METHODS The study was a secondary evaluation of empirical data from a large cohort study in general practices. The 58 participating primary care practices recruited patients who reported chronic back pain in the consultation. Besides the FBR and a pain sketch, the patients completed scales measuring depression, anxiety, resilience, sociodemographic factors and pain characteristics. To allow computing of retested parameters the FBR was sent to some of the original participants again after 6 months (90% response rate). We calculated consistency and retest reliability coefficients as well as correlations between the FBR subscales and depression, anxiety and resilience scores to account for validity. By means of a cluster analysis groups with different resource profiles were formed. Results. RESULTS For the study 609 complete FBR baseline data sets could be used for statistical analysis. The internal consistency scores ranged fromα=0.58 to α=0.78 and retest reliability scores were between rTT=0.41 and rTT=0.63. Correlation with depression, fear and resilience ranged from r=-0.38 to r=0.42. The cluster analysis resulted in four groups with relatively homogenous intragroup profiles (high CRs, low spirituality, medium CRs, low CRs). The four groups differed significantly in fear and depression (the more inefficient the resources the higher the difference) as well as in resilience (the more inefficient the lower the difference). The group with low CRs also reported permanent pain with no relief. The groups did not otherwise differ. CONCLUSIONS The FBR is an economic instrument that is suitable for practical use e.g. in primary care practices to identify strengths and deficits in the CRs of chronic pain patients that can then be specified in face to face consultation. However, due to the rather low reliability, the use of subscales for profile differentiation and follow-up measurement in individual diagnoses is limited.
Resumo:
The distribution of the number of heterozygous loci in two randomly chosen gametes or in a random diploid zygote provides information regarding the nonrandom association of alleles among different genetic loci. Two alternative statistics may be employed for detection of nonrandom association of genes of different loci when observations are made on these distributions: observed variance of the number of heterozygous loci (s2k) and a goodness-of-fit criterion (X2) to contrast the observed distribution with that expected under the hypothesis of random association of genes. It is shown, by simulation, that s2k is statistically more efficient than X2 to detect a given extent of nonrandom association. Asymptotic normality of s2k is justified, and X2 is shown to follow a chi-square (chi 2) distribution with partial loss of degrees of freedom arising because of estimation of parameters from the marginal gene frequency data. Whenever direct evaluations of linkage disequilibrium values are possible, tests based on maximum likelihood estimators of linkage disequilibria require a smaller sample size (number of zygotes or gametes) to detect a given level of nonrandom association in comparison with that required if such tests are conducted on the basis of s2k. Summarization of multilocus genotype (or haplotype) data, into the different number of heterozygous loci classes, thus, amounts to appreciable loss of information.
Resumo:
Clays and claystones are used as backfill and barrier materials in the design of waste repositories, because they act as hydraulic barriers and retain contaminants. Transport through such barriers occurs mainly by molecular diffusion. There is thus an interest to relate the diffusion properties of clays to their structural properties. In previous work, we have developed a concept for up-scaling pore-scale molecular diffusion coefficients using a grid-based model for the sample pore structure. Here we present an operational algorithm which can generate such model pore structures of polymineral materials. The obtained pore maps match the rock’s mineralogical components and its macroscopic properties such as porosity, grain and pore size distributions. Representative ensembles of grains in 2D or 3D are created by a lattice Monte Carlo (MC) method, which minimizes the interfacial energy of grains starting from an initial grain distribution. Pores are generated at grain boundaries and/or within grains. The method is general and allows to generate anisotropic structures with grains of approximately predetermined shapes, or with mixtures of different grain types. A specific focus of this study was on the simulation of clay-like materials. The generated clay pore maps were then used to derive upscaled effective diffusion coefficients for non-sorbing tracers using a homogenization technique. The large number of generated maps allowed to check the relations between micro-structural features of clays and their effective transport parameters, as is required to explain and extrapolate experimental diffusion results. As examples, we present a set of 2D and 3D simulations and investigated the effects of nanopores within particles (interlayer pores) and micropores between particles. Archie’s simple power law is followed in systems with only micropores. When nanopores are present, additional parameters are required; the data reveal that effective diffusion coefficients could be described by a sum of two power functions, related to the micro- and nanoporosity. We further used the model to investigate the relationships between particle orientation and effective transport properties of the sample.
Resumo:
Environmental data sets of pollutant concentrations in air, water, and soil frequently include unquantified sample values reported only as being below the analytical method detection limit. These values, referred to as censored values, should be considered in the estimation of distribution parameters as each represents some value of pollutant concentration between zero and the detection limit. Most of the currently accepted methods for estimating the population parameters of environmental data sets containing censored values rely upon the assumption of an underlying normal (or transformed normal) distribution. This assumption can result in unacceptable levels of error in parameter estimation due to the unbounded left tail of the normal distribution. With the beta distribution, which is bounded by the same range of a distribution of concentrations, $\rm\lbrack0\le x\le1\rbrack,$ parameter estimation errors resulting from improper distribution bounds are avoided. This work developed a method that uses the beta distribution to estimate population parameters from censored environmental data sets and evaluated its performance in comparison to currently accepted methods that rely upon an underlying normal (or transformed normal) distribution. Data sets were generated assuming typical values encountered in environmental pollutant evaluation for mean, standard deviation, and number of variates. For each set of model values, data sets were generated assuming that the data was distributed either normally, lognormally, or according to a beta distribution. For varying levels of censoring, two established methods of parameter estimation, regression on normal ordered statistics, and regression on lognormal ordered statistics, were used to estimate the known mean and standard deviation of each data set. The method developed for this study, employing a beta distribution assumption, was also used to estimate parameters and the relative accuracy of all three methods were compared. For data sets of all three distribution types, and for censoring levels up to 50%, the performance of the new method equaled, if not exceeded, the performance of the two established methods. Because of its robustness in parameter estimation regardless of distribution type or censoring level, the method employing the beta distribution should be considered for full development in estimating parameters for censored environmental data sets. ^
Resumo:
Nuclear morphometry (NM) uses image analysis to measure features of the cell nucleus which are classified as: bulk properties, shape or form, and DNA distribution. Studies have used these measurements as diagnostic and prognostic indicators of disease with inconclusive results. The distributional properties of these variables have not been systematically investigated although much of the medical data exhibit nonnormal distributions. Measurements are done on several hundred cells per patient so summary measurements reflecting the underlying distribution are needed.^ Distributional characteristics of 34 NM variables from prostate cancer cells were investigated using graphical and analytical techniques. Cells per sample ranged from 52 to 458. A small sample of patients with benign prostatic hyperplasia (BPH), representing non-cancer cells, was used for general comparison with the cancer cells.^ Data transformations such as log, square root and 1/x did not yield normality as measured by the Shapiro-Wilks test for normality. A modulus transformation, used for distributions having abnormal kurtosis values, also did not produce normality.^ Kernel density histograms of the 34 variables exhibited non-normality and 18 variables also exhibited bimodality. A bimodality coefficient was calculated and 3 variables: DNA concentration, shape and elongation, showed the strongest evidence of bimodality and were studied further.^ Two analytical approaches were used to obtain a summary measure for each variable for each patient: cluster analysis to determine significant clusters and a mixture model analysis using a two component model having a Gaussian distribution with equal variances. The mixture component parameters were used to bootstrap the log likelihood ratio to determine the significant number of components, 1 or 2. These summary measures were used as predictors of disease severity in several proportional odds logistic regression models. The disease severity scale had 5 levels and was constructed of 3 components: extracapsulary penetration (ECP), lymph node involvement (LN+) and seminal vesicle involvement (SV+) which represent surrogate measures of prognosis. The summary measures were not strong predictors of disease severity. There was some indication from the mixture model results that there were changes in mean levels and proportions of the components in the lower severity levels. ^
Resumo:
The light scattering properties of oceanic particles have been suggested as an alternative index of phytoplankton biomass than chlorophyll-a concentration (chl-a), with the benefit of being less sensitive to physiological forcings (e.g., light and nutrients) that alter the intracellular pigment concentrations. The drawback of particulate scattering is that it is not unique to phytoplankton. Nevertheless, field studies have demonstrated that, to first order, the particulate beam-attenuation coefficient (c(p)) can track phytoplankton biomass. The relationship between c(p) and the particulate backscattering coefficient (b(bp)), a property retrievable from space, has not been fully evaluated, largely due to a lack of open-ocean field observations. Here, we present extensive data on inherent optical properties from the Equatorial Pacific surface waters and demonstrate a remarkable coherence in b(bp) and c(p). Coincident measurements of particle size distributions (PSDs) and optical properties of size-fractionated samples indicate that this covariance is due to both the conserved nature of the PSD and a greater contribution of phytoplankton-sized particles to b(bp) than theoretically predicted. These findings suggest that satellite-derived b(bp)could provide similar information on phytoplankton biomass in the open ocean as c(p).
Resumo:
A 272-ha grove of dominant Microberlinia bisulcata (Caesalpinioideae) adult trees greater than or equal to 50 cm stem diameter was mapped in its entirety in the southern part of Korup National Park, Cameroon. The approach used an earlier-established 82.5-ha permanent plot with a new surrounding 50-m grid of transect lines. Tree diameters were available from the plot but trees on the grid were recorded as being greater than or equal to 50 cm. The grove consisted of 1028 trees in 2000. Other species occurred within the grove. including the associated subdominants Tetraberlinia bifoliolata and T. korupensis. Microberlinia bisulcata becomes adult at a stein diameter of c. 50 cm and at an estimated age of 50 y. Three oval-shaped subgroves with dimensions c. 8 50 in x 13 50 in (90 ha) were defined. For two of them (within the plot) tree diameters were available. Subgroves differed in their scales and intensities of spatial tree patterns, and in their size frequency distributions, these suggesting differing past dynamics. The modal scale of clumping was 40-50 m. Seed dispersal by pod ejection (to c. 50 in) was evident from the semi-circles of trees at the grove's edge and from the many internal circles (100-200 m diameter). The grove has the capacity. therefore, to increase at c. 100 m per century. To form its present extent and structure. it is inferred that it expanded and infilled from a possibly smaller area of lower adult-tree density. This possibly happened in three waves of recruitment, each one determined by a period of several intense disturbances. Climate records for Africa show that 1740-50 and 1820-30 were periods of drought, and that 1870-1895 was also regionally very dry. Canopy openings allow the light-demanding and fast-growing ectomycorrhizal M. bisulcata to establish, but successive releases are thought to be required to achieve effective recruitment. Nevertheless, in the last 50 y there were no major events and recruitment in the grove was very poor. This present study leads to a new hypothesis of the role of periods of multiple extreme events being the driving factor for the population dynamics of many large African tree species such as M. bisulcata.
Resumo:
A search for nonresonant new phenomena, originating from either contact interactions or large extra spatial dimensions, has been carried out using events with two isolated electrons or muons. These events, produced at the LHC in proton-proton collisions at root s = 7 TeV, were recorded by the ATLAS detector. The data sample, collected throughout 2011, corresponds to an integrated luminosity of 4.9 and 5.0 fb(-1) in the e(+)e(-) and mu(+)mu(-) channels, respectively. No significant deviations from the Standard Model expectation are observed. Using a Bayesian approach, 95% confidence level lower limits ranging from 9.0 to 13.9 TeV are placed on the energy scale of llqq contact interactions in the left-left isoscalar model. Lower limits ranging from 2.4 to 3.9 TeV are also set on the string scale in large extra dimension models. After combining these limits with results from a similar search in the diphoton channel, slightly more stringent limits are obtained.
Resumo:
Measurements are presented of differential cross-sections for top quark pair production in pp collisions at root s = 7 TeV relative to the total inclusive top quark pair production cross-section. A data sample of 2.05 fb(-1) recorded by the ATLAS detector at the Large Hadron Collider is used. Relative differential cross-sections are derived as a function of the invariant mass, the transverse momentum and the rapidity of the top quark pair system. Events are selected in the lepton (electron or muon) + jets channel. The background-subtracted differential distributions are corrected for detector effects, normalized to the total inclusive top quark pair production cross-section and compared to theoretical predictions. The measurement uncertainties range typically between 10 % and 20 % and are generally dominated by systematic effects. No significant deviations from the Standard Model expectations are observed.
Resumo:
The comparison of radiotherapy techniques regarding secondary cancer risk has yielded contradictory results possibly stemming from the many different approaches used to estimate risk. The purpose of this study was to make a comprehensive evaluation of different available risk models applied to detailed whole-body dose distributions computed by Monte Carlo for various breast radiotherapy techniques including conventional open tangents, 3D conformal wedged tangents and hybrid intensity modulated radiation therapy (IMRT). First, organ-specific linear risk models developed by the International Commission on Radiological Protection (ICRP) and the Biological Effects of Ionizing Radiation (BEIR) VII committee were applied to mean doses for remote organs only and all solid organs. Then, different general non-linear risk models were applied to the whole body dose distribution. Finally, organ-specific non-linear risk models for the lung and breast were used to assess the secondary cancer risk for these two specific organs. A total of 32 different calculated absolute risks resulted in a broad range of values (between 0.1% and 48.5%) underlying the large uncertainties in absolute risk calculation. The ratio of risk between two techniques has often been proposed as a more robust assessment of risk than the absolute risk. We found that the ratio of risk between two techniques could also vary substantially considering the different approaches to risk estimation. Sometimes the ratio of risk between two techniques would range between values smaller and larger than one, which then translates into inconsistent results on the potential higher risk of one technique compared to another. We found however that the hybrid IMRT technique resulted in a systematic reduction of risk compared to the other techniques investigated even though the magnitude of this reduction varied substantially with the different approaches investigated. Based on the epidemiological data available, a reasonable approach to risk estimation would be to use organ-specific non-linear risk models applied to the dose distributions of organs within or near the treatment fields (lungs and contralateral breast in the case of breast radiotherapy) as the majority of radiation-induced secondary cancers are found in the beam-bordering regions.
Resumo:
This paper analyses local geographical contexts targeted by transnational large-scale land acquisitions (>200 ha per deal) in order to understand how emerging patterns of socio-ecological characteristics can be related to processes of large-scale foreign investment in land. Using a sample of 139 land deals georeferenced with high spatial accuracy, we first analyse their target contexts in terms of land cover, population density, accessibility, and indicators for agricultural potential. Three distinct patterns emerge from the analysis: densely populated and easily accessible croplands (35% of land deals); remote forestlands with lower population densities (34% of land deals); and moderately populated and moderately accessible shrub- or grasslands (26% of land deals). These patterns are consistent with processes described in the relevant case study literature, and they each involve distinct types of stakeholders and associated competition over land. We then repeat the often-cited analysis that postulates a link between land investments and target countries with abundant so-called “idle” or “marginal” lands as measured by yield gap and available suitable but uncultivated land; our methods differ from the earlier approach, however, in that we examine local context (10-km radius) rather than countries as a whole. The results show that earlier findings are disputable in terms of concepts, methods, and contents. Further, we reflect on methodologies for exploring linkages between socioecological patterns and land investment processes. Improving and enhancing large datasets of georeferenced land deals is an important next step; at the same time, careful choice of the spatial scale of analysis is crucial for ensuring compatibility between the spatial accuracy of land deal locations and the resolution of available geospatial data layers. Finally, we argue that new approaches and methods must be developed to empirically link socio-ecological patterns in target contexts to key determinants of land investment processes. This would help to improve the validity and the reach of our findings as an input for evidence-informed policy debates.
Resumo:
A measurement of charged-particle distributions sensitive to the properties of the underlying event is presented for an inclusive sample of events containing a Z-boson, decaying to an electron or muon pair. The measurement is based on data collected using the ATLAS detector at the LHC in proton–proton collisions at a centre-of-mass energy of 7 TeV with an integrated luminosity of 4.6fb−1. Distributions of the charged particle multiplicity and of the charged particle transverse momentum are measured in regions of azimuthal angle defined with respect to the Z-boson direction. The measured distributions are compared to similar distributions measured in jet events, and to the predictions of various Monte Carlo generators implementing different underlying event models.