158 resultados para Data-stream balancing
Resumo:
The effect of number of samples and selection of data for analysis on the calculation of surface motor unit potential (SMUP) size in the statistical method of motor unit number estimates (MUNE) was determined in 10 normal subjects and 10 with amyotrophic lateral sclerosis (ALS). We recorded 500 sequential compound muscle action potentials (CMAPs) at three different stable stimulus intensities (10–50% of maximal CMAP). Estimated mean SMUP sizes were calculated using Poisson statistical assumptions from the variance of 500 sequential CMAP obtained at each stimulus intensity. The results with the 500 data points were compared with smaller subsets from the same data set. The results using a range of 50–80% of the 500 data points were compared with the full 500. The effect of restricting analysis to data between 5–20% of the CMAP and to standard deviation limits was also assessed. No differences in mean SMUP size were found with stimulus intensity or use of different ranges of data. Consistency was improved with a greater sample number. Data within 5% of CMAP size gave both increased consistency and reduced mean SMUP size in many subjects, but excluded valid responses present at that stimulus intensity. These changes were more prominent in ALS patients in whom the presence of isolated SMUP responses was a striking difference from normal subjects. Noise, spurious data, and large SMUP limited the Poisson assumptions. When these factors are considered, consistent statistical MUNE can be calculated from a continuous sequence of data points. A 2 to 2.5 SD or 10% window are reasonable methods of limiting data for analysis. Muscle Nerve 27: 320–331, 2003
Resumo:
In microarray studies, the application of clustering techniques is often used to derive meaningful insights into the data. In the past, hierarchical methods have been the primary clustering tool employed to perform this task. The hierarchical algorithms have been mainly applied heuristically to these cluster analysis problems. Further, a major limitation of these methods is their inability to determine the number of clusters. Thus there is a need for a model-based approach to these. clustering problems. To this end, McLachlan et al. [7] developed a mixture model-based algorithm (EMMIX-GENE) for the clustering of tissue samples. To further investigate the EMMIX-GENE procedure as a model-based -approach, we present a case study involving the application of EMMIX-GENE to the breast cancer data as studied recently in van 't Veer et al. [10]. Our analysis considers the problem of clustering the tissue samples on the basis of the genes which is a non-standard problem because the number of genes greatly exceed the number of tissue samples. We demonstrate how EMMIX-GENE can be useful in reducing the initial set of genes down to a more computationally manageable size. The results from this analysis also emphasise the difficulty associated with the task of separating two tissue groups on the basis of a particular subset of genes. These results also shed light on why supervised methods have such a high misallocation error rate for the breast cancer data.
Resumo:
For zygosity diagnosis in the absence of genotypic data, or in the recruitment phase of a twin study where only single twins from same-sex pairs are being screened, or to provide a test for sample duplication leading to the false identification of a dizygotic pair as monozygotic, the appropriate analysis of respondents' answers to questions about zygosity is critical. Using data from a young adult Australian twin cohort (N = 2094 complete pairs and 519 singleton twins from same-sex pairs with complete responses to all zygosity items), we show that application of latent class analysis (LCA), fitting a 2-class model, yields results that show good concordance with traditional methods of zygosity diagnosis, but with certain important advantages. These include the ability, in many cases, to assign zygosity with specified probability on the basis of responses of a single informant (advantageous when one zygosity type is being oversampled); and the ability to quantify the probability of misassignment of zygosity, allowing prioritization of cases for genotyping as well as identification of cases of probable laboratory error. Out of 242 twins (from 121 like-sex pairs) where genotypic data were available for zygosity confirmation, only a single case was identified of incorrect zygosity assignment by the latent class algorithm. Zygosity assignment for that single case was identified by the LCA as uncertain (probability of being a monozygotic twin only 76%), and the co-twin's responses clearly identified the pair as dizygotic (probability of being dizygotic 100%). In the absence of genotypic data, or as a safeguard against sample duplication, application of LCA for zygosity assignment or confirmation is strongly recommended.
Resumo:
Acetohydroxyacid synthase (AHAS, EC 4.1.3.18) catalyses the first step in branched-chain amino acid biosynthesis and is the target for sulfonylurea and imidazolinone herbicides, which act as potent and specific inhibitors. Mutants of the enzyme have been identified that are resistant to particular herbicides. However, the selectivity of these mutants towards various sulfonylureas and imidazolinones has not been determined systematically. Now that the structure of the yeast enzyme is known, both in the absence and presence of a bound herbicide, a detailed understanding of the molecular interactions between the enzyme and its inhibitors becomes possible. Here we construct 10 active mutants of yeast AHAS, purify the enzymes and determine their sensitivity to six sulfonylureas and three imidazolinones. An additional three active mutants were constructed with a view to increasing imidazolinone sensitivity. These three variants were purified and tested for their sensitivity to the imidazolinones only. Substantial differences are observed in the sensitivity of the 13 mutants to the various inhibitors and these differences are interpreted in terms of the structure of the herbicide-binding site on the enzyme.
Resumo:
The Caridina indistincta complex is a group of closely related atyid shrimps that inhabit coastal freshwater streams throughout north-eastern Australia. Using mitochondrial DNA sequence data (cytochrome oxidase 1, CO1), we (1) inferred the timing of speciation in the C. indistincta group and (2) examined the intraspecific phylogeographic patterns within the group. Assuming a shrimp-specific rate of CO1 evolution, the level of sequence divergence among species suggests that speciation took place during the Miocene epoch. Within one widespread mainland species, phylogeographic patterns suggest strong geographic 'regionalisation' of mtDNA lineages that are most likely of Pleistocene origin. By contrast, another species comprises two highly divergent mtDNA lineages that occur in sympatry. We suggest that although Pleistocene sea-level regressions appear important in generating population-level phylogeographic patterns, these events were largely unimportant in the formation of species in this group.
Resumo:
The design of randomized controlled trials entails decisions that have economic as well as statistical implications. In particular, the choice of an individual or cluster randomization design may affect the cost of achieving the desired level of power, other things being equal. Furthermore, if cluster randomization is chosen, the researcher must decide how to balance the number of clusters, or sites, and the size of each site. This article investigates these interrelated statistical and economic issues. Its principal purpose is to elucidate the statistical and economic trade-offs to assist researchers to employ randomized controlled trials that have desired economic, as well as statistical, properties. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
A bituminous coal was pyrolyzed in a nitrogen stream in an entrained flow reactor at various temperatures from 700 to 1475 degreesC. Char samples were collected at different positions along the reactor. Each collected sample was oxidized nonisothermally in a TGA for reactivity determination. The reactivity of the coal char was found to decrease rapidly with residence time until 0.5 s, after which it decreased only slightly. On the bases of the reactivity data at various temperatures, a new approach was utilized to obtaining the true activation energy distribution function for thermal annealing without the assumption of any distribution function form or a constant preexponential factor. It appears that the true activation energy distribution function consists of two separate parts corresponding to different temperature ranges, suggesting different mechanisms in different temperature ranges. Partially burnt coal chars were also collected along the reactor when the coal was oxidized in air at various temperatures from 700 to 1475 degreesC. The collected samples were analyzed for the residual carbon content and the specific reaction rate was estimated. The characteristic time of thermal deactivation was compared with that of oxidation under realistic conditions. The characteristic times were found to be close to each other, indicating the importance of thermal deactivation during combustion of the coal studied.