33 resultados para categorical and mix datasets

em University of Queensland eSpace - Australia


Relevância:

100.00% 100.00%

Publicador:

Resumo:

There is substantial disagreement among published epidemiological studies regarding environmental risk factors for Parkinson’s disease (PD). Differences in the quality of measurement of environmental exposures may contribute to this variation. The current study examined the test–retest repeatability of self-report data on risk factors for PD obtained from a series of 32 PD cases recruited from neurology clinics and 29 healthy sex-, age-and residential suburb-matched controls. Exposure data were collected in face-to-face interviews using a structured questionnaire derived from previous epidemiological studies. High repeatability was demonstrated for ‘lifestyle’ exposures, such as smoking and coffee/tea consumption (kappas 0.70–1.00). Environmental exposures that involved some action by the person, such as pesticide application and use of solvents and metals, also showed high repeatability (kappas>0.78). Lower repeatability was seen for rural residency and bore water consumption (kappa 0.39–0.74). In general, we found that case and control participants provided similar rates of incongruent and missing responses for categorical and continuous occupational, domestic, lifestyle and medical exposures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The notorious "dimensionality curse" is a well-known phenomenon for any multi-dimensional indexes attempting to scale up to high dimensions. One well-known approach to overcome degradation in performance with respect to increasing dimensions is to reduce the dimensionality of the original dataset before constructing the index. However, identifying the correlation among the dimensions and effectively reducing them are challenging tasks. In this paper, we present an adaptive Multi-level Mahalanobis-based Dimensionality Reduction (MMDR) technique for high-dimensional indexing. Our MMDR technique has four notable features compared to existing methods. First, it discovers elliptical clusters for more effective dimensionality reduction by using only the low-dimensional subspaces. Second, data points in the different axis systems are indexed using a single B+-tree. Third, our technique is highly scalable in terms of data size and dimension. Finally, it is also dynamic and adaptive to insertions. An extensive performance study was conducted using both real and synthetic datasets, and the results show that our technique not only achieves higher precision, but also enables queries to be processed efficiently. Copyright Springer-Verlag 2005

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have undertaken two-dimensional gel electrophoresis proteomic profiling on a series of cell lines with different recombinant antibody production rates. Due to the nature of gel-based experiments not all protein spots are detected across all samples in an experiment, and hence datasets are invariably incomplete. New approaches are therefore required for the analysis of such graduated datasets. We approached this problem in two ways. Firstly, we applied a missing value imputation technique to calculate missing data points. Secondly, we combined a singular value decomposition based hierarchical clustering with the expression variability test to identify protein spots whose expression correlates with increased antibody production. The results have shown that while imputation of missing data was a useful method to improve the statistical analysis of such data sets, this was of limited use in differentiating between the samples investigated, and highlighted a small number of candidate proteins for further investigation. (c) 2006 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Australian Soil Resources Information System (ASRIS) database compiles the best publicly available information available across Commonwealth, State, and Territory agencies into a national database of soil profile data, digital soil and land resources maps, and climate, terrain, and lithology datasets. These datasets are described in detail in this paper. Most datasets are thematic grids that cover the intensively used agricultural zones in Australia.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Large chemical libraries can be synthesized on solid-support beads by the combinatorial split-and-mix method. A major challenge associated with this type of library synthesis is distinguishing between the beads and their attached compounds. A new method of encoding these solid-support beads, 'colloidal bar-coding', involves attaching fluorescent silica colloids ('reporters') to the beads as they pass through the compound synthesis, thereby creating a fluorescent bar code on each bead. In order to obtain sufficient reporter varieties to bar code extremely large libraries, many of the reporters must contain multiple fluorescent dyes. We describe here the synthesis and spectroscopic analysis of various mono- and multi-fluorescent silica particles for this purpose. It was found that by increasing the amount of a single dye introduced into the particle reaction mixture, mono- fluorescent silica particles of increasing intensities could be prepared. This increase was highly reproducible and was observed for six different fluorescent dyes. Multi-fluorescent silica particles containing up to six fluorescent dyes were also prepared. The resultant emission intensity of each dye in the multi-fluorescent particles was found to be dependent upon a number of factors; the hydrolysis rate of each silane-dye conjugate, the magnitude of the inherent emission intensity of each dye within the silica matrix, and energy transfer effects between dyes. We show that by varying the relative concentration of each silane-dye conjugate in the synthesis of multi-fluorescent particles, it is possible to change and optimize the resultant emission intensity of each dye to enable viewing in a fluorescence detection instrument.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is generally accepted that two major gene pools exist in cultivated common bean (Phaseolus vulgaris L.), a Middle American and an Andean one. Some evidence, based on unique phaseolin morphotypes and AFLP analysis, suggests that at least one more gene pool exists in cultivated common bean. To investigate this hypothesis, 1072 accessions from a common bean core collection from the primary centres of origin, held at CIAT, were investigated. Various agronomic and morphological attributes (14 categorical and 11 quantitative) were measured. Multivariate analyses, consisting of homogeneity analysis and clustering for categorical data, clustering and ordination techniques for quantitative data and nonlinear principal component analysis for mixed data, were undertaken. The results of most analyses supported the existence of the two major gene pools. However, the analysis of categorical data of protein types showed an additional minor gene pool. The minor gene pool is designated North Andean and includes phaseolin types CH, S and T; lectin types 312, Pr, B and K; and mostly A5, A6 and A4 types alpha-amylase inhibitor. Analysis of the combined categorical data of protein types and some plant categorical data also suggested that some other germplasm with C type phaseolin are distinguished from the major gene pools.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The ability to generate enormous random libraries of DNA probes via split-and-mix synthesis on solid supports is an important biotechnological application of colloids that has not been fully utilized to date. To discriminate between colloid-based DNA probes each colloidal particle must be 'encoded' so it is distinguishable from all other particles. To this end, we have used novel particle synthesis strategies to produce large numbers of optically encoded particle suitable for DNA library synthesis. Multifluorescent particles with unique and reproducible optical signatures (i.e., fluorescence and light-scattering attributes) suitable for high-throughput flow cytometry have been produced. In the spectroscopic study presented here, we investigated the optical characteristics of multi-fluorescent particles that were synthesized by coating silica 'core' particles with up to six different fluorescent dye shells alternated with non-fluorescent silica 'spacer' shells. It was observed that the diameter of the particles increased by up to 20% as a result of the addition of twelve concentric shells and that there was a significant reduction in fluorescence emission intensities from inner shells as an increasing number of shells were deposited.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: It is generally assumed that fascial defects in the rectovaginal septum are the result of childbirth. However, rectoceles do occur in women who have never delivered vaginally. Aims: To determine the incidence of rectocele in a cohort of asymptomatic, young nulliparous women. Methods: Observational cohort study on 178 nulliparous caucasian women (aged 18-24) recruited for a twin study of pelvic floor dysfunction. All women were interviewed and examined by translabial ultrasound, supine and after voiding. In 52 women, 3D imaging was obtained and 171 datasets were complete and available for analysis. Ultrasound findings were reviewed for rectovaginal septal integrity by an assessor blinded against interview and demographic data for rectovaginal septal integrity. Results: A discontinuity of the anterior rectal wall with extrusion of rectal mucosa or contents (depth of ! 10 mm) was observed in 21/171 (12%). The depth of this herniation ranged from 10 to 25 mm and was filled with stool (n = 10) or rectal mucosa (n = 11). Defects were associated with a higher BMI (P = 0.049), with the complaint of constipation (P = 0.049) and non-significantly with straining at stool (P = 0.09). Descent of the ampulla to beyond the level of the symphysis pubis without fascial defect, that is, significant perineal relaxation, was observed in 23/171 (13%). Conclusions: Twelve percent of 171 young nulligravid caucasian women showed a defect of the rectovaginal septum. Associations were observed with higher body mass index and a history of constipation. It is hypothesised that in some women defects of the rectovaginal septum and perineal hypermobility may be congenital in nature.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Categorical models dominate the eating disorder field, but the tandem use of categorical and dimensional models has been proposed. A transdiagnostic dimensional model, number of lifetime eating disorder behaviors (LEDB), was examined with respect to (1) its relationship to a variety of indicators of the individual's functioning, (2) the degree to which it was influenced by genetic and environmental risk factors, and (3) exposure to specific environmental risk factors. Data from self-report and interview from 1002 female twins (mean age = 34.91 years, SD = 2.09) were examined. While 15.4% women met criteria for a lifetime eating disorder, 29% had at least one LEDB. The dimensional measure provided an indicator of associated functioning, and was influenced primarily by the nonshared environment. The number of LEDB was associated with the degree of impaired functioning. This impairment was associated with conflict between parents and criticism from parents when growing up.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Crossbred ewes, weighing 30-40 kg, were assigned to three groups of six animals. One group of sheep was fed chopped oat hay (control), the second group was fed the control diet plus 30 g per head per day spray dried residue from the fermentation of molasses and the third group was fed the control diet plus 30 g per head per day of a non-protein nitrogen/mineral mix. Voluntary feed intake, digestibility of DM, OM and nitrogen, nitrogen balance and microbial nitrogen flow to the intestines were significantly increased by supplementation but efficiency of microbial protein production was not affected. (C) 2001 Elsevier Science BN. All rights reserved,

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We compare Bayesian methodology utilizing free-ware BUGS (Bayesian Inference Using Gibbs Sampling) with the traditional structural equation modelling approach based on another free-ware package, Mx. Dichotomous and ordinal (three category) twin data were simulated according to different additive genetic and common environment models for phenotypic variation. Practical issues are discussed in using Gibbs sampling as implemented by BUGS to fit subject-specific Bayesian generalized linear models, where the components of variation may be estimated directly. The simulation study (based on 2000 twin pairs) indicated that there is a consistent advantage in using the Bayesian method to detect a correct model under certain specifications of additive genetics and common environmental effects. For binary data, both methods had difficulty in detecting the correct model when the additive genetic effect was low (between 10 and 20%) or of moderate range (between 20 and 40%). Furthermore, neither method could adequately detect a correct model that included a modest common environmental effect (20%) even when the additive genetic effect was large (50%). Power was significantly improved with ordinal data for most scenarios, except for the case of low heritability under a true ACE model. We illustrate and compare both methods using data from 1239 twin pairs over the age of 50 years, who were registered with the Australian National Health and Medical Research Council Twin Registry (ATR) and presented symptoms associated with osteoarthritis occurring in joints of the hand.