999 resultados para Data amalgamation


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Modern approaches to biomedical research and diagnostics targeted towards precision medicine are generating ‘big data’ across a range of high-throughput experimental and analytical platforms. Integrative analysis of this rich clinical, pathological, molecular and imaging data represents one of the greatest bottlenecks in biomarker discovery research in cancer and other diseases. Following on from the publication of our successful framework for multimodal data amalgamation and integrative analysis, Pathology Integromics in Cancer (PICan), this article will explore the essential elements of assembling an integromics framework from a more detailed perspective. PICan, built around a relational database storing curated multimodal data, is the research tool sitting at the heart of our interdisciplinary efforts to streamline biomarker discovery and validation. While recognizing that every institution has a unique set of priorities and challenges, we will use our experiences with PICan as a case study and starting point, rationalizing the design choices we made within the context of our local infrastructure and specific needs, but also highlighting alternative approaches that may better suit other programmes of research and discovery. Along the way, we stress that integromics is not just a set of tools, but rather a cohesive paradigm for how modern bioinformatics can be enhanced. Successful implementation of an integromics framework is a collaborative team effort that is built with an eye to the future and greatly accelerates the processes of biomarker discovery, validation and translation into clinical practice.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Soil aggregation is an index of soil structure measured by mean weight diameter (MWD) or scaling factors often interpreted as fragmentation fractal dimensions (D-f). However, the MWD provides a biased estimate of soil aggregation due to spurious correlations among aggregate-size fractions and scale-dependency. The scale-invariant D-f is based on weak assumptions to allow particle counts and sensitive to the selection of the fractal domain, and may frequently exceed a value of 3, implying that D-f is a biased estimate of aggregation. Aggregation indices based on mass may be computed without bias using compositional analysis techniques. Our objective was to elaborate compositional indices of soil aggregation and to compare them to MWD and D-f using a published dataset describing the effect of 7 cropping systems on aggregation. Six aggregate-size fractions were arranged into a sequence of D-1 balances of building blocks that portray the process of soil aggregation. Isometric log-ratios (ilrs) are scale-invariant and orthogonal log contrasts or balances that possess the Euclidean geometry necessary to compute a distance between any two aggregation states, known as the Aitchison distance (A(x,y)). Close correlations (r>0.98) were observed between MWD, D-f, and the ilr when contrasting large and small aggregate sizes. Several unbiased embedded ilrs can characterize the heterogeneous nature of soil aggregates and be related to soil properties or functions. Soil bulk density and penetrater resistance were closely related to A(x,y) with reference to bare fallow. The A(x,y) is easy to implement as unbiased index of soil aggregation using standard sieving methods and may allow comparisons between studies. (C) 2012 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the tantalising remaining problems in compositional data analysis lies in how to deal with data sets in which there are components which are essential zeros. By an essential zero we mean a component which is truly zero, not something recorded as zero simply because the experimental design or the measuring instrument has not been sufficiently sensitive to detect a trace of the part. Such essential zeros occur in many compositional situations, such as household budget patterns, time budgets, palaeontological zonation studies, ecological abundance studies. Devices such as nonzero replacement and amalgamation are almost invariably ad hoc and unsuccessful in such situations. From consideration of such examples it seems sensible to build up a model in two stages, the first determining where the zeros will occur and the second how the unit available is distributed among the non-zero parts. In this paper we suggest two such models, an independent binomial conditional logistic normal model and a hierarchical dependent binomial conditional logistic normal model. The compositional data in such modelling consist of an incidence matrix and a conditional compositional matrix. Interesting statistical problems arise, such as the question of estimability of parameters, the nature of the computational process for the estimation of both the incidence and compositional parameters caused by the complexity of the subcompositional structure, the formation of meaningful hypotheses, and the devising of suitable testing methodology within a lattice of such essential zero-compositional hypotheses. The methodology is illustrated by application to both simulated and real compositional data

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Simpson's paradox, also known as amalgamation or aggregation paradox, appears when dealing with proportions. Proportions are by construction parts of a whole, which can be interpreted as compositions assuming they only carry relative information. The Aitchison inner product space structure of the simplex, the sample space of compositions, explains the appearance of the paradox, given that amalgamation is a nonlinear operation within that structure. Here we propose to use balances, which are specific elements of this structure, to analyse situations where the paradox might appear. With the proposed approach we obtain that the centre of the tables analysed is a natural way to compare them, which avoids by construction the possibility of a paradox. Key words: Aitchison geometry, geometric mean, orthogonal projection

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The studied sector of the central Ribeira Fold Belt (SE Brazil) comprises metatexites, diatexites, charnockites and blastomylonites. This study integrates petrological and thermochronological data in order to constrain the thermotectonic and geodynamic evolution of this Neoproterozoic-Ordovician mobile belt during Western Gondwana amalgamation. New data indicate that after an earlier collision stage at similar to 610 Ma (zircon, U-Pb age), peak metamorphism and lower crust partial melting, coeval with the main regional high grade D(1) thrust deformation, occurred at 572-562 Ma (zircon, U-Pb ages). The overall average cooling rate was low (<5 degrees C/Ma) from 750 to 250 degrees C (at similar to 455 Ma; biotite-WR Rb-Sr age), but disparate cooling paths indicate differential uplift between distinct lithotypes: (a) metatexites and blastomylonites show a overall stable 3-5 degrees C/Ma cooling rate; (b) charnockites and associated rocks remained at T>650 degrees C during sub-horizontal D(2) shearing until similar to 510-470 Ma (garnet-WR Sm-Nd ages) (1-2 degrees C/Ma), being then rapidly exhumed/cooled (8-30 degrees C/Ma) during post-orogenic D(3) deformation with late granite emplacement at similar to 490 Ma (zircon, U-Pb age). Cooling rates based on garnet-biotite Fe-Mg diffusion are broadly consistent with the geochronological cooling rates: (a) metatexites were cooled faster at high temperatures (6 degrees C/Ma) and slowly at low temperatures (0.1 degrees C/Ma), decreasing cooling rates with time; (b) charnockites show low cooling rates (2 degrees C/Ma) near metamorphic peak conditions and high cooling rates (120 degrees C/Ma) at lower temperatures, increasing cooling rates during retrogression. The charnockite thermal evolution and the extensive production of granitoid melts in the area imply that high geothermal gradients were sustained fora long period of time (50-90 Ma). This thermal anomaly most likely reflects upwelling of asthenospheric mantle and magma underplating coupled with long-term generation of high HPE (heat producing elements) granitoids. These factors must have sustained elevated crustal geotherms for similar to 100 Ma, promoting widespread charnockite generation at middle to lower crustal levels. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Ibituruna quartz-syenite was emplaced as a sill in the Ribeira-Aracuai Neoproterozoic belt (Southeastern Brazil) during the last stages of the Gondwana supercontinent amalgamation. We have measured the Anisotropy of Magnetic Susceptibility (AMS) in samples from the Ibituruna sill to unravel its magnetic fabric that is regarded as a proxy for its magmatic fabric. A large magnetic anisotropy, dominantly due to magnetite, and a consistent magnetic fabric have been determined over the entire Ibituruna massif. The magmatic foliation and lineation are strikingly parallel to the solid-state mylonitic foliation and lineation measured in the country-rock. Altogether, these observations suggest that the Ibituruna sill was emplaced during the high temperature (similar to 750 degrees C) regional deformation and was deformed before full solidification coherently with its country-rock. Unexpectedly, geochronological data suggest a rather different conclusion. LA-ICP-MS and SHRIMP ages of zircons from the Ibituruna quartz-syenite are in the range 530-535 Ma and LA-ICP-MS ages of zircons and monazites from synkinematic leucocratic veins in the country-rocks suggest a crystallization at similar to 570-580 Ma, i.e., an HT deformation >35My older than the emplacement of the Ibituruna quartz-syenite. Conclusions from the structural and the geochronological studies are therefore conflicting. A possible explanation arises from (40)Ar-(39)Ar thermochronology. We have dated amphiboles from the quartz-syenite, and amphiboles and biotites from the country-rock. Together with the ages of monazites and zircons in the country-rock, (40)Ar-(39)Ar mineral ages suggest a very low cooling rate: <3 degrees C/My between 570 and similar to 500 Ma and similar to 5 degrees C/My between 500 and 460 Ma. Assuming a protracted regional deformation consistent over tens of My, under such stable thermal conditions the fabric and microstructure of deformed rocks may remain almost unchanged even if they underwent and recorded strain pulses separated by long periods of time. This may be a characteristic of slow cooling ""hot orogens"" that rocks deformed at significantly different periods during the orogeny, but under roughly unchanged temperature conditions, may display almost indiscernible microstructure and fabric. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recent data indicate that levels of overweight and obesity are increasing at an alarming rate throughout the world. At a population level (and commonly to assess individual health risk), the prevalence of overweight and obesity is calculated using cut-offs of the Body Mass Index (BMI) derived from height and weight. Similarly, the BMI is also used to classify individuals and to provide a notional indication of potential health risk. It is likely that epidemiologic surveys that are reliant on BMI as a measure of adiposity will overestimate the number of individuals in the overweight (and slightly obese) categories. This tendency to misclassify individuals may be more pronounced in athletic populations or groups in which the proportion of more active individuals is higher. This differential is most pronounced in sports where it is advantageous to have a high BMI (but not necessarily high fatness). To illustrate this point we calculated the BMIs of international professional rugby players from the four teams involved in the semi-finals of the 2003 Rugby Union World Cup. According to the World Health Organisation (WHO) cut-offs for BMI, approximately 65% of the players were classified as overweight and approximately 25% as obese. These findings demonstrate that a high BMI is commonplace (and a potentially desirable attribute for sport performance) in professional rugby players. An unanswered question is what proportion of the wider population, classified as overweight (or obese) according to the BMI, is misclassified according to both fatness and health risk? It is evident that being overweight should not be an obstacle to a physically active lifestyle. Similarly, a reliance on BMI alone may misclassify a number of individuals who might otherwise have been automatically considered fat and/or unfit.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a singularly perturbed ordinary differential equation with non-smooth data is considered. The numerical method is generated by means of a Petrov-Galerkin finite element method with the piecewise-exponential test function and the piecewise-linear trial function. At the discontinuous point of the coefficient, a special technique is used. The method is shown to be first-order accurate and singular perturbation parameter uniform convergence. Finally, numerical results are presented, which are in agreement with theoretical results.