Biblioteca Digital

990 resultados para imbalanced data

Epidemiology of pulmonary hypertension: new data from the Swiss registry.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

since 1999 data from pulmonary hypertension (PH) patients from all PH centres in Switzerland were prospectively collected. We analyse the epidemiological aspects of these data. PH was defined as a mean pulmonary artery pressure of >25 mm Hg at rest or >30 mm Hg during exercise. Patients with pulmonary arterial hypertension (PAH), PH associated with lung diseases, PH due to chronic thrombotic and/or embolic disease (CTEPH), or PH due to miscellaneous disorders were registered. Data from adult patients included between January 1999 and December 2004 were analysed. 250 patients were registered (age 58 +/- 16 years, 104 (41%) males). 152 patients (61%) had PAH, 73 (29%) had CTEPH and 18 (7%) had PH associated with lung disease. Patients <50 years (32%) were more likely to have PAH than patients >50 years (76% vs. 53%, p <0.005). Twenty-four patients (10%) were lost to followup, 58 patients (26%) died and 150 (66%) survived without transplantation or thrombendarterectomy. Survivors differed from patients who died in the baseline six-minute walking distance (400 m [300-459] vs. 273 m [174-415]), the functional impairment (NYHA class III/IV 86% vs. 98%), mixed venous saturation (63% [57-68] vs. 56% [50-61]) and right atrial pressure (7 mm Hg [4-11] vs. 11 mm Hg [4-18]). PH is a disease affecting adults of all ages. The management of these patients in specialised centres guarantees a high quality of care. Analysis of the registry data could be an instrument for quality control and might help identify weak points in assessment and treatment of these patients.

Variation of the Chlorophyll a Related to Sea Surface Temperature, Wind and Geostrophic Currents in the Cape Verde Region Using Satellite Data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a comparative analysis of satellite derived climatologies in the Cape Verde region (CV). In order to establish chlorophyll a variability, in relation to other oceanographic phenomena, a set of, relatively long (from five to eight years), time series of chlorophyll a, sea surface temperature, wind and geostrophic currents, were ensembled for the Eastern Central Atlantic (ECA). We studied seasonal and inter-annual variability of phytoplankton concentration, in relation to the rest of the variables, with a special focus in CV. We compared the situation within the archipelago with those of the surrounding marine environments, such as the North West African Upwelling (NWAU), North Atlantic Subtropical Gyre (NASTG), North Equatorial Counter Current (NECC) and Guinea Dome (GD). At the seasonal scale, CV region behaves partly as the surrounding areas, nevertheless, some autochthonous features were also found. The maximum peak of the pigment having a positive correlation with temperature is found at the end of the year for all the points in the archipelago; a less remarkable rise with negative correlation is also detected in February for points CV2 and CV4. This is behavior that none of the surrounding environments have shown. This enrichment was found to be preceded by a drastic drop in wind intensity (SW Monsoon) during summer months. The inter-annual analysis shows a tendency for decreasing of the chlorophyll a concentration.

Variation of the Chlorophyll a Related to Sea Surface Temperature, Wind and Geostrophic Currents in the Cape Verde Region Using Satellite Data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a comparative analysis of satellite derived climatologies in the Cape Verde region (CV). In order to establish chlorophyll a variability, in relation to other oceanographic phenomena, a set of, relatively long (from five to eight years), time series of chlorophyll a, sea surface temperature, wind and geostrophic currents, were ensembled for the Eastern Central Atlantic (ECA). We studied seasonal and inter-annual variability of phytoplankton concentration, in relation to the rest of the variables, with a special focus in CV. We compared the situation within the archipelago with those of the surrounding marine environments, such as the North West African Upwelling (NWAU), North Atlantic Subtropical Gyre (NASTG), North Equatorial Counter Current (NECC) and Guinea Dome (GD). At the seasonal scale, CV region behaves partly as the surrounding areas, nevertheless, some autochthonous features were also found. The maximum peak of the pigment having a positive correlation with temperature is found at the end of the year for all the points in the archipelago; a less remarkable rise with negative correlation is also detected in February for points CV2 and CV4. This is behavior that none of the surrounding environments have shown. This enrichment was found to be preceded by a drastic drop in wind intensity (SW Monsoon) during summer months. The inter-annual analysis shows a tendency for decreasing of the chlorophyll a concentration.

Some last thoughts on compositional data analysis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

One of the disadvantages of old age is that there is more past than future: this,however, may be turned into an advantage if the wealth of experience and, hopefully,wisdom gained in the past can be reflected upon and throw some light on possiblefuture trends. To an extent, then, this talk is necessarily personal, certainly nostalgic,but also self critical and inquisitive about our understanding of the discipline ofstatistics. A number of almost philosophical themes will run through the talk: searchfor appropriate modelling in relation to the real problem envisaged, emphasis onsensible balances between simplicity and complexity, the relative roles of theory andpractice, the nature of communication of inferential ideas to the statistical layman, theinter-related roles of teaching, consultation and research. A list of keywords might be:identification of sample space and its mathematical structure, choices betweentransform and stay, the role of parametric modelling, the role of a sample spacemetric, the underused hypothesis lattice, the nature of compositional change,particularly in relation to the modelling of processes. While the main theme will berelevance to compositional data analysis we shall point to substantial implications forgeneral multivariate analysis arising from experience of the development ofcompositional data analysis…

Compositional Data in Biomedical Research

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Modern methods of compositional data analysis are not well known in biomedical research.Moreover, there appear to be few mathematical and statistical researchersworking on compositional biomedical problems. Like the earth and environmental sciences,biomedicine has many problems in which the relevant scienti c information isencoded in the relative abundance of key species or categories. I introduce three problemsin cancer research in which analysis of compositions plays an important role. Theproblems involve 1) the classi cation of serum proteomic pro les for early detection oflung cancer, 2) inference of the relative amounts of di erent tissue types in a diagnostictumor biopsy, and 3) the subcellular localization of the BRCA1 protein, and it'srole in breast cancer patient prognosis. For each of these problems I outline a partialsolution. However, none of these problems is \solved". I attempt to identify areas inwhich additional statistical development is needed with the hope of encouraging morecompositional data analysts to become involved in biomedical research

Compositions in life science data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this talk is to convince the reader that there are a lot of interesting statisticalproblems in presentday life science data analysis which seem ultimately connected withcompositional statistics.Key words: SAGE, cDNA microarrays, (1D-)NMR, virus quasispecies

High-resolution 3D whole-heart coronary MRA: a study on the combination of data acquisition in multiple breath-holds and 1D residual respiratory motion compensation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECT: To study a scan protocol for coronary magnetic resonance angiography based on multiple breath-holds featuring 1D motion compensation and to compare the resulting image quality to a navigator-gated free-breathing acquisition. Image reconstruction was performed using L1 regularized iterative SENSE. MATERIALS AND METHODS: The effects of respiratory motion on the Cartesian sampling scheme were minimized by performing data acquisition in multiple breath-holds. During the scan, repetitive readouts through a k-space center were used to detect and correct the respiratory displacement of the heart by exploiting the self-navigation principle in image reconstruction. In vivo experiments were performed in nine healthy volunteers and the resulting image quality was compared to a navigator-gated reference in terms of vessel length and sharpness. RESULTS: Acquisition in breath-hold is an effective method to reduce the scan time by more than 30 % compared to the navigator-gated reference. Although an equivalent mean image quality with respect to the reference was achieved with the proposed method, the 1D motion compensation did not work equally well in all cases. CONCLUSION: In general, the image quality scaled with the robustness of the motion compensation. Nevertheless, the featured setup provides a positive basis for future extension with more advanced motion compensation methods.

Interpretation of data in psychology : a false problem

Relevância:

20.00% 20.00%

Publicador:

Secondary Accident Data Fusion for Assessing Long-Term Performance of Transportation Systems, 2007

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Secondary accident statistics can be useful for studying the impact of traffic incident management strategies. An easy-to-implement methodology is presented for classifying secondary accidents using data fusion of a police accident database with intranet incident reports. A current method for classifying secondary accidents uses a static threshold that represents the spatial and temporal region of influence of the primary accident, such as two miles and one hour. An accident is considered secondary if it occurs upstream from the primary accident and is within the duration and queue of the primary accident. However, using the static threshold may result in both false positives and negatives because accident queues are constantly varying. The methodology presented in this report seeks to improve upon this existing method by making the threshold dynamic. An incident progression curve is used to mark the end of the queue throughout the entire incident. Four steps in the development of incident progression curves are described. Step one is the processing of intranet incident reports. Step two is the filling in of incomplete incident reports. Step three is the nonlinear regression of incident progression curves. Step four is the merging of individual incident progression curves into one master curve. To illustrate this methodology, 5,514 accidents from Missouri freeways were analyzed. The results show that secondary accidents identified by dynamic versus static thresholds can differ by more than 30%.

Rivers Regional Patterns in Historic Data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A report produced by the Department of Natural Resources on the historical pattern the rivers take.

Mutation identification by direct comparison of whole-genome sequencing data from mutant and wild-type individuals using k-mers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genes underlying mutant phenotypes can be isolated by combining marker discovery, genetic mapping and resequencing, but a more straightforward strategy for mapping mutations would be the direct comparison of mutant and wild-type genomes. Applying such an approach, however, is hampered by the need for reference sequences and by mutational loads that confound the unambiguous identification of causal mutations. Here we introduce NIKS (needle in the k-stack), a reference-free algorithm based on comparing k-mers in whole-genome sequencing data for precise discovery of homozygous mutations. We applied NIKS to eight mutants induced in nonreference rice cultivars and to two mutants of the nonmodel species Arabis alpina. In both species, comparing pooled F2 individuals selected for mutant phenotypes revealed small sets of mutations including the causal changes. Moreover, comparing M3 seedlings of two allelic mutants unambiguously identified the causal gene. Thus, for any species amenable to mutagenesis, NIKS enables forward genetics without requiring segregating populations, genetic maps and reference sequences.

Iowa STORET Data Retrieval, 2002

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Part of Iowa's Water Ambient monitoring Program, produced by the Iowa Department of Natural Resources.

Preliminary data on the geographical distribution of Drosophila species within morphoclimatic domains of Brazil. III. The cardini group

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A reanalysis, based on museum specimens, of our previously published data on the geographical distribution of the species of Drosophila belonging to the cardini group in Brazil is presented and discussed. As previously recorded in several papers, including ours, the following four species were recognized: D. cardini, D. cardinoides, D. neocardini, and D. polymorpha. However, it was realized that most of the flies we have previously identified as Drosophila cardinoides belong in fact to Drosophila cardini. To facilitate the proper identification of these four near-sibling species, their holotypes were analyzed and their terminalia were described and illustrated. A key to the four species is also provided.

Stepwise multiple testing as formalized data snooping

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is common in econometric applications that several hypothesis tests arecarried out at the same time. The problem then becomes how to decide whichhypotheses to reject, accounting for the multitude of tests. In this paper,we suggest a stepwise multiple testing procedure which asymptoticallycontrols the familywise error rate at a desired level. Compared to relatedsingle-step methods, our procedure is more powerful in the sense that itoften will reject more false hypotheses. In addition, we advocate the useof studentization when it is feasible. Unlike some stepwise methods, ourmethod implicitly captures the joint dependence structure of the teststatistics, which results in increased ability to detect alternativehypotheses. We prove our method asymptotically controls the familywise errorrate under minimal assumptions. We present our methodology in the context ofcomparing several strategies to a common benchmark and deciding whichstrategies actually beat the benchmark. However, our ideas can easily beextended and/or modied to other contexts, such as making inference for theindividual regression coecients in a multiple regression framework. Somesimulation studies show the improvements of our methods over previous proposals. We also provide an application to a set of real data.

Fusion of data sets in multivariate linear regression with errors-in-variables

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider the application of normal theory methods to the estimation and testing of a general type of multivariate regressionmodels with errors--in--variables, in the case where various data setsare merged into a single analysis and the observable variables deviatepossibly from normality. The various samples to be merged can differ on the set of observable variables available. We show that there is a convenient way to parameterize the model so that, despite the possiblenon--normality of the data, normal--theory methods yield correct inferencesfor the parameters of interest and for the goodness--of--fit test. Thetheory described encompasses both the functional and structural modelcases, and can be implemented using standard software for structuralequations models, such as LISREL, EQS, LISCOMP, among others. An illustration with Monte Carlo data is presented.

«
1
2
...
58
59
60
61
62
63
64
65
66
»