976 resultados para Nonparametric statistical analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A Világbank az 1990-es évek végén egy nagyszabású, nemzetközi teljesítmény-értékelési programot indított a víz- és szennyvíz-szolgáltató vállalatok körében. Az International Benchmarking Network for Water and Sanitation Utilities (IBNET) elnevezésű kezdeményezés keretében a szolgáltatók egy szabványosított kérdőíven információt adnak meg működésükről. Az egyedi, cégszintű adatokból egy adatbázis készül, mely lehetővé teszi a vállalatok működésének összehasonlító elemzését, amit teljesítmény-értékelésnek (benchmarking) is szokás nevezni. A programról és eddigi eredményeiről angol nyelvű információ a www.ib-net.org honlapon található. A felmérést eddig több, mint 70 országban végezték el, köztük Magyarországon is. Itthon a REKK kapott megbízást a feladat végrehajtására. Az adatgyűjtésen túl az adatbázisra alapozva Közép és Kelet-Európa országainak víziközmű cégeiről statisztikai elemzést végeztünk az alap adottságok és a költségek összefüggésének feltárására.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The article investigates the division between member states of the European Union considering the aspect of their level of information and communication technology (ICT) development focusing on e-learning. With the help of discriminant analysis the countries are categorized into groups based on their ICT maturity and e-learning literacy level of development. Making a comparison with a benchmarking tool, the ITU (International Telecommunication Union)’s ICT Development Index (IDI) the results are confirmed partly correct. The article tries to find economical explanations for the re-grouping of the countries ranking. Finally the author examines the reliability of Hungary’s ranking results and the factors which may affect this divergence from the real picture.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This dissertation develops a new figure of merit to measure the similarity (or dissimilarity) of Gaussian distributions through a novel concept that relates the Fisher distance to the percentage of data overlap. The derivations are expanded to provide a generalized mathematical platform for determining an optimal separating boundary of Gaussian distributions in multiple dimensions. Real-world data used for implementation and in carrying out feasibility studies were provided by Beckman-Coulter. It is noted that although the data used is flow cytometric in nature, the mathematics are general in their derivation to include other types of data as long as their statistical behavior approximate Gaussian distributions. ^ Because this new figure of merit is heavily based on the statistical nature of the data, a new filtering technique is introduced to accommodate for the accumulation process involved with histogram data. When data is accumulated into a frequency histogram, the data is inherently smoothed in a linear fashion, since an averaging effect is taking place as the histogram is generated. This new filtering scheme addresses data that is accumulated in the uneven resolution of the channels of the frequency histogram. ^ The qualitative interpretation of flow cytometric data is currently a time consuming and imprecise method for evaluating histogram data. This method offers a broader spectrum of capabilities in the analysis of histograms, since the figure of merit derived in this dissertation integrates within its mathematics both a measure of similarity and the percentage of overlap between the distributions under analysis. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The microarray technology provides a high-throughput technique to study gene expression. Microarrays can help us diagnose different types of cancers, understand biological processes, assess host responses to drugs and pathogens, find markers for specific diseases, and much more. Microarray experiments generate large amounts of data. Thus, effective data processing and analysis are critical for making reliable inferences from the data. ^ The first part of dissertation addresses the problem of finding an optimal set of genes (biomarkers) to classify a set of samples as diseased or normal. Three statistical gene selection methods (GS, GS-NR, and GS-PCA) were developed to identify a set of genes that best differentiate between samples. A comparative study on different classification tools was performed and the best combinations of gene selection and classifiers for multi-class cancer classification were identified. For most of the benchmarking cancer data sets, the gene selection method proposed in this dissertation, GS, outperformed other gene selection methods. The classifiers based on Random Forests, neural network ensembles, and K-nearest neighbor (KNN) showed consistently god performance. A striking commonality among these classifiers is that they all use a committee-based approach, suggesting that ensemble classification methods are superior. ^ The same biological problem may be studied at different research labs and/or performed using different lab protocols or samples. In such situations, it is important to combine results from these efforts. The second part of the dissertation addresses the problem of pooling the results from different independent experiments to obtain improved results. Four statistical pooling techniques (Fisher inverse chi-square method, Logit method. Stouffer's Z transform method, and Liptak-Stouffer weighted Z-method) were investigated in this dissertation. These pooling techniques were applied to the problem of identifying cell cycle-regulated genes in two different yeast species. As a result, improved sets of cell cycle-regulated genes were identified. The last part of dissertation explores the effectiveness of wavelet data transforms for the task of clustering. Discrete wavelet transforms, with an appropriate choice of wavelet bases, were shown to be effective in producing clusters that were biologically more meaningful. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multivariate statistical analysis on the kaolinite/chlorite ratios from 20 South Atlantic sediment cores allowed for the extraction of two processes controlling the fluctuations of the kaolinite/chlorite ratio during the last 130,000 yrs, (1) the relative strength of North Atlantic Deep Water (NADW) inflow into the South Atlantic Ocean and (2) the influx of aeolian sediments from the south African continent. The NADW fluctuation can be traced in the entire deep South Atlantic while the dust signal is restricted to the vicinity of South Africa. Our data indicate that NADW formation underwent significant changes in response to glacial/interglacial climate changes with enhanced export to the Southern Hemisphere during interglacials. The most pronounced phases with Enhanced South African Dust Export (ESADE) occurred during cold Marine Isotope Stage (MIS) 5d and across the Late Glacial/Holocene transition from 16 ka to 4 ka (MIS 2 to 1). This particular pattern is attributed to the interaction of Antarctic Sea Ice extent, the position of the westerlies and the South African monsoon system.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

With the growing pressure of eutrophication in tropical regions, the Mauritian shelf provides a natural situation to understand the variability in mesotrophic assemblages. Site-specific dynamics occur throughout the 1200 m depth gradient. The shallow assemblages divide into three types of warm-water mesotrophic foraminiferal assemblages, which is not only a consequence of high primary productivity restricting light to the benthos but due to low pore water oxygenation, shelf geomorphology, and sediment partitioning. In the intermediate depth (approx. 500 m), the increase in foraminiferal diversity is due to the cold-water coral habitat providing a greater range of micro niches. Planktonic species characterise the lower bathyal zone, which emphasizes the reduced benthic carbonate production at depth. Although, due to the strong hydrodynamics within the Golf, planktonic species occur in notable abundances through out the whole depth gradient. Overall, this study can easily be compared to other tropical marine settings investigating the long-term effects of tropical eutrophication and the biogeographic distribution of carbonate producing organisms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Studies have shown that large geographical spreading can reduce the wind power variability and smooth production. It is frequently assumed that storage and interconnection can manage wind power variability and are totally flexible. However, constraints do exist. In the future more and more electricity will be provided by renewable energy sources and more electricity interconnectors will be built between European Union (EU) countries, as outlines in many of the Projects of Common Interests. It is essential to understand the correlation of wind generation throughout Europe considering power system constraints. In this study the spatial and temporal correlation of wind power production across several countries is examined in order to understand how “the wind ‘travels’ across Europe”. Three years of historical hourly wind power generation from ten EU countries is analysed to investigate the geographic diversity and time scales influence on correlation of wind power variations. Results are then compared with two other studies and show similar general characteristics of correlation between EU country pairs to identify opportunities for storage optimisation, power system operations, and trading.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The sea surface temperature (SST) and chlorophyll-a concentration (CHL-a) were analysed in the Gulf of Tadjourah from two set of 8-day composite satellite data, respectively from 2008 to 2012 and from 2005 to 2011. A singular spectrum analysis (SSA) shows that the annual cycle of SST is strong (74.3% of variance) and consists of warming (April-October) and cooling (November-March) of about 2.5C than the long-term average. The semi-annual cycle captures only 14.6% of temperature variance and emphasises the drop of SST during July-August. Similarly, the annual cycle of CHL-a (29.7% of variance) depicts high CHL-a from June to October and low concentration from November to May. In addition, the first spatial empirical orthogonal function (EOF) of SST (93% of variance) shows that the seasonal warming/cooling is in phase across the whole study area but the southeastern part always remaining warmer or cooler. In contrast to the SST, the first EOF of CHL-a (54.1% of variance) indicates the continental shelf in phase opposition with the offshore area in winter during which the CHL-a remains sequestrated in the coastal area particularly in the south-east and in the Ghoubet Al-Kharab Bay. Inversely during summer, higher CHL-a quantities appear in the offshore waters. In order to investigate processes generating these patterns, a multichannel spectrum analysis was applied to a set of oceanic (SST, CHL-a) and atmospheric parameters (wind speed, air temperature and air specific humidity). This analysis shows that the SST is well correlated to the atmospheric parameters at an annual scale. The windowed cross correlation indicates that this correlation is significant only from October to May. During this period, the warming was related to the solar heating of the surface water when the wind is low (April-May and October) while the cooling (November-March) was linked to the strong and cold North-East winds and to convective mixing. The summer drop in SST followed by a peak of CHL-a, seems strongly correlated to the upwelling. The second EOF modes of SST and CHL-a explain respectively 1.3% and 5% of the variance and show an east-west gradient during winter that is reversed during summer. This work showed that the seasonal signals have a wide spatial influence and dominate the variability of the SST and CHL-a while the east-west gradient are specific for the Gulf of Tadjourah and seem induced by the local wind modulated by the topography.