61 resultados para Data Streams Distribution


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Monitor a distribution network implies working with a huge amount of data coining from the different elements that interact in the network. This paper presents a visualization tool that simplifies the task of searching the database for useful information applicable to fault management or preventive maintenance of the network

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A key issue in the implementation of the Water Framework Directive is the classification of streams and rivers using biological quality parameters and type-specific reference conditions. Four groups of stream types were defined in NE Spain on the basis of 152 diatom samples by means of detrended correspondence analysis and classification techniques. Diatom analysis was restricted to epilithic taxa, and the sites included gradients ranging from near-natural streams to sites with poor ecological quality. The main gradient shows a clear separation of sites in relation to the degree of human influence: polluted streams (mainly located in the lowlands) differ from streams in mountainous areas and in the Pyrenees. A second gradient is related to physiographical features. Headwater streams can be distinguished by their catchment geology. The type-specific diatom taxa for the stream types studied were determined by using indicator species analysis (IndVal). The type-specific taxa from near-natural streams are coincident with the indicator taxa for high ecological status. Human impact reduced the typological heterogeneity of the diatom community composition. Overall, the diatom communities in NE Spain exhibit a regional distribution pattern that closely corresponds with that observed in river systems elsewhere. Physiographical differences are only evident in undisturbed sites, while nutrient enrichment and other human disturbances may mask the regional differences in the distribution of diatom communities

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the past, sensors networks in cities have been limited to fixed sensors, embedded in particular locations, under centralised control. Today, new applications can leverage wireless devices and use them as sensors to create aggregated information. In this paper, we show that the emerging patterns unveiled through the analysis of large sets of aggregated digital footprints can provide novel insights into how people experience the city and into some of the drivers behind these emerging patterns. We particularly explore the capacity to quantify the evolution of the attractiveness of urban space with a case study of in the area of the New York City Waterfalls, a public art project of four man-made waterfalls rising from the New York Harbor. Methods to study the impact of an event of this nature are traditionally based on the collection of static information such as surveys and ticket-based people counts, which allow to generate estimates about visitors’ presence in specific areas over time. In contrast, our contribution makes use of the dynamic data that visitors generate, such as the density and distribution of aggregate phone calls and photos taken in different areas of interest and over time. Our analysis provides novel ways to quantify the impact of a public event on the distribution of visitors and on the evolution of the attractiveness of the points of interest in proximity. This information has potential uses for local authorities, researchers, as well as service providers such as mobile network operators.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Automatic classification of makams from symbolic data is a rarely studied topic. In this paper, first a review of an n-gram based approach is presented using various representations of the symbolic data. While a high degree of precision can be obtained, confusion happens mainly for makams using (almost) the same scale and pitch hierarchy but differ in overall melodic progression, seyir. To further improve the system, first n-gram based classification is tested for various sections of the piece to take into account a feature of the seyir that melodic progression starts in a certain region of the scale. In a second test, a hierarchical classification structure is designed which uses n-grams and seyir features in different levels to further improve the system.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we propose a subsampling estimator for the distribution ofstatistics diverging at either known rates when the underlying timeseries in strictly stationary abd strong mixing. Based on our results weprovide a detailed discussion how to estimate extreme order statisticswith dependent data and present two applications to assessing financialmarket risk. Our method performs well in estimating Value at Risk andprovides a superior alternative to Hill's estimator in operationalizingSafety First portofolio selection.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Do the contests with the largest prizes attract the most able contestants? Towhat extent do contestants avoid competition? In this paper, we show, theoreticallyand empirically, that the distribution of abilities plays a crucial role in determiningcontest choice. Sorting exists only when the proportion of high-ability contestantsis sufficiently small. As this proportion increases, contestants shy away from competitionand sorting decreases, such that, reverse sorting becomes a possibility. Wetest our theoretical predictions using a large panel data set containing contest choiceover three decades. We use exogenous variation in the participation of highly-ablecompetitors to provide empirical evidence for the relationship among prizes, competition,and sorting.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the mid-1980s, many European countries introduced fixed-term contracts.Since then their labor markets have become more dynamic. This paper studiesthe implications of such reforms for the duration distribution ofunemployment, with particular emphasis on the changes in the durationdependence. I estimate a parametric duration model using cross-sectionaldata drawn from the Spanish Labor Force Survey from 1980 to 1994 to analyzethe chances of leaving unemployment before and after the introduction offixed-term contracts. I find that duration dependence has increased sincesuch reform. Semi-parametric estimation of the model also shows that forlong spells, the probability of leaving unemployment has decreased sincesuch reform.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We propose a new econometric estimation method for analyzing the probabilityof leaving unemployment using uncompleted spells from repeated cross-sectiondata, which can be especially useful when panel data are not available. Theproposed method-of-moments-based estimator has two important features:(1) it estimates the exit probability at the individual level and(2) it does not rely on the stationarity assumption of the inflowcomposition. We illustrate and gauge the performance of the proposedestimator using the Spanish Labor Force Survey data, and analyze the changesin distribution of unemployment between the 1980s and 1990s during a periodof labor market reform. We find that the relative probability of leavingunemployment of the short-term unemployed versus the long-term unemployedbecomes significantly higher in the 1990s.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We study the statistical properties of three estimation methods for a model of learning that is often fitted to experimental data: quadratic deviation measures without unobserved heterogeneity, and maximum likelihood withand without unobserved heterogeneity. After discussing identification issues, we show that the estimators are consistent and provide their asymptotic distribution. Using Monte Carlo simulations, we show that ignoring unobserved heterogeneity can lead to seriously biased estimations in samples which have the typical length of actual experiments. Better small sample properties areobtained if unobserved heterogeneity is introduced. That is, rather than estimating the parameters for each individual, the individual parameters are considered random variables, and the distribution of those random variables is estimated.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The absolute K magnitudes and kinematic parameters of about 350 oxygen-rich Long-Period Variable stars are calibrated, by means of an up-to-date maximum-likelihood method, using HIPPARCOS parallaxes and proper motions together with radial velocities and, as additional data, periods and V-K colour indices. Four groups, differing by their kinematics and mean magnitudes, are found. For each of them, we also obtain the distributions of magnitude, period and de-reddened colour of the base population, as well as de-biased period-luminosity-colour relations and their two-dimensional projections. The SRa semiregulars do not seem to constitute a separate class of LPVs. The SRb appear to belong to two populations of different ages. In a PL diagram, they constitute two evolutionary sequences towards the Mira stage. The Miras of the disk appear to pulsate on a lower-order mode. The slopes of their de-biased PL and PC relations are found to be very different from the ones of the Oxygen Miras of the LMC. This suggests that a significant number of so-called Miras of the LMC are misclassified. This also suggests that the Miras of the LMC do not constitute a homogeneous group, but include a significant proportion of metal-deficient stars, suggesting a relatively smooth star formation history. As a consequence, one may not trivially transpose the LMC period-luminosity relation from one galaxy to the other.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The distribution of Sn4+ cations within the five crystallographic sites of the magnetoplumbite (M) ‐like compound BaFe12−2xCoxSnxO19 has been analyzed using single‐crystal x‐ray‐diffraction data. The species Fe3+ and Co2+ cannot be distinguished using x rays because of their very similar atomic numbers; however, the calculation of the apparent valencies for the different sites allows an insight into the Co2+ cation segregation. The use of previous data from neutron powder diffraction allows a precise picture of the cation distribution, which indicates a pronounced site selectivity for both Sn4+ and Co2+ cations. The Sn4+ cations prefer the 4f2 sites and to a much lower extent the 12k sites, while they do not enter the octahedral 2a sites at all. Co2+ cations are distributed among tetrahedral and octahedral sites displaying a clear preference for the tetrahedral 4f1 sites. Magnetic measurements indicate that the compound still exhibits uniaxial anisotropy with the easy direction parallel to the c axis. Nevertheless, the magnetic structure shows a considerable degree of noncolinearity. A strong reduction of the magnetic anisotropy regarding that of the undoped compound is also detected.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Several unit root tests in panel data have recently been proposed. The test developed by Harris and Tzavalis (1999 JoE) performs particularly well when the time dimension is moderate in relation to the cross-section dimension. However, in common with the traditional tests designed for the unidimensional case, it was found to perform poorly when there is a structural break in the time series under the alternative. Here we derive the asymptotic distribution of the test allowing for a shift in the mean, and assess the small sample performance. We apply this new test to show how the hypothesis of (perfect) hysteresis in Spanish unemployment is rejected in favour of the alternative of the natural unemployment rate, when the possibility of a change in the latter is considered.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Several unit root tests in panel data have recently been proposed. The test developed by Harris and Tzavalis (1999 JoE) performs particularly well when the time dimension is moderate in relation to the cross-section dimension. However, in common with the traditional tests designed for the unidimensional case, it was found to perform poorly when there is a structural break in the time series under the alternative. Here we derive the asymptotic distribution of the test allowing for a shift in the mean, and assess the small sample performance. We apply this new test to show how the hypothesis of (perfect) hysteresis in Spanish unemployment is rejected in favour of the alternative of the natural unemployment rate, when the possibility of a change in the latter is considered.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we deal with the identification of dependencies between time series of equity returns. Marginal distribution functions are assumed to be known, and a bivariate chi-square test of fit is applied in a fully parametric copula approach. Several families of copulas are fitted and compared with Spanish stock market data. The results show that the t-copula generally outperforms other dependence structures, and highlight the difficulty in adjusting a significant number of bivariate data series