Biblioteca Digital

55 resultados para non-trivial data structures

Towards expressive rule induction on IP network event streams

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In order to gain insights into events and issues that may cause errors and outages in parts of IP networks, intelligent methods that capture and express causal relationships online (in real-time) are needed. Whereas generalised rule induction has been explored for non-streaming data applications, its application and adaptation on streaming data is mostly undeveloped or based on periodic and ad-hoc training with batch algorithms. Some association rule mining approaches for streaming data do exist, however, they can only express binary causal relationships. This paper presents the ongoing work on Online Generalised Rule Induction (OGRI) in order to create expressive and adaptive rule sets real-time that can be applied to a broad range of applications, including network telemetry data streams.

Convectively coupled equatorial waves: A new methodology for identifying wave structures in observational data

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Convectively coupled equatorial waves are fundamental components of the interaction between the physics and dynamics of the tropical atmosphere. A new methodology, which isolates individual equatorial wave modes, has been developed and applied to observational data. The methodology assumes that the horizontal structures given by equatorial wave theory can be used to project upper- and lower-tropospheric data onto equatorial wave modes. The dynamical fields are first separated into eastward- and westward-moving components with a specified domain of frequency–zonal wavenumber. Each of the components for each field is then projected onto the different equatorial modes using the y structures of these modes given by the theory. The latitudinal scale yo of the modes is predetermined by data to fit the equatorial trapping in a suitable latitude belt y = ±Y. The extent to which the different dynamical fields are consistent with one another in their depiction of each equatorial wave structure determines the confidence in the reality of that structure. Comparison of the analyzed modes with the eastward- and westward-moving components in the convection field enables the identification of the dynamical structure and nature of convectively coupled equatorial waves. In a case study, the methodology is applied to two independent data sources, ECMWF Reanalysis and satellite-observed window brightness temperature (Tb) data for the summer of 1992. Various convectively coupled equatorial Kelvin, mixed Rossby–gravity, and Rossby waves have been detected. The results indicate a robust consistency between the two independent data sources. Different vertical structures for different wave modes and a significant Doppler shifting effect of the background zonal winds on wave structures are found and discussed. It is found that in addition to low-level convergence, anomalous fluxes induced by strong equatorial zonal winds associated with equatorial waves are important for inducing equatorial convection. There is evidence that equatorial convection associated with Rossby waves leads to a change in structure involving a horizontal structure similar to that of a Kelvin wave moving westward with it. The vertical structure may also be radically changed. The analysis method should make a very powerful diagnostic tool for investigating convectively coupled equatorial waves and the interaction of equatorial dynamics and physics in the real atmosphere. The results from application of the analysis method for a reanalysis dataset should provide a benchmark against which model studies can be compared.

Can wavelets improve the representation of forecast error covariances in variational data assimilation?

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Two wavelet-based control variable transform schemes are described and are used to model some important features of forecast error statistics for use in variational data assimilation. The first is a conventional wavelet scheme and the other is an approximation of it. Their ability to capture the position and scale-dependent aspects of covariance structures is tested in a two-dimensional latitude-height context. This is done by comparing the covariance structures implied by the wavelet schemes with those found from the explicit forecast error covariance matrix, and with a non-wavelet- based covariance scheme used currently in an operational assimilation scheme. Qualitatively, the wavelet-based schemes show potential at modeling forecast error statistics well without giving preference to either position or scale-dependent aspects. The degree of spectral representation can be controlled by changing the number of spectral bands in the schemes, and the least number of bands that achieves adequate results is found for the model domain used. Evidence is found of a trade-off between the localization of features in positional and spectral spaces when the number of bands is changed. By examining implied covariance diagnostics, the wavelet-based schemes are found, on the whole, to give results that are closer to diagnostics found from the explicit matrix than from the nonwavelet scheme. Even though the nature of the covariances has the right qualities in spectral space, variances are found to be too low at some wavenumbers and vertical correlation length scales are found to be too long at most scales. The wavelet schemes are found to be good at resolving variations in position and scale-dependent horizontal length scales, although the length scales reproduced are usually too short. The second of the wavelet-based schemes is often found to be better than the first in some important respects, but, unlike the first, it has no exact inverse transform.

Dynamic load balancing for the distributed mining of molecular structures

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In molecular biology, it is often desirable to find common properties in large numbers of drug candidates. One family of methods stems from the data mining community, where algorithms to find frequent graphs have received increasing attention over the past years. However, the computational complexity of the underlying problem and the large amount of data to be explored essentially render sequential algorithms useless. In this paper, we present a distributed approach to the frequent subgraph mining problem to discover interesting patterns in molecular compounds. This problem is characterized by a highly irregular search tree, whereby no reliable workload prediction is available. We describe the three main aspects of the proposed distributed algorithm, namely, a dynamic partitioning of the search space, a distribution process based on a peer-to-peer communication framework, and a novel receiverinitiated load balancing algorithm. The effectiveness of the distributed method has been evaluated on the well-known National Cancer Institute’s HIV-screening data set, where we were able to show close-to linear speedup in a network of workstations. The proposed approach also allows for dynamic resource aggregation in a non dedicated computational environment. These features make it suitable for large-scale, multi-domain, heterogeneous environments, such as computational grids.

In search of simple structures in climate data: simplifying EOFs

Relevância:

40.00% 40.00%

Publicador:

A score test for binary data with patient non-compliance

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A score test is developed for binary clinical trial data, which incorporates patient non-compliance while respecting randomization. It is assumed in this paper that compliance is all-or-nothing, in the sense that a patient either accepts all of the treatment assigned as specified in the protocol, or none of it. Direct analytic comparisons of the adjusted test statistic for both the score test and the likelihood ratio test are made with the corresponding test statistics that adhere to the intention-to-treat principle. It is shown that no gain in power is possible over the intention-to-treat analysis, by adjusting for patient non-compliance. Sample size formulae are derived and simulation studies are used to demonstrate that the sample size approximation holds. Copyright © 2003 John Wiley & Sons, Ltd.

Active-control trials with binary data: a comparison of methods for testing superiority or non-inferiority using the odds ratio

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper considers methods for testing for superiority or non-inferiority in active-control trials with binary data, when the relative treatment effect is expressed as an odds ratio. Three asymptotic tests for the log-odds ratio based on the unconditional binary likelihood are presented, namely the likelihood ratio, Wald and score tests. All three tests can be implemented straightforwardly in standard statistical software packages, as can the corresponding confidence intervals. Simulations indicate that the three alternatives are similar in terms of the Type I error, with values close to the nominal level. However, when the non-inferiority margin becomes large, the score test slightly exceeds the nominal level. In general, the highest power is obtained from the score test, although all three tests are similar and the observed differences in power are not of practical importance. Copyright (C) 2007 John Wiley & Sons, Ltd.

Overlapping double turn conformations adopted by tetrapeptides containing non-coded alpha-Amino Isobutyric Acid (AIB) and formation of tape-like structures through supramolecular helix mediated self-assembly

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Single crystal X-ray diffraction studies and solvent dependent H-1 NMR titrations reveal that a set of four tetrapeptides with general formula Boc-Xx(1)-Aib(2)-Yy(3)-Zz(4)-OMe, where Xx, Yy and Zz are coded L- amino acids, adopt equivalent conformations that can be described as overlapping double turn conformations stabilized by two 4 -> 1 intramolecular hydrogen bonds between Yy(3)-NH and Boc C=O and Zz(4)-NH and Xx(1)C=O. In the crystalline state, the double turn structures are packed in head-to-tail fashion through intermolecular hydrogen bonds to create supramolecular helical structures. Field emission scanning electron microscopic (FE-SEM) images of the tetrapeptides in the solid state reveal that they can form flat tape-like structures. The results establish that synthetic Aib containing supramolecular helices can form highly ordered self-aggregated amyloid plaque like human amylin.

Design of supramolecular beta-sheet forming beta-turns containing aromatic rings and non-coded alpha-aminoisobutyric acid and the formation of flat fibrillar structures through self-assembly

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Single crystal X-ray diffraction studies show that the three designed tripeptides Boc-Leu-Aib-m-NA-NO2 (I), Boc-Phe-Aib-m-NA-NO2 (II) and Boc-Pro-Aib-m-ABA-OMe (III) (Aib, -aminoisobutyric acid; m-NA, m-nitroaniline; m-ABA, m-aminobenzoic acid; Boc, t-butyloxycarbonyl) containing aromatic rings in the backbones adopt -turn structures that are self-assembled through intermolecular hydrogen bonds and van der Waals interactions to create layers of -sheets. Solvent-dependent NMR titration and CD studies show that the -turn structures of the peptides also exist in the solution phase. The field emission scanning electron microscopic and transmission electron microscopic images of the peptides in the solid state reveal fibrillar structures of flat morphology that are formed through -sheet mediated self-assembly of the preorganised -turn building blocks.

Accurate molecular structures of chlorothiazide and hydrochlorothiazide by joint refinement against powder neutron and X-ray diffraction data

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The compounds chlorothiazide and hydrochlorothiazide (crystalline form II) have been studied in their fully hydrogenous forms by powder neutron diffraction on the GEM diffractometer. The results of joint Rietveld refinement of the structures against multi-bank neutron and single-bank X-ray powder data are reported and show that accurate and precise structural information can be obtained from polycrystalline molecular organic materials by this route.

Gabor wavelets and Gaussian models to separate ground and non-ground for airborne scanned LIDAR data

Relevância:

40.00% 40.00%

Publicador:

Sex allocation and local mate competition in Old World non-pollinating fig wasps

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The populations of many species are structured such that mating is not random and occurs between members of local patches. When patches are founded by a single female and all matings occur between siblings, brothers may compete with each other for matings with their sisters. This local mate competition (LMC) selects for a female-biased sex ratio, especially in species where females have control over offspring sex, as in the parasitic Hymenoptera. Two factors are predicted to decrease the degree of female bias: (1) an increase in the number of foundress females in the patch and (2) an increase in the fraction of individuals mating after dispersal from the natal patch. Pollinating fig wasps are well known as classic examples of species where all matings occur in the local patch. We studied non-pollinating fig wasps, which are more diverse than the pollinating fig wasps and also provide natural experimental groups of species with different male morphologies that are linked to different mating structures. In this group of wasps, species with wingless males mate in the local patch (i.e. the fig fruit) while winged male species mate after dispersal. Species with both kinds of male have a mixture of local and non-local mating. Data from 44 species show that sex ratios (defined as the proportion of males) are in accordance with theoretical predictions: wingless male species < wing-dimorphic male species < winged male species. These results are also supported by a formal comparative analysis that controls for phylogeny. The foundress number is difficult to estimate directly for non-pollinating fig wasps but a robust indirect method leads to the prediction that foundress number, and hence sex ratio, should increase with the proportion of patches occupied in a crop. This result is supported strongly across 19 species with wingless males, but not across 8 species with winged males. The mean sex ratios for species with winged males are not significantly different from 0.5, and the absence of the correlation observed across species with wingless males may reflect weak selection to adjust the sex ratio in species whose population mating structure tends not to be subdivided. The same relationship is also predicted to occur within species if individual females adjust their sex ratios facultatively. This final prediction was not supported by data from a wingless male species, a male wing-dimorphic species or a winged male species.

Solving surface structures from normal incidence X-ray standing wave data

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A program is provided to determine structural parameters of atoms in or adsorbed on surfaces by refinement of atomistic models towards experimentally determined data generated by the normal incidence X-ray standing wave (NIXSW) technique. The method employs a combination of Differential Evolution Genetic Algorithms and Steepest Descent Line Minimisations to provide a fast, reliable and user friendly tool for experimentalists to interpret complex multidimensional NIXSW data sets.

A data framework for measuring the energy consumption of the non-domestic building stock

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The transition to a low-carbon economy urgently demands better information on the drivers of energy consumption. UK government policy has prioritized energy efficiency in the built stock as a means of carbon reduction, but the sector is historically information poor, particularly the non-domestic building stock. This paper presents the results of a pilot study that investigated whether and how property and energy consumption data might be combined for non-domestic energy analysis. These data were combined in a ‘Non-Domestic Energy Efficiency Database’ to describe the location and physical attributes of each property and its energy consumption. The aim was to support the generation of a range of energy-efficiency statistics for the industrial, commercial and institutional sectors of the non-domestic building stock, and to provide robust evidence for national energy-efficiency and carbon-reduction policy development and monitoring. The work has brought together non-domestic energy data, property data and mapping in a ‘data framework’ for the first time. The results show what is possible when these data are integrated and the associated difficulties. A data framework offers the potential to inform energy-efficiency policy formation and to support its monitoring at a level of detail not previously possible.

Performance analysis of data transmissions in MPLS and non-MPLS networks

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Basic Network transactions specifies that datagram from source to destination is routed through numerous routers and paths depending on the available free and uncongested paths which results in the transmission route being too long, thus incurring greater delay, jitter, congestion and reduced throughput. One of the major problems of packet switched networks is the cell delay variation or jitter. This cell delay variation is due to the queuing delay depending on the applied loading conditions. The effect of delay, jitter accumulation due to the number of nodes along transmission routes and dropped packets adds further complexity to multimedia traffic because there is no guarantee that each traffic stream will be delivered according to its own jitter constraints therefore there is the need to analyze the effects of jitter. IP routers enable a single path for the transmission of all packets. On the other hand, Multi-Protocol Label Switching (MPLS) allows separation of packet forwarding and routing characteristics to enable packets to use the appropriate routes and also optimize and control the behavior of transmission paths. Thus correcting some of the shortfalls associated with IP routing. Therefore MPLS has been utilized in the analysis for effective transmission through the various networks. This paper analyzes the effect of delay, congestion, interference, jitter and packet loss in the transmission of signals from source to destination. In effect the impact of link failures, repair paths in the various physical topologies namely bus, star, mesh and hybrid topologies are all analyzed based on standard network conditions.

«
1
2
3
4
»