835 resultados para Cluster Analysis. Information Theory. Entropy. Cross Information Potential. Complex Data
Resumo:
[EN]In this paper an architecture for an estimator of short-term wind farm power is proposed. The estimator is made up of a Linear Machine classifier and a set of k Multilayer Perceptrons, training each one for a specific subspace of the input space. The splitting of the input dataset into the k clusters is done using a k-means technique, obtaining the equivalent Linear Machine classifier from the cluster centroids...
Resumo:
Crosswell data set contains a range of angles limited only by the geometry of the source and receiver configuration, the separation of the boreholes and the depth to the target. However, the wide angles reflections present in crosswell imaging result in amplitude-versus-angle (AVA) features not usually observed in surface data. These features include reflections from angles that are near critical and beyond critical for many of the interfaces; some of these reflections are visible only for a small range of angles, presumably near their critical angle. High-resolution crosswell seismic surveys were conducted over a Silurian (Niagaran) reef at two fields in northern Michigan, Springdale and Coldspring. The Springdale wells extended to much greater depths than the reef, and imaging was conducted from above and from beneath the reef. Combining the results from images obtained from above with those from beneath provides additional information, by exhibiting ranges of angles that are different for the two images, especially for reflectors at shallow depths, and second, by providing additional constraints on the solutions for Zoeppritz equations. Inversion of seismic data for impedance has become a standard part of the workflow for quantitative reservoir characterization. Inversion of crosswell data using either deterministic or geostatistical methods can lead to poor results with phase change beyond the critical angle, however, the simultaneous pre-stack inversion of partial angle stacks may be best conducted with restrictions to angles less than critical. Deterministic inversion is designed to yield only a single model of elastic properties (best-fit), while the geostatistical inversion produces multiple models (realizations) of elastic properties, lithology and reservoir properties. Geostatistical inversion produces results with far more detail than deterministic inversion. The magnitude of difference in details between both types of inversion becomes increasingly pronounced for thinner reservoirs, particularly those beyond the vertical resolution of the seismic. For any interface imaged from above and from beneath, the results AVA characters must result from identical contrasts in elastic properties in the two sets of images, albeit in reverse order. An inversion approach to handle both datasets simultaneously, at pre-critical angles, is demonstrated in this work. The main exploration problem for carbonate reefs is determining the porosity distribution. Images of elastic properties, obtained from deterministic and geostatistical simultaneous inversion of a high-resolution crosswell seismic survey were used to obtain the internal structure and reservoir properties (porosity) of Niagaran Michigan reef. The images obtained are the best of any Niagaran pinnacle reef to date.
Resumo:
2016
Resumo:
The correlation dimension D 2 and correlation entropy K 2 are both important quantifiers in nonlinear time series analysis. However, use of D 2 has been more common compared to K 2 as a discriminating measure. One reason for this is that D 2 is a static measure and can be easily evaluated from a time series. However, in many cases, especially those involving coloured noise, K 2 is regarded as a more useful measure. Here we present an efficient algorithmic scheme to compute K 2 directly from a time series data and show that K 2 can be used as a more effective measure compared to D 2 for analysing practical time series involving coloured noise.
Resumo:
he 1,3-dipolar cycloaddition between glycine-derived azlactones with maleimides is efficiently catalyzed by the dimeric chiral complex [(S-a)-Binap.AuTFA](2). The alanine-derived oxazolone only reacts with tert-butyl acrylate giving anomalous regiochemistry, which is explained and supported by Natural Resonance Theory and Nucleus Independent Chemical Shifts calculations. The origin of the high enantiodiscrimination observed with maleimides and tert-butyl acrylate is analyzed using DFT computed at M06/Lanl2dz//ONIOM(b3lyp/Lanl2dz:UFF) level. Several applications of these cycloadducts in the synthesis of new proline derivatives with a 2,5-trans-arrangement and in the preparation of complex fused polycyclic molecules are described.
Resumo:
This paper studies the relationship between segregation of women across establish- ments and the wages of males and females. To investigate this issue empirically we use a panel of matched employer-employee data from Brazil. Various longitudinal models are used to assess the wage impact of establishment gender segregation. Overall, the results indicate that the e ect of establishment female proportion on the wages of males and females is negative. We also compare these longitudinal results with cross-section estimates, which are the usual ones obtained in the related literature. This com- parison suggests that unmeasured, time-invariant worker- and establishment-speci c e ects are correlated with the establishment female composition.
Resumo:
The 1,3-dipolar cycloaddition between glycine-derived azlactones with maleimides is efficiently catalyzed by the dimeric chiral complex [(Sa)-Binap·AuTFA]2. The alanine-derived oxazolone only reacts with tert-butyl acrylate giving anomalous regiochemistry, which is explained and supported by Natural Resonance Theory and Nucleus Independent Chemical Shifts calculations. The origin of the high enantiodiscrimination observed with maleimides and tert-butyl acrylate is analyzed using DFT computed at M06/Lanl2dz//ONIOM(b3lyp/Lanl2dz:UFF) level. Several applications of these cycloadducts in the synthesis of new proline derivatives with a 2,5-trans-arrangement and in the preparation of complex fused polycyclic molecules are described.
Resumo:
Originally presented as the author's thesis (M.A.), University of Illinois at Urbana-Champaign.
Resumo:
A principal components analysis was carried out on neuropathological data collected from 79 cases of Alzheimer's disease (AD) diagnosed in a single centre. The purpose of the study was to determine whether on neuropathological criteria there was evidence for clearly defined subtypes of the disease. Two principal components (PC1 and PC2) were extracted from the data. PC1 was considerable more important than PC2 accounting for 72% of the total variance. When plotted in relation to the first two principal components the majority of cases (65/79) were distributed in a single cluster within which subgroupings were not clearly evident. In addition, there were a number of individual, mainly early-onset cases, which were neither related to each other nor to the main cluster. The distribution of each neuropathological feature was examined in relation to PC1 and 2, Disease onset, rhe degree of gross brain atrophy, neuronal loss and the devlopment of senile plaques (SP) and neurofibrillary tangles (NFT) were negatively correlated with PC1. The devlopment of SP and NFT and the degree of brain athersclerosis were positively correlated with PC2. These results suggested: 1) that there were different forms of AD but no clear division of the cases into subclasses could be made based on the neuropathological criteria used; the cases showing a more continuous distribution from one form to another, 2) that disease onset was an important variable and was associated with a greater development of pathological changes, 3) familial cases were not a distinct subclass of AD; the cases being widely distributed in relation to PC1 and PC2 and 4) that there may be two forms of late-onset AD whic grade into each other, one of which was associated with less SP and NFT development but with a greater degree of brain atherosclerosis.
Resumo:
We propose a model-based approach to unify clustering and network modeling using time-course gene expression data. Specifically, our approach uses a mixture model to cluster genes. Genes within the same cluster share a similar expression profile. The network is built over cluster-specific expression profiles using state-space models. We discuss the application of our model to simulated data as well as to time-course gene expression data arising from animal models on prostate cancer progression. The latter application shows that with a combined statistical/bioinformatics analyses, we are able to extract gene-to-gene relationships supported by the literature as well as new plausible relationships.
Resumo:
Grocery shopping is a routine activity widely considered the responsibility of the female spouse, yet modern social and demographic shifts are causing men to engage in this task. This study develops a retail shopping typology of male grocery shoppers, employing a cluster analysis technique. Five distinct cohorts emerge from the data of eight constructs, measured by seventy one items. One new shopper type emerges from this research. This shopper presented as a younger man, at the commencement of their family lifecycle, attracted by a strong value offer, focusing on price and promotional discounts. Our research offers a contribution to the marketing, consumer behaviour and supermarket retailing disciplines in three ways. By examining and identifying male shopping behaviour in the context of grocery shopping, the development of a retail shopping typology of male grocery shoppers and the extension and employment of a cluster analysis in identifying distinct groups. This research has implications for gender, segmentation studies and consumer behaviour disciplines in regard to grocery shopping. The identification of specific groups of male grocery shoppers will enable grocery retailers to effectively implement important, targeted marketing strategies.
Resumo:
[EN]Based on the theoretical tools of Complex Networks, this work provides a basic descriptive study of a synonyms dictionary, the Spanish Open Thesaurus represented as a graph. We study the main structural measures of the network compared with those of a random graph. Numerical results show that Open-Thesaurus is a graph whose topological properties approximate a scale-free network, but seems not to present the small-world property because of its sparse structure. We also found that the words of highest betweenness centrality are terms that suggest the vocabulary of psychoanalysis: placer (pleasure), ayudante (in the sense of assistant or worker), and regular (to regulate).
Resumo:
The Inter-American Tropical Tuna Commission (IATTC) staff has been sampling the size distributions of tunas in the eastern Pacific Ocean (EPO) since 1954, and the species composition of the catches since 2000. The IATTC staff use the data from the species composition samples, in conjunction with observer and/or logbook data, and unloading data from the canneries to estimate the total annual catches of yellowfin (Thunnus albacares), skipjack (Katsuwonus pelamis), and bigeye (Thunnus obesus) tunas. These sample data are collected based on a stratified sampling design. I propose an update of the stratification of the EPO into more homogenous areas in order to reduce the variance in the estimates of the total annual catches and incorporate the geographical shifts resulting from the expansion of the floating-object fishery during the 1990s. The sampling model used by the IATTC is a stratified two-stage (cluster) random sampling design with first stage units varying (unequal) in size. The strata are month, area, and set type. Wells, the first cluster stage, are selected to be sampled only if all of the fish were caught in the same month, same area, and same set type. Fish, the second cluster stage, are sampled for lengths, and independently, for species composition of the catch. The EPO is divided into 13 sampling areas, which were defined in 1968, based on the catch distributions of yellowfin and skipjack tunas. This area stratification does not reflect the multi-species, multi-set-type fishery of today. In order to define more homogenous areas, I used agglomerative cluster analysis to look for groupings of the size data and the catch and effort data for 2000–2006. I plotted the results from both datasets against the IATTC Sampling Areas, and then created new areas. I also used the results of the cluster analysis to update the substitution scheme for strata with catch, but no sample. I then calculated the total annual catch (and variance) by species by stratifying the data into new Proposed Sampling Areas and compared the results to those reported by the IATTC. Results showed that re-stratifying the areas produced smaller variances of the catch estimates for some species in some years, but the results were not significant.
Resumo:
Drought frequency analysis can be performed with statistical techniques developed for determining recurrence intervals for extreme precipitation and flood events (Linsley et al 1992). The drought analysis method discussed in this paper uses the log-Pearson Type III distribution, which has been widely used in flood frequency research. Some of the difficulties encountered when using this distribution for drought analysis are investigated.