34 resultados para Cluster Analysis of Variables

em CentAUR: Central Archive University of Reading - UK


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A realistic representation of the North Atlantic tropical cyclone tracks is crucial as it allows, for example, explaining potential changes in US landfalling systems. Here we present a tentative study, which examines the ability of recent climate models to represent North Atlantic tropical cyclone tracks. Tracks from two types of climate models are evaluated: explicit tracks are obtained from tropical cyclones simulated in regional or global climate models with moderate to high horizontal resolution (1° to 0.25°), and downscaled tracks are obtained using a downscaling technique with large-scale environmental fields from a subset of these models. For both configurations, tracks are objectively separated into four groups using a cluster technique, leading to a zonal and a meridional separation of the tracks. The meridional separation largely captures the separation between deep tropical and sub-tropical, hybrid or baroclinic cyclones, while the zonal separation segregates Gulf of Mexico and Cape Verde storms. The properties of the tracks’ seasonality, intensity and power dissipation index in each cluster are documented for both configurations. Our results show that except for the seasonality, the downscaled tracks better capture the observed characteristics of the clusters. We also use three different idealized scenarios to examine the possible future changes of tropical cyclone tracks under 1) warming sea surface temperature, 2) increasing carbon dioxide, and 3) a combination of the two. The response to each scenario is highly variable depending on the simulation considered. Finally, we examine the role of each cluster in these future changes and find no preponderant contribution of any single cluster over the others.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The validity of ensemble averaging on event-related potential (ERP) data has been questioned, due to its assumption that the ERP is identical across trials. Thus, there is a need for preliminary testing for cluster structure in the data. New method: We propose a complete pipeline for the cluster analysis of ERP data. To increase the signalto-noise (SNR) ratio of the raw single-trials, we used a denoising method based on Empirical Mode Decomposition (EMD). Next, we used a bootstrap-based method to determine the number of clusters, through a measure called the Stability Index (SI). We then used a clustering algorithm based on a Genetic Algorithm (GA)to define initial cluster centroids for subsequent k-means clustering. Finally, we visualised the clustering results through a scheme based on Principal Component Analysis (PCA). Results: After validating the pipeline on simulated data, we tested it on data from two experiments – a P300 speller paradigm on a single subject and a language processing study on 25 subjects. Results revealed evidence for the existence of 6 clusters in one experimental condition from the language processing study. Further, a two-way chi-square test revealed an influence of subject on cluster membership.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The variogram is essential for local estimation and mapping of any variable by kriging. The variogram itself must usually be estimated from sample data. The sampling density is a compromise between precision and cost, but it must be sufficiently dense to encompass the principal spatial sources of variance. A nested, multi-stage, sampling with separating distances increasing in geometric progression from stage to stage will do that. The data may then be analyzed by a hierarchical analysis of variance to estimate the components of variance for every stage, and hence lag. By accumulating the components starting from the shortest lag one obtains a rough variogram for modest effort. For balanced designs the analysis of variance is optimal; for unbalanced ones, however, these estimators are not necessarily the best, and the analysis by residual maximum likelihood (REML) will usually be preferable. The paper summarizes the underlying theory and illustrates its application with data from three surveys, one in which the design had four stages and was balanced and two implemented with unbalanced designs to economize when there were more stages. A Fortran program is available for the analysis of variance, and code for the REML analysis is listed in the paper. (c) 2005 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The variogram is essential for local estimation and mapping of any variable by kriging. The variogram itself must usually be estimated from sample data. The sampling density is a compromise between precision and cost, but it must be sufficiently dense to encompass the principal spatial sources of variance. A nested, multi-stage, sampling with separating distances increasing in geometric progression from stage to stage will do that. The data may then be analyzed by a hierarchical analysis of variance to estimate the components of variance for every stage, and hence lag. By accumulating the components starting from the shortest lag one obtains a rough variogram for modest effort. For balanced designs the analysis of variance is optimal; for unbalanced ones, however, these estimators are not necessarily the best, and the analysis by residual maximum likelihood (REML) will usually be preferable. The paper summarizes the underlying theory and illustrates its application with data from three surveys, one in which the design had four stages and was balanced and two implemented with unbalanced designs to economize when there were more stages. A Fortran program is available for the analysis of variance, and code for the REML analysis is listed in the paper. (c) 2005 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A first step in interpreting the wide variation in trace gas concentrations measured over time at a given site is to classify the data according to the prevailing weather conditions. In order to classify measurements made during two intensive field campaigns at Mace Head, on the west coast of Ireland, an objective method of assigning data to different weather types has been developed. Air-mass back trajectories calculated using winds from ECMWF analyses, arriving at the site in 1995–1997, were allocated to clusters based on a statistical analysis of the latitude, longitude and pressure of the trajectory at 12 h intervals over 5 days. The robustness of the analysis was assessed by using an ensemble of back trajectories calculated for four points around Mace Head. Separate analyses were made for each of the 3 years, and for four 3-month periods. The use of these clusters in classifying ground-based ozone measurements at Mace Head is described, including the need to exclude data which have been influenced by local perturbations to the regional flow pattern, for example, by sea breezes. Even with a limited data set, based on 2 months of intensive field measurements in 1996 and 1997, there are statistically significant differences in ozone concentrations in air from the different clusters. The limitations of this type of analysis for classification and interpretation of ground-based chemistry measurements are discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Flow in geophysical fluids is commonly summarized by coherent streams, for example conveyor belt flows in extratropical cyclones or jet streaks in the upper troposphere. Typically, parcel trajectories are calculated from the flow field and subjective thresholds are used to distinguish coherent streams of interest. This methodology contribution develops a more objective approach to distinguish coherent airstreams within extratropical cyclones. Agglomerative clustering is applied to trajectories along with a method to identify the optimal number of cluster classes. The methodology is applied to trajectories associated with the low-level jets of a well-studied extratropical cyclone. For computational efficiency, a constraint that trajectories must pass through these jet regions is applied prior to clustering; the partitioning into different airstreams is then performed by the agglomerative clustering. It is demonstrated that the methodology can identify the salient flow structures of cyclones: the warm and cold conveyor belts. A test focusing on the airstreams terminating at the tip of the bent-back front further demonstrates the success of the method in that it can distinguish fine-scale flow structure such as descending sting jet airstreams.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The hepatitis C virus (HCV) non-structural 5A protein (NS5A) contains a highly conserved C-terminal polyproline motif with the consensus sequence Pro-X-X- Pro-X-Arg that is able to interact with the Src-homology 3 (SH3) domains of a variety of cellular proteins. Results: To understand this interaction in more detail we have expressed two N-terminally truncated forms of NS5A in E. coli and examined their interactions with the SH3 domain of the Src-family tyrosine kinase, Fyn. Surface plasmon resonance analysis revealed that NS5A binds to the Fyn SH3 domain with what can be considered a high affinity SH3 domain-ligand interaction (629 nM), and this binding did not require the presence of domain I of NS5A (amino acid residues 32-250). Mutagenic analysis of the Fyn SH3 domain demonstrated the requirement for an acidic cluster at the C-terminus of the RT-Src loop of the SH3 domain, as well as several highly conserved residues previously shown to participate in SH3 domain peptide binding. Conclusion: We conclude that the NS5A: Fyn SH3 domain interaction occurs via a canonical SH3 domain binding site and the high affinity of the interaction suggests that NS5A would be able to compete with cognate Fyn ligands within the infected cell.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

1Urban areas are predicted to grow significantly in the foreseeable future because of increasing human population growth. Predicting the impact of urban development and expansion on mammal populations is of considerable interest due to possible effects on biodiversity and human-wildlife conflict. 2The British government has recently announced a substantial housing programme to meet the demands of its growing population and changing socio-economic profile. This is likely to result in the construction of high-density, low-cost housing with small residential gardens. To assess the potential effects of this programme, we analysed the factors affecting the current pattern of use of residential gardens by a range of mammal species using a questionnaire distributed in wildlife and gardening magazines and via The Mammal Society. 3Twenty-two species/species groups were recorded. However, the pattern of garden use by individual species was limited, with only six species/species groups (bats, red fox Vulpes vulpes, grey squirrel Sciurus carolinensis, hedgehog Erinaceus europaeus, mice, voles) recorded as frequent visitors to > 20% of gardens in the survey. 4There was a high degree of association between the variables recorded in the study, such that it was difficult to quantify the effects of individual variables. However, all species/species groups appeared to be negatively affected by the increased fragmentation and reduced proximity of natural and semi-natural habitats, decreasing garden size and garden structure, but to differing degrees. Patterns of garden use were most clearly affected by house location (city, town, village, rural), with garden use declining with increasing urbanization for the majority of species/species groups, except red foxes and grey squirrels. Increasing urbanization is likely to be related to a wide range of interrelated factors, any or all of which may affect a range of mammal species. 5Overall, the probable effects of the planned housing development programme in Britain are not likely to be beneficial to mammal populations, although the pattern of use examined in this study may represent patterns of habitat selection by species rather than differences in distribution or abundance. Consequently, additional data are required on the factors affecting the density of species within urban environments.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The distribution and activity of communities of sulfate-reducing bacteria (SRB) and methanogenic archaea in two contrasting Antarctic sediments were investigated. Methanogenesis dominated in freshwater Lake Heywood, while sulfate reduction dominated in marine Shallow Bay. Slurry experiments indicated that 90% of the methanogenesis in Lake Heywood was acetoclastic. This finding was supported by the limited diversity of clones detected in a Lake Heywood archaeal clone library, in which most clones were closely related to the obligate acetate-utilizing Methanosaeta concilii. The Shallow Bay archaeal clone library contained clones related to the C-1-utilizing Methanolobus and Methanococcoides and the H-2-utilizing Methanogenium. Oligonucleotide probing of RNA extracted directly from sediment indicated that archaea represented 34% of the total prokaryotic signal in Lake Heywood and that Methanosaeta was a major component (13.2%) of this signal. Archaea represented only 0.2% of the total prokaryotic signal in RNA extracted from Shallow Bay sediments. In the Shallow Bay bacterial clone library, 10.3% of the clones were SRB-like, related to Desulfotalea/Desulforhopalus, Desulfofaba, Desulfosarcina, and Desulfobacter as well as to the sulfur and metal oxidizers comprising the Desulfuromonas cluster. Oligonucleotide probes for specific SRB clusters indicated that SRB represented 14.7% of the total prokaryotic signal, with Desulfotalea/Desulforhopalus being the dominant SRB group (10.7% of the total prokaryotic signal) in the Shallow Bay sediments; these results support previous results obtained for Arctic sediments. Methanosaeta and Desulfotalea/Desulforhopalus appear to be important in Lake Heywood and Shallow Bay, respectively, and may be globally important in permanently low-temperature sediments.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A full dimensional, ab initio-based semiglobal potential energy surface for C2H3+ is reported. The ab initio electronic energies for this molecule are calculated using the spin-restricted, coupled cluster method restricted to single and double excitations with triples corrections [RCCSD(T)]. The RCCSD(T) method is used with the correlation-consistent polarized valence triple-zeta basis augmented with diffuse functions (aug-cc-pVTZ). The ab initio potential energy surface is represented by a many-body (cluster) expansion, each term of which uses functions that are fully invariant under permutations of like nuclei. The fitted potential energy surface is validated by comparing normal mode frequencies at the global minimum and secondary minimum with previous and new direct ab initio frequencies. The potential surface is used in vibrational analysis using the "single-reference" and "reaction-path" versions of the code MULTIMODE. (c) 2006 American Institute of Physics.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We provide a system identification framework for the analysis of THz-transient data. The subspace identification algorithm for both deterministic and stochastic systems is used to model the time-domain responses of structures under broadband excitation. Structures with additional time delays can be modelled within the state-space framework using additional state variables. We compare the numerical stability of the commonly used least-squares ARX models to that of the subspace N4SID algorithm by using examples of fourth-order and eighth-order systems under pulse and chirp excitation conditions. These models correspond to structures having two and four modes simultaneously propagating respectively. We show that chirp excitation combined with the subspace identification algorithm can provide a better identification of the underlying mode dynamics than the ARX model does as the complexity of the system increases. The use of an identified state-space model for mode demixing, upon transformation to a decoupled realization form is illustrated. Applications of state-space models and the N4SID algorithm to THz transient spectroscopy as well as to optical systems are highlighted.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The overall operation and internal complexity of a particular production machinery can be depicted in terms of clusters of multidimensional points which describe the process states, the value in each point dimension representing a measured variable from the machinery. The paper describes a new cluster analysis technique for use with manufacturing processes, to illustrate how machine behaviour can be categorised and how regions of good and poor machine behaviour can be identified. The cluster algorithm presented is the novel mean-tracking algorithm, capable of locating N-dimensional clusters in a large data space in which a considerable amount of noise is present. Implementation of the algorithm on a real-world high-speed machinery application is described, with clusters being formed from machinery data to indicate machinery error regions and error-free regions. This analysis is seen to provide a promising step ahead in the field of multivariable control of manufacturing systems.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper deals with the selection of centres for radial basis function (RBF) networks. A novel mean-tracking clustering algorithm is described as a way in which centers can be chosen based on a batch of collected data. A direct comparison is made between the mean-tracking algorithm and k-means clustering and it is shown how mean-tracking clustering is significantly better in terms of achieving an RBF network which performs accurate function modelling.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper describes the novel use of cluster analysis in the field of industrial process control. The severe multivariable process problems encountered in manufacturing have often led to machine shutdowns, where the need for corrective actions arises in order to resume operation. Production faults which are caused by processes running in less efficient regions may be prevented or diagnosed using a reasoning based on cluster analysis. Indeed the intemal complexity of a production machinery may be depicted in clusters of multidimensional data points which characterise the manufacturing process. The application of a Mean-Tracking cluster algorithm (developed in Reading) to field data acquired from a high-speed machinery will be discussed. The objective of such an application is to illustrate how machine behaviour can be studied, in particular how regions of erroneous and stable running behaviour can be identified.