59 resultados para PCA and HCA
Resumo:
The techniques of principal component analysis (PCA) and partial least squares (PLS) are introduced from the point of view of providing a multivariate statistical method for modelling process plants. The advantages and limitations of PCA and PLS are discussed from the perspective of the type of data and problems that might be encountered in this application area. These concepts are exemplified by two case studies dealing first with data from a continuous stirred tank reactor (CSTR) simulation and second a literature source describing a low-density polyethylene (LDPE) reactor simulation.
Resumo:
Systematic principal component analysis (PCA) methods are presented in this paper for reliable islanding detection for power systems with significant penetration of distributed generations (DGs), where synchrophasors recorded by Phasor Measurement Units (PMUs) are used for system monitoring. Existing islanding detection methods such as Rate-of-change-of frequency (ROCOF) and Vector Shift are fast for processing local information, however with the growth in installed capacity of DGs, they suffer from several drawbacks. Incumbent genset islanding detection cannot distinguish a system wide disturbance from an islanding event, leading to mal-operation. The problem is even more significant when the grid does not have sufficient inertia to limit frequency divergences in the system fault/stress due to the high penetration of DGs. To tackle such problems, this paper introduces PCA methods for islanding detection. Simple control chart is established for intuitive visualization of the transients. A Recursive PCA (RPCA) scheme is proposed as a reliable extension of the PCA method to reduce the false alarms for time-varying process. To further reduce the computational burden, the approximate linear dependence condition (ALDC) errors are calculated to update the associated PCA model. The proposed PCA and RPCA methods are verified by detecting abnormal transients occurring in the UK utility network.
Resumo:
We investigated the role of the C1772T polymorphisms in exon 12 of the Hypoxia-inducible factor-1 alpha (HIF-1alpha) gene C1772T genotype in prostate cancer (PCa) and amplification of the hypoxic response. We identified the heterozygous germline CT genotype as an increased risk factor for clinically localised prostate cancer (Odds ratio = 6.2; p < 0.0001). While immunostaining intensity for HIF-1alpha and VEGF was significantly enhanced in 75% of PCa specimens when compared to matched benign specimens (p < 0.0001), the CT genotype did not modulate the kinetics of HIF-1alpha protein expression in hypoxia in vitro, and was not associated with enhanced expression of hypoxic biomarkers. This study provides the first evidence of an increased risk for clinically localised prostate cancer in men carrying the C1772T HIF-1alpha gene polymorphism. Although our results did not suggest an association between expression of hypoxic biomarkers and genotype status, the correlation may merit further investigation.
Resumo:
The unfolded protein response (UPR) is a homeostatic mechanism to maintain endoplasmic reticulum (ER) function. The UPR is activated by various physiological conditions as well as in disease states, such as cancer. As androgens regulate secretion and development of the normal prostate and drive prostate cancer (PCa) growth, they may affect UPR pathways. Here, we show that the canonical UPR pathways are directly and divergently regulated by androgens in PCa cells, through the androgen receptor (AR), which is critical for PCa survival. AR bound to gene regulatory sites and activated the IRE1α branch, but simultaneously inhibited PERK signaling. Inhibition of the IRE1α arm profoundly reduced PCa cell growth in vitro as well as tumor formation in preclinical models of PCa in vivo. Consistently, AR and UPR gene expression were correlated in human PCa, and spliced XBP-1 expression was significantly upregulated in cancer compared with normal prostate. These data establish a genetic switch orchestrated by AR that divergently regulates the UPR pathways and suggest that targeting IRE1α signaling may have therapeutic utility in PCa.
Resumo:
A compositional multivariate approach is used to analyse regional scale soil geochemical data obtained as part of the Tellus Project generated by the Geological Survey Northern Ireland (GSNI). The multi-element total concentration data presented comprise XRF analyses of 6862 rural soil samples collected at 20cm depths on a non-aligned grid at one site per 2 km2. Censored data were imputed using published detection limits. Using these imputed values for 46 elements (including LOI), each soil sample site was assigned to the regional geology map provided by GSNI initially using the dominant lithology for the map polygon. Northern Ireland includes a diversity of geology representing a stratigraphic record from the Mesoproterozoic, up to and including the Palaeogene. However, the advance of ice sheets and their meltwaters over the last 100,000 years has left at least 80% of the bedrock covered by superficial deposits, including glacial till and post-glacial alluvium and peat. The question is to what extent the soil geochemistry reflects the underlying geology or superficial deposits. To address this, the geochemical data were transformed using centered log ratios (clr) to observe the requirements of compositional data analysis and avoid closure issues. Following this, compositional multivariate techniques including compositional Principal Component Analysis (PCA) and minimum/maximum autocorrelation factor (MAF) analysis method were used to determine the influence of underlying geology on the soil geochemistry signature. PCA showed that 72% of the variation was determined by the first four principal components (PC’s) implying “significant” structure in the data. Analysis of variance showed that only 10 PC’s were necessary to classify the soil geochemical data. To consider an improvement over PCA that uses the spatial relationships of the data, a classification based on MAF analysis was undertaken using the first 6 dominant factors. Understanding the relationship between soil geochemistry and superficial deposits is important for environmental monitoring of fragile ecosystems such as peat. To explore whether peat cover could be predicted from the classification, the lithology designation was adapted to include the presence of peat, based on GSNI superficial deposit polygons and linear discriminant analysis (LDA) undertaken. Prediction accuracy for LDA classification improved from 60.98% based on PCA using 10 principal components to 64.73% using MAF based on the 6 most dominant factors. The misclassification of peat may reflect degradation of peat covered areas since the creation of superficial deposit classification. Further work will examine the influence of underlying lithologies on elemental concentrations in peat composition and the effect of this in classification analysis.
Resumo:
The androgen receptor (AR) is required for prostate cancer (PCa) survival and progression, and ablation of AR activity is the first line of therapeutic intervention for disseminated disease. While initially effective, recurrent tumors ultimately arise for which there is no durable cure. Despite the dependence of PCa on AR activity throughout the course of disease, delineation of the AR-dependent transcriptional network that governs disease progression remains elusive, and the function of AR in mitotically active cells is not well understood. Analyzing AR activity as a function of cell cycle revealed an unexpected and highly expanded repertoire of AR-regulated gene networks in actively cycling cells. New AR functions segregated into two major clusters: those that are specific to cycling cells and retained throughout the mitotic cell cycle ('Cell Cycle Common'), versus those that were specifically enriched in a subset of cell cycle phases ('Phase Restricted'). Further analyses identified previously unrecognized AR functions in major pathways associated with clinical PCa progression. Illustrating the impact of these unmasked AR-driven pathways, dihydroceramide desaturase 1 was identified as an AR-regulated gene in mitotically active cells that promoted pro-metastatic phenotypes, and in advanced PCa proved to be highly associated with development of metastases, recurrence after therapeutic intervention and reduced overall survival. Taken together, these findings delineate AR function in mitotically active tumor cells, thus providing critical insight into the molecular basis by which AR promotes development of lethal PCa and nominate new avenues for therapeutic intervention.
Resumo:
This brief examines the application of nonlinear statistical process control to the detection and diagnosis of faults in automotive engines. In this statistical framework, the computed score variables may have a complicated nonparametric distri- bution function, which hampers statistical inference, notably for fault detection and diagnosis. This brief shows that introducing the statistical local approach into nonlinear statistical process control produces statistics that follow a normal distribution, thereby enabling a simple statistical inference for fault detection. Further, for fault diagnosis, this brief introduces a compensation scheme that approximates the fault condition signature. Experimental results from a Volkswagen 1.9-L turbo-charged diesel engine are included.
Resumo:
Traditional Chinese Medicines (TCMs) derived from animal horns are one of the most important types of Chinese medicine. In the present study, a fast and sensitive analytical method was established for qualitative and quantitative determination of 14 nucleosides and nucleobases in animal horns using hydrophilic interaction ultra-high performance liquid chromatography coupled with triple-quadruple tandem mass spectrometry (HILIC-UPLC-QQQ-MS/MS) in selective reaction monitoring (SRM) mode. The method was optimized and validated, and showed good linearity, precision, repeatability, and accuracy. The method was successfully used to determine contents of the 14 nucleosides and nucleobases in 25 animal horn samples. Hierarchical clustering analysis (HCA) and principal component analysis (PCA) were performed and the 25 samples were thereby divided into two groups, which agreed with taxonomy. The method may enable quick and effective search of substitutes for precious horns.
Resumo:
This paper proposes a method for the detection and classification of multiple events in an electrical power system in real-time, namely; islanding, high frequency events (loss of load) and low frequency events (loss of generation). This method is based on principal component analysis of frequency measurements and employs a moving window approach to combat the time-varying nature of power systems, thereby increasing overall situational awareness of the power system. Numerical case studies using both real data, collected from the UK power system, and simulated case studies, constructed using DigSilent PowerFactory, for islanding events, as well as both loss of load and generation dip events, are used to demonstrate the reliability of the proposed method.
Resumo:
Quantitative examination of prostate histology offers clues in the diagnostic classification of lesions and in the prediction of response to treatment and prognosis. To facilitate the collection of quantitative data, the development of machine vision systems is necessary. This study explored the use of imaging for identifying tissue abnormalities in prostate histology. Medium-power histological scenes were recorded from whole-mount radical prostatectomy sections at × 40 objective magnification and assessed by a pathologist as exhibiting stroma, normal tissue (nonneoplastic epithelial component), or prostatic carcinoma (PCa). A machine vision system was developed that divided the scenes into subregions of 100 × 100 pixels and subjected each to image-processing techniques. Analysis of morphological characteristics allowed the identification of normal tissue. Analysis of image texture demonstrated that Haralick feature 4 was the most suitable for discriminating stroma from PCa. Using these morphological and texture measurements, it was possible to define a classification scheme for each subregion. The machine vision system is designed to integrate these classification rules and generate digital maps of tissue composition from the classification of subregions; 79.3% of subregions were correctly classified. Established classification rates have demonstrated the validity of the methodology on small scenes; a logical extension was to apply the methodology to whole slide images via scanning technology. The machine vision system is capable of classifying these images. The machine vision system developed in this project facilitates the exploration of morphological and texture characteristics in quantifying tissue composition. It also illustrates the potential of quantitative methods to provide highly discriminatory information in the automated identification of prostatic lesions using computer vision.
Resumo:
This paper builds on work presented in the first paper, Part 1 [1] and is of equal significance. The paper proposes a novel compensation method to preserve the integrity of step-fault signatures prevalent in various processes that can be masked during the removal of both auto- and cross correlation. Using industrial data, the paper demonstrates the benefit of the proposed method, which is applicable to chemical, electrical, and mechanical process monitoring. This paper, (and Part 1 [1]), has led to further work supported by EPSRC grant GR/S84354/01 involving kernel PCA methods.
Resumo:
A 1.2 m sediment core from Lake Forsyth, Canterbury, New Zealand, records the development of the catchment/lake system over the last 7000 years, and its response to anthropogenic disturbance following European settlement c. 1840 AD. Pollen was used to reconstruct catchment vegetation history, while foraminifera, chironomids, Trichoptera, and the abundance of Pediastrum simplex colonies were used to infer past environmental conditions within the lake. The basal 30 cm of core records the transition of the Lake Forsyth Basin from a tidal embayment to a brackish coastal lake. Timing of closure of the lake mouth could not be accurately determined, but it appears that Lake Forsyth had stabilised as a slightly brackish, oligo mesotrophic shallow lake by about 500 years BP. Major deforestation occurred on Banks Peninsula between 1860 AD and 1890 AD. This deforestation is marked by the rapid decline in the main canopy trees (Prumnopitys taxifolia (matai) and Podocarpus totara/hallii (totara/mountain totara), an increase in charcoal, and the appearance of grasses. At around 1895 AD, pine appears in the record while a willow (Salix spp.) appears somewhat later. Redundancy analysis (RDA) of the pollen and aquatic species data revealed a significant relationship between regional vegetation and the abundance of aquatic taxa, with the percentage if disturbance pollen explaining most (14.8%) of the constrained variation in the aquatic species data. Principle components analysis (PCA) of aquatic species data revealed that the most significant period of rapid biological change in the lakes history corresponded to the main period of human disturbance in the catchment. Deforestation led to increased sediment and nutrient input into the lake which was accompanied by a major reduction in salinity. These changes are inferred from the appearance and proliferation of freshwater algae (Pediastrum simplex), an increase in abundance and diversity of chironomids, and the abundance of cases and remains from the larvae of the caddisfly, Oecetis unicolor. Eutrophication accompanied by increasing salinity of the lake is inferred from a significant peak and then decline of P. simplex, and a reduction in the abundance and diversity of aquatic invertebrates. The artificial opening of the lake to the Pacific Ocean, which began in the late 1800s, is the likely cause of the recent increase in salinity. An increase in salinity may have also encouraged blooms of the halotolerant and hepatotoxic cyanobacteria Nodularia spumigena.
Resumo:
This paper introduces a fast algorithm for moving window principal component analysis (MWPCA) which will adapt a principal component model. This incorporates the concept of recursive adaptation within a moving window to (i) adapt the mean and variance of the process variables, (ii) adapt the correlation matrix, and (iii) adjust the PCA model by recomputing the decomposition. This paper shows that the new algorithm is computationally faster than conventional moving window techniques, if the window size exceeds 3 times the number of variables, and is not affected by the window size. A further contribution is the introduction of an N-step-ahead horizon into the process monitoring. This implies that the PCA model, identified N-steps earlier, is used to analyze the current observation. For monitoring complex chemical systems, this work shows that the use of the horizon improves the ability to detect slowly developing drifts.