866 resultados para INDEPENDENT COMPONENT ANALYSIS (ICA)
Resumo:
We propose an algorithm for solving optimization problems defined on a subset of the cone of symmetric positive semidefinite matrices. This algorithm relies on the factorization X = Y Y T , where the number of columns of Y fixes an upper bound on the rank of the positive semidefinite matrix X. It is thus very effective for solving problems that have a low-rank solution. The factorization X = Y Y T leads to a reformulation of the original problem as an optimization on a particular quotient manifold. The present paper discusses the geometry of that manifold and derives a second-order optimization method with guaranteed quadratic convergence. It furthermore provides some conditions on the rank of the factorization to ensure equivalence with the original problem. In contrast to existing methods, the proposed algorithm converges monotonically to the sought solution. Its numerical efficiency is evaluated on two applications: the maximal cut of a graph and the problem of sparse principal component analysis. © 2010 Society for Industrial and Applied Mathematics.
Resumo:
This work considers the problem of fitting data on a Lie group by a coset of a compact subgroup. This problem can be seen as an extension of the problem of fitting affine subspaces in n to data which can be solved using principal component analysis. We show how the fitting problem can be reduced for biinvariant distances to a generalized mean calculation on an homogeneous space. For biinvariant Riemannian distances we provide an algorithm based on the Karcher mean gradient algorithm. We illustrate our approach by some examples on SO(n). © 2010 Springer -Verlag Berlin Heidelberg.
Resumo:
This study consisted of sampling benthic algae at 32 sites in the Gangqu River, an important upstream tributary of the Yangtze River. Our aims were to characterize the benthic algae communities and relationships with environmental variables. Among the 162 taxa observed, Achnanthes linearis and Achnanthes lanceolata var. elliptica were the dominant species (17.10% and 14.30% of the total relative abundance, respectively). Major gradients and principal patterns of variation within the environmental variables were detected by principal component analysis (PCA). Then non-metric multidimensional scaling (NMS) divided all the sites into three groups, which were validated by multi-response permutation procedures (MRPP). Canonical correspondence analysis (CCA) indicated that three environmental variables (TN, TDS, and TP) significantly affected the distribution of benthic algae. Weighted averaging regression and cross-calibration produced strong models for predicting TN and TDS concentration, which enabled selection of algae taxa as potentially sensitive indicators of certain TN and TDS levels: for TN, Achnanthes lanceolata, Achnanthes lanceolata var. elliptica, and Cymbella ventricosa var. semicircularis; for TDS, Cocconeis placentula, Cymbella alpina var. minuta, and Fragilaria virescens. The present study represents an early step in establishing baseline conditions. Further monitoring is suggested to gain a better understanding of this region.
Resumo:
Critical swimming speeds (U-crit) and morphological characters were compared between the F-4 generation of GH-transgenic common carp Cyprinus carpio and the non-transgenic controls. Transgenic fish displayed a mean absolute U-crit value 22.3% lower than the controls. Principal component analysis identified variations in body shape, with transgenic fish having significantly deeper head, longer caudal length of the dorsal region, longer standard length (L-S) and shallower body and caudal region, and shorter caudal length of the ventral region. Swimming speeds were related to the combination of deeper body and caudal region, longer caudal length of the ventral region, shallower head depth, shorter caudal length of dorsal region and L-S. These findings suggest that morphological variations which are poorly suited to produce maximum thrust and minimum drag in GH-transgenic C. carpio may be responsible for their lower swimming abilities in comparison with non-transgenic controls.
Resumo:
The purpose of the research is to study the seasonal succession of protozoa community and the effect of water quality on the protozoa community to characterize biochemical processes occurring at a eutrophic Lake Donghu, a large shallow lake in Wuhan City, China. Samples of protozoa communities were obtained monthly at three stations by PFU (polyurethane foam unit) method over a year. Synchronously, water samples also were taken from the stations for the water chemical quality analysis. Six major variables were examined in a principal component analysis (PCA), which indicate the fast changes of water quality in this station I and less within-year variation and a comparatively stable water quality in stations II and III. The community data were analyzed using multivariate techniques, and we show that clusters are rather mixed and poorly separated, suggesting that the community structure is changing gradually, giving a slight merging of clusters form the summer to the autumn and the autumn to the winter. Canonical correspondence analysis (CCA) was used to infer the relationship between water quality variables and phytoplankton community structure, which changed substantially over the survey period. From the analysis of cluster and CCA, coupled by community pollution value (CPV), it is concluded that the key factors driving the change in protozoa community composition in Lake Donghu was water qualities rather than seasons. (c) 2006 Elsevier Ltd. All rights reserved.
Resumo:
Although the peritrichous ciliate Carchesium polypinum is common in freshwater, its population genetic structure is largely unknown. We used inter-simple sequence repeat (ISSR) fingerprinting to analyze the genetic structure of 48 different isolates of the species from four lakes in Wuhan, central China. Using eight polymorphic primers, 81 discernible DNA fragments were detected, among which 76 (93.83%) were polymorphic, indicating high genetic diversity at the isolate level. Further, Nei's gene diversity (h) and Shannon's Information index (I) between the different isolates both revealed a remarkable genetic diversity, higher than previously indicated by their morphology. At the same time, substantial gene flow was found. So the main factors responsible for the high level of diversity within populations are probably due to conjugation (sexual reproduction) and wide distribution of swarmers. Analysis of molecular variance (AMOVA) showed that there was low genetic differentiation among the four populations probably due to common ancestry and flooding events. The cluster analysis and principal component analysis (PCA) suggested that genotypes isolated from the same lake displayed a higher genetic similarity than those from different lakes. Both analyses separated C. polypinum isolates into subgroups according to the geographical locations. However, there is only a weak positive correlation between the genetic distance and geographical distance, suggesting a minor effect of geographical distance on the distribution of genetic diversity between populations of C. polypinum at the local level. In conclusion, our studies clearly demonstrated that a single morphospecies may harbor high levels of genetic diversity, and that the degree of resolution offered by morphology as a marker for measuring distribution patterns of genetically distinct entities is too low.
Resumo:
The contamination and distribution of polychlorinated dibeinizo-p-dioxins and dibenzofurans (PCDD/Fs) from two agricultural fields of a heavily polluted lake area in China (Ya-Er Lake) are presented. The vertical distribution pattern of total PCDD/Fs in soil cores reveals that the maximum concentration was in the layer of 20-30 cm. The concentrations in the top layer of soil at the two sites were similar (17.48 ng/kg at Site 1 and 18.10 ng/kg at Site 2), but the maximum concentration of Site 1 (120.8 ng/kg) was two times higher than that of Site 2 (64.39 ng/kg). The maximum concentration of PCDD/Fs in mud cores in rice fields (0-50 cm) at Sites 1 and 2 was in the layer of 0-10 cm. The maximum PCDD/F concentration in the top layer in mud at Site 1 (203.1 ng/kg) was higher than that at Site 2: (143.3 ng/kg). Significant correlations were found between the mind PCDD/Fs and the organic carbon content (R = 0.9743, P< 0,05 at Site 1; R = 0.9821, P< 0.05 at Site 2), the two variables being highly correlated (R = 0.9049, P< 0.05, at Site 1; R = 0.9916, P< 0.05 at Site 2). All correlation coefficients were significant at the 95% level. Concentrations were highly correlated with organic carbon, indicating that sorption to organic carbon was the dominant mechanism. Using principal component analysis, the homologue profiles of soil, mud, and plants (rice and radish) were compared. The PCDD/F patterns in plants were found not to be correlated to those in soil and mud. This suggests that atmospheric deposition may be the main source of PCDD/Fs in rice grain. However, mixed exposure involving uptake mechanisms and atmospheric deposition is considered main the source of PCDD/F pollution in radishes. (C) 2002 Elsevier Science (USA).
Resumo:
The sediment of Ya-Er Lake had been heavily polluted by polychlorinated dibenzo-p-dioxins and dibenzofurans (PCDD/Fs) from the former chloralkali industry. The total amounts of PCDD/Fs and I-TEQ decreased along the water flow direction and also decreased from top to bottom layers of sediment cores. Sediment of Pond 1 was dominated by PCDF, especially TCDF. In contrast, in the other four ponds, PCDD dominated in all layers and octachlorinated dibenzo-p-dioxin (OCDD) predominated in all of the homologues. When homologue profiles from sediments and water samples were compared using principal component analysis (PCA), the first two principal components represented 95.2% of the variance in the data. The first component explained 75.9% of the variance and the second one 19.3%. Two clusters were most distinct, presenting a shift in PCDD/Fs composition from PCDF to heptachlorinated dibenzo-p-dioxin (HpCDD) and OCDD in sediments and water from Pond I to Ponds 2-5. The pattern variation between Pond 1 and Ponds 2-5 in Ya-Er Lake was most likely due to the change of process in the chemical plant after the dams between the ponds were built. The results of the present study also showed that log K-oc of PCDD/Fs calculated from data of sediment and water in the field were comparable with theoretical log K-oc. The results also implied that the concentrations of PCDD/Fs in water and sediments could be predicted from each other by log K-oc. (C) 2001 Elsevier Science Ltd. All rights reserved.
Resumo:
The existing methods for the discrimination of varieties of commodity corn seed are unable to process batch data and speed up identification, and very time consuming and costly. The present paper developed a new approach to the fast discrimination of varieties of commodity corn by means of near infrared spectral data. Firstly, the experiment obtained spectral data of 37 varieties of commodity corn seed with the Fourier transform near infrared spectrometer in the wavenurnber range from 4 000 to 12 000 cm (1). Secondly, the original data were pretreated using statistics method of normalization in order to eliminate noise and improve the efficiency of models. Thirdly, a new way based on sample standard deviation was used to select the characteristic spectral regions, and it can search very different wavenumbers among all wavenumbers and reduce the amount of data in part. Fourthly, principal component analysis (PCA) was used to compress spectral data into several variables, and the cumulate reliabilities of the first ten components were more than 99.98%. Finally, according to the first ten components, recognition models were established based on BPR. For every 25 samples in each variety, 15 samples were randomly selected as the training set. The remaining 10 samples of the same variety were used as the first testing set, and all the 900 samples of the other varieties were used as the second testing set. Calculation results showed that the average correctness recognition rate of the 37 varieties of corn seed was 94.3%. Testing results indicate that the discrimination method had higher precision than the discrimination of various kinds of commodity corn seed. In short, it is feasible to discriminate various varieties of commodity corn seed based on near infrared spectroscopy and BPR.
Resumo:
A new discrimination method for the maize seed varieties based on the near-infrared spectroscopy was proposed. The reflectance spectra of maize seeds were obtained by a FT-NIR spectrometer (12 000-4 000 cm(-1)). The original spectra data were preprocessed by first derivative method. Then the principal component analysis (PCA) was used to compress the spectra data. The principal components with the cumulate reliabilities more than 80% were used to build the discrimination models. The model was established by Psi-3 neuron based on biomimetic pattern recognition (BPR). Especially, the parameter of the covering index was proposed to assist to discriminating the variety of a seed sample. The authors tested the discrimination capability of the model through four groups of experiments. There were 10, 18, 26 and 34 varieties training the discrimination models in these experiments, respectively. Additionally, another seven maize varieties and nine wheat varieties were used to test the capability of the models to reject the varieties not participating in training the models. Each group of the experiment was repeated three times by selecting different training samples at random. The correct classification rates of the models in the four-group experiments were above 91. 8%. The correct rejection rates for the varieties not participating in training the models all attained above 95%. Furthermore, the performance of the discrimination models did not change obviously when using the different training samples. The results showed that this discrimination method can not only effectively recognize the maize seed varieties, but also reject the varieties not participating in training the model. It may be practical in the discrimination of maize seed varieties.
Resumo:
重构是软件系统不断演化的关键之一,也是一项复杂而又困难的活动.传统的定位重构代码方法依赖开发者的观察和主观意识,耗时耗力,尤其在重构代码较多时.因此,提出了一套自动化定位重构的方法.该方法利用基于面向对象软件度量指标获取代码特征信息,使用相关性检验查验特征信息数据,应用主成分分析压缩和解释特征信息,应用聚类分析分类相似代码段,迅速准确定位重构.一个简单的实例表明该方法是简单有效的,并且优于传统方法.
Resumo:
对黄土丘陵沟壑区安塞纸坊沟和县南沟、延安燕沟3个流域不同恢复年限的植物群落的土壤抗蚀性和侵蚀程度进行了研究。对12个土壤抗蚀性指标进行主成分分析表明,土壤抗蚀性(主成分综合指数)强弱为灌木群落阶段>多年生草本和蒿类群落阶段>一二年生草本群落阶段,与一二年生草本群落阶段相比,灌木群落阶段与多年生草本和蒿类群落阶段的土壤抗蚀性分别增加了362.29%~673.33%和574.71%~930.00%;野外调查结果分析表明,随着植被的恢复演替,土壤侵蚀量呈现明显的下降趋势,灌木群落阶段的土壤侵蚀量仅为演替初期的1.42%~5.59%;通过回归分析,土壤侵蚀量和水稳性团聚类因子,以及有机质含量之间分别存在极显著与显著相关关系,鉴于土壤分析的易获性,可选择>0.5mm水稳性团聚体与有机质含量作为反映土壤侵蚀程度的指标。
Resumo:
Principal Component and Canonical Correlation Analysis of the Environmental Factors Influencing the Growth of Caragana korshinskii Kom. in Grassland
Resumo:
选择黄土高原子午岭地区植被自然恢复1~140 a不同年限的阳坡坡地为研究对象,通过相关性分析筛选土壤表层(0~20 cm)16项表征土壤物理、化学、生物学性质的指标,运用主成分分析计算土壤质量综合指数,评价子午岭地区植被恢复过程对土壤质量的影响。结果表明:土壤总孔隙度、平均重量直径(MWD)、有机质质量分数、速效磷质量分数、蔗糖酶活性、碱性磷酸酶活性、真菌数量、微生物总量构成土壤质量评价指标体系;土壤质量综合指数随植被恢复年限的增加而增加;植被自然恢复1~140 a间,土壤质量综合指数变化范围为0.155 9~0.850 9,较裸露休闲地增加4.2~27.5倍;根据土壤质量综合指数变化规律,可将140 a植被恢复过程中的土壤质量演变过程分为3个阶段,即植被恢复初期(1~20 a)的土壤质量综合指数呈快速增长,植被恢复中期(20~40 a)的土壤质量综合指数呈波动性增长,植被恢复后期(40~140 a)的土壤质量综合指数呈稳定增长。植被演替过程中不同植被生活型土壤质量综合指数表现为乔木林地>灌木林地>草地。
Resumo:
研究了半干旱沙区不同滴灌带埋设深度下紫花苜蓿的生长特性。通过试验研究分析了滴灌带埋设深度对紫花苜蓿植株高度、茎粗、分枝数、根系生长、根系密度和产量等生长特性的影响。采用主成分分析法对不同滴灌带埋设深度的紫花苜蓿等生长特性进行了综合评价。结果表明,滴灌带不同埋设深度对苜蓿各个生育期生长特性指标影响不同。在苗期,埋设深度为10 cm的处理,有利于苜蓿生长。从分枝期起,埋设深度为30 cm的处理优于其它处理;在整个生育期内,不同埋设深度对苜蓿生长特性影响的综合评判结果为:埋深30 cm>埋深20 cm>埋深10 cm>埋深40 cm。