873 resultados para agglomerative clustering
Resumo:
Cylindrospermopsis raciborskii is a toxic-bloom-forming cyanobacterium that is commonly found in tropical to subtropical climatic regions worldwide, but it is also recognized as a common component of cyanobacterial communities in temperate climates. Genetic profiles of C. raciborskii were examined in 19 cultured isolates originating from geographically diverse regions of Australia and represented by two distinct morphotypes. A 609-bp region of rpoC1, a DNA-dependent RNA polymerase gene, was amplified by PCR from these isolates with cyanobacterium-specific primers. Sequence analysis revealed that all isolates belonged to the same species, including morphotypes with straight or coiled trichomes. Additional rpoC1 gene sequences obtained for a range of cyanobacteria highlighted clustering of C. raciborskii with other heterocyst-producing cyanobacteria (orders Nostocales and Stigonematales). In contrast, randomly amplified polymorphic DNA and short tandemly repeated repetitive sequence profiles revealed a greater level of genetic heterogeneity among C. raciborskii isolates than did rpoC1 gene analysis, and unique band profiles were also found among each of the cyanobacterial genera examined. A PCR test targeting a region of the rpoC1 gene unique to C. raciborskii was developed for the specific identification of C. raciborskii from both purified genomic DNA and environmental samples. The PCR was evaluated with a number of cyanobacterial isolates, but a PCR-positive result was only achieved with C, raciborskii. This method provides an accurate alternative to traditional morphological identification of C. raciborskii.
Resumo:
Normal mixture models are being increasingly used to model the distributions of a wide variety of random phenomena and to cluster sets of continuous multivariate data. However, for a set of data containing a group or groups of observations with longer than normal tails or atypical observations, the use of normal components may unduly affect the fit of the mixture model. In this paper, we consider a more robust approach by modelling the data by a mixture of t distributions. The use of the ECM algorithm to fit this t mixture model is described and examples of its use are given in the context of clustering multivariate data in the presence of atypical observations in the form of background noise.
Resumo:
This paper develops an interactive approach for exploratory spatial data analysis. Measures of attribute similarity and spatial proximity are combined in a clustering model to support the identification of patterns in spatial information. Relationships between the developed clustering approach, spatial data mining and choropleth display are discussed. Analysis of property crime rates in Brisbane, Australia is presented. A surprising finding in this research is that there are substantial inconsistencies in standard choropleth display options found in two widely used commercial geographical information systems, both in terms of definition and performance. The comparative results demonstrate the usefulness and appeal of the developed approach in a geographical information system environment for exploratory spatial data analysis.
Resumo:
Examples from the Murray-Darling basin in Australia are used to illustrate different methods of disaggregation of reconnaissance-scale maps. One approach for disaggregation revolves around the de-convolution of the soil-landscape paradigm elaborated during a soil survey. The descriptions of soil ma units and block diagrams in a soil survey report detail soil-landscape relationships or soil toposequences that can be used to disaggregate map units into component landscape elements. Toposequences can be visualised on a computer by combining soil maps with digital elevation data. Expert knowledge or statistics can be used to implement the disaggregation. Use of a restructuring element and k-means clustering are illustrated. Another approach to disaggregation uses training areas to develop rules to extrapolate detailed mapping into other, larger areas where detailed mapping is unavailable. A two-level decision tree example is presented. At one level, the decision tree method is used to capture mapping rules from the training area; at another level, it is used to define the domain over which those rules can be extrapolated. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
Using data from the H I Parkes All Sky Survey (HIPASS), we have searched for neutral hydrogen in galaxies in a region similar to25x25 deg(2) centred on NGC 1399, the nominal centre of the Fornax cluster. Within a velocity search range of 300-3700 km s(-1) and to a 3sigma lower flux limit of similar to40 mJy, 110 galaxies with H I emission were detected, one of which is previously uncatalogued. None of the detections has early-type morphology. Previously unknown velocities for 14 galaxies have been determined, with a further four velocity measurements being significantly dissimilar to published values. Identification of an optical counterpart is relatively unambiguous for more than similar to90 per cent of our H I galaxies. The galaxies appear to be embedded in a sheet at the cluster velocity which extends for more than 30degrees across the search area. At the nominal cluster distance of similar to20 Mpc, this corresponds to an elongated structure more than 10 Mpc in extent. A velocity gradient across the structure is detected, with radial velocities increasing by similar to500 km s(-1) from south-east to north-west. The clustering of galaxies evident in optical surveys is only weakly suggested in the spatial distribution of our H I detections. Of 62 H I detections within a 10degrees projected radius of the cluster centre, only two are within the core region (projected radius
Resumo:
We introduced a spectral clustering algorithm based on the bipartite graph model for the Manufacturing Cell Formation problem in [Oliveira S, Ribeiro JFF, Seok SC. A spectral clustering algorithm for manufacturing cell formation. Computers and Industrial Engineering. 2007 [submitted for publication]]. It constructs two similarity matrices; one for parts and one for machines. The algorithm executes a spectral clustering algorithm on each separately to find families of parts and cells of machines. The similarity measure in the approach utilized limited information between parts and between machines. This paper reviews several well-known similarity measures which have been used for Group Technology. Computational clustering results are compared by various performance measures. (C) 2008 The Society of Manufacturing Engineers. Published by Elsevier Ltd. All rights reserved.
Resumo:
The habit of inducing plant galls has evolved multiple times among insects but most species diversity occurs in only a few groups, such as gall midges and gall wasps. This phylogenetic clustering may reflect adaptive radiations in insect groups in which the trait has evolved. Alternatively, multiple independent origins of galling may suggest a selective advantage to the habit. We use DNA sequence data to examine the origins of galling among the most speciose group of gall-inducing scale insects, the eriococcids. We determine that the galling habit has evolved multiple times, including four times in Australian taxa, suggesting that there has been a selective advantage to galling in Australia. Additionally, although most gall-inducing eriococcid species occur on Myrtaceae, we found that lineages feeding on Myrtaceae are no more likely to have evolved the galling habit than those feeding on other plant groups. However, most gall-inducing species-richness is clustered in only two clades (Apiomorpha and Lachnodius + Opisthoscelis), all of which occur exclusively on Eucalyptus s.s. The Eriococcidae and the large genus Eriococcus were determined to be non-monophyletic and each will require revision. (C) 2004 The Linnean Society of London.
Resumo:
Tissue-nonspecific alkaline phosphatase (TNAP), present on the surface of chondrocyte- and osteoblast-derived matrix vesicles (MVs), plays key enzymatic functions during endochondral ossification. Many studies have shown that MVs are enriched in TNAP and also in cholesterol compared to the plasma membrane. Here we have studied the influence of cholesterol on the reconstitution of TNAP into dipalmitoylphosphatidylcholine (DPPC)-liposomes, monitoring the changes in lipid critical transition temperature (T(c)) and enthalpy variation (Delta H) using differential scanning calorimetry (DSC). DPPC-liposomes revealed a T(c) of 41.5 degrees C and Delta H of 7.63 Kcal mol(-1). The gradual increase in cholesterol concentration decrease Delta H values, reaching a Delta H of 0.87 Kcal mol(-1) for DPPC: cholesterol system with 36 mol% of cholesterol. An increase in T(c), up to 47 degrees C for the DPPC:cholesterol liposomes (36 mol% of Chol), resulted from the increase in the area per molecule in the gel phase. TNAP (0.02 mg/mL) reconstitution was done with protein:lipid 1:10,000 (molar ratio), resulting in 85% of the added enzyme being incorporated. The presence of cholesterol reduced the incorporation of TNAP to 42% of the added enzyme when a lipid composition of 36 mol% of Chol was used. Furthermore, the presence of TNAP in proteoliposomes resulted in a reduction in Delta H. The gradual proportional increase of cholesterol in liposomes results in broadening of the phase transition peak and eventually eliminates the cooperative gel-to-liquid-crystalline phase transition of phospholipids bilayers. Thus, the formation of microdomains may facilitate the clustering of enzymes and transporters known to be functional in MVs during endochondral ossification. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
Objective: To examine the quality of diabetes care and prevention of cardiovascular disease (CVD) in Australian general practice patients with type 2 diabetes and to investigate its relationship with coronary heart disease absolute risk (CHDAR). Methods: A total of 3286 patient records were extracted from registers of patients with type 2 diabetes held by 16 divisions of general practice (250 practices) across Australia for the year 2002. CHDAR was estimated using the United Kingdom Prospective Diabetes Study algorithm with higher CHDAR set at a 10 year risk of >15%. Multivariate multilevel logistic regression investigated the association between CHDAR and diabetes care. Results: 47.9% of diabetic patient records had glycosylated haemoglobin (HbA1c) >7%, 87.6% had total cholesterol >= 4.0 mmol/l, and 73.8% had blood pressure (BP) >= 130/85 mm Hg. 57.6% of patients were at a higher CHDAR, 76.8% of whom were not on lipid modifying medication and 66.2% were not on antihypertensive medication. After adjusting for clustering at the general practice level and age, lipid modifying medication was negatively related to CHDAR (odds ratio (OR) 0.84) and total cholesterol. Antihypertensive medication was positively related to systolic BP but negatively related to CHDAR (OR 0.88). Referral to ophthalmologists/optometrists and attendance at other health professionals were not related to CHDAR. Conclusions: At the time of the study the diabetes and CVD preventive care in Australian general practice was suboptimal, even after a number of national initiatives. The Australian Pharmaceutical Benefits Scheme (PBS) guidelines need to be modified to improve CVD preventive care in patients with type 2 diabetes.
Resumo:
Purpose: To evaluate the influence of cross-sectional arc calcification on the diagnostic accuracy of computed tomography (CT) angiography compared with conventional coronary angiography for the detection of obstructive coronary artery disease (CAD). Materials and Methods: Institutional Review Board approval and written informed consent were obtained from all centers and participants for this HIPAA-compliant study. Overall, 4511 segments from 371 symptomatic patients (279 men, 92 women; median age, 61 years [interquartile range, 53-67 years]) with clinical suspicion of CAD from the CORE-64 multi-center study were included in the analysis. Two independent blinded observers evaluated the percentage of diameter stenosis and the circumferential extent of calcium (arc calcium). The accuracy of quantitative multidetector CT angiography to depict substantial (>50%) stenoses was assessed by using quantitative coronary angiography (QCA). Cross-sectional arc calcium was rated on a segment level as follows: noncalcified or mild (<90 degrees), moderate (90 degrees-180 degrees), or severe (>180 degrees) calcification. Univariable and multivariable logistic regression, receiver operation characteristic curve, and clustering methods were used for statistical analyses. Results: A total of 1099 segments had mild calcification, 503 had moderate calcification, 338 had severe calcification, and 2571 segments were noncalcified. Calcified segments were highly associated (P < .001) with disagreement between CTA and QCA in multivariable analysis after controlling for sex, age, heart rate, and image quality. The prevalence of CAD was 5.4% in noncalcified segments, 15.0% in mildly calcified segments, 27.0% in moderately calcified segments, and 43.0% in severely calcified segments. A significant difference was found in area under the receiver operating characteristic curves (noncalcified: 0.86, mildly calcified: 0.85, moderately calcified: 0.82, severely calcified: 0.81; P < .05). Conclusion: In a symptomatic patient population, segment-based coronary artery calcification significantly decreased agreement between multidetector CT angiography and QCA to detect a coronary stenosis of at least 50%.
Resumo:
In breast cancer patients, primary chemotherapy is associated with the same survival benefits as adjuvant chemotherapy. Residual tumors represent a clinical challenge, Lis they may be resistant to additional cycles of the same drugs. Our aim was to identify differential transcripts expressed in residual tumors, after neoadjuvant chemotherapy, that might be related with tumor resistance. Hence, 16 patients with paired tumor samples, collected before and after treatment (4 cycles doxorubicin/cyclophosphamide, AC) had their gene expression evaluated on cDNA microarray slides containing 4,608 genes. Three hundred and eighty-nine genes were differentially expressed (paired Student`s t-test, pFDR<0.01) between pre- and post-chemotherapy samples and among the regulated functions were the JNK cascade and cell death. Unsupervised hierarchical clustering identified one branch comprising exclusively, eight pre-chemotherapy samples and another branch, including the former correspondent eight post-chemotherapy samples and other 16 paired pre/post-chemotherapy samples. No differences in clinical and tumor parameters could explain this clustering. Another group of I I patients with paired samples had expression of selected genes determined by real-time RT-PCR and CTGF and DUSP1 were confirmed more expressed in post- as compared to pre-chemotherapy samples. After neoadjuvant chemotherapy some residual samples may retain their molecular signature while others present significant changes in their gene expression, probably induced by the treatment. CTGF and DUSP1 overexpression in residual samples may be a reflection of resistance to further administration of AC regimen.
Resumo:
Hepatitis C virus (HCV) transmission has decreased with the adoption of universal blood donor screening and social policies to reduce the risk of infection in intravenous drug users, but remains a worldwide health problem. The objective of this study was to evaluate the phylogenetic relationships among sequences from different HCV genomic regions from sexual partners of infected patients. Nine couples with a stable relationship and without other risk factors for HCV infection and 42 control patients were selected, and the NS3 and NS5B regions were analysed. Phylogenetic analysis showed that viruses from five of the couples had a common origin, clustering in the same monophyletic group, with bootstrap values greater than 70. For the other couples, monophyletic groups were observed, but without bootstrap support. Thus, using two different viral genome regions, a common source of infection was observed in both members of five couples. These data strongly support HCV transmission within couples.
Resumo:
The traditional methods employed to detect atherosclerotic lesions allow for the identification of lesions; however, they do not provide specific characterization of the lesion`s biochemistry. Currently, Raman spectroscopy techniques are widely used as a characterization method for unknown substances, which makes this technique very important for detecting atherosclerotic lesions. The spectral interpretation is based on the analysis of frequency peaks present in the signal; however, spectra obtained from the same substance can show peaks slightly different and these differences make difficult the creation of an automatic method for spectral signal analysis. This paper presents a signal analysis method based on a clustering technique that allows for the classification of spectra as well as the inference of a diagnosis about the arterial wall condition. The objective is to develop a computational tool that is able to create clusters of spectra according to the arterial wall state and, after data collection, to allow for the classification of a specific spectrum into its correct cluster.
Resumo:
Objective. To explore the relationship between biomarkers of pulmonary arterial hypertension (PAH), interferon (IFN)-regulated gene expression, and the alternative activation pathway in systemic sclerosis (SSc). Methods. Peripheral blood mononuclear cells (PBMCs) were purified from healthy controls, patients with idiopathic PAH, and SSc patients (classified as having diffuse cutaneous SSc, limited cutaneous SSc [lcSSc] without PAH, and lcSSc with PAH). IFN-regulated and ""PAH biomarker"" genes were compared after supervised hierarchical clustering. Messenger RNA levels of selected IFN-regulated genes (Siglec1 and MX1), biomarker genes (IL13RA1, CCR1, and JAK2), and the alternative activation marker gene (MRC1) were analyzed on PBMCs and on CD14- and CD14+ cell populations. Interleukin-13 (IL-13) and IL-4 concentrations were measured in plasma by immunoassay. CD14, MRC1, and IL13RA1 surface expression was analyzed by flow cytometry. Results. Increased PBMC expression of both IFN-regulated and biomarker genes distinguished SSc patients from healthy controls. Expression of genes in the biomarker cluster, but not in the IFN-regulated cluster, distinguished lcSSc with PAH from lcSSc without PAH. The genes CCR1 (P < 0.001) and JAK2 (P < 0.001) were expressed more highly in lcSSc patients with PAH compared with controls and mainly by CD14+ cells. MRC1 expression was increased exclusively in lcSSc patients with PAH (P < 0.001) and correlated strongly with pulmonary artery pressure (r = 0.52, P = 0.03) and higher mortality (P = 0.02). MRC1 expression was higher in CD14+ cells and was greatly increased by stimulation with IL-13. IL-13 concentrations in plasma were most highly increased in lcSSc patients with PAH (P < 0.001). Conclusion. IFN-regulated and biomarker genes represent distinct, although related, clusters in lcSSc patients with PAH. MRC1, a marker for the effect of IL-13 on alternative monocyte/macrophage activation, is associated with this severe complication and is related to mortality.
Resumo:
The expression of peripheral tissue antigens (PTAs) in the thymus by medullary thymic epithelial cells (mTECs) is essential for the central self-tolerance in the generation of the T cell repertoire. Due to heterogeneity of autoantigen representation, this phenomenon has been termed promiscuous gene expression (PGE), in which the autoimmune regulator (Aire) gene plays a key role as a transcription factor in part of these genes. Here we used a microarray strategy to access PGE in cultured murine CD80(+) 3.10 mTEC line. Hierarchical clustering of the data allowed observation that PTA genes were differentially expressed being possible to found their respective induced or repressed mRNAs. To further investigate the control of PGE, we tested the hypothesis that genes involved in this phenomenon might also be modulated by transcriptional network. We then reconstructed such network based on the microarray expression data, featuring the guanylate cyclase 2d (Gucy2d) gene as a main node. In such condition, we established 167 positive and negative interactions with downstream PTA genes. Silencing Aire by RNA interference, Gucy2d while down regulated established a larger number (355) of interactions with PTA genes. T- and G-boxes corresponding to AIRE protein binding sites located upstream to ATG codon of Gucy2d supports this effect. These findings provide evidence that Aire plays a role in association with Gucy2d, which is connected to Several PTA genes and establishes a cascade-like transcriptional control of promiscuous gene expression in mTEC cells. (C) 2009 Elsevier Ltd. All rights reserved.