35 resultados para classification aided by clustering


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Understanding the ecological role of benthic microalgae, a highly productive component of coral reef ecosystems, requires information on their spatial distribution. The spatial extent of benthic microalgae on Heron Reef (southern Great Barrier Reef, Australia) was mapped using data from the Landsat 5 Thematic Mapper sensor. integrated with field measurements of sediment chlorophyll concentration and reflectance. Field-measured sediment chlorophyll concentrations. 2 ranging from 23-1.153 mg chl a m(2), were classified into low, medium, and high concentration classes (1-170, 171-290, and > 291 mg chl a m(-2)) using a K-means clustering algorithm. The mapping process assumed that areas in the Thematic Mapper image exhibiting similar reflectance levels in red and blue bands would correspond to areas of similar chlorophyll a levels. Regions of homogenous reflectance values corresponding to low, medium, and high chlorophyll levels were identified over the reef sediment zone by applying a standard image classification algorithm to the Thematic Mapper image. The resulting distribution map revealed large-scale ( > 1 km 2) patterns in chlorophyll a levels throughout the sediment zone of Heron Reef. Reef-wide estimates of chlorophyll a distribution indicate that benthic Microalgae may constitute up to 20% of the total benthic chlorophyll a at Heron Reef. and thus contribute significantly to total primary productivity on the reef.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

beta-turns are important topological motifs for biological recognition of proteins and peptides. Organic molecules that sample the side chain positions of beta-turns have shown broad binding capacity to multiple different receptors, for example benzodiazepines. beta-turns have traditionally been classified into various types based on the backbone dihedral angles (phi 2, psi 2, phi 3 and psi 3). Indeed, 57-68% of beta-turns are currently classified into 8 different backbone families (Type I, Type II, Type I', Type II', Type VIII, Type VIa1, Type VIa2 and Type VIb and Type IV which represents unclassified beta-turns). Although this classification of beta-turns has been useful, the resulting beta-turn types are not ideal for the design of beta-turn mimetics as they do not reflect topological features of the recognition elements, the side chains. To overcome this, we have extracted beta-turns from a data set of non-homologous and high-resolution protein crystal structures. The side chain positions, as defined by C-alpha-C-beta vectors, of these turns have been clustered using the kth nearest neighbor clustering and filtered nearest centroid sorting algorithms. Nine clusters were obtained that cluster 90% of the data, and the average intra-cluster RMSD of the four C-alpha-C-beta vectors is 0.36. The nine clusters therefore represent the topology of the side chain scaffold architecture of the vast majority of beta-turns. The mean structures of the nine clusters are useful for the development of beta-turn mimetics and as biological descriptors for focusing combinatorial chemistry towards biologically relevant topological space.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In a first step toward understanding the molecular basis of pineapple fruit development, a sequencing project was initiated to survey a range of expressed sequences from green unripe and yellow ripe fruit tissue. A highly abundant metallothionein transcript was identified during library construction, and was estimated to account for up to 50% of all EST library clones. Library clones with metallothionein subtracted were sequenced, and 408 unripe green and 1140 ripe yellow edited EST clone sequences were retrieved. Clone redundancy was high, with the combined 1548 clone sequences clustering into just 634 contigs comprising 191 consensus sequences and 443 singletons. Half of the EST clone sequences clustered within 13.5% and 9.3% of contigs from green unripe and yellow ripe libraries, respectively, indicating that a small subset of genes dominate the majority of the transcriptome. Furthermore, sequence cluster analysis, northern analysis, and functional classification revealed major differences between genes expressed in the unripe green and ripe yellow fruit tissues. Abundant genes identified from the green fruit include a fruit bromelain and a bromelain inhibitor. Abundant genes identified in the yellow fruit library include a MADS box gene, and several genes normally associated with protein synthesis, including homologues of ribosomal L10 and the translation factors SUI1 and eIF5A. Both the green unripe and yellow ripe libraries contained high proportions of clones associated with oxidative stress responses and the detoxification of free radicals.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We consider the statistical problem of catalogue matching from a machine learning perspective with the goal of producing probabilistic outputs, and using all available information. A framework is provided that unifies two existing approaches to producing probabilistic outputs in the literature, one based on combining distribution estimates and the other based on combining probabilistic classifiers. We apply both of these to the problem of matching the HI Parkes All Sky Survey radio catalogue with large positional uncertainties to the much denser SuperCOSMOS catalogue with much smaller positional uncertainties. We demonstrate the utility of probabilistic outputs by a controllable completeness and efficiency trade-off and by identifying objects that have high probability of being rare. Finally, possible biasing effects in the output of these classifiers are also highlighted and discussed.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The most common human cancers are malignant neoplasms of the skin(1,2). Incidence of cutaneous melanoma is rising especially steeply, with minimal progress in non-surgical treatment of advanced disease(3,4). Despite significant effort to identify independent predictors of melanoma outcome, no accepted histopathological, molecular or immunohistochemical marker defines subsets of this neoplasm(2,3). Accordingly, though melanoma is thought to present with different 'taxonomic' forms, these are considered part of a continuous spectrum rather than discrete entities(2). Here we report the discovery of a subset of melanomas identified by mathematical analysis of gene expression in a series of samples. Remarkably, many genes underlying the classification of this subset are differentially regulated in invasive melanomas that form primitive tubular networks in vitro, a feature of some highly aggressive metastatic melanomas(5). Global transcript analysis can identify unrecognized subtypes of cutaneous melanoma and predict experimentally verifiable phenotypic characteristics that may be of importance to disease progression.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue samples of known type containing the expression data on very many (possibly thousands) genes. Recently, results have been presented in the literature suggesting that it is possible to construct a prediction rule from only a few genes such that it has a negligible prediction error rate. However, in these results the test error or the leave-one-out cross-validated error is calculated without allowance for the selection bias. There is no allowance because the rule is either tested on tissue samples that were used in the first instance to select the genes being used in the rule or because the cross-validation of the rule is not external to the selection process; that is, gene selection is not performed in training the rule at each stage of the cross-validation process. We describe how in practice the selection bias can be assessed and corrected for by either performing a cross-validation or applying the bootstrap external to the selection process. We recommend using 10-fold rather than leave-one-out cross-validation, and concerning the bootstrap, we suggest using the so-called. 632+ bootstrap error estimate designed to handle overfitted prediction rules. Using two published data sets, we demonstrate that when correction is made for the selection bias, the cross-validated error is no longer zero for a subset of only a few genes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The high-affinity receptors for human granulocyte-macrophage colony-stimulating factor (GM-CSF), interleukin-1 (IL-3), and IL-5 are heterodimeric complexes consisting of cytokine-specific alpha subunits and a common signal-transducing beta subunit (h beta c). We have previously demonstrated the oncogenic potential of this group of receptors by identifying constitutively activating point mutations in the extracellular and transmembrane domains of h beta c. We report here a comprehensive screen of the entire h beta c molecule that has led to the identification of additional constitutive point mutations by virtue of their ability to confer factor independence on murine FDC-P1 cells. These mutations were clustered exclusively in a central region of h beta c that encompasses the extracellular membrane-proximal domain, transmembrane domain, and membrane-proximal region of the cytoplasmic domain. Interestingly, most h beta c mutants exhibited cell type-specific constitutive activity, with only two transmembrane domain mutants able to confer factor independence on both murine FDC-P1 and BAF-B03 cells. Examination of the biochemical properties of these mutants in FDC-P1 cells indicated that MAP kinase (ERK1/2), STAT, and JAK2 signaling molecules were constitutively activated. In contrast, only some of the mutant beta subunits were constitutively tyrosine phosphorylated. Taken together; these results highlight key regions involved in h beta c activation, dissociate h beta c tyrosine phosphorylation from MAP kinase and STAT activation, and suggest the involvement of distinct mechanisms by which proliferative signals can be generated by h beta c. (C) 1998 by The American Society of Hematology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Strain-dependent hydraulic conductivities are uniquely defined by an environmental factor, representing applied normal and shear strains, combined with intrinsic material parameters representing mass and component deformation moduli, initial conductivities, and mass structure. The components representing mass moduli and structure are defined in terms of RQD (rock quality designation) and RMR (rock mass rating) to represent the response of a whole spectrum of rock masses, varying from highly fractured (crushed) rock to intact rock. These two empirical parameters determine the hydraulic response of a fractured medium to the induced-deformations The constitutive relations are verified against available published data and applied to study one-dimensional, strain-dependent fluid flow. Analytical results indicate that both normal and shear strains exert a significant influence on the processes of fluid flow and that the magnitude of this influence is regulated by the values of RQD and RMR.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Examples from the Murray-Darling basin in Australia are used to illustrate different methods of disaggregation of reconnaissance-scale maps. One approach for disaggregation revolves around the de-convolution of the soil-landscape paradigm elaborated during a soil survey. The descriptions of soil ma units and block diagrams in a soil survey report detail soil-landscape relationships or soil toposequences that can be used to disaggregate map units into component landscape elements. Toposequences can be visualised on a computer by combining soil maps with digital elevation data. Expert knowledge or statistics can be used to implement the disaggregation. Use of a restructuring element and k-means clustering are illustrated. Another approach to disaggregation uses training areas to develop rules to extrapolate detailed mapping into other, larger areas where detailed mapping is unavailable. A two-level decision tree example is presented. At one level, the decision tree method is used to capture mapping rules from the training area; at another level, it is used to define the domain over which those rules can be extrapolated. (C) 2001 Elsevier Science B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this study was to estimate the extent of association of cervical screening in NSW women with socio-economic status (SES), rurality, and proportions of non-English speaking background (NESB) and Indigenous status. Data on women who had at least one Pap test over two years (January 1998-December 1999) were obtained from the NSW Pap test Register. Each local government area (LGA) was allocated to categories of population proportions of NESB and Indigenous status, a rurality classification based on population density and remoteness, and to an SES quintile. The odds ratios (OR) of having a Pap test were estimated and confounding adjusted by multiple logistic regression analysis. Implied Pap test rates in urban NESB and in rural Indigenous women were estimated from the modelled estimates. The adjusted OR for a Pap test in large rural centres (1.14) was significantly higher than those for metropolitan or capital city residents (0.9 and 1.0 respectively). Adjusted OR for a Pap test in other rural centres (0.73) and other remote areas (0.64) were significantly lower than those for metropolitan or capital city residents. In urban populations the lowest OR were in areas with both low SES and high proportion of NESB. The lowest OR for Pap screening in rural populations occurred in the most remote areas with the highest proportion of Indigenous women. For urban NESB women the biennial Pap test rate was estimated as 50%, and for rural Indigenous women 29%, compared with the NSW average of 59%.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The immunosurveillance of transformed cells by the immune system remains one of the most controversial and poorly understood areas of immunity. Gene-targeted mice have greatly aided our understanding of the key effector molecules in tumor immunity. Herein, we describe spontaneous tumor development in gene-targeted mice lacking interferon (IFN)-gamma and/or perform (pfp), or the immunoregulatory cytokines, interleukin (IL)-12, IL-18, and tumor necrosis factor (TNF). Both IFN-gamma and pfp were critical for suppression of lymphomagenesis, however the level of protection afforded by IFN-gamma was strain specific. Lymphomas arising in IFN-gamma deficient mice were very nonimmunogenic compared with those derived from pfp-deficient mice, suggesting a comparatively weaker immunoselection pressure by IFN-gamma. Single loss of IL-12, IL-18, or TNF was not sufficient for spontaneous tumor development. A significant incidence of late onset adenocarcinoma observed in both IFN-gamma- and pfp-deficient mice indicated that some epithelial tissues were also subject to immunosurveillance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Aim: To determine acceptability of a health advocacy intervention, the Ask Diary and the comprehensive health assessment program (CHAP). Method: We performed a two by two designed randomised controlled trial of the Ask Diary and the CHAP tool in  adults with intellectual disability. Results of interviews of self-advocates and caregiver advocates, both families and paid carers, will be presented. Results: The interviews found strong support for the Ask Diary and the CHAP tool among selfadvocates and family caregivers. There was clear indication that the Ask Diary improved advocacy, aided in the organisation of health matters and was easy to use. It was reported that the health assessment resulted in benefits for the person’s health and high acceptability by carers. There was less support for the interventions where the person was supported through government services. Conclusions: Self-advocates and family caregivers welcome and use a personalised health advocacy diary and also a health assessment. However paid carers used the diary less but were supportive of the health assessment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

CysView is a web-based application tool that identifies and classifies proteins according to their disulfide connectivity patterns. It accepts a dataset of annotated protein sequences in various formats and returns a graphical representation of cysteine pairing patterns. CysView displays cysteine patterns for those records in the data with disulfide annotations. It allows the viewing of records grouped by connectivity patterns. CysView's utility as an analysis tool was demonstrated by the rapid and correct classification of scorpion toxin entries from GenPept on the basis of their disulfide pairing patterns. It has proved useful for rapid detection of irrelevant and partial records, or those with incomplete annotations. CysView can be used to support distant homology between proteins. CysView is publicly available at http://research.i2r.a-star.edu.sg/CysView/.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The phylogenetic relationships of members of Eudorylini (Diptera: Pipunculidae: Pipunculinae) were explored. Two hundred and fifty-seven species of Eudorylini from all biogeographical regions and all known genera were examined. Sixty species were included in an exemplar-based phylogeny for the tribe. Two new genera are described, Clistoabdominalis and Dasydorylas. The identity of Eudorylas Aczél, the type genus for Eudorylini, has been obscure since its inception. The genus is re-diagnosed and a proposal to stabilize the genus and tribal names is discussed. An illustrated key to the genera of Pipunculidae is presented and all Eudorylini genera are diagnosed. Numerous new generic synonyms are proposed. Moriparia nigripennis Kozánek & Kwon is preoccupied by Congomyia nigripennis Hardy when both are transferred to Claraeola, so Cla. koreana Skevington is proposed as a new name for Mo. nigripennis.