89 resultados para Query clustering


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cylindrospermopsis raciborskii is a toxic-bloom-forming cyanobacterium that is commonly found in tropical to subtropical climatic regions worldwide, but it is also recognized as a common component of cyanobacterial communities in temperate climates. Genetic profiles of C. raciborskii were examined in 19 cultured isolates originating from geographically diverse regions of Australia and represented by two distinct morphotypes. A 609-bp region of rpoC1, a DNA-dependent RNA polymerase gene, was amplified by PCR from these isolates with cyanobacterium-specific primers. Sequence analysis revealed that all isolates belonged to the same species, including morphotypes with straight or coiled trichomes. Additional rpoC1 gene sequences obtained for a range of cyanobacteria highlighted clustering of C. raciborskii with other heterocyst-producing cyanobacteria (orders Nostocales and Stigonematales). In contrast, randomly amplified polymorphic DNA and short tandemly repeated repetitive sequence profiles revealed a greater level of genetic heterogeneity among C. raciborskii isolates than did rpoC1 gene analysis, and unique band profiles were also found among each of the cyanobacterial genera examined. A PCR test targeting a region of the rpoC1 gene unique to C. raciborskii was developed for the specific identification of C. raciborskii from both purified genomic DNA and environmental samples. The PCR was evaluated with a number of cyanobacterial isolates, but a PCR-positive result was only achieved with C, raciborskii. This method provides an accurate alternative to traditional morphological identification of C. raciborskii.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper presents the unique collection of additional features of Qu-Prolog, a variant of the Al programming language Prolog, and illustrates how they can be used for implementing DAI applications. By this we mean applications comprising communicating information servers, expert systems, or agents, with sophisticated reasoning capabilities and internal concurrency. Such an application exploits the key features of Qu-Prolog: support for the programming of sound non-clausal inference systems, multi-threading, and high level inter-thread message communication between Qu-Prolog query threads anywhere on the internet. The inter-thread communication uses email style symbolic names for threads, allowing easy construction of distributed applications using public names for threads. How threads react to received messages is specified by a disjunction of reaction rules which the thread periodically executes. A communications API allows smooth integration of components written in C, which to Qu-Prolog, look like remote query threads.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Normal mixture models are being increasingly used to model the distributions of a wide variety of random phenomena and to cluster sets of continuous multivariate data. However, for a set of data containing a group or groups of observations with longer than normal tails or atypical observations, the use of normal components may unduly affect the fit of the mixture model. In this paper, we consider a more robust approach by modelling the data by a mixture of t distributions. The use of the ECM algorithm to fit this t mixture model is described and examples of its use are given in the context of clustering multivariate data in the presence of atypical observations in the form of background noise.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper develops an interactive approach for exploratory spatial data analysis. Measures of attribute similarity and spatial proximity are combined in a clustering model to support the identification of patterns in spatial information. Relationships between the developed clustering approach, spatial data mining and choropleth display are discussed. Analysis of property crime rates in Brisbane, Australia is presented. A surprising finding in this research is that there are substantial inconsistencies in standard choropleth display options found in two widely used commercial geographical information systems, both in terms of definition and performance. The comparative results demonstrate the usefulness and appeal of the developed approach in a geographical information system environment for exploratory spatial data analysis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Examples from the Murray-Darling basin in Australia are used to illustrate different methods of disaggregation of reconnaissance-scale maps. One approach for disaggregation revolves around the de-convolution of the soil-landscape paradigm elaborated during a soil survey. The descriptions of soil ma units and block diagrams in a soil survey report detail soil-landscape relationships or soil toposequences that can be used to disaggregate map units into component landscape elements. Toposequences can be visualised on a computer by combining soil maps with digital elevation data. Expert knowledge or statistics can be used to implement the disaggregation. Use of a restructuring element and k-means clustering are illustrated. Another approach to disaggregation uses training areas to develop rules to extrapolate detailed mapping into other, larger areas where detailed mapping is unavailable. A two-level decision tree example is presented. At one level, the decision tree method is used to capture mapping rules from the training area; at another level, it is used to define the domain over which those rules can be extrapolated. (C) 2001 Elsevier Science B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Using data from the H I Parkes All Sky Survey (HIPASS), we have searched for neutral hydrogen in galaxies in a region similar to25x25 deg(2) centred on NGC 1399, the nominal centre of the Fornax cluster. Within a velocity search range of 300-3700 km s(-1) and to a 3sigma lower flux limit of similar to40 mJy, 110 galaxies with H I emission were detected, one of which is previously uncatalogued. None of the detections has early-type morphology. Previously unknown velocities for 14 galaxies have been determined, with a further four velocity measurements being significantly dissimilar to published values. Identification of an optical counterpart is relatively unambiguous for more than similar to90 per cent of our H I galaxies. The galaxies appear to be embedded in a sheet at the cluster velocity which extends for more than 30degrees across the search area. At the nominal cluster distance of similar to20 Mpc, this corresponds to an elongated structure more than 10 Mpc in extent. A velocity gradient across the structure is detected, with radial velocities increasing by similar to500 km s(-1) from south-east to north-west. The clustering of galaxies evident in optical surveys is only weakly suggested in the spatial distribution of our H I detections. Of 62 H I detections within a 10degrees projected radius of the cluster centre, only two are within the core region (projected radius

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Spatial data has now been used extensively in the Web environment, providing online customized maps and supporting map-based applications. The full potential of Web-based spatial applications, however, has yet to be achieved due to performance issues related to the large sizes and high complexity of spatial data. In this paper, we introduce a multiresolution approach to spatial data management and query processing such that the database server can choose spatial data at the right resolution level for different Web applications. One highly desirable property of the proposed approach is that the server-side processing cost and network traffic can be reduced when the level of resolution required by applications are low. Another advantage is that our approach pushes complex multiresolution structures and algorithms into the spatial database engine. That is, the developer of spatial Web applications needs not to be concerned with such complexity. This paper explains the basic idea, technical feasibility and applications of multiresolution spatial databases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The habit of inducing plant galls has evolved multiple times among insects but most species diversity occurs in only a few groups, such as gall midges and gall wasps. This phylogenetic clustering may reflect adaptive radiations in insect groups in which the trait has evolved. Alternatively, multiple independent origins of galling may suggest a selective advantage to the habit. We use DNA sequence data to examine the origins of galling among the most speciose group of gall-inducing scale insects, the eriococcids. We determine that the galling habit has evolved multiple times, including four times in Australian taxa, suggesting that there has been a selective advantage to galling in Australia. Additionally, although most gall-inducing eriococcid species occur on Myrtaceae, we found that lineages feeding on Myrtaceae are no more likely to have evolved the galling habit than those feeding on other plant groups. However, most gall-inducing species-richness is clustered in only two clades (Apiomorpha and Lachnodius + Opisthoscelis), all of which occur exclusively on Eucalyptus s.s. The Eriococcidae and the large genus Eriococcus were determined to be non-monophyletic and each will require revision. (C) 2004 The Linnean Society of London.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objective: To examine the quality of diabetes care and prevention of cardiovascular disease (CVD) in Australian general practice patients with type 2 diabetes and to investigate its relationship with coronary heart disease absolute risk (CHDAR). Methods: A total of 3286 patient records were extracted from registers of patients with type 2 diabetes held by 16 divisions of general practice (250 practices) across Australia for the year 2002. CHDAR was estimated using the United Kingdom Prospective Diabetes Study algorithm with higher CHDAR set at a 10 year risk of >15%. Multivariate multilevel logistic regression investigated the association between CHDAR and diabetes care. Results: 47.9% of diabetic patient records had glycosylated haemoglobin (HbA1c) >7%, 87.6% had total cholesterol >= 4.0 mmol/l, and 73.8% had blood pressure (BP) >= 130/85 mm Hg. 57.6% of patients were at a higher CHDAR, 76.8% of whom were not on lipid modifying medication and 66.2% were not on antihypertensive medication. After adjusting for clustering at the general practice level and age, lipid modifying medication was negatively related to CHDAR (odds ratio (OR) 0.84) and total cholesterol. Antihypertensive medication was positively related to systolic BP but negatively related to CHDAR (OR 0.88). Referral to ophthalmologists/optometrists and attendance at other health professionals were not related to CHDAR. Conclusions: At the time of the study the diabetes and CVD preventive care in Australian general practice was suboptimal, even after a number of national initiatives. The Australian Pharmaceutical Benefits Scheme (PBS) guidelines need to be modified to improve CVD preventive care in patients with type 2 diabetes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

With the proliferation of relational database programs for PC's and other platforms, many business end-users are creating, maintaining, and querying their own databases. More importantly, business end-users use the output of these queries as the basis for operational, tactical, and strategic decisions. Inaccurate data reduce the expected quality of these decisions. Implementing various input validation controls, including higher levels of normalisation, can reduce the number of data anomalies entering the databases. Even in well-maintained databases, however, data anomalies will still accumulate. To improve the quality of data, databases can be queried periodically to locate and correct anomalies. This paper reports the results of two experiments that investigated the effects of different data structures on business end-users' abilities to detect data anomalies in a relational database. The results demonstrate that both unnormalised and higher levels of normalisation lower the effectiveness and efficiency of queries relative to the first normal form. First normal form databases appear to provide the most effective and efficient data structure for business end-users formulating queries to detect data anomalies.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cerebral Autosomal Dominant Arteriopathy with Subcortical Infarcts and Leucoencephalopathy (CADASIL) is a recently described cause of stroke or stroke-like episodes. It is caused by mutations in the Notch3 gene on chromosome 19p. We sought to demonstrate mutations of the Notch3 gene in Australian patients suspected of having CADASIL. Patients from several families were referred to the study. A diagnosis was determined clinically and by neuroimaging. Those suspected of having CADASIL had sequencing of exons 3 and 4 of the Notch3 gene. Eight patients, two of whom were siblings, were suspected of having CADASIL. Five patients (including the siblings) had mutations. Because of strong clustering of Notch3 mutations in CADASIL, this has potential as a reliable test for the disease in Australian patients. (C) 2001 Harcourt Publishers Ltd.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

When the data consist of certain attributes measured on the same set of items in different situations, they would be described as a three-mode three-way array. A mixture likelihood approach can be implemented to cluster the items (i.e., one of the modes) on the basis of both of the other modes simultaneously (i.e,, the attributes measured in different situations). In this paper, it is shown that this approach can be extended to handle three-mode three-way arrays where some of the data values are missing at random in the sense of Little and Rubin (1987). The methodology is illustrated by clustering the genotypes in a three-way soybean data set where various attributes were measured on genotypes grown in several environments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Most mammalian defensins are cationic peptides of 29-42 amino acids long, stabilized by three disulfide bonds. However, recently Tang et al. (1999, Science 286, 498-502) reported the isolation of a new defensin type found in the leukocytes of rhesus macaques. In contrast to all the other defensins found so far, rhesus theta defensin-1 (RTD-1) is composed of just 18 amino acids with the backbone cyclized through peptide bonds. Antibacterial activities of both the native cyclic peptide and a linear form were examined, showing that the cyclic form was 3-fold more active than the open chain analogue [Tang et al. (1999) Science 286, 498-502]. To elucidate the three-dimensional structure of RTD-1 and its open chain analogue, both peptides were synthesized using solid-phase peptide synthesis and tert-butyloxycarbonyl chemistry. The structures of both peptides in aqueous solution were determined from two-dimensional H-1 NMR data recorded at 500 and 750 MHz. Structural constraints consisting of interproton distances and dihedral angles were used as input for simulated-annealing calculations and water refinement with the program CNS. RTD-1 and its open chain analogue oRTD-1 adopt very similar structures in water. Both comprise an extended beta -hairpin structure with turns at one or both ends. The turns are well defined within themselves and seem to be flexible with respect to the extended regions of the molecules. Although the two strands of the beta -sheet are connected by three disulfide bonds, this region displays a degree of flexibility. The structural similarity of RTD-1 and its open chain analogue oRTD-1, as well as their comparable degree of flexibility, support the theory that the additional charges at the termini of the open chain analogue rather than overall differences in structure or flexibility are the cause for oRTD-1's lower antimicrobial activity. In contrast to numerous other antimicrobial peptides, RTD-1 does not display any amphiphilic character, even though surface models of RTD-1 exhibit a certain clustering of positive charges. Some amide protons of RTD-1 that should be solvent-exposed in monomeric beta -sheet structures show low-temperature coefficients, suggesting the possible presence of weak intermolecular hydrogen bonds.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A multivariate model using hierarchical clustering and discriminant analysis is used to identify clusters of community opportunity and community vulnerability across Australia's mega metropolitan regions, Variables used in the model measure aspects of structural economic change, occupational change, human capital, income, unemployment, family/household disadvantage, and housing stress. A nine-cluster solution is used to categorise communities across metropolitan space. Significant between-city variations in the incidence of these clusters of opportunity and vulnerability are apparent, suggesting the emergence of marked differentiation between Australia's mega metropolitan regions in their adjustments to changing economic and social conditions. JEL classification: C49, R11, R12.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Most Internet search engines are keyword-based. They are not efficient for the queries where geographical location is important, such as finding hotels within an area or close to a place of interest. A natural interface for spatial searching is a map, which can be used not only to display locations of search results but also to assist forming search conditions. A map-based search engine requires a well-designed visual interface that is intuitive to use yet flexible and expressive enough to support various types of spatial queries as well as aspatial queries. Similar to hyperlinks for text and images in an HTML page, spatial objects in a map should support hyperlinks. Such an interface needs to be scalable with the size of the geographical regions and the number of websites it covers. In spite of handling typically a very large amount of spatial data, a map-based search interface should meet the expectation of fast response time for interactive applications. In this paper we discuss general requirements and the design for a new map-based web search interface, focusing on integration with the WWW and visual spatial query interface. A number of current and future research issues are discussed, and a prototype for the University of Queensland is presented. (C) 2001 Published by Elsevier Science Ltd.