33 resultados para categorical and mix datasets


Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the challenges in scientific visualization is to generate software libraries suitable for the large-scale data emerging from tera-scale simulations and instruments. We describe the efforts currently under way at SDSC and NPACI to address these challenges. The scope of the SDSC project spans data handling, graphics, visualization, and scientific application domains. Components of the research focus on the following areas: intelligent data storage, layout and handling, using an associated “Floor-Plan” (meta data); performance optimization on parallel architectures; extension of SDSC’s scalable, parallel, direct volume renderer to allow perspective viewing; and interactive rendering of fractional images (“imagelets”), which facilitates the examination of large datasets. These concepts are coordinated within a data-visualization pipeline, which operates on component data blocks sized to fit within the available computing resources. A key feature of the scheme is that the meta data, which tag the data blocks, can be propagated and applied consistently. This is possible at the disk level, in distributing the computations across parallel processors; in “imagelet” composition; and in feature tagging. The work reflects the emerging challenges and opportunities presented by the ongoing progress in high-performance computing (HPC) and the deployment of the data, computational, and visualization Grids.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study describes the pedagogical impact of real-world experimental projects undertaken as part of an advanced undergraduate Fluid Mechanics subject at an Australian university. The projects have been organised to complement traditional lectures and introduce students to the challenges of professional design, physical modelling, data collection and analysis. The physical model studies combine experimental, analytical and numerical work in order to develop students’ abilities to tackle real-world problems. A first study illustrates the differences between ideal and real fluid flow force predictions based upon model tests of buildings in a large size wind tunnel used for research and professional testing. A second study introduces the complexity arising from unsteady non-uniform wave loading on a sheltered pile. The teaching initiative is supported by feedback from undergraduate students. The pedagogy of the course and projects is discussed with reference to experiential, project-based and collaborative learning. The practical work complements traditional lectures and tutorials, and provides opportunities which cannot be learnt in the classroom, real or virtual. Student feedback demonstrates a strong interest for the project phases of the course. This was associated with greater motivation for the course, leading in turn to lower failure rates. In terms of learning outcomes, the primary aim is to enable students to deliver a professional report as the final product, where physical model data are compared to ideal-fluid flow calculations and real-fluid flow analyses. Thus the students are exposed to a professional design approach involving a high level of expertise in fluid mechanics, with sufficient academic guidance to achieve carefully defined learning goals, while retaining sufficient flexibility for students to construct there own learning goals. The overall pedagogy is a blend of problem-based and project-based learning, which reflects academic research and professional practice. The assessment is a mix of peer-assessed oral presentations and written reports that aims to maximise student reflection and development. Student feedback indicated a strong motivation for courses that include a well-designed project component.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A Geographic Information System (GIS) was used to model datasets of Leyte Island, the Philippines, to identify land which was suitable for a forest extension program on the island. The datasets were modelled to provide maps of the distance of land from cities and towns, land which was a suitable elevation and slope for smallholder forestry and land of various soil types. An expert group was used to assign numeric site suitabilities to the soil types and maps of site suitability were used to assist the selection of municipalities for the provision of extension assistance to smallholders. Modelling of the datasets was facilitated by recent developments of the ArcGIS® suite of computer programs and derivation of elevation and slope was assisted by the availability of digital elevation models (DEM) produced by the Shuttle Radar Topography (SRTM) mission. The usefulness of GIS software as a decision support tool for small-scale forestry extension programs is discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The area of private land suitable and available for growing hoop pine (Araucaria cunninghamii) on the Atherton Tablelands in North Queensland was modelled using a geographic information system (GIS). In Atherton, Eacham and Herberton shires, approximately 64,700 ha of privately owned land were identified as having a mean annual rainfall and soil type similar to Forestry Plantations Queensland (FPQ) hoop pine growth plots with an approximate growth rate of 20 m3 per annum. Land with slope of over 25° and land covered with native vegetation were excluded in the estimation. If land which is currently used for high-value agriculture is also excluded, the net area of land potentially suitable and available for expansion of hoop pine plantations is approximately 22,900 ha. Expert silvicultural advice emphasized the role of site preparation and weed control in affecting the long-term growth rate of hoop pine. Hence, sites with less than optimal fertility and rainfall may be considered as being potentially suitable for growing hoop pine at a lower growth rate. The datasets had been prepared at various scales and differing precision for their description of land attributes. Therefore, the results of this investigation have limited applicability for planning at the individual farm level but are useful at the regional level to target areas for plantation expansion.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In order to examine whether different populations show the same pattern of onset in the Southern Hemisphere, we examined the age-at-first-admission distribution for schizophrenia based on mental health registers from Australia and Brazil. Data on age-at-first-admission for individuals with schizophrenia were extracted from two names-linked registers, (1) the Queensland Mental Health Statistics System, Australia (N=7651, F= 3293, M=4358), and (2) a psychiatric hospital register in Pelotas, Brazil (N=4428, F=2220, M=2208). Age distributions were derived for males and females for both datasets. The general population structure tbr both countries was also obtained. There were significantly more males in the Queensland dataset (gz = 56.9, df3, p < 0.0001 ). Both dataset distributions were skewed to the right. Onset rose steeply after puberty to reach a modal age group of 20-29 for men and women, with a more gradual tail toward the older age groups. In Queensland 68% of women with schizophrenia had their first admissions after age 30, while the proportion from Brazil was 58%. Compared to the Australian dataset, the Brazilian dataset had a slightly greater proportion of first admissions under the age 30 and a slightly smaller proportion over the age of 60 years. This reflects the underlying age distributions of the two populations. This study confirms the wide age range and gender differences in age-at-first-admission distributions for schizophrenia and identified a significant difference in the gender ratio between the two datasets. Given widely differing health services, cultural practices, ethic variability, and the different underlying population distributions, the age-at-first-admission in Queensland and Brazil showed more similarities than differences. Acknowledgments: The Stanley Foundation supported this project.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective To determine the costs and benefits of interventions for maternal and newborn health to assess the appropriateness of current strategies and guide future plans to attain the millennium development goals. Design Cost effectiveness analysis. Setting Two regions classified by the World Health Organization according to their epidemiological grouping: Afr-E, those countries in sub-Saharan Africa with very high adult and high child mortality, and Sear-D, comprising countries in South East Asia with high adult and high child mortality. Data sources Effectiveness data from several sources, including trials, observational studies, and expert opinion. For resource inputs, quantifies came from WHO guidelines, literature, and expert opinion, and prices from the WHO choosing interventions that are cost effective database. Main outcome measures Cost per disability adjusted life year (DALY) averted in year 2000 international dollars. Results The most cost effective mix of interventions was similar in Afr-E and Sear-D. These were the community based newborn care package, followed by antenatal care (tetanus toxoid, screening for pre-eclampsia, screening and treatment of asymptomatic bacteriuria and syphilis); skilled attendance at birth, offering first level maternal and neonatal care around childbirth; and emergency obstetric and neonatal care around and after birth. Screening and treatment of maternal syphilis, community based management of neonatal pneumonia, and steroids given during the antenatal period were relatively less cost effective in Sear-D. Scaling up all of the included interventions to 95% coverage would halve neonatal and maternal deaths. Conclusion Preventive interventions at the community level for newborn babies and at the primary care level for mothers and newborn babies are extremely cost effective, but the millennium development goals for maternal and child health will not be achieved without universal access to clinical services as well.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Phytophthora-resistant lucerne cultivars do not always perform well under conditions of high disease pressure in the field. To determine whether resistance expression remains stable under different infection intensities, tetraploid and diploid lucerne genotypes, genotypically defined for their reactions to Phytophthora medicaginis, were clonally propagated, and the influence of different reproducible inoculum levels (0 . 5 and 5 . 0 g dry weight mycelium/kg dry weight potting mix), the period of exposure to these levels (10-60 days), and temperature (16/22 degrees C and 24/30 degrees C) on disease expression was determined in controlled environments. Generally, expression of resistance by resistant genotypes, remained stable under these conditions. Biotic (e.g. Aphanomyces eutiches) or abiotic factors other than P. medicaginis may be responsible for the poorer than expected performance under field conditions in some instances, or the percentage of resistant plants in some cultivars currently classified as resistant is insufficient to provide buffering against productivity reductions under severe epidemics. Further research is needed to clarify the situation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have examined the effect of tubal sterilisation and hysterectomy on risk of ovarian cancer in a large case-control study in eastern Australia involving 824 women aged 18-79 years, diagnosed with epithelial ovarian cancer between 1990 and 1993, and 855 controls randomly selected from the electoral roll. Relative risks for ovarian cancer were estimated using multiple categorical regression to adjust for age, parity, oral contraceptive use and other risk factors. Tubal sterilisation was associated with a 39% reduction in risk of ovarian cancer (RR 0.61, 95% Cl 0.46-0.85) and hysterectomy with a 36% reduction (RR 0.64, 95% Cl 0.48-0.85). Risk remained low 25 years after surgery and was reduced irrespective of sterilisation technique, and estimates were similar among various types of epithelial ovarian cancer. The greatest reduction (74%) was observed among women with primary peritoneal tumours. Pelvic infection and use of vaginal sprays or contraceptive foams were not related to ovarian cancer, while use of talc in the perineal region slightly but significantly increased risk among women with patent fallopian tubes. Reportedly heavy or painful menses, perhaps associated with retrograde flow, were associated with ovarian cancer, and reduction in risk of disease after hysterectomy was greatest among women who had heavy periods. Our findings support the theory that contaminants from the vagina, such as talc, and from the uterus, such as endometrium, gain access to the peritoneal cavity through patent fallopian tubes and may enhance the malignant transformation of ovarian surface epithelium. Surgical tubal occlusion may reduce the risk of ovarian cancer by preventing the access of such agents. (C) 1997 Wiley-Liss, Inc.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We inferred the phylogeny of 33 species of ticks from the subfamilies Rhipicephalinae and Hyalomminae from analyses of nuclear and mitochondrial DNA and morphology. We used nucleotide sequences from 12S rRNA, cytochrome c oxidase I, internal transcribed spacer 2 of the nuclear rRNA, and 18S rRNA. Nucleotide sequences and morphology were analyzed separately and together in a total-evidence analysis. Analyses of the five partitions together (3303 characters) gave the best-resolved and the best-supported hypothesis so far for the phylogeny of ticks in the Rhipicephalinae and Hyalomminae, despite the fact that some partitions did not have data for some taxa. However, most of the hidden conflict (lower support in the total-evidence analyses compared to that in the individual analyses) was found in those partitions that had taxa without data. The partitions with complete taxonomic sampling had more hidden support (higher support in the total-evidence analyses compared to that in the separate-partition analyses) than hidden conflict. Mapping of geographic origins of ticks onto our phylogeny indicates an African origin for the Rhipicephalinae sensu lato (i.e., including Hyalomma spp.), the Rhipicephalus-Boophilus lineage, the Dermacentor-Anocentor lineage, and the Rhipicephalus-Booophilus-Nosomma-Hyalomma-Rhipicentor lineage. The Nosomma-Hyalomma lineage appears to have evolved in Asia. Our total-evidence phylogeny indicates that (i) the genus Rhipicephalus is paraphyletic with respect to the genus Boophilus, (ii) the genus Dermacentor is paraphyletic with respect to the genus Anocentor, and (iii) some subgenera of the genera Hyalomma and Rhipicephalus are paraphyletic with respect to other subgenera in these genera. Study of the Rhipicephalinae and Hyalomminae over the last 7 years has shown that analyses of individual datasets (e.g., one gene or morphology) seldom resolve many phylogenetic relationships, but analyses of more than one dataset can generate well-resolved phylogenies for these ticks. (C) 2001 Academic Press.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Generally employment has been studied in terms of changes in the types of goods and services that the economy is purchasing. Far less attention has been given to the occupational aggregates that go into producing these goods and services. The few studies that did investigate this area found that the mix of tabour inputs appear to have been changing over time in a systematic pattern. The increasing prevalence of white-collar, information workers gave rise to the assertion that many societies had entered a post-industrial information age. Deals first of aff with some issues of measurement in the context of the Australian labour force, then looks at trends in various occupational groups using a non-standard four-sector classification of the labour force. Finally suggests an application in relation to the link between education and training and its ability to reduce structural unemployment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The principal aim of this paper is to measure the amount by which the profit of a multi-input, multi-output firm deviates from maximum short-run profit, and then to decompose this profit gap into components that are of practical use to managers. In particular, our interest is in the measurement of the contribution of unused capacity, along with measures of technical inefficiency, and allocative inefficiency, in this profit gap. We survey existing definitions of capacity and, after discussing their shortcomings, we propose a new ray economic capacity measure that involves short-run profit maximisation, with the output mix held constant. We go on to describe how the gap between observed profit and maximum profit can be calculated and decomposed using linear programming methods. The paper concludes with an empirical illustration, involving data on 28 international airline companies. The empirical results indicate that these airline companies achieve profit levels which are on average US$815m below potential levels, and that 70% of the gap may be attributed to unused capacity. (C) 2002 Elsevier Science B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Molecular evolution has been considered to be essentially a stochastic process, little influenced by the pace of phenotypic change. This assumption was challenged by a study that demonstrated an association between rates of morphological and molecular change estimated for total-evidence phylogenies, a finding that led some researchers to challenge molecular date estimates of major evolutionary radiations. Here we show that Omland's (1997) result is probably due to methodological bias, particularly phylogenetic nonindependence, rather than being indicative of an underlying evolutionary phenomenon. We apply three new methods specifically designed to overcome phylogenetic bias to 13 published phylogenetic datasets for vertebrate taxa, each of which includes both morphological characters and DNA sequence data. We find no evidence of an association between rates of molecular and morphological rates of change.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mechanism underlying segregation in liquid fluidized beds is investigated in this paper, A binary fluidized bed system not at a stable equilibrium condition. is modelled in the literature as forming a mixed part-corresponding to stable mixture-at the bottom of the bed and a pure layer of excess components always floating on the mixed part. On the basis of this model: (0 comprehensive criteria for binary particles of any type to mix/segregate, and (ii) mixing, segregation regime map in terms of size ratio and density ratio of the particles for a given fluidizing medium, are established in this work. Therefore, knowing the properties of given particles, a second type of particles can be chosen in order to avoid or to promote segregation according to the particular process requirements. The model is then advanced for multicomponent fluidized beds and validated against experimental results observed for ternary fluidized beds. (C) 2002 Elsevier Science B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objectives: To compare the population modelling programs NONMEM and P-PHARM during investigation of the pharmacokinetics of tacrolimus in paediatric liver-transplant recipients. Methods: Population pharmacokinetic analysis was performed using NONMEM and P-PHARM on retrospective data from 35 paediatric liver-transplant patients receiving tacrolimus therapy. The same data were presented to both programs. Maximum likelihood estimates were sought for apparent clearance (CL/F) and apparent volume of distribution (V/F). Covariates screened for influence on these parameters were weight, age, gender, post-operative day, days of tacrolimus therapy, transplant type, biliary reconstructive procedure, liver function tests, creatinine clearance, haematocrit, corticosteroid dose, and potential interacting drugs. Results: A satisfactory model was developed in both programs with a single categorical covariate - transplant type - providing stable parameter estimates and small, normally distributed (weighted) residuals. In NONMEM, the continuous covariates - age and liver function tests - improved modelling further. Mean parameter estimates were CL/F (whole liver) = 16.3 1/h, CL/F (cut-down liver) = 8.5 1/h and V/F = 565 1 in NONMEM, and CL/F = 8.3 1/h and V/F = 155 1 in P-PHARM. Individual Bayesian parameter estimates were CL/F (whole liver) = 17.9 +/- 8.8 1/h, CL/F (cutdown liver) = 11.6 +/- 18.8 1/h and V/F = 712 792 1 in NONMEM, and CL/F (whole liver) = 12.8 +/- 3.5 1/h, CL/F (cut-down liver) = 8.2 +/- 3.4 1/h and V/F = 221 1641 in P-PHARM. Marked interindividual kinetic variability (38-108%) and residual random error (approximately 3 ng/ml) were observed. P-PHARM was more user friendly and readily provided informative graphical presentation of results. NONMEM allowed a wider choice of errors for statistical modelling and coped better with complex covariate data sets. Conclusion: Results from parametric modelling programs can vary due to different algorithms employed to estimate parameters, alternative methods of covariate analysis and variations and limitations in the software itself.