914 resultados para Data selection


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Staphylococcus aureus (S. aureus) is a prominent human and livestock pathogen investigated widely using omic technologies. Critically, due to availability, low visibility or scattered resources, robust network and statistical contextualisation of the resulting data is generally under-represented. Here, we present novel meta-analyses of freely-accessible molecular network and gene ontology annotation information resources for S. aureus omics data interpretation. Furthermore, through the application of the gene ontology annotation resources we demonstrate their value and ability (or lack-there-of) to summarise and statistically interpret the emergent properties of gene expression and protein abundance changes using publically available data. This analysis provides simple metrics for network selection and demonstrates the availability and impact that gene ontology annotation selection can have on the contextualisation of bacterial omics data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This research identifies the commuting mode choice behaviour of 3537 adults living in different types of transit oriented development (TOD) in Brisbane by disentangling the effects of their “evil twin” transit adjacent developments (TADs), and by also controlling for residential self-selection, travel attitudes and preferences, and socio-demographic effects. A TwoStep cluster analysis was conducted to identify the natural groupings of respondents’ living environment based on six built environment indicators. The analysis resulted in five types of neighbourhoods: urban TODs, activity centre TODs, potential TODs, TADs, and traditional suburbs. HABITAT survey data were used to derive the commute mode choice behaviour of people living in these neighbourhoods. In addition, statements reflecting both respondents’ travel attitudes and living preferences were also collected as part of the survey. Factor analyses were conducted based on these statements and these derived factors were then used to control for residential self-selection. Four binary logistic regression models were estimated, one for each of the travel modes used (e.g. public transport, active transport, less sustainable transport such as the car/taxi, and other), to differentiate between the commuting behaviour of people living in the five types of neighbourhoods. The findings verify that urban TODs enhance the use of public transport and reduce car usage. No significant difference was found in the commuting behaviour between respondents living in traditional suburbs and TADs. The results confirm the hypothesis that TADs are the “evil twin” of TODs. The data indicates that TADs and the mode choices of residents in these neighbourhoods is a missed transport policy opportunity. Further policy efforts are required for a successive transition of TADs into TODs in order to realise the full benefits of these. TOD policy should also be integrated with context specific TOD design principles.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Spatial analysis is increasingly important for identifying modifiable geographic risk factors for disease. However, spatial health data from surveys are often incomplete, ranging from missing data for only a few variables, to missing data for many variables. For spatial analyses of health outcomes, selection of an appropriate imputation method is critical in order to produce the most accurate inferences. Methods We present a cross-validation approach to select between three imputation methods for health survey data with correlated lifestyle covariates, using as a case study, type II diabetes mellitus (DM II) risk across 71 Queensland Local Government Areas (LGAs). We compare the accuracy of mean imputation to imputation using multivariate normal and conditional autoregressive prior distributions. Results Choice of imputation method depends upon the application and is not necessarily the most complex method. Mean imputation was selected as the most accurate method in this application. Conclusions Selecting an appropriate imputation method for health survey data, after accounting for spatial correlation and correlation between covariates, allows more complete analysis of geographic risk factors for disease with more confidence in the results to inform public policy decision-making.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this report an artificial neural network (ANN) based automated emergency landing site selection system for unmanned aerial vehicle (UAV) and general aviation (GA) is described. The system aims increase safety of UAV operation by emulating pilot decision making in emergency landing scenarios using an ANN to select a safe landing site from available candidates. The strength of an ANN to model complex input relationships makes it a perfect system to handle the multicriteria decision making (MCDM) process of emergency landing site selection. The ANN operates by identifying the more favorable of two landing sites when provided with an input vector derived from both landing site's parameters, the aircraft's current state and wind measurements. The system consists of a feed forward ANN, a pre-processor class which produces ANN input vectors and a class in charge of creating a ranking of landing site candidates using the ANN. The system was successfully implemented in C++ using the FANN C++ library and ROS. Results obtained from ANN training and simulations using randomly generated landing sites by a site detection simulator data verify the feasibility of an ANN based automated emergency landing site selection system.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Spatial data analysis has become more and more important in the studies of ecology and economics during the last decade. One focus of spatial data analysis is how to select predictors, variance functions and correlation functions. However, in general, the true covariance function is unknown and the working covariance structure is often misspecified. In this paper, our target is to find a good strategy to identify the best model from the candidate set using model selection criteria. This paper is to evaluate the ability of some information criteria (corrected Akaike information criterion, Bayesian information criterion (BIC) and residual information criterion (RIC)) for choosing the optimal model when the working correlation function, the working variance function and the working mean function are correct or misspecified. Simulations are carried out for small to moderate sample sizes. Four candidate covariance functions (exponential, Gaussian, Matern and rational quadratic) are used in simulation studies. With the summary in simulation results, we find that the misspecified working correlation structure can still capture some spatial correlation information in model fitting. When the sample size is large enough, BIC and RIC perform well even if the the working covariance is misspecified. Moreover, the performance of these information criteria is related to the average level of model fitting which can be indicated by the average adjusted R square ( [GRAPHICS] ), and overall RIC performs well.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We investigate methods for data-based selection of working covariance models in the analysis of correlated data with generalized estimating equations. We study two selection criteria: Gaussian pseudolikelihood and a geodesic distance based on discrepancy between model-sensitive and model-robust regression parameter covariance estimators. The Gaussian pseudolikelihood is found in simulation to be reasonably sensitive for several response distributions and noncanonical mean-variance relations for longitudinal data. Application is also made to a clinical dataset. Assessment of adequacy of both correlation and variance models for longitudinal data should be routine in applications, and we describe open-source software supporting this practice.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Efficiency of analysis using generalized estimation equations is enhanced when intracluster correlation structure is accurately modeled. We compare two existing criteria (a quasi-likelihood information criterion, and the Rotnitzky-Jewell criterion) to identify the true correlation structure via simulations with Gaussian or binomial response, covariates varying at cluster or observation level, and exchangeable or AR(l) intracluster correlation structure. Rotnitzky and Jewell's approach performs better when the true intracluster correlation structure is exchangeable, while the quasi-likelihood criteria performs better for an AR(l) structure.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Statistical methods are often used to analyse commercial catch and effort data to provide standardised fishing effort and/or a relative index of fish abundance for input into stock assessment models. Achieving reliable results has proved difficult in Australia's Northern Prawn Fishery (NPF), due to a combination of such factors as the biological characteristics of the animals, some aspects of the fleet dynamics, and the changes in fishing technology. For this set of data, we compared four modelling approaches (linear models, mixed models, generalised estimating equations, and generalised linear models) with respect to the outcomes of the standardised fishing effort or the relative index of abundance. We also varied the number and form of vessel covariates in the models. Within a subset of data from this fishery, modelling correlation structures did not alter the conclusions from simpler statistical models. The random-effects models also yielded similar results. This is because the estimators are all consistent even if the correlation structure is mis-specified, and the data set is very large. However, the standard errors from different models differed, suggesting that different methods have different statistical efficiency. We suggest that there is value in modelling the variance function and the correlation structure, to make valid and efficient statistical inferences and gain insight into the data. We found that fishing power was separable from the indices of prawn abundance only when we offset the impact of vessel characteristics at assumed values from external sources. This may be due to the large degree of confounding within the data, and the extreme temporal changes in certain aspects of individual vessels, the fleet and the fleet dynamics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective To attenuate two strains of Eimeria tenella by selecting for precocious development and evaluate the strains in characterisation trials and by field evaluation, to choose one precocious line for incorporation into an Australian live coccidiosis vaccine for poultry. Design Two strains from non-commercial flocks were passaged through chickens while selecting for precocious development. Each strain was characterised for drug sensitivity, pathogenicity, protection against homologous and heterologous challenge, and oocyst output in replicated experiments in which the experimental unit was a cage of three birds. Oocyst output and/or body weight gain data collected over a 10 to 12 day period following final inoculation were measured. Feed conversion ratios were also calculated where possible. Results Fifteen passages resulted in prepatent periods reduced by 24 h for the Redlands strain (from 144 h to 120 h)and 23 h for the Darryl strain (from 139 h to 116 h). Characterisation trials demonstrated that each precocious line was significantly less pathogenic than its parent strain and each effectively induced immunity that protected chickens against challenge with both the parent strain and other virulent field strains. Both lines had oocyst outputs that, although significantly reduced relative to the parent strains, remained sufficiently high for commercial vaccine production, and both showed susceptibility to coccidiostats. Conclusion Two attenuated lines have been produced that exhibit the appropriate characteristics for use in an Australian live coccidiosis vaccine.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Modeling of cultivar x trial effects for multienvironment trials (METs) within a mixed model framework is now common practice in many plant breeding programs. The factor analytic (FA) model is a parsimonious form used to approximate the fully unstructured form of the genetic variance-covariance matrix in the model for MET data. In this study, we demonstrate that the FA model is generally the model of best fit across a range of data sets taken from early generation trials in a breeding program. In addition, we demonstrate the superiority of the FA model in achieving the most common aim of METs, namely the selection of superior genotypes. Selection is achieved using best linear unbiased predictions (BLUPs) of cultivar effects at each environment, considered either individually or as a weighted average across environments. In practice, empirical BLUPs (E-BLUPs) of cultivar effects must be used instead of BLUPs since variance parameters in the model must be estimated rather than assumed known. While the optimal properties of minimum mean squared error of prediction (MSEP) and maximum correlation between true and predicted effects possessed by BLUPs do not hold for E-BLUPs, a simulation study shows that E-BLUPs perform well in terms of MSEP.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Black point in wheat has the potential to cost the Australian industry $A30.4 million a year. It is difficult and expensive to screen for resistance, so the aim of this study was to validate 3 previously identified quantitative trait loci (QTLs) for black point resistance on chromosomes 2B, 4A, and 3D of the wheat variety Sunco. Black point resistance data and simple sequence repeat (SSR) markers, linked to the resistance QTLs and suited to high-throughput assay, were analysed in the doubled haploid population, Batavia (susceptible) × Pelsart (resistant). Sunco and Pelsart both have Cook in their pedigree and both have the Triticum timopheevii translocation on 2B. SSR markers identified for the 3 genetic regions were gwm319 (2B, T. timopheevii translocation), wmc048 (4AS), and gwm341 (3DS). Gwm319 and wmc048 were associated with black point resistance in the validation population. Gwm341 may have an epistatic influence on the trait because when resistance alleles were present at both gwm319 and wmc048, the Batavia-derived allele at gwm341 was associated with a higher proportion of resistant lines. Data are presented showing the level of enrichment achieved for black point resistance, using 1, 2, or 3 of these molecular markers, and the number of associated discarded resistant lines. The level of population enrichment was found to be 1.83-fold with 6 of 17 resistant lines discarded when gwm319 and wmc048 were both used for selection. Interactions among the 3 QTLs appear complex and other genetic and epigenetic factors influence susceptibility to black point. Polymorphism was assessed for these markers within potential breeding material. This indicated that alternative markers to wmc048 may be required for some parental combinations. Based on these results, marker-assisted selection for the major black point resistance QTLs can increase the rate of genetic gain by improving the selection efficiency and may facilitate stacking of black point resistances from different sources.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The value of CLIMEX models to inform biocontrol programs was assessed, including predicting the potential distribution of biocontrol agents and their subsequent population dynamics, using bioclimatic models for the weed Parkinsonia aculeata, two Lantana camara biocontrol agents, and five Mimosa pigra biocontrol agents. The results showed the contribution of data types to CLIMEX models and the capacity of these models to inform and improve the selection, release and post release evaluation of biocontrol agents. Foremost among these was the quality of spatial and temporal information as well as the extent to which overseas range data samples the species’ climatic envelope. Post hoc evaluation and refinement of these models requires improved long-term monitoring of introduced agents and their dynamics at well selected study sites. The authors described the findings of these case studies, highlighted their implications, and considered how to incorporate models effectively into biocontrol programs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The variation in liveweight gain in grazing beef cattle as influenced by pasture type, season and year effects has important economic implications for mixed crop-livestock systems and the ability to better predict such variation would benefit beef producers by providing a guide for decision making. To identify key determinants of liveweight change of Brahman-cross steers grazing subtropical pastures, measurements of pasture quality and quantity, and diet quality in parallel with liveweight were made over two consecutive grazing seasons (48 and 46 weeks, respectively), on mixed Clitoria ternatea/grass, Stylosanthes seabrana/grass and grass swards (grass being a mixture of Bothriochloa insculpta cv. Bisset, Dichanthium sericeum and Panicum maximum var. trichoglume cv. Petrie). Steers grazing the legume-based pastures had the highest growth rate and gained between 64 and 142 kg more than those grazing the grass pastures in under 12 months. Using an exponential model, green leaf mass, green leaf %, adjusted green leaf % (adjusted for inedible woody legume stems), faecal near infrared reflectance spectroscopy predictions of diet crude protein and diet dry matter digestibility, accounted for 77, 74, 80, 63 and 60%, respectively, of the variation in daily weight gain when data were pooled across pasture types and grazing seasons. The standard error of the regressions indicated that 95% prediction intervals were large (+/- 0.42-0.64 kg/head.day) suggesting that derived regression relationships have limited practical application for accurately estimating growth rate. In this study, animal factors, especially compensatory growth effects, appeared to have a major influence on growth rate in relation to pasture and diet attributes. It was concluded that predictions of growth rate based only on pasture or diet attributes are unlikely to be accurate or reliable. Nevertheless, key pasture attributes such as green leaf mass and green leaf% provide a robust indication of what proportion of the potential growth rate of the grazing animals can be achieved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The selection of different patch types for grazing by cattle in tropical savannas is well documented. Advances in high resolution satellite imagery and computing power now allow us to identify patch types over an entire paddock, combined with GPS collars as a non instrusive method of capturing positional data, an accurate and comprehensive picture of landscape use by cattle can be quantified.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Patch selection by grazing animals is difficult to quantify, particularly in large, extensive paddocks like those in northern Australia. However, advances in high resolution satellite imagery now allow identification of patch types over an entire paddock which combined with GPS collars to capture positional data, can give an accurate and comprehensive picture of landscape use by cattle.