17 resultados para Blog datasets
em eResearch Archive - Queensland Department of Agriculture
Resumo:
Marker ordering during linkage map construction is a critical component of QTL mapping research. In recent years, high-throughput genotyping methods have become widely used, and these methods may generate hundreds of markers for a single mapping population. This poses problems for linkage analysis software because the number of possible marker orders increases exponentially as the number of markers increases. In this paper, we tested the accuracy of linkage analyses on simulated recombinant inbred line data using the commonly used Map Manager QTX (Manly et al. 2001: Mammalian Genome 12, 930-932) software and RECORD (Van Os et al. 2005: Theoretical and Applied Genetics 112, 30-40). Accuracy was measured by calculating two scores: % correct marker positions, and a novel, weighted rank-based score derived from the sum of absolute values of true minus observed marker ranks divided by the total number of markers. The accuracy of maps generated using Map Manager QTX was considerably lower than those generated using RECORD. Differences in linkage maps were often observed when marker ordering was performed several times using the identical dataset. In order to test the effect of reducing marker numbers on the stability of marker order, we pruned marker datasets focusing on regions consisting of tightly linked clusters of markers, which included redundant markers. Marker pruning improved the accuracy and stability of linkage maps because a single unambiguous marker order was produced that was consistent across replications of analysis. Marker pruning was also applied to a real barley mapping population and QTL analysis was performed using different map versions produced by the different programs. While some QTLs were identified with both map versions, there were large differences in QTL mapping results. Differences included maximum LOD and R-2 values at QTL peaks and map positions, thus highlighting the importance of marker order for QTL mapping
Resumo:
Many statistical forecast systems are available to interested users. In order to be useful for decision-making, these systems must be based on evidence of underlying mechanisms. Once causal connections between the mechanism and their statistical manifestation have been firmly established, the forecasts must also provide some quantitative evidence of `quality’. However, the quality of statistical climate forecast systems (forecast quality) is an ill-defined and frequently misunderstood property. Often, providers and users of such forecast systems are unclear about what ‘quality’ entails and how to measure it, leading to confusion and misinformation. Here we present a generic framework to quantify aspects of forecast quality using an inferential approach to calculate nominal significance levels (p-values) that can be obtained either by directly applying non-parametric statistical tests such as Kruskal-Wallis (KW) or Kolmogorov-Smirnov (KS) or by using Monte-Carlo methods (in the case of forecast skill scores). Once converted to p-values, these forecast quality measures provide a means to objectively evaluate and compare temporal and spatial patterns of forecast quality across datasets and forecast systems. Our analysis demonstrates the importance of providing p-values rather than adopting some arbitrarily chosen significance levels such as p < 0.05 or p < 0.01, which is still common practice. This is illustrated by applying non-parametric tests (such as KW and KS) and skill scoring methods (LEPS and RPSS) to the 5-phase Southern Oscillation Index classification system using historical rainfall data from Australia, The Republic of South Africa and India. The selection of quality measures is solely based on their common use and does not constitute endorsement. We found that non-parametric statistical tests can be adequate proxies for skill measures such as LEPS or RPSS. The framework can be implemented anywhere, regardless of dataset, forecast system or quality measure. Eventually such inferential evidence should be complimented by descriptive statistical methods in order to fully assist in operational risk management.
Resumo:
Background: Molecular marker technologies are undergoing a transition from largely serial assays measuring DNA fragment sizes to hybridization-based technologies with high multiplexing levels. Diversity Arrays Technology (DArT) is a hybridization-based technology that is increasingly being adopted by barley researchers. There is a need to integrate the information generated by DArT with previous data produced with gel-based marker technologies. The goal of this study was to build a high-density consensus linkage map from the combined datasets of ten populations, most of which were simultaneously typed with DArT and Simple Sequence Repeat (SSR), Restriction Enzyme Fragment Polymorphism (RFLP) and/or Sequence Tagged Site (STS) markers. Results: The consensus map, built using a combination of JoinMap 3.0 software and several purpose-built perl scripts, comprised 2,935 loci (2,085 DArT, 850 other loci) and spanned 1,161 cM. It contained a total of 1,629 'bins' (unique loci), with an average inter-bin distance of 0.7 ± 1.0 cM (median = 0.3 cM). More than 98% of the map could be covered with a single DArT assay. The arrangement of loci was very similar to, and almost as optimal as, the arrangement of loci in component maps built for individual populations. The locus order of a synthetic map derived from merging the component maps without considering the segregation data was only slightly inferior. The distribution of loci along chromosomes indicated centromeric suppression of recombination in all chromosomes except 5H. DArT markers appeared to have a moderate tendency toward hypomethylated, gene-rich regions in distal chromosome areas. On the average, 14 ± 9 DArT loci were identified within 5 cM on either side of SSR, RFLP or STS loci previously identified as linked to agricultural traits. Conclusion: Our barley consensus map provides a framework for transferring genetic information between different marker systems and for deploying DArT markers in molecular breeding schemes. The study also highlights the need for improved software for building consensus maps from high-density segregation data of multiple populations.
Resumo:
Whilst the topic of soil salinity has received a substantive research effort over the years, the accurate measurement and interpretation of salinity tolerance data remain problematic. The tolerance of four perennial grass species (non-halophytes) to sodium chloride (NaCl) dominated salinity was determined in a free-flowing sand culture system. Although the salinity tolerance of non-halophytes is often represented by the threshold salinity model (bent-stick model), none of the species in the current study displayed any observable salinity threshold. Further, the observed yield decrease was not linear as suggested by the model. On re-examination of earlier datasets, we conclude that the threshold salinity model does not adequately describe the physiological processes limiting growth of non-halophytes in saline soils. Therefore, the use of the threshold salinity model is not recommended for non-halophytes, but rather, a model which more accurately reflects the physiological response observed in these saline soils, such as an exponential regression curve.
Resumo:
Cultivation and cropping of soils results in a decline in soil organic carbon and soil nitrogen, and can lead to reduced crop yields. The CENTURY model was used to simulate the effects of continuous cultivation and cereal cropping on total soil organic matter (C and N), carbon pools, nitrogen mineralisation, and crop yield from 6 locations in southern Queensland. The model was calibrated for each replicate from the original datasets, allowing comparisons for each replicate rather than site averages. The CENTURY model was able to satisfactorily predict the impact of long-term cultivation and cereal cropping on total organic carbon, but was less successful in simulating the different fractions and nitrogen mineralisation. The model firstly over-predicted the initial (pre-cropping) soil carbon and nitrogen concentration of the sites. To account for the unique shrinking and swelling characteristics of the Vertosol soils, the default annual decomposition rates of the slow and passive carbon pools were doubled, and then the model accurately predicted initial conditions. The ability of the model to predict carbon pool fractions varied, demonstrating the difficulty inherent in predicting the size of these conceptual pools. The strength of the model lies in the ability to closely predict the starting soil organic matter conditions, and the ability to predict the impact of clearing, cultivation, fertiliser application, and continuous cropping on total soil carbon and nitrogen.
Resumo:
Land condition monitoring information is required for the strategic management of grazing land and for a better understanding of ecosystem processes. Yet, for policy makers and those land managers whose properties are situated within north-eastern Australia's vast Great Barrier Reef catchments, there has been a general lack of geospatial land condition monitoring information. This paper provides an overview of integrated land monitoring activity in rangeland areas of two major Reef catchments in Queensland: the Burdekin and Fitzroy regions. The project aims were to assemble land condition monitoring datasets that would assist grazing land management and support decision-makers investing public funds; and deliver these data to natural resource management(NRM) community groups, which had been given increased responsibility for delivering local environmental outcomes. We describe the rationale and processes used to produce new land condition monitoring datasets derived from remotely sensed Landsat thematic mapper (TM) and high resolution SPOT 5 satellite imagery and from rapid land condition ground assessment. Specific products include subcatchment groundcover change maps, regional mapping of indicative very poor land condition, and stratified land condition site summaries. Their application, integration, and limitations are discussed. The major innovation is a better understanding of NRM issues with respect to land condition across vast regional areas, and the effective transfer of decision-making capacity to the local level. Likewise, with an increased ability to address policy questions from an evidence-based position, combined with increased cooperation between community, industry and all levels of government, a new era has emerged for decision-makers in rangeland management.
Resumo:
Long-running datasets from aerial surveys of kangaroos (Macropus giganteus, Macropus [uliginosus, Macropus robustus and Macropus rufus) across Queensland, New South Wales and South Australia have been analysed, seeking better predictors of rates of increase which would allow aerial surveys to be undertaken less frequently than annually. Early models of changes in kangaroo numbers in response to rainfall had shown great promise, but much variability. We used normalised difference vegetation index (NDVI) instead, reasoning that changes in pasture condition would provide a better predictor than rainfall. However, except at a fine scale, NDVI proved no better; although two linked periods of rainfall proved useful predictors of rates of increase, this was only in some areas for some species. The good correlations reported in earlier studies were a consequence of data dominated by large droughtinduced adult mortality, whereas over a longer time frame and where changes between years are less dramatic, juvenile survival has the strongest influence on dynamics. Further, harvesting, density dependence and competition with domestic stock are additional and important influences and it is now clear that kangaroo movement has a greater influence on population dynamics than had been assumed. Accordingly, previous conclusions about kangaroo populations as simple systems driven by rainfall need to be reassessed. Examination of this large dataset has permitted descriptions of shifts in distribution of three species across eastern Australia, changes in dispersion in response to rainfall, and an evaluation of using harvest statistics as an index of density and harvest rate. These results have been combined into a risk assessment and decision theory framework to identify optimal monitoring strategies.
Resumo:
The Rhipicephalus micro plus genome is large and complex in structure, making it difficult to assemble a genome sequence and costly to resource the required bioinformatics. In light of this, a consortium of international collaborators was formed to pool resources to begin sequencing this genome. We have acquired and assembled genomic DNA into contigs that represent over 1.8 Gigabase pairs of DNA from gene-enriched regions of the R. micro plus genome. We also have several datasets containing transcript sequences from a number of gene expression experiments conducted by the consortium. A web-based resource was developed to enable the scientific community to access our datasets and conduct analysis through a web-based bioinformatics environment called YABI. The collective bioinformatics resource is termed CattleTickBase. Our consortium has acquired genomic and transcriptomic sequence data at approximately 0.9X coverage of the gene-coding regions of the R. microplus genome. The YABI tool will facilitate access and manipulation of cattle tick genome sequence data as the genome sequencing of R. microplus proceeds. During this process the CattleTickBase resource will continue to be updated. Published by Elsevier Ltd. on behalf of Australian Society for Parasitology Inc.
Resumo:
Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.
Resumo:
To quantify the impact that planting indigenous trees and shrubs in mixed communities (environmental plantings) have on net sequestration of carbon and other environmental or commercial benefits, precise and non-biased estimates of biomass are required. Because these plantings consist of several species, estimation of their biomass through allometric relationships is a challenging task. We explored methods to accurately estimate biomass through harvesting 3139 trees and shrubs from 22 plantings, and collating similar datasets from earlier studies, in non-arid (>300mm rainfallyear-1) regions of southern and eastern Australia. Site-and-species specific allometric equations were developed, as were three types of generalised, multi-site, allometric equations based on categories of species and growth-habits: (i) species-specific, (ii) genus and growth-habit, and (iii) universal growth-habit irrespective of genus. Biomass was measured at plot level at eight contrasting sites to test the accuracy of prediction of tonnes dry matter of above-ground biomass per hectare using different classes of allometric equations. A finer-scale analysis tested performance of these at an individual-tree level across a wider range of sites. Although the percentage error in prediction could be high at a given site (up to 45%), it was relatively low (<11%) when generalised allometry-predictions of biomass was used to make regional- or estate-level estimates across a range of sites. Precision, and thus accuracy, increased slightly with the level of specificity of allometry. Inclusion of site-specific factors in generic equations increased efficiency of prediction of above-ground biomass by as much as 8%. Site-and-species-specific equations are the most accurate for site-based predictions. Generic allometric equations developed here, particularly the generic species-specific equations, can be confidently applied to provide regional- or estate-level estimates of above-ground biomass and carbon. © 2013 Elsevier B.V.
Resumo:
The aim of this review is to report changes in irrigated cotton water use from research projects and on-farm practice-change programs in Australia, in relation to both plant-based and irrigation engineering disciplines. At least 80% of the Australian cotton-growing area is irrigated using gravity surface-irrigation systems. This review found that, over 23 years, cotton crops utilise 6-7ML/ha of irrigation water, depending on the amount of seasonal rain received. The seasonal evapotranspiration of surface-irrigated crops averaged 729mm over this period. Over the past decade, water-use productivity by Australian cotton growers has improved by 40%. This has been achieved by both yield increases and more efficient water-management systems. The whole-farm irrigation efficiency index improved from 57% to 70%, and the crop water use index is >3kg/mm.ha, high by international standards. Yield increases over the last decade can be attributed to plant-breeding advances, the adoption of genetically modified varieties, and improved crop management. Also, there has been increased use of irrigation scheduling tools and furrow-irrigation system optimisation evaluations. This has reduced in-field deep-drainage losses. The largest loss component of the farm water balance on cotton farms is evaporation from on-farm water storages. Some farmers are changing to alternative systems such as centre pivots and lateral-move machines, and increasing numbers of these alternatives are expected. These systems can achieve considerable labour and water savings, but have significantly higher energy costs associated with water pumping and machine operation. The optimisation of interactions between water, soils, labour, carbon emissions and energy efficiency requires more research and on-farm evaluations. Standardisation of water-use efficiency measures and improved water measurement techniques for surface irrigation are important research outcomes to enable valid irrigation benchmarks to be established and compared. Water-use performance is highly variable between cotton farmers and farming fields and across regions. Therefore, site-specific measurement is important. The range in the presented datasets indicates potential for further improvement in water-use efficiency and productivity on Australian cotton farms.
Resumo:
Endoraecium (Raveneliaceae, Pucciniales) is a genus of rust that infects several species of Acacia (Fabaceae) in Australia, south-east Asia and Hawaii. Thirteen species of Endoraecium have been described, including seven species that are endemic to Australia, one species to south-east Asia and five to Hawaii. This study investigated the systematics of Endoraecium from 50 specimens in Australia and south-east Asia with a combined morphological and molecular approach. Phylogenetic analyses were conducted on combined datasets of the SSU, ITS and LSU regions of rDNA. The recovered phylogeny (i) supported a recent division of Endoraecium digitatum into five separate species based on morphology and host specificity and (ii) found lineages that did not correspond with known species.
Resumo:
Background and Aim The etiology of Crohn's disease (CD) implicates both genetic and environmental factors. Smoking behavior is one environmental risk factor to play a role in the development of CD. The study aimed to assess the contribution of the interleukin 23 receptor (IL23R) in determining disease susceptibility in two independent cohorts of CD, and to investigate the interactions between IL23R variants, smoking behavior, and CD-associated genes, NOD2 and ATG16L1. Methods Ten IL23R single-nucleotide polymorphisms (SNPs) were genotyped in 675 CD cases, and 1255 controls from Brisbane, Australia (dataset 1). Six of these SNPs were genotyped in 318 CD cases and 533 controls from Canterbury, New Zealand (dataset 2). Case–control analysis of genotype and allele frequencies, and haplotype analysis for all SNPs was conducted. Results We demonstrate a strong increased CD risk for smokers in both datasets (odds ratio 3.77, 95% confidence interval 2.88–4.94), and an additive interaction between IL23R SNPs and cigarette smoking. Ileal involvement was a consistent marker of strong SNP–CD association (P ≤ 0.001), while the lowest minor allele frequencies for location were found in those with colonic CD (L2). Three haplotype blocks were identified across the 10 IL23R SNPs conferring different risk of CD. Haplotypes conferred no further risk of CD when compared with single SNP analyses. Conclusion IL23R gene variants determine CD susceptibility in the Australian and New Zealand population, particularly ileal CD. A strong additive interaction exists between IL23R SNPs and smoking behavior resulting in a dramatic increase in disease risk depending upon specific genetic background.
Resumo:
There is an increasing requirement for more astute land resource management through efficiencies in agricultural inputs in a sugar cane production system. A precision agriculture (PA) approach can provide a pathway for a sustainable sugarcane production system. One of the impediments to the adoption of PA practices is access to paddock-scale mapping layers displaying variability in soil properties, crop growth and surface drainage. Variable rate application (VRA) of nutrients is an important component of PA. However, agronomic expertise within PA systems has fallen well behind significant advances in PA technologies. Generally, advisers in the sugar industry have a poor comprehension of the complex interaction of variables that contribute to within-paddock variations in crop growth. This is regarded as a significant impediment to the progression of PA in sugarcane and is one of the reasons for the poor adoption of VRA of nutrients in a PA approach to improved sugar cane production. This project therefore has established a number of key objectives which will contribute to the adoption of PA and the staged progression of VRA supported by relevant and practical agronomic expertise. These objectives include provision of base soils attribute mapping that can be determined using Veris 3100 Electrical Conductivity (EC) and digital elevation datasets using GPS mapping technology for a large sector of the central cane growing region using analysis of archived satellite imagery to determine the location and stability of yield patterns over time and in varying seasonal conditions on selected project study sites. They also include the stablishment of experiments to determine appropriate VRA nitrogen rates on various soil types subjected to extended anaerobic conditions, and the establishment of trials to determine nitrogen rates applicable to a declining yield potential associated with the aging of ratoons in the crop cycle. Preliminary analysis of archived yield estimation data indicates that yield patterns remain relatively stable overtime. Results also indicate the where there is considerable variability in EC values there is also significant variation in yield.
Resumo:
An integrated approach of using strandings and bycatch data may provide an indicator of long-term trends for data-limited cetaceans. Strandings programs can give a faithful representation of the species composition of cetacean assemblages, while standardised bycatch rates can provide a measure of relative abundance. Comparing the two datasets may also facilitate managing impacts by understanding which species, sex or sizes are the most vulnerable to interactions with fisheries gear. Here we apply this approach to two long-term datasets in East Australia, bycatch in the Queensland Shark Control Program QSCP, 1992–2012) and strandings in the Queensland Marine Wildlife Strandings and Mortality Program StrandNet, 1996–2012). Short-beaked common dolphins, Delphinus delphis, were markedly more frequent in bycatch than in the strandings dataset, suggesting that they are more prone to being incidentally caught than other cetacean species in the region. The reverse was true for humpback whales, Megaptera novaeangliae, bottlenose dolphins, Tursiops spp.; and species predominantly found in offshore waters. QSCP bycatch was strongly skewed towards females for short-beaked common dolphins, and towards smaller sizes for Australian humpback dolphins, Sousa sahulensis. Overall, both datasets demonstrated similar seasonality and a similar long-term increase from 1996 until 2008. Analysis on a species-by-species basis was then used to explore potential explanations for long-term trends, which ranged from a recovering stock (humpback whales) to a shift in habitat use (short-beaked common dolphins).