18 resultados para SNPs
em eResearch Archive - Queensland Department of Agriculture
Resumo:
White clover (Trifolium repens L.) is an obligate outbreeding allotetraploid forage legume. Gene-associated SNPs provide the optimum genetic system for improvement of such crop species. An EST resource obtained from multiple cDNA libraries constructed from numerous genotypes of a single cultivar has been used for in silico SNP discovery and validation. A total of 58 from 236 selected sequence clusters (24.5%) were fully validated as containing polymorphic SNPs by genotypic analysis across the parents and progeny of several two-way pseudo-testcross mapping families. The clusters include genes belonging to a broad range of predicted functional categories. Polymorphic SNP-containing ESTs have also been used for comparative genomic analysis by comparison with whole genome data from model legume species, as well as Arabidopsis thaliana. A total of 29 (50%) of the 58 clusters detected putative ortholoci with known chromosomal locations in Medicago truncatula, which is closely related to white clover within the Trifolieae tribe of the Fabaceae. This analysis provides access to translational data from model species. The efficiency of in silico SNP discovery in white clover is limited by paralogous and homoeologous gene duplication effects, which are resolved unambiguously by the transmission test. This approach will also be applicable to other agronomically important cross-pollinating allopolyploid plant species.
Resumo:
Sustainable management of sea mullet (Mugil cephalus) fisheries needs to account for recent observations of regional-scale differentiation. Population genetic analysis is sought to assess the situation of this ecologically and economically important fish species in eastern Australian waters. Here, we report (i) new population genetic markers [single nucleotide polymorphisms (SNPs) and potential microsatellites], (ii) first estimates of spatial genetic differentiation and (iii) prospective power tests for designing more comprehensive studies. Six DNA samples from three sampling regions (North Queensland, South Queensland and central New South Wales) on the eastern coast of Australia were used to prepare restriction site associated DNA (RAD) tag libraries from genomic DNA digested with EcoRI and MseI. A pooled sample of regional RAD tag libraries was sequenced using the Roche GS-FLX Titanium platform. A total of 172837 raw reads (17.4Mbp) were retrieved, 95500 of which were used to discover 1267 SNPs and 1417 microsatellites. A subset of 161 SNPs was validated based on 63 additional DNA samples genotyped using the Sequenom MassArray (iPLEX Gold chemistry). Altogether 92 SNPs (57%) were confirmed, with 40% of these marking fixed variants between northern and southern sampling regions. Our preliminary findings indicate a multispecies fishery stock of M. cephalus in eastern Australian waters, but suggest that strong genetic differentiation occurs north of major fishing grounds. Low potential differentiation within major fishing grounds (e.g. FST=0.0025) can be resolved with a likely power 67% by using standard sample sizes of 50 and validated subsets of available markers.
Resumo:
The development of innovative methods of stock assessment is a priority for State and Commonwealth fisheries agencies. It is driven by the need to facilitate sustainable exploitation of naturally occurring fisheries resources for the current and future economic, social and environmental well being of Australia. This project was initiated in this context and took advantage of considerable recent achievements in genomics that are shaping our comprehension of the DNA of humans and animals. The basic idea behind this project was that genetic estimates of effective population size, which can be made from empirical measurements of genetic drift, were equivalent to estimates of the successful number of spawners that is an important parameter in process of fisheries stock assessment. The broad objectives of this study were to 1. Critically evaluate a variety of mathematical methods of calculating effective spawner numbers (Ne) by a. conducting comprehensive computer simulations, and by b. analysis of empirical data collected from the Moreton Bay population of tiger prawns (P. esculentus). 2. Lay the groundwork for the application of the technology in the northern prawn fishery (NPF). 3. Produce software for the calculation of Ne, and to make it widely available. The project pulled together a range of mathematical models for estimating current effective population size from diverse sources. Some of them had been recently implemented with the latest statistical methods (eg. Bayesian framework Berthier, Beaumont et al. 2002), while others had lower profiles (eg. Pudovkin, Zaykin et al. 1996; Rousset and Raymond 1995). Computer code and later software with a user-friendly interface (NeEstimator) was produced to implement the methods. This was used as a basis for simulation experiments to evaluate the performance of the methods with an individual-based model of a prawn population. Following the guidelines suggested by computer simulations, the tiger prawn population in Moreton Bay (south-east Queensland) was sampled for genetic analysis with eight microsatellite loci in three successive spring spawning seasons in 2001, 2002 and 2003. As predicted by the simulations, the estimates had non-infinite upper confidence limits, which is a major achievement for the application of the method to a naturally-occurring, short generation, highly fecund invertebrate species. The genetic estimate of the number of successful spawners was around 1000 individuals in two consecutive years. This contrasts with about 500,000 prawns participating in spawning. It is not possible to distinguish successful from non-successful spawners so we suggest a high level of protection for the entire spawning population. We interpret the difference in numbers between successful and non-successful spawners as a large variation in the number of offspring per family that survive – a large number of families have no surviving offspring, while a few have a large number. We explored various ways in which Ne can be useful in fisheries management. It can be a surrogate for spawning population size, assuming the ratio between Ne and spawning population size has been previously calculated for that species. Alternatively, it can be a surrogate for recruitment, again assuming that the ratio between Ne and recruitment has been previously determined. The number of species that can be analysed in this way, however, is likely to be small because of species-specific life history requirements that need to be satisfied for accuracy. The most universal approach would be to integrate Ne with spawning stock-recruitment models, so that these models are more accurate when applied to fisheries populations. A pathway to achieve this was established in this project, which we predict will significantly improve fisheries sustainability in the future. Regardless of the success of integrating Ne into spawning stock-recruitment models, Ne could be used as a fisheries monitoring tool. Declines in spawning stock size or increases in natural or harvest mortality would be reflected by a decline in Ne. This would be good for data-poor fisheries and provides fishery independent information, however, we suggest a species-by-species approach. Some species may be too numerous or experiencing too much migration for the method to work. During the project two important theoretical studies of the simultaneous estimation of effective population size and migration were published (Vitalis and Couvet 2001b; Wang and Whitlock 2003). These methods, combined with collection of preliminary genetic data from the tiger prawn population in southern Gulf of Carpentaria population and a computer simulation study that evaluated the effect of differing reproductive strategies on genetic estimates, suggest that this technology could make an important contribution to the stock assessment process in the northern prawn fishery (NPF). Advances in the genomics world are rapid and already a cheaper, more reliable substitute for microsatellite loci in this technology is available. Digital data from single nucleotide polymorphisms (SNPs) are likely to super cede ‘analogue’ microsatellite data, making it cheaper and easier to apply the method to species with large population sizes.
Resumo:
Barley (Hordeum vulgare) genotypes were sequenced for polymorphism in the hardness genes, these being the three hordoindoline (hin a, hin b1 and hin b2) genes. The variation in haplotype was determined by sequencing for single nucleotide polymorphisms (SNPs). Polymorphism between each gene was then compared to grain hardness (three methods), malt quality characteristics (hot water extract and friability) and cattle feed quality. Two haplotypes were found in a set of forty barley genotypes. For hin a, two alleles were present, namely hin a1 and hin a2. However, there was no specific hin a allele that was associated with grain hardness, malt and feed quality. Barley has two hin b genes, namely hin b1 and hin b2, and the genotypes tested here had one of two alleles for each gene. However, there were no obvious effects on hardness or quality from either of these hin b alleles. Unlike wheat, where a clear relationship has been demonstrated between a number of SNPs in the wheat hardness genes and quality (soft or hard wheat), there was no such relationship for barley. Despite the wide range in hardness, malt and feed quality, there were only two haplotypes for each of the hin a, hin b1 and hin b2 genes and there was no clear relationship between grain hardness, malt or feed quality. The genotypes used in this study demonstrated that there was a low level of polymorphism in hardness genes in current commercial varieties as well as breeding lines and these polymorphisms had no impact on quality.
Resumo:
The principal objective of this study was to determine if Campylobacter jejuni genotyping methods based upon resolution optimised sets of single nucleotide polymorphisms (SNPs) and binary genetic markers were capable of identifying epidemiologically linked clusters of chicken-derived isolates. Eighty-eight C. jejuni isolates of known flaA RFLP type were included in the study. They encompassed three groups of ten isolates that were obtained at the same time and place and possessed the same flaA type. These were regarded as being epidemiologically linked. Twenty-six unlinked C. jejuni flaA type I isolates were included to test the ability of SNP and binary typing to resolve isolates that were not resolved by flaA RFLP. The remaining isolates were of different flaA types. All isolates were typed by real-time PCR interrogation of the resolution optimised sets of SNPs and binary markers. According to each typing method, the three epidemiologically linked clusters were three different clones that were well resolved from the other isolates. The 26 unlinked C. jejuni flaA type I isolates were resolved into 14 SNP-binary types, indicating that flaA typing can be unreliable for revealing epidemiological linkage. Comparison of the data with data from a fully typed set of isolates associated with human infection revealed that abundant lineages in the chicken isolates that were also found in the human isolates belonged to clonal complex (CC) -21 and CC-353, with the usually rare C-353 member ST-524 being especially abundant in the chicken collection. The chicken isolates selected to be diverse according to flaA were also diverse according to SNP and binary typing. It was observed that CC-48 was absent in the chicken isolates, despite being very common in Australian human infection isolates, indicating that this may be a major cause of human disease that is not chicken associated.
Resumo:
Using an established genetic map, a single gene conditioning covered smut resistance, Ruh.7H, was mapped to the telomere region of chromosome 7HS in an Alexis/Sloop doubled haploid barley population. The closest marker to Ruh.7H, abg704 was 7.5 cM away. Thirteen loci on the distal end of 7HS with potential to contain single nucleotide polymorphisms (SNPs) were identified by applying a comparative genomics approach using rice sequence data. Of these, one locus produced polymorphic co-dominant bands of different size while two further loci contained SNPs that were identified using the recently developed high resolution melting (HRM) technique. Two of these markers flanked Ruh.7H with the proximal marker located 3.8 cM and the distal marker 2.7 cM away. This is the first report on the application of the HRM technique to SNP detection and to rapid scoring of known cleaved amplified polymorphic sequence (CAPS) markers in plants. This simple, precise post-PCR technique should find widespread use in the fine-mapping of genetic regions of interest in complex cereal and other plant genomes.
Resumo:
The highly variable flagellin-encoding flaA gene has long been used for genotyping Campylobacter jejuni and Campylobacter coli. High-resolution melting (HRM) analysis is emerging as an efficient and robust method for discriminating DNA sequence variants. The objective of this study was to apply HRM analysis to flaA-based genotyping. The initial aim was to identify a suitable flaA fragment. It was found that the PCR primers commonly used to amplify the flaA short variable repeat (SVR) yielded a mixed PCR product unsuitable for HRM analysis. However, a PCR primer set composed of the upstream primer used to amplify the fragment used for flaA restriction fragment length polymorphism (RFLP) analysis and the downstream primer used for flaA SVR amplification generated a very pure PCR product, and this primer set was used for the remainder of the study. Eighty-seven C. jejuni and 15 C. coli isolates were analyzed by flaA HRM and also partial flaA sequencing. There were 47 flaA sequence variants, and all were resolved by HRM analysis. The isolates used had previously also been genotyped using single-nucleotide polymorphisms (SNPs), binary markers, CRISPR HRM, and flaA RFLP.flaA HRM analysis provided resolving power multiplicative to the SNPs, binary markers, and CRISPR HRM and largely concordant with the flaA RFLP. It was concluded that HRM analysis is a promising approach to genotyping based on highly variable genes.
Resumo:
The Juvenile Wood Initiative (JWI) project has been running successfully since July 2003 under a Research Agreement with FWPA and Letters of Association with the consortium partners STBA (Southern Tree Breeding Association), ArborGen and FPQ (Forestry Plantations Queensland). Over the last five and half years, JWI scientists in CSIRO, FPQ, and STBA have completed all 12 major milestones and 28 component milestones according to the project schedule. We have made benchmark progress in understanding the genetic control of wood formation and interrelationships among wood traits. The project has made 15 primary scientific findings and several results have been adopted by industry as summarized below. This progress was detailed in 10 technical reports to funding organizations and industry clients. Team scientists produced 16 scientific manuscripts (8 published, 1 in press, 2 submitted, and several others in the process of submission) and 15 conference papers or presentations. Primary Scientific Findings. The 15 major scientific findings related to wood science, inheritance and the genetic basis of juvenile wood traits are: 1. An optimal method to predict stiffness of standing trees in slash/Caribbean pine is to combine gravimetric basic density from 12 mm increment cores with a standing tree prediction of MoE using a time of flight acoustic tool. This was the most accurate and cheapest way to rank trees for breeding selection for slash/Caribbean hybrid pine. This method was also recommended for radiata pine. 2. Wood density breeding values were predicted for the first time in the STBA breeding population using a large sample of 7,078 trees (increment cores) and it was estimated that selection of the best 250 trees for deployment will produce wood density gains of 12.4%. 3. Large genetic variation for a suite of wood quality traits including density, MFA, spiral grain, shrinkage, acoustic and non-acoustic stiffness (MoE) for clear wood and standing trees were observed. Genetic gains of between 8 and 49% were predicted for these wood quality traits with selection intensity between 1 to 10% for radiata pine. 4. Site had a major effect on juvenile-mature wood transition age and the effect of selective breeding for a shorter juvenile wood formation phase was only moderate (about 10% genetic gain with 10% selection intensity, equivalent to about 2 years reduction of juvenile wood). 5. The study found no usable site by genotype interactions for the wood quality traits of density, MFA and MoE for both radiata and slash/Caribbean pines, suggesting that assessment of wood properties on one or two sites will provide reliable estimates of the genetic worth of individuals for use in future breeding. 6. There were significant and sizable genotype by environment interactions between the mainland and Tasmanian regions and within Tasmania for DBH and branch size. 7. Strong genetic correlations between rings for density, MFA and MoE for both radiata and slash/Caribbean pines were observed. This suggests that selection for improved wood properties in the innermost rings would also result in improvement of wood properties in the subsequent rings, as well as improved average performance of the entire core. 8. Strong genetic correlations between pure species and hybrid performance for each of the wood quality traits were observed in the hybrid pines. Parental performance can be used to identify the hybrid families which are most likely to have superior juvenile wood properties of the slash/Caribbean F1 hybrid in southeast Queensland. 9. Large unfavourable genetic correlations between growth and wood quality traits were a prominent feature in radiata pine, indicating that overcoming this unfavourable genetic correlation will be a major technical issue in progressing radiata pine breeding. 10. The project created the first radiata pine 18 k cDNA microarray and generated 5,952 radiata pine xylogenesis expressed sequence tags (ESTs) which assembled into 3,304 unigenes. 11. A total of 348 genes were identified as preferentially expressed genes in earlywood or latewood while a total of 168 genes were identified as preferentially expressed genes in either juvenile or mature wood. 12. Juvenile earlywood has a distinct transcriptome relative to other stages of wood development. 13. Discovered rapid decay of linkage disequilibrium (LD) in radiata pine with LD decaying to approximately 50% within 1,700 base pairs (within a typical gene). A total of 913 SNPS from sequencing 177,380 base pairs were identified for association genetic studies. 14. 149 SNPs from 44 genes and 255 SNPs from a further 51 genes (total 95 genes) were selected for association analysis with 62 wood traits, and 30 SNPs were shortlisted for their significant association with variation of wood quality traits (density, MFA and MoE) with individual significant SNPs accounting for between 1.9 and 9.7% of the total genetic variation in traits. 15. Index selection using breeding objectives was the most profitable selection method for radiata pine, but in the long term it may not be the most effective in dealing with negative genetic correlations between wood volume and quality traits. A combination of economic and biological approaches may be needed to deal with the strong adverse correlation.
Resumo:
DArTseq technology is potentially the most appropriate system to discover hundreds of polymorphic genomic loci, scoring thousands of unique genomic-wide DNA fragments in one single experiment, without requiring existing DNA sequence information. The DArT complexity reduction approach in combination with Illumina short read sequencing (Hiseq2000) was applied. To test the application of DArTseq technology in pineapple, a reference population of 13 Ananas genotypes from primitive wild accessions to modern cultivars was used. In a comparison of 3 systems, the combination of restriction enzymes PstI and MseI performed the best producing 18,900 DArT markers and close to 20,000 SNPs. Based on these markers genetic relationships between the samples were identified and a dendrogram was generated. The topography of the tree corresponds with our understanding of the genetic relationships between the genotypes. Importantly, the replicated samples of all genotypes have a dissimilarity of close to 0.0 and occupy the same positions on the tree, confirming high reproducibility of the markers detected. Eventually it is planned that molecular markers will be identified that are associated with resistance to Phytophthora cinnamomi (Pc), the most economically important pathogen of pineapple in Australia, as genetic resistance is known to exist within the Ananas. Marker assisted selection can then be utilized in a pineapple breeding program to develop cultivars resistant to Pc.
Resumo:
Age at puberty is an important component of reproductive performance in beef cattle production systems. Brahman cattle are typically late-pubertal relative to Bos taurus cattle and so it is of economic relevance to select for early age at puberty. To assist selection and elucidate the genes underlying puberty, we performed a genome-wide association study (GWAS) using the BovineSNP50 chip (similar to 54 000 polymorphisms) in Brahman bulls (n = 1105) and heifers (n = 843) and where the heifers were previously analysed in a different study. In a new attempt to generate unbiased estimates of single-nucleotide polymorphism (SNP) effects and proportion of variance explained by each SNP, the available data were halved on the basis of year and month of birth into a calibration and validation set. The traits that defined age at puberty were, in heifers, the age at which the first corpus luteum was detected (AGECL, h(2) = 0.56 +/- 0.11) and in bulls, the age at a scrotal circumference of 26 cm (AGE26, h(2) = 0.78 +/- 0.10). At puberty, heifers were on average older (751 +/- 142 days) than bulls (555 +/- 101 days), but AGECL and AGE26 were genetically correlated (r = 0.20 +/- 0.10). There were 134 SNPs associated with AGECL and 146 SNPs associated with AGE26 (P < 0.0001). From these SNPs, 32 (similar to 22%) were associated (P < 0.0001) with both traits. These top 32 SNPs were all located on Chromosome BTA 14, between 21.95 Mb and 28.4 Mb. These results suggest that the genes located in that region of BTA 14 play a role in pubertal development in Brahman cattle. There are many annotated genes underlying this region of BTA 14 and these are the subject of current research. Further, we identified a region on Chromosome X where markers were associated (P < 1.00E-8) with AGE26, but not with AGECL. Information about specific genes and markers add value to our understanding of puberty and potentially contribute to genomic selection. Therefore, identifying these genes contributing to genetic variation in AGECL and AGE26 can assist with the selection for early onset of puberty.
Resumo:
Wood is an important biological resource which contributes to nutrient and hydrology cycles through ecosystems, and provides structural support at the plant level. Thousands of genes are involved in wood development, yet their effects on phenotype are not well understood. We have exploited the low genomic linkage disequilibrium (LD) and abundant phenotypic variation of forest trees to explore allelic diversity underlying wood traits in an association study. Candidate gene allelic diversity was modelled against quantitative variation to identify SNPs influencing wood properties, growth and disease resistance across three populations of Corymbia citriodora subsp. variegata, a forest tree of eastern Australia. Nine single nucleotide polymorphism (SNP) associations from six genes were identified in a discovery population (833 individuals). Associations were subsequently tested in two smaller populations (130160 individuals), validating our findings in three cases for actin 7 (ACT7) and COP1 interacting protein 7 (CIP7). The results imply a functional role for these genes in mediating wood chemical composition and growth, respectively. A flip in the effect of ACT7 on pulp yield between populations suggests gene by environment interactions are at play. Existing evidence of gene function lends strength to the observed associations, and in the case of CIP7 supports a role in cortical photosynthesis.
Resumo:
Japanese isolates of Candidatus Liberibacter asiaticus have been shown to be clearly differentiated by simple sequence repeat (SSR) profiles at four loci. In this study, 25 SSR loci, including these four loci, were selected from the whole-genome sequence and were used to differentiate non-Japanese samples of Ca. Liberibacter asiaticus (13 Indian, 3 East Timorese, 1 Papuan and 8 Floridian samples). Out of the 25 SSR loci, 13 were polymorphic. Dendrogram analysis using SSR loci showed that the clusters were mostly consistent with the geographical origins of the isolates. When single nucleotide polymorphisms (SNPs) were searched around these 25 loci, only the upstream region of locus 091 exhibited polymorphism. Phylogenetic tree analysis of the SNPs in the upstream region of locus 091 showed that Floridian samples were clustered into one group as shown by dendrogram analysis using SSR loci. The differences in nucleotide sequences were not associated with differences in the citrus hosts (lime, mandarin, lemon and sour orange) from which the isolates were originally derived.
Resumo:
Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.
Resumo:
Sorghum is a food and feed cereal crop adapted to heat and drought and a staple for 500 million of the world’s poorest people. Its small diploid genome and phenotypic diversity make it an ideal C4 grass model as a complement to C3 rice. Here we present high coverage (16-45 × ) resequenced genomes of 44 sorghum lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group. We also report the first resequenced genome of S. propinquum, identifying 8 M high-quality SNPs, 1.9 M indels and specific gene loss and gain events in S. bicolor. We observe strong racial structure and a complex domestication history involving at least two distinct domestication events. These assembled genomes enable the leveraging of existing cereal functional genomics data against the novel diversity available in sorghum, providing an unmatched resource for the genetic improvement of sorghum and other grass species.
Resumo:
Tillering in sorghum can be associated with either the carbon supply–demand (S/D) balance of the plant or an intrinsic propensity to tiller (PTT). Knowledge of the genetic control of tillering could assist breeders in selecting germplasm with tillering characteristics appropriate for their target environments. The aims of this study were to identify QTL for tillering and component traits associated with the S/D balance or PTT, to develop a framework model for the genetic control of tillering in sorghum. Four mapping populations were grown in a number of experiments in south east Queensland, Australia. The QTL analysis suggested that the contribution of traits associated with either the S/D balance or PTT to the genotypic differences in tillering differed among populations. Thirty-four tillering QTL were identified across the populations, of which 15 were novel to this study. Additionally, half of the tillering QTL co-located with QTL for component traits. A comparison of tillering QTL and candidate gene locations identified numerous coincident QTL and gene locations across populations, including the identification of common non-synonymous SNPs in the parental genotypes of two mapping populations in a sorghum homologue of MAX1, a gene involved in the control of tiller bud outgrowth through the production of strigolactones. Combined with a framework for crop physiological processes that underpin genotypic differences in tillering, the co-location of QTL for tillering and component traits and candidate genes allowed the development of a framework QTL model for the genetic control of tillering in sorghum.