963 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Blood cells participate in vital physiological processes, and their numbers are tightly regulated so that homeostasis is maintained. Disruption of key regulatory mechanisms underlies many blood-related Mendelian diseases but also contributes to more common disorders, including atherosclerosis. We searched for quantitative trait loci (QTL) for hematology traits through a whole-genome association study, because these could provide new insights into both hemopoeitic and disease mechanisms. We tested 1.8 million variants for association with 13 hematology traits measured in 6015 individuals from the Australian and Dutch populations. These traits included hemoglobin composition, platelet counts, and red blood cell and white blood cell indices. We identified three regions of strong association that, to our knowledge, have not been previously reported in the literature. The first was located in an intergenic region of chromosome 9q31 near LPAR1, explaining 1.5% of the variation in monocyte counts (best SNP rs7023923, p=8.9x10(-14)). The second locus was located on chromosome 6p21 and associated with mean cell erythrocyte volume (rs12661667, p=1.2x10(-9), 0.7% variance explained) in a region that spanned five genes, including CCND3, a member of the D-cyclin gene family that is involved in hematopoietic stem cell expansion. The third region was also associated with erythrocyte volume and was located in an intergenic region on chromosome 6q24 (rs592423, p=5.3x10(-9), 0.6% variance explained). All three loci replicated in an independent panel of 1543 individuals (p values=0.001, 9.9x10(-5), and 7x10(-5), respectively). The identification of these QTL provides new opportunities for furthering our understanding of the mechanisms regulating hemopoietic cell fate.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multidrug-resistant Escherichia colt sequence type 131 (51131) has recently emerged as a globally distributed cause of extraintestinal infections in humans. Diverse factors have been investigated as explanations for ST131's rapid and successful dissemination, including transmission through animal contact and consumption of food, as suggested by the detection of ST131 in a number of nonhuman species. For example, ST131 has recently been identified as a cause of clinical infection in companion animals and poultry, and both host groups have been confirmed as faecal carriers of ST131. Moreover, a high degree of similarity has been shown among certain ST131 isolates from humans, companion animals, and poultry based on resistance characteristics and genomic background and human and companion animal ST131 isolates tend to exhibit similar virulence genotypes. However, most ST131 isolates from poultry appear to possess specific virulence genes that are typically absent from human and companion animal isolates, including genes associated with avian pathogenic E. coli. Since the number of reported animal and food-associated ST131 isolates is quite small, the role of nonhuman host species in the emergence, dissemination, and transmission of ST131 to humans remains unclear. Nevertheless, given the profound public health importance of the emergent ST131 clonal group, even the limited available evidence indicates a pressing need for further careful study of this significant question.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Khaya senegalensis (African mahogany or dry-zone mahogany) is a high-value hardwood timber species with great potential for forest plantations in northern Australia. The species is distributed across the sub-Saharan belt from Senegal to Sudan and Uganda. Because of heavy exploitation and constraints on natural regeneration and sustainable planting, it is now classified as a vulnerable species. Here, we describe the development of microsatellite markers for K. senegalensis using next-generation sequencing to assess its intra-specific diversity across its natural range, which is a key for successful breeding programs and effective conservation management of the species. Next-generation sequencing yielded 93943 sequences with an average read length of 234bp. The assembled sequences contained 1030 simple sequence repeats, with primers designed for 522 microsatellite loci. Twenty-one microsatellite loci were tested with 11 showing reliable amplification and polymorphism in K. senegalensis. The 11 novel microsatellites, together with one previously published, were used to assess 73 accessions belonging to the Australian K. senegalensis domestication program, sampled from across the natural range of the species. STRUCTURE analysis shows two major clusters, one comprising mainly accessions from west Africa (Senegal to Benin) and the second based in the far eastern limits of the range in Sudan and Uganda. Higher levels of genetic diversity were found in material from western Africa. This suggests that new seed collections from this region may yield more diverse genotypes than those originating from Sudan and Uganda in eastern Africa.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

CONTEXT People meeting diagnostic criteria for anxiety or depressive disorders tend to score high on the personality scale of neuroticism. Studying this personality dimension can give insights into the etiology of these important psychiatric disorders. OBJECTIVES To undertake a comprehensive genome-wide linkage study of neuroticism using large study samples that have been measured multiple times and to compare the results between countries for replication and across time within countries for consistency. DESIGN Genome-wide linkage scan. SETTING Twin individuals and their family members from Australia and the Netherlands. PARTICIPANTS Nineteen thousand six hundred thirty-five sibling pairs completed self-report questionnaires for neuroticism up to 5 times over a period of up to 22 years. Five thousand sixty-nine sibling pairs were genotyped with microsatellite markers. METHODS Nonparametric linkage analyses were conducted in MERLIN-REGRESS for the mean neuroticism scores averaged across time. Additional analyses were conducted for the time-specific measures of neuroticism from each country to investigate consistency of linkage results. RESULTS Three chromosomal regions exceeded empirically derived thresholds for suggestive linkage using mean neuroticism scores: 10p 5 Kosambi cM (cM) (Dutch study sample), 14q 103 cM (Dutch study sample), and 18q 117 cM (combined Australian and Dutch study sample), but only 14q retained significance after correction for multiple testing. These regions all showed evidence for linkage in individual time-specific measures of neuroticism and 1 (18q) showed some evidence for replication between countries. Linkage intervals for these regions all overlap with regions identified in other studies of neuroticism or related traits and/or in studies of anxiety in mice. CONCLUSIONS Our results demonstrate the value of the availability of multiple measures over time and add to the optimism reported in recent reviews for replication of linkage regions for neuroticism. These regions are likely to harbor causal variants for neuroticism and its related psychiatric disorders and can inform prioritization of results from genome-wide association studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Japanese isolates of Candidatus Liberibacter asiaticus have been shown to be clearly differentiated by simple sequence repeat (SSR) profiles at four loci. In this study, 25 SSR loci, including these four loci, were selected from the whole-genome sequence and were used to differentiate non-Japanese samples of Ca. Liberibacter asiaticus (13 Indian, 3 East Timorese, 1 Papuan and 8 Floridian samples). Out of the 25 SSR loci, 13 were polymorphic. Dendrogram analysis using SSR loci showed that the clusters were mostly consistent with the geographical origins of the isolates. When single nucleotide polymorphisms (SNPs) were searched around these 25 loci, only the upstream region of locus 091 exhibited polymorphism. Phylogenetic tree analysis of the SNPs in the upstream region of locus 091 showed that Floridian samples were clustered into one group as shown by dendrogram analysis using SSR loci. The differences in nucleotide sequences were not associated with differences in the citrus hosts (lime, mandarin, lemon and sour orange) from which the isolates were originally derived.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The silver gemfish Rexea solandri is an important economic resource but vulnerable to overfishing in Australian waters. The complete mitochondrial genome sequence is described from 1.6 million reads obtained via next generation sequencing. The total length of the mitogenome is 16,350 bp comprising 2 rRNA, 13 protein-coding genes, 22 tRNA and 2 non-coding regions. The mitogenome sequence was validated against sequences of PCR fragments and BLAST queries of Genbank. Gene order was equivalent to that found in marine fishes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sorghum is a food and feed cereal crop adapted to heat and drought and a staple for 500 million of the world’s poorest people. Its small diploid genome and phenotypic diversity make it an ideal C4 grass model as a complement to C3 rice. Here we present high coverage (16-45 × ) resequenced genomes of 44 sorghum lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group. We also report the first resequenced genome of S. propinquum, identifying 8 M high-quality SNPs, 1.9 M indels and specific gene loss and gain events in S. bicolor. We observe strong racial structure and a complex domestication history involving at least two distinct domestication events. These assembled genomes enable the leveraging of existing cereal functional genomics data against the novel diversity available in sorghum, providing an unmatched resource for the genetic improvement of sorghum and other grass species.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

NMR spectroscopy enables the study of biomolecules from peptides and carbohydrates to proteins at atomic resolution. The technique uniquely allows for structure determination of molecules in solution-state. It also gives insights into dynamics and intermolecular interactions important for determining biological function. Detailed molecular information is entangled in the nuclear spin states. The information can be extracted by pulse sequences designed to measure the desired molecular parameters. Advancement of pulse sequence methodology therefore plays a key role in the development of biomolecular NMR spectroscopy. A range of novel pulse sequences for solution-state NMR spectroscopy are presented in this thesis. The pulse sequences are described in relation to the molecular information they provide. The pulse sequence experiments represent several advances in NMR spectroscopy with particular emphasis on applications for proteins. Some of the novel methods are focusing on methyl-containing amino acids which are pivotal for structure determination. Methyl-specific assignment schemes are introduced for increasing the size range of 13C,15N labeled proteins amenable to structure determination without resolving to more elaborate labeling schemes. Furthermore, cost-effective means are presented for monitoring amide and methyl correlations simultaneously. Residual dipolar couplings can be applied for structure refinement as well as for studying dynamics. Accurate methods for measuring residual dipolar couplings in small proteins are devised along with special techniques applicable when proteins require high pH or high temperature solvent conditions. Finally, a new technique is demonstrated to diminish strong-coupling induced artifacts in HMBC, a routine experiment for establishing long-range correlations in unlabeled molecules. The presented experiments facilitate structural studies of biomolecules by NMR spectroscopy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Increased disease resistance is a key target of cereal breeding programs, with disease outbreaks continuing to threaten global food production, particularly in Africa. Of the disease resistance gene families, the nucleotide-binding site plus leucine-rich repeat (NBS-LRR) family is the most prevalent and ancient and is also one of the largest gene families known in plants. The sequence diversity in NBS-encoding genes was explored in sorghum, a critical food staple in Africa, with comparisons to rice and maize and with comparisons to fungal pathogen resistance QTL. Results In sorghum, NBS-encoding genes had significantly higher diversity in comparison to non NBS-encoding genes and were significantly enriched in regions of the genome under purifying and balancing selection, both through domestication and improvement. Ancestral genes, pre-dating species divergence, were more abundant in regions with signatures of selection than in regions not under selection. Sorghum NBS-encoding genes were also significantly enriched in the regions of the genome containing fungal pathogen disease resistance QTL; with the diversity of the NBS-encoding genes influenced by the type of co-locating biotic stress resistance QTL. Conclusions NBS-encoding genes are under strong selection pressure in sorghum, through the contrasting evolutionary processes of purifying and balancing selection. Such contrasting evolutionary processes have impacted ancestral genes more than species-specific genes. Fungal disease resistance hot-spots in the genome, with resistance against multiple pathogens, provides further insight into the mechanisms that cereals use in the “arms race” with rapidly evolving pathogens in addition to providing plant breeders with selection targets for fast-tracking the development of high performing varieties with more durable pathogen resistance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Brassica napus is one of the most important oil crops in the world, and stem rot caused by the fungus Sclerotinia sclerotiorum results in major losses in yield and quality. To elucidate resistance genes and pathogenesis-related genes, genome-wide association analysis of 347 accessions was performed using the Illumina 60K Brassica SNP (single nucleotide polymorphism) array. In addition, the detached stem inoculation assay was used to select five highly resistant (R) and susceptible (S) B. napus lines, 48 h postinoculation with S. sclerotiorum for transcriptome sequencing. We identified 17 significant associations for stem resistance on chromosomes A8 and C6, five of which were on A8 and 12 on C6. The SNPs identified on A8 were located in a 409-kb haplotype block, and those on C6 were consistent with previous QTL mapping efforts. Transcriptome analysis suggested that S. sclerotiorum infection activates the immune system, sulphur metabolism, especially glutathione (GSH) and glucosinolates in both R and S genotypes. Genes found to be specific to the R genotype related to the jasmonic acid pathway, lignin biosynthesis, defence response, signal transduction and encoding transcription factors. Twenty-four genes were identified in both the SNP-trait association and transcriptome sequencing analyses, including a tau class glutathione S-transferase (GSTU) gene cluster. This study provides useful insight into the molecular mechanisms underlying the plant's response to S. sclerotiorum.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sorghum (Sorghum bicolor) is one of the most important cereal crops globally and a potential energy plant for biofuel production. In order to explore genetic gain for a range of important quantitative traits, such as drought and heat tolerance, grain yield, stem sugar accumulation, and biomass production, via the use of molecular breeding and genomic selection strategies, knowledge of the available genetic variation and the underlying sequence polymorphisms, is required.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two complete mitochondrial genomes of the black marlin Istiompax indica were assembled from approximately 3.5 and 2.5 million reads produced by Ion Torrent next generation sequencing. The complete genomes were 16,531 bp and 16,532 bp in length consisting of 2 rRNA, 13 protein-coding genes, 22tRNA and 2 coding regions. They demonstrated a similar A + T base (52.6%) to other teleosts. Intraspecific sequence variation was 99.5% for three I. indica mitogenomes and 99.7% for X. gladius. A lower value (85%) was found for the I. platypterus mitogenomes from genbank and accredited to inadvertent inclusion of gene regions from a con-familial species in one record, highlighting the need for cautious downstream use of genbank data. © 2014 Informa UK Ltd.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mango industry in Australia is worth in excess of $150 million annually with the Kensington Pride (KP) cultivar capturing 60% of the domestic market. Valued by consumers for desirable taste and colour characteristics, KP has been used extensively as a parent in the Department of Agriculture and Fisheries’ (Queensland, Australia) mango breeding program with over 400 hybrid trees sharing KP as the male parent. In order to gain a better understanding of Australia’s most significant mango variety, Horticulture Innovation Australia had led an international collaboration between the Queensland Department of Agriculture and Fisheries (Australia), the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT, India) and the Beijing Genomics Institute (China) to sequence the KP genome. Preliminary de novo assembly of illumina short read sequence data suggests that the KP genome is highly heterozygous and has an estimated genome size of 407 Mb. As refinements and additional sequence data are added to the assembly, a more complete picture of the mango genome will be elucidated.