996 resultados para Genome resources
Resumo:
Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.
Resumo:
Sorghum is a food and feed cereal crop adapted to heat and drought and a staple for 500 million of the world’s poorest people. Its small diploid genome and phenotypic diversity make it an ideal C4 grass model as a complement to C3 rice. Here we present high coverage (16-45 × ) resequenced genomes of 44 sorghum lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group. We also report the first resequenced genome of S. propinquum, identifying 8 M high-quality SNPs, 1.9 M indels and specific gene loss and gain events in S. bicolor. We observe strong racial structure and a complex domestication history involving at least two distinct domestication events. These assembled genomes enable the leveraging of existing cereal functional genomics data against the novel diversity available in sorghum, providing an unmatched resource for the genetic improvement of sorghum and other grass species.
Resumo:
The first complete genome sequence of capsicum chlorosis virus (CaCV) from Australia was determined using a combination of Illumina HiSeq RNA and Sanger sequencing technologies. Australian CaCV had a tripartite genome structure like other CaCV isolates. The large (L) RNA was 8913 nucleotides (nt) in length and contained a single open reading frame (ORF) of 8634 nt encoding a predicted RNA-dependent RNA polymerase (RdRp) in the viral-complementary (vc) sense. The medium (M) and small (S) RNA segments were 4846 and 3944 nt in length, respectively, each containing two non-overlapping ORFs in ambisense orientation, separated by intergenic regions (IGR). The M segment contained ORFs encoding the predicted non-structural movement protein (NSm; 927 nt) and precursor of glycoproteins (GP; 3366 nt) in the viral sense (v) and vc strand, respectively, separated by a 449-nt IGR. The S segment coded for the predicted nucleocapsid (N) protein (828 nt) and non-structural suppressor of silencing protein (NSs; 1320 nt) in the vc and v strand, respectively. The S RNA contained an IGR of 1663 nt, being the largest IGR of all CaCV isolates sequenced so far. Comparison of the Australian CaCV genome with complete CaCV genome sequences from other geographic regions showed highest sequence identity with a Taiwanese isolate. Genome sequence comparisons and phylogeny of all available CaCV isolates provided evidence for at least two highly diverged groups of CaCV isolates that may warrant re-classification of AIT-Thailand and CP-China isolates as unique tospoviruses, separate from CaCV.
Resumo:
Territoriality is a central issue in indigenous peoples struggles. The territorial struggles involve struggles over the control of natural resources and over political participation and representation, but also over the perception of territorial rights and the symbolic representation of the territory. These struggles are carried through both in material and symbolic ways through recurring to different discourses and representations that provide legitimation for the territorial claims of the group. The study is located in the Northern Autonomous Atlantic Region of Nicaragua. The study concerns the territorial strategies, conceptions and practices of the indigenous people and other actors. Territorial conflicts exist between the autonomous region and the central government of Nicaragua, between mestizo settlers and indigenous people, between different indigenous groups, and between these and development agents such as conservation projects. The study focuses on how territorial discourses and representations are used to legitimate territorial control. Environmental, historical and cartographical discourses are the most important discourses recurred to. The influence of discourses and representations on the territorial practices and policies of the different actors, the links between the local struggles and global processes, and the broader structural factors impacting on the territorial struggles are also analysed. Among the structural factors are the problems related to land tenure and management and the use of natural resources, the advance of the agricultural frontier, the institutional weaknesses of the central and regional governments and the legislative processes. The territorial discourses are both recurred to in a strategic way and also grounded in local ideals and practices. The discourses have produced real effects for example in legislation, land tenure systems, political representation and environmental practices. Although the use of discourses and representations are an important power tool in territorial struggles, territorial control cannot be effectively accomplished merely through representing territorial claims in a legitimate way or through reforming legislation, as the conflicts are also largely a result of structural factors affecting the region. The fieldwork was carried out during a total of twelve months between 2000 and 2002. The research methods used were semi-structured interviews, participant observation and participatory research methods. A broad range of literary sources were also used to collect data. The study is located within the field of critical political geography with a discursive political ecology approach. It can be called a critical realist approach to the discursive analysis of indigenous territoriality.
Resumo:
Summary We have determined the full-length 14,491-nucleotide genome sequence of a new plant rhabdovirus, alfalfa dwarf virus (ADV). Seven open reading frames (ORFs) were identified in the antigenomic orientation of the negative-sense, single-stranded viral RNA, in the order 3′-N-P-P3-M-G-P6-L-5′. The ORFs are separated by conserved intergenic regions and the genome coding region is flanked by complementary 3′ leader and 5′ trailer sequences. Phylogenetic analysis of the nucleoprotein amino acid sequence indicated that this alfalfa-infecting rhabdovirus is related to viruses in the genus Cytorhabdovirus. When transiently expressed as GFP fusions in Nicotiana benthamiana leaves, most ADV proteins accumulated in the cell periphery, but unexpectedly P protein was localized exclusively in the nucleus. ADV P protein was shown to have a homotypic, and heterotypic nuclear interactions with N, P3 and M proteins by bimolecular fluorescence complementation. ADV appears unique in that it combines properties of both cytoplasmic and nuclear plant rhabdoviruses.
Resumo:
A limited number of plant rhabdovirus genomes have been fully sequenced, making taxonomic classification, evolutionary analysis and molecular characterization of this virus group difficult. We have for the first time determined the complete genome sequence of 13,188 nucleotides of Datura yellow vein nucleorhabdovirus (DYVV). DYVV genome organization resembles that of its closest relative, Sonchus yellow net virus (SYNV), with six ORFs in antigenomic orientation, separated by highly conserved intergenic regions and flanked by complementary 3′ leader and 5′ trailer sequences. As is typical for nucleorhabdoviruses, all viral proteins, except the glycoprotein, which is targeted to the endoplasmic reticulum, are localized to the nucleus. Nucleocapsid (N) protein, matrix (M) protein and polymerase, as components of nuclear viroplasms during replication, have predicted strong canonical nuclear localization signals, and N and M proteins exclusively localize to the nucleus when transiently expressed as GFP fusions. As in all nucleorhabdoviruses studied so far, N and phosphoprotein P interact when co-expressed, significantly increasing P nuclear localization in the presence of N protein. This research adds to the list of complete genomes of plant-infecting rhabdoviruses, provides molecular tools for further characterization and supports classification of DYVV as a nucleorhabdovirus closely related to but with some distinct differences from SYNV.
Resumo:
Brassica napus is one of the most important oil crops in the world, and stem rot caused by the fungus Sclerotinia sclerotiorum results in major losses in yield and quality. To elucidate resistance genes and pathogenesis-related genes, genome-wide association analysis of 347 accessions was performed using the Illumina 60K Brassica SNP (single nucleotide polymorphism) array. In addition, the detached stem inoculation assay was used to select five highly resistant (R) and susceptible (S) B. napus lines, 48 h postinoculation with S. sclerotiorum for transcriptome sequencing. We identified 17 significant associations for stem resistance on chromosomes A8 and C6, five of which were on A8 and 12 on C6. The SNPs identified on A8 were located in a 409-kb haplotype block, and those on C6 were consistent with previous QTL mapping efforts. Transcriptome analysis suggested that S. sclerotiorum infection activates the immune system, sulphur metabolism, especially glutathione (GSH) and glucosinolates in both R and S genotypes. Genes found to be specific to the R genotype related to the jasmonic acid pathway, lignin biosynthesis, defence response, signal transduction and encoding transcription factors. Twenty-four genes were identified in both the SNP-trait association and transcriptome sequencing analyses, including a tau class glutathione S-transferase (GSTU) gene cluster. This study provides useful insight into the molecular mechanisms underlying the plant's response to S. sclerotiorum.
Resumo:
Metabolism is the cellular subsystem responsible for generation of energy from nutrients and production of building blocks for larger macromolecules. Computational and statistical modeling of metabolism is vital to many disciplines including bioengineering, the study of diseases, drug target identification, and understanding the evolution of metabolism. In this thesis, we propose efficient computational methods for metabolic modeling. The techniques presented are targeted particularly at the analysis of large metabolic models encompassing the whole metabolism of one or several organisms. We concentrate on three major themes of metabolic modeling: metabolic pathway analysis, metabolic reconstruction and the study of evolution of metabolism. In the first part of this thesis, we study metabolic pathway analysis. We propose a novel modeling framework called gapless modeling to study biochemically viable metabolic networks and pathways. In addition, we investigate the utilization of atom-level information on metabolism to improve the quality of pathway analyses. We describe efficient algorithms for discovering both gapless and atom-level metabolic pathways, and conduct experiments with large-scale metabolic networks. The presented gapless approach offers a compromise in terms of complexity and feasibility between the previous graph-theoretic and stoichiometric approaches to metabolic modeling. Gapless pathway analysis shows that microbial metabolic networks are not as robust to random damage as suggested by previous studies. Furthermore the amino acid biosynthesis pathways of the fungal species Trichoderma reesei discovered from atom-level data are shown to closely correspond to those of Saccharomyces cerevisiae. In the second part, we propose computational methods for metabolic reconstruction in the gapless modeling framework. We study the task of reconstructing a metabolic network that does not suffer from connectivity problems. Such problems often limit the usability of reconstructed models, and typically require a significant amount of manual postprocessing. We formulate gapless metabolic reconstruction as an optimization problem and propose an efficient divide-and-conquer strategy to solve it with real-world instances. We also describe computational techniques for solving problems stemming from ambiguities in metabolite naming. These techniques have been implemented in a web-based sofware ReMatch intended for reconstruction of models for 13C metabolic flux analysis. In the third part, we extend our scope from single to multiple metabolic networks and propose an algorithm for inferring gapless metabolic networks of ancestral species from phylogenetic data. Experimenting with 16 fungal species, we show that the method is able to generate results that are easily interpretable and that provide hypotheses about the evolution of metabolism.
Resumo:
Sorghum (Sorghum bicolor) is one of the most important cereal crops globally and a potential energy plant for biofuel production. In order to explore genetic gain for a range of important quantitative traits, such as drought and heat tolerance, grain yield, stem sugar accumulation, and biomass production, via the use of molecular breeding and genomic selection strategies, knowledge of the available genetic variation and the underlying sequence polymorphisms, is required.
Resumo:
Two complete mitochondrial genomes of the black marlin Istiompax indica were assembled from approximately 3.5 and 2.5 million reads produced by Ion Torrent next generation sequencing. The complete genomes were 16,531 bp and 16,532 bp in length consisting of 2 rRNA, 13 protein-coding genes, 22tRNA and 2 coding regions. They demonstrated a similar A + T base (52.6%) to other teleosts. Intraspecific sequence variation was 99.5% for three I. indica mitogenomes and 99.7% for X. gladius. A lower value (85%) was found for the I. platypterus mitogenomes from genbank and accredited to inadvertent inclusion of gene regions from a con-familial species in one record, highlighting the need for cautious downstream use of genbank data. © 2014 Informa UK Ltd.
Resumo:
The mango industry in Australia is worth in excess of $150 million annually with the Kensington Pride (KP) cultivar capturing 60% of the domestic market. Valued by consumers for desirable taste and colour characteristics, KP has been used extensively as a parent in the Department of Agriculture and Fisheries’ (Queensland, Australia) mango breeding program with over 400 hybrid trees sharing KP as the male parent. In order to gain a better understanding of Australia’s most significant mango variety, Horticulture Innovation Australia had led an international collaboration between the Queensland Department of Agriculture and Fisheries (Australia), the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT, India) and the Beijing Genomics Institute (China) to sequence the KP genome. Preliminary de novo assembly of illumina short read sequence data suggests that the KP genome is highly heterozygous and has an estimated genome size of 407 Mb. As refinements and additional sequence data are added to the assembly, a more complete picture of the mango genome will be elucidated.
Resumo:
We present here the complete genome sequences of a novel polerovirus from Trifolium subterraneum (subterranean clover) and Cicer arietinum (chickpea) and compare these to a partial viral genome sequence obtained from Macroptilium lathyroides (phasey bean). We propose the name phasey bean mild yellows virus for this novel polerovirus.
Resumo:
The forest tree species Khaya senegalensis (Desr.) A. Juss. occurs in a belt across 20 African countries from Senegal-Guinea to Sudan-Uganda where it is a highly important resource. However, it is listed as Vulnerable (IUCN 2015-3). Since introduction in northern Australia around 1959, the species has been planted widely, yielding high-value products. The total area of plantations of the species in Australia exceeds 15,000 ha, mostly planted in the Northern Territory since 2006, and includes substantial areas across 60-70 woodlots and industrial plantations established in north-eastern Queensland since the early-1990s and during 2005-2007 respectively. Collaborative conservation and tree improvement by governments began in the Northern Territory and Queensland in 2001 based on provenance and other trials of the 1960s-1970s. This work has developed a broad base of germplasm in clonal seed orchards, hedge gardens and trials (clone and progeny). Several of the trials were established collaboratively on private land. Since the mid-2000s, commercial growers have introduced large numbers of provenance-bulk and individual-tree seedlots to establish industrial plantations and trials, several of the latter in collaboration with the Queensland Government. Provenance bulks (>140) and families (>400) from 17 African countries are established in Australia, considered the largest genetic base of the species in a single country outside Africa. Recently the annual rate of industrial planting of the species in Australia has declined, and R&D has been suspended by governments and reduced by the private sector. However, new commercial plantings in the Northern Territory and Queensland are proposed. In domesticating a species, the strategic importance of a broad genetic base is well known. The wide range of first- and advanced-generation germplasm of the species established in northern Australia and documented in this paper provides a sound basis for further domestication and industrial plantation and woodlot expansion, when investment conditions are favourable