911 resultados para Molecular Sequence Data.
Resumo:
Novel species of Cercospora and Pseudocercospora are described from Australian native plant species. These taxa are Cercospora ischaemi sp. nov. on Ischaemum australe (Poaceae); Pseudocercospora airliensis sp. nov. on Polyalthia nitidissima (Annonaceae); Pseudocercospora proiphydis sp. nov. on Proiphys amboinensis (Amaryllidaceae); and Pseudocercospora jagerae sp. nov. on Jagera pseudorhus var. pseudorhus (Sapindaceae). These species were characterised by morphology and an analysis of partial nucleotide sequence data for the three gene loci, ITS, LSU and EF-1α. Recent divergence of closely related Australian species of Pseudocercospora on native plants is proposed.
Resumo:
This thesis which consists of an introduction and four peer-reviewed original publications studies the problems of haplotype inference (haplotyping) and local alignment significance. The problems studied here belong to the broad area of bioinformatics and computational biology. The presented solutions are computationally fast and accurate, which makes them practical in high-throughput sequence data analysis. Haplotype inference is a computational problem where the goal is to estimate haplotypes from a sample of genotypes as accurately as possible. This problem is important as the direct measurement of haplotypes is difficult, whereas the genotypes are easier to quantify. Haplotypes are the key-players when studying for example the genetic causes of diseases. In this thesis, three methods are presented for the haplotype inference problem referred to as HaploParser, HIT, and BACH. HaploParser is based on a combinatorial mosaic model and hierarchical parsing that together mimic recombinations and point-mutations in a biologically plausible way. In this mosaic model, the current population is assumed to be evolved from a small founder population. Thus, the haplotypes of the current population are recombinations of the (implicit) founder haplotypes with some point--mutations. HIT (Haplotype Inference Technique) uses a hidden Markov model for haplotypes and efficient algorithms are presented to learn this model from genotype data. The model structure of HIT is analogous to the mosaic model of HaploParser with founder haplotypes. Therefore, it can be seen as a probabilistic model of recombinations and point-mutations. BACH (Bayesian Context-based Haplotyping) utilizes a context tree weighting algorithm to efficiently sum over all variable-length Markov chains to evaluate the posterior probability of a haplotype configuration. Algorithms are presented that find haplotype configurations with high posterior probability. BACH is the most accurate method presented in this thesis and has comparable performance to the best available software for haplotype inference. Local alignment significance is a computational problem where one is interested in whether the local similarities in two sequences are due to the fact that the sequences are related or just by chance. Similarity of sequences is measured by their best local alignment score and from that, a p-value is computed. This p-value is the probability of picking two sequences from the null model that have as good or better best local alignment score. Local alignment significance is used routinely for example in homology searches. In this thesis, a general framework is sketched that allows one to compute a tight upper bound for the p-value of a local pairwise alignment score. Unlike the previous methods, the presented framework is not affeced by so-called edge-effects and can handle gaps (deletions and insertions) without troublesome sampling and curve fitting.
Resumo:
The mango industry in Australia is worth in excess of $150 million annually with the Kensington Pride (KP) cultivar capturing 60% of the domestic market. Valued by consumers for desirable taste and colour characteristics, KP has been used extensively as a parent in the Department of Agriculture and Fisheries’ (Queensland, Australia) mango breeding program with over 400 hybrid trees sharing KP as the male parent. In order to gain a better understanding of Australia’s most significant mango variety, Horticulture Innovation Australia had led an international collaboration between the Queensland Department of Agriculture and Fisheries (Australia), the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT, India) and the Beijing Genomics Institute (China) to sequence the KP genome. Preliminary de novo assembly of illumina short read sequence data suggests that the KP genome is highly heterozygous and has an estimated genome size of 407 Mb. As refinements and additional sequence data are added to the assembly, a more complete picture of the mango genome will be elucidated.
Resumo:
Phylogenetic studies of cyanobacterial lichens Lichens are symbiotic assemblages between fungi (mycobiont) and green algae (phycobiont) or/and cyanobacteria (cyanobiont). Fossil records show that lichen-like symbioses occurred already 600 million years ago. Lichen symbiosis has since then become an important life strategy for the Fungi, particularly for species in the phylum Ascomycota as approximately 98% of the lichenized fungal species are ascomycetes. The taxonomy of lichen associations is based on the mycobiont. We reconstructed, using DNA sequence data, hypotheses of phylogenetic relationships of lichen-forming fungi that include species associated with cyanobacteria. These hypotheses of phylogeny should form the basis for the taxonomy. They also allowed studies of the origin and the evolution of specific symbioses. Genetic diversity and phylogenetic relationships of symbiotic cyanobionts were also studied in order to examine selectivity of cyanobionts and mycobionts as well as possible co-evolution between partners involved in lichen associations. The suggested circumscription of the family Stereocaulaceae to include Stereocaulon and Lepraria is supported. The recently described crustose Stereocaulon species seem to be correctly placed in the genus, although Stereocaulon traditionally included only fruticose species. The monospecific crustose genus Muhria is also shown to be best placed in Stereocaulon. Family Lobariaceae as currently delimited is monophyletic. Within Lobariaceae genus Sticta including Dendriscocaulon dendroides form a monophyletic group while the genera Lobaria and Pseudocyphellaria are non-monophyletic. A new classification of Lobariaceae is obviously needed. Further studies are however required before a final proposal for a new classification can be made. Our results show that the cyanobacterial symbiotic state has been gained repeatedly in the Ascomycota while losses of symbiotic cyanobacteria appear to be rare. The symbiosis with green algae is confirmed to have been gained repeatedly in Ascomycota but also repeatedly lost. Cyanobacterial symbioses therefore seem to be more stable than green algal associations. Cyanobacteria are perhaps more beneficial for the lichen fungi and therefore maintained. The results indicate a dynamic association of the lichen symbiosis. This evolutionary instability will perhaps be important for the lichen fungi as the utilization of options will perhaps enable lichens to colonize new substrates and survive environmental changes. Some cyanobacterial lichen genera seem to be highly selective towards the cyanobiont while others form symbioses with a broad spectrum of cyanobacteria. No evidence of co-evolution between fungi and cyanobacteria in cyanolichens could be demonstrated.
Resumo:
Lead contamination in the environment is of particular concern, as it is a known toxin. Until recently, however, much less attention has been given to the local contamination caused by activities at shooting ranges compared to large-scale industrial contamination. In Finland, more than 500 tons of Pb is produced each year for shotgun ammunition. The contaminant threatens various organisms, ground water and the health of human populations. However, the forest at shooting ranges usually shows no visible sign of stress compared to nearby clean environments. The aboveground biota normally reflects the belowground ecosystem. Thus, the soil microbial communities appear to bear strong resistance to contamination, despite the influence of lead. The studies forming this thesis investigated a shooting range site at Hälvälä in Southern Finland, which is heavily contaminated by lead pellets. Previously it was experimentally shown that the growth of grasses and degradation of litter are retarded. Measurements of acute toxicity of the contaminated soil or soil extracts gave conflicting results, as enchytraeid worms used as toxicity reporters were strongly affected, while reporter bacteria showed no or very minor decreases in viability. Measurements using sensitive inducible luminescent reporter bacteria suggested that the bioavailability of lead in the soil is indeed low, and this notion was supported by the very low water extractability of the lead. Nevertheless, the frequency of lead-resistant cultivable bacteria was elevated based on the isolation of cultivable strains. The bacterial and fungal diversity in heavily lead contaminated shooting sectors were compared with those of pristine sections of the shooting range area. The bacterial 16S rRNA gene and fungal ITS rRNA gene were amplified, cloned and sequenced using total DNA extracted from the soil humus layer as the template. Altogether, 917 sequenced bacterial clones and 649 sequenced fungal clones revealed a high soil microbial diversity. No effect of lead contamination was found on bacterial richness or diversity, while fungal richness and diversity significantly differed between lead contaminated and clean control areas. However, even in the case of fungi, genera that were deemed sensitive were not totally absent from the contaminated area: only their relative frequency was significantly reduced. Some operational taxonomic units (OTUs) assigned to Basidiomycota were clearly affected, and were much rarer in the lead contaminated areas. The studies of this thesis surveyed EcM sporocarps, analyzed morphotyped EcM root tips by direct sequencing, and 454-pyrosequenced fungal communities in in-growth bags. A total of 32 EcM fungi that formed conspicuous sporocarps, 27 EcM fungal OTUs from 294 root tips, and 116 EcM fungal OTUs from a total of 8 194 ITS2 454 sequences were recorded. The ordination analyses by non-parametric multidimensional scaling (NMS) indicated that Pb enrichment induced a shift in the EcM community composition. This was visible as indicative trends in the sporocarp and root tip datasets, but explicitly clear in the communities observed in the in-growth bags. The compositional shift in the EcM community was mainly attributable to an increase in the frequencies of OTUs assigned to the genus Thelephora, and to a decrease in the OTUs assigned to Pseudotomentella, Suillus and Tylospora in Pb-contaminated areas when compared to the control. The enrichment of Thelephora in contaminated areas was also observed when examining the total fungal communities in soil using DNA cloning and sequencing technology. While the compositional shifts are clear, their functional consequences for the dominant trees or soil ecosystem remain undetermined. The results indicate that at the Hälvälä shooting range, lead influences the fungal communities but not the bacterial communities. The forest ecosystem shows apparent functional redundancy, since no significant effects were seen on forest trees. Recently, by means of 454 pyrosequencing , the amount of sequences in a single analysis run can be up to one million. It has been applied in microbial ecology studies to characterize microbial communities. The handling of sequence data with traditional programs is becoming difficult and exceedingly time consuming, and novel tools are needed to handle the vast amounts of data being generated. The field of microbial ecology has recently benefited from the availability of a number of tools for describing and comparing microbial communities using robust statistical methods. However, although these programs provide methods for rapid calculation, it has become necessary to make them more amenable to larger datasets and numbers of samples from pyrosequencing. As part of this thesis, a new program was developed, MuSSA (Multi-Sample Sequence Analyser), to handle sequence data from novel high-throughput sequencing approaches in microbial community analyses. The greatest advantage of the program is that large volumes of sequence data can be manipulated, and general OTU series with a frequency value can be calculated among a large number of samples.
Resumo:
Understanding the overwhelming diversity of life calls for complex organisational schemes. The field of systematics may thus be seen as the cornerstone of evolutionary biology. In the last few decades, systematics has been rejuvenated through the introduction of molecular methods such as DNA barcoding and multi-gene phylogenetic approaches. These methods may shed new light on established taxonomic ideas and problems. For example, the classification of ants has aroused much debate due to reinterpretation of morphological characters or contradictions between molecular data and morphology. Only in the last few years a consensus was reached regarding the phylogeny of ant subfamilies. However, the situation remains deplorable for lower taxonomic ranks such as subfamilies, tribes and genera. This thesis describes the systematics and evolution of the Holarctic ant genus Myrmica and the tribe to which it belongs, Myrmicini. Using barcoding, molecular-phylogenetic data and divergence time estimations, it addresses questions regarding the taxonomy, morphology and biogeography of this group. Furthermore, the interrelationships between socially parasitic Myrmica species and their hosts (other species in the genus) were inferred. The phylogeny suggests that social parasitism evolved several times in Myrmica. Finally, this thesis investigated whether coevolution shaped the phylogeny of socially parasitic Maculinea butterflies that live inside Myrmica colonies. No evidence was found for coevolution.
Resumo:
Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
Resumo:
Social behaviour affects dispersal of animals and is an important modifier of genetic population structures. The female sex is often philopatric, which maintains coancestry within the breeding groups and promotes cooperative behaviours. This enables also inclusive fitness returns from altruism and explains why some individuals sacrifice personal reproduction for the good of others in social insects such as ants. However, reduced dispersal and population substructuring at the level of colonies may also entail inbreeding, loss of genetic diversity, and vulnerability. In addition, the most vulnerable ants are species that are evolved to parasitize colonies of other ants, and which compromise between abilities to disperse and the efficiency to parasitize the host. On the other hand, certain social organisations of ant colonies may facilitate a species to disperse outside its natural range and become a pest. Altogether, knowledge on genetic structuring of ant populations, as well as the evolution of their life histories can contribute to conservation biology and population management. The aim of this thesis was to investigate population structures and phylogenetic evolution of the ant Plagiolepis pygmaea and its two obligatory, workerless social parasites (inquilines) P. xene and P. grassei with genetic markers and DNA sequence data. The results support the general assumption that populations of inquiline parasites are highly fragmented and genetically vulnerable. Comparison of the two parasites suggests that differences in their relative abundance may follow from their interaction with the host, i.e. how well the species is adapted to reproduce in the host colonies. The results also indicate that the most recent free living ancestor to these two parasite species is their common host. This is considered to provide evidence for the controversial issue of sympatric speciation. Further, given that the level of adaptations to parasitic life history depends on the evolutionary time since the free-living ancestor, the results establish a link between species rarity and its evolutionary age. The populations of the host species P. pygmaea displayed significantly reduced dispersal both among the females (queens) and males, and high levels of inbreeding which may enhance worker altruism. In addition, the queens were found to mate with multiple males. Given the high relatedness between the queens and their mates, this occurs probably for non-genetic reasons, e.g. without benefits associated in genetically more diverse offspring. The results hence caution that the contribution of non-genetic factors to the prevailing mating patterns and genetic population structures should not be underestimated.
Novel TBK1 truncating mutation in a familial amyotrophic lateral sclerosis patient of Chinese origin
Resumo:
Missense and frameshift mutations in TRAF family member-associated NF-kappa-B activator (TANK)-binding kinase 1 (TBK1) have been reported in European sporadic and familial amyotrophic lateral sclerosis (ALS) cohorts. To assess the role of TBK1 in ALS patient cohorts of wider ancestry, we have analyzed whole-exome sequence data from an Australian cohort of familial ALS (FALS) patients and controls. We identified a novel TBK1 deletion (c.1197delC) in a FALS patient of Chinese origin. This frameshift mutation (p.L399fs) likely results in a truncated protein that lacks functional domains required for adapter protein binding, as well as protein activation and structural integrity. No novel or reported TBK1 mutations were identified in FALS patients of European ancestry. This is the first report of a TBK1 mutation in an ALS patient of Asian origin and indicates that sequence variations in TBK1 are a rare cause of FALS in Australia. © 2015 Elsevier Inc.
Resumo:
The mechanism of translation in eubacteria and organelles is thought to be similar. In eubacteria, the three initiation factors IF1, IF2, and IF3 are vital. Although the homologs of IF2 and IF3 are found in mammalian mitochondria, an IF1 homolog has never been detected. Here, we show that bovine mitochondrial IF2 (IF2mt) complements E. coli containing a deletion of the IF2 gene (E. coli ΔinfB). We find that IF1 is no longer essential in an IF2mt-supported E. coli ΔinfB strain. Furthermore, biochemical and molecular modeling data show that a conserved insertion of 37 amino acids in the IF2mt substitutes for the function of IF1. Deletion of this insertion from IF2mt supports E. coli for the essential function of IF2. However, in this background, IF1 remains essential. These observations provide strong evidence that a single factor (IF2mt) in mammalian mitochondria performs the functions of two eubacterial factors, IF1 and IF2.
Resumo:
Dimethyl sulphoxide complexes of lanthanide and yttrium nitrates of the general formula M(DMSO)n(NO3)3 where M = La, Ce, Pr, Nd, Sm or Gd; n = 4 and M = Y, Ho or Yb; n = 3 have been isolated and characterized. The i.r. data besides excluding the presence of D3h nitrate, reveal co-ordination through the oxygen atom of the dimethyl sulphoxide. The complexes are monomeric in acetonitrile. Molecular conductance data in acetone, acetonitrile, dimethyl formamide and dimethyl sulphoxide suggest a co-ordination number of eight for the lighter lanthanides and seven for yttrium and the heavier lanthanides.
Resumo:
Rare earth perchlorate-antipyrine (ap) complexes of the formula Ln (ClO4)3.6 ap have been prepared and characterised. Infrared and electronic spectra showed the co-ordination through carbonyl oxygen. Conductivity and molecular weight data indicated a co-ordination number of six for these complexes.
Resumo:
Gene expression is one of the most critical factors influencing the phenotype of a cell. As a result of several technological advances, measuring gene expression levels has become one of the most common molecular biological measurements to study the behaviour of cells. The scientific community has produced enormous and constantly increasing collection of gene expression data from various human cells both from healthy and pathological conditions. However, while each of these studies is informative and enlighting in its own context and research setup, diverging methods and terminologies make it very challenging to integrate existing gene expression data to a more comprehensive view of human transcriptome function. On the other hand, bioinformatic science advances only through data integration and synthesis. The aim of this study was to develop biological and mathematical methods to overcome these challenges and to construct an integrated database of human transcriptome as well as to demonstrate its usage. Methods developed in this study can be divided in two distinct parts. First, the biological and medical annotation of the existing gene expression measurements needed to be encoded by systematic vocabularies. There was no single existing biomedical ontology or vocabulary suitable for this purpose. Thus, new annotation terminology was developed as a part of this work. Second part was to develop mathematical methods correcting the noise and systematic differences/errors in the data caused by various array generations. Additionally, there was a need to develop suitable computational methods for sample collection and archiving, unique sample identification, database structures, data retrieval and visualization. Bioinformatic methods were developed to analyze gene expression levels and putative functional associations of human genes by using the integrated gene expression data. Also a method to interpret individual gene expression profiles across all the healthy and pathological tissues of the reference database was developed. As a result of this work 9783 human gene expression samples measured by Affymetrix microarrays were integrated to form a unique human transcriptome resource GeneSapiens. This makes it possible to analyse expression levels of 17330 genes across 175 types of healthy and pathological human tissues. Application of this resource to interpret individual gene expression measurements allowed identification of tissue of origin with 92.0% accuracy among 44 healthy tissue types. Systematic analysis of transcriptional activity levels of 459 kinase genes was performed across 44 healthy and 55 pathological tissue types and a genome wide analysis of kinase gene co-expression networks was done. This analysis revealed biologically and medically interesting data on putative kinase gene functions in health and disease. Finally, we developed a method for alignment of gene expression profiles (AGEP) to perform analysis for individual patient samples to pinpoint gene- and pathway-specific changes in the test sample in relation to the reference transcriptome database. We also showed how large-scale gene expression data resources can be used to quantitatively characterize changes in the transcriptomic program of differentiating stem cells. Taken together, these studies indicate the power of systematic bioinformatic analyses to infer biological and medical insights from existing published datasets as well as to facilitate the interpretation of new molecular profiling data from individual patients.
Resumo:
Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity are, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.
Resumo:
A developmental series of larval and pelagic juvenile pygmy rockfish (Sebastes wilsoni) from central California is illustrated and described. Sebastes wilsoni is a non- commercially, but ecologically, important rockfish, and the ability to differentiate its young stages will aid researchers in population abundance studies. Pigment patterns, meristic characters, morphometric measurements, and head spination were recorded from specimens that ranged from 8.1 to 34.4 mm in standard length. Larvae were identified initially by meristic characters and the absence of ventral and lateral midline pigment. Pelagic juveniles developed a prominent pigment pattern of three body bars that did not extend to the ventral surface. Species identification was confirmed subsequently by using mitochondrial sequence data of four representative specimens of various sizes. As determined from the examination of otoliths, the growth rate of larval and pelagic juvenile pygmy rockfish was 0.28 mm/day, which is relatively slow in comparison to the growth rate of other species of Sebastes. These data will aid researchers in determining species abundance.