911 resultados para Molecular Sequence Data.
Resumo:
We describe a novel approach to explore DNA nucleotide sequence data, aiming to produce high-level categorical and structural information about the underlying chromosomes, genomes and species. The article starts by analyzing chromosomal data through histograms using fixed length DNA sequences. After creating the DNA-related histograms, a correlation between pairs of histograms is computed, producing a global correlation matrix. These data are then used as input to several data processing methods for information extraction and tabular/graphical output generation. A set of 18 species is processed and the extensive results reveal that the proposed method is able to generate significant and diversified outputs, in good accordance with current scientific knowledge in domains such as genomics and phylogenetics.
Resumo:
Proteins are biochemical entities consisting of one or more blocks typically folded in a 3D pattern. Each block (a polypeptide) is a single linear sequence of amino acids that are biochemically bonded together. The amino acid sequence in a protein is defined by the sequence of a gene or several genes encoded in the DNA-based genetic code. This genetic code typically uses twenty amino acids, but in certain organisms the genetic code can also include two other amino acids. After linking the amino acids during protein synthesis, each amino acid becomes a residue in a protein, which is then chemically modified, ultimately changing and defining the protein function. In this study, the authors analyze the amino acid sequence using alignment-free methods, aiming to identify structural patterns in sets of proteins and in the proteome, without any other previous assumptions. The paper starts by analyzing amino acid sequence data by means of histograms using fixed length amino acid words (tuples). After creating the initial relative frequency histograms, they are transformed and processed in order to generate quantitative results for information extraction and graphical visualization. Selected samples from two reference datasets are used, and results reveal that the proposed method is able to generate relevant outputs in accordance with current scientific knowledge in domains like protein sequence/proteome analysis.
Resumo:
The ability of Mycobacterium tuberculosis to establish a latent infection (LTBI) in humans confounds the treatment of tuberculosis. Consequently, there is a need to discover new therapeutic agents that can kill M. tuberculosis both during active disease and LTBI. The streptomycin-dependent strain of M. tuberculosis, 18b, provides a useful tool for this purpose since upon removal of streptomycin (STR) it enters a non-replicating state that mimics latency both in vitro and in animal models. The 4.41Â Mb genome sequence of M. tuberculosis 18b was determined and this revealed the strain to belong to clade 3 of the ancient ancestral lineage of the Beijing family. STR-dependence was attributable to insertion of a single cytosine in the 530 loop of the 16S rRNA and to a single amino acid insertion in the N-terminal domain of initiation factor 3. RNA-seq was used to understand the genetic programme activated upon STR-withdrawal and hence to gain insight into LTBI. This revealed reconfiguration of gene expression and metabolic pathways showing strong similarities between non-replicating 18b and M. tuberculosis residing within macrophages, and with the core stationary phase and microaerophilic responses. The findings of this investigation confirm the validity of 18b as a model for LTBI, and provide insight into both the evolution of tubercle bacilli and the functioning of the ribosome.
Resumo:
UNLABELLED: Whole-genome sequencing (WGS) of 228 isolates was used to elucidate the origin and dynamics of a long-term outbreak of methicillin-resistant Staphylococcus aureus (MRSA) sequence type 228 (ST228) SCCmec I that involved 1,600 patients in a tertiary care hospital between 2008 and 2012. Combining of the sequence data with detailed metadata on patient admission and movement confirmed that the outbreak was due to the transmission of a single clonal variant of ST228, rather than repeated introductions of this clone into the hospital. We note that this clone is significantly more frequently recovered from groin and rectal swabs than other clones (P < 0.0001) and is also significantly more transmissible between roommates (P < 0.01). Unrecognized MRSA carriers, together with movements of patients within the hospital, also seem to have played a major role. These atypical colonization and transmission dynamics can help explain how the outbreak was maintained over the long term. This "stealthy" asymptomatic colonization of the gut, combined with heightened transmissibility (potentially reflecting a role for environmental reservoirs), means the dynamics of this outbreak share some properties with enteric pathogens such as vancomycin-resistant enterococci or Clostridium difficile. IMPORTANCE: Using whole-genome sequencing, we showed that a large and prolonged outbreak of methicillin-resistant Staphylococcus aureus was due to the clonal spread of a specific strain with genetic elements adapted to the hospital environment. Unrecognized MRSA carriers, the movement of patients within the hospital, and the low detection with clinical specimens were also factors that played a role in this occurrence. The atypical colonization of the gut means the dynamics of this outbreak may share some properties with enteric pathogens.
Resumo:
Although a substantial amount of research has been done on all aspects ofHeliconius biology and their ecological interactions with Passiflora, there has not hitherto been a phylogenetic examination of this association for coevolution. To test the HeliconiuslPassilfora association for coevolutionary congruence, phylogenies for each group were established and compared. The phylogeny for 14 species ofHeliconiinae from Costa Rica was based on combined sequence data from rRNA ITS 2 and partial EF-1a gene regions. For the Passifloraceae, 17 host plant species were utilized to establish a phylogeny based on tRNALeucine and ITS 1/5.8S1 ITS 2 sequence data. The phylogenies for both groups were largely in agreement with current classification (for Passifloraceae) and previously established phylogenies. Associations with the large subgenera Passiflora and Decaloba correspond with the two major Advanced Radiation groups in Heliconius. Although strict congruence above subgenus level was not observed, broad scale congruence was evident. One main host shift as well as other possible explanations for lack of strict congruence are suggested.
Resumo:
Agaricus bisporus is the most commonly cultivated mushroom in North America and has a great economic value. Green mould is a serious disease of A. bisporus and causes major reductions in mushroom crop production. The causative agent of green mould disease in North America was identified as Trichoderma aggressivum f. aggressivum. Variations in the disease resistance have been shown in the different commercial mushroom strains. The purpose of this study is to continue investigations of the interactions between T. aggressivum and A. bisporus during the development of green mould disease. The main focus of the research was to study the roles of cell wall degrading enzymes in green mould disease resistance and pathogenesis. First, we tried to isolate and sequence the N-acetylglucosaminidase from A. bisporus to understand the defensive mechanism of mushroom against the disease. However, the lack of genomic and proteomic information of A. bisporus limited our efforts. Next, T. aggressivum cell wall degrading enzymes that are thought to attack Agaricus and mediate the disease development were examined. The three cell wall degrading enzymes genes, encoding endochitinase (ech42), glucanase (fJ-1,3 glucanase) and protease (prb 1), were isolated and sequenced from T. aggressivum f. aggressivum. The sequence data showed significant homology with the corresponding genes from other fungi including Trichoderma species. The transcription levels of the three T. aggressivum cell wall degrading enzymes were studied during the in vitro co-cultivation with A. bisporus using R T -qPCR. The transcription levels of the three genes were significantly upregulated compared to the solitary culture levels but were upregulated to a lesser extent in co-cultivation with a resistant strain of A. bisporus than with a sensitive strain. An Agrobacterium tumefaciens transformation system was developed for T. aggressivum and was used to transform three silencing plasmids to construct three new T. aggressivum phenotypes, each with a silenced cell wall degrading enzyme. The silencing efficiency was determined by RT-qPCR during the individual in vitro cocultivation of each of the new phenotypes with A. bisporus. The results showed that the expression of the three enzymes was significantly decreased during the in vitro cocultivation when compared with the wild type. The phenotypes were co-cultivated with A. bisporus on compost with monitoring the green mould disease progression. The data indicated that prbi and ech42 genes is more important in disease progression than the p- 1,3 glucanase gene. Finally, the present study emphasises the role of the three cell wall degrading enzymes in green mould disease infection and may provide a promising tool for disease management.
Resumo:
"Mémoire Présenté à la Faculté des Études Supérieures en vue de l'obtention du Grade de Maîtrise En Droit Option Recherche"
Resumo:
Diversification of insect herbivores is often associated with coevolution between plant toxins and insect countermeasures, resulting in a specificity that restricts host plant shifts. Gall inducers, however, bypass plant toxins and the factors influencing host plant associations in these specialized herbivores remain unclear. We reconstructed the evolution of host plant associations in Western Palaearctic oak gallwasps (Cynipidae: Cynipini), a species-rich lineage of specialist herbivores on oak (Quercus). (1) Bayesian analyses of sequence data for three genes revealed extreme host plant conservatism, with inferred shifts between major oak lineages (sections Cerris and Quercus) closely matching the minimum required to explain observed diversity. It thus appears that the coevolutionary demands of gall induction constrain host plant shifts, both in cases of mutualism (e.g., fig wasps, yucca moths) and parasitism (oak gallwasps). (2) Shifts between oak sections occurred independently in sexual and asexual generations of the gallwasp lifecycle, implying that these can evolve independently. (3) Western Palaearctic gallwasps associated with sections Cerris and Quercus diverged at least 20 million years ago (mya), prior to the arrival of oaks in the Western Palaearctic from Asia 5-7 mya. This implies an Asian origin for Western Palaearctic gallwasps, with independent westwards range expansion by multiple lineages.
Resumo:
A longstanding debate in evolutionary biology concerns whether species diverge gradually through time or by rapid punctuational bursts at the time of speciation. The theory of punctuated equilibrium states that evolutionary change is characterised by short periods of rapid evolution followed by longer periods of stasis in which no change occurs. Despite years of work seeking evidence for punctuational change in the fossil record, the theory remains contentious. Further there is little consensus as to the size of the contribution of punctuational changes to overall evolutionary divergence. Here we review recent developments which show that punctuational evolution is common and widespread in gene sequence data.
Resumo:
The DNA barcode potential of three regions (the nuclear ribosomal ITS and the plastid psbA-trnH and trnT-trnL intergenic spacers) was investigated for the plant genus Aspalathus L. (Fabaceac: Crotalarieae). Aspalathus is a large genus (278 species) that revealed low levels of DNA variation in phylogenetic studies. In a 51-species dataset for the psbA-trnH and ITS regions, 45%, and 16% of sequences respectively were identical to the sequence of at least one other species, with two species undiscriminated even when the two regions were combined. In contrast, trnT-trnL, discriminated between all species in this dataset. In a larger ITS and trnT-trnL dataset. including a further 82 species. 7 species in five pairwise comparisons remained Undiscriminated when the two regions were combined. Four of the five pairs of species not discriminated by sequence data were readily distinguished using a combination of qualitative and quantitative morphological data. The difficulty of barcoding in this group is increased by the presence of intraspecific variation in all three regions studied. In the case of psbA-trnH, three intraspecific samples had a sequence identical to at least one other species. Overall, psbA-trnH. currently a candidate for plant barcoding, was the least discriminatory region in our study.
Resumo:
The Fox genes are united by encoding a fork head domain, a deoxyribonucleic acid (DNA)-binding domain of the winged-helix type that marks these genes as encoding transcription factors. Vertebrate Fox genes are classified into 23 subclasses named from FoxA to FoxS. We have surveyed the genome of the amphioxus Branchiostoma floridae, identifying 32 distinct Fox genes representing 21 of these 23 subclasses. The missing subclasses, FoxR and FoxS, are specific to vertebrates, and in addition, B. floridae has one further group, FoxAB, that is not found in vertebrates. Hence, we conclude B. floridae has maintained a high level of Fox gene diversity. Expressed sequence tag and complementary DNA sequence data support the expression of 23 genes. Several linkages between B. floridae Fox genes were noted, including some that have evolved relatively recently via tandem duplication in the amphioxus lineage and others that are more ancient.
Resumo:
In this study, complementary species-level and intraspecific phylogenies were used to better circumscribe the original native range and history of translocation of the invasive tree Parkinsonia aculeata. Species-level phylogenies were reconstructed using three chloroplast gene regions, and amplified fragment length polymorphism (AFLP) markers were used to reconstruct the intraspecific phylogeny. Together, these phylogenies revealed the timescale of transcontinental lineage divergence and the likely source of recent introductions of the invasive. The sequence data showed that divergence between North American and Argentinean P. aculeata occurred at least 5.7 million years ago, refuting previous hypotheses of recent dispersal between North and South America. AFLP phylogenies revealed the most likely sources of naturalized populations. The AFLP data also identified putatively introgressed plants, underlining the importance of wide sampling of AFLPs and of comparison with uniparentally inherited marker data when investigating hybridizing groups. Although P. aculeata has generally been considered North American, these data show that the original native range of P. aculeata included South America; recent introductions to Africa and Australia are most likely to have occurred from South American populations.
Resumo:
The Cape Floristic Region is exceptionally species-rich both for its area and latitude, and this diversity is highly unevenly distributed among genera. The modern flora is hypothesized to result largely from recent (post-Oligocene) speciation, and it has long been speculated that particular species-poor lineages pre-date this burst of speciation. Here, we employ molecular phylogenetic data in combination with fossil calibrations to estimate the minimum duration of Cape occupation by 14 unrelated putative relicts. Estimates vary widely between lineages (7-101 Myr ago), and when compared with the estimated timing of onset of the modern flora's radiation, it is clear that many, but possibly not all, of these lineages pre-date its establishment. Statistical comparisons of diversities with lineage age show that low species diversity of many of the putative relicts results from a lower rate of diversification than in dated Cape radiations. In other putative relicts, however, we cannot reject the possibility that they diversify at the same underlying rate as the radiations, but have been present in the Cape for insufficient time to accumulate higher diversity. Although the extremes in diversity of currently dated Cape lineages fall outside expectations under a underlying diversification rate, sampling of all Cape lineages would be required to reject this null hypothesis.
Resumo:
Phylogenetic hypotheses for the largely South African genus Pelargonium L'Hér. (Geraniaceae) were derived based on DNA sequence data from nuclear, chloroplast and mitochondrial encoded regions. The datasets were unequally represented and comprised cpDNA trnL-F sequences for 152 taxa, nrDNA ITS sequences for 55 taxa, and mtDNA nad1 b/c exons for 51 taxa. Phylogenetic hypotheses derived from the separate three datasets were overall congruent. A single hypothesis synthesising the information in the three datasets was constructed following a total evidence approach and implementing dataset specific stepmatrices in order to correct for substitution biases. Pelargonium was found to consist of five main clades, some with contrasting evolutionary patterns with respect to biogeographic distributions, dispersal capacity, pollination biology and karyological diversification. The five main clades are structured in two (subgeneric) clades that correlate with chromosome size. One of these clades includes a "winter rainfall clade" containing more than 70% of all currently described Pelargonium species, and all restricted to the South African Cape winter rainfall region. Apart from (woody) shrubs and small herbaceous rosette subshrubs, this clade comprises a large "xerophytic" clade including geophytes, stem and leaf succulents, harbouring in total almost half of the genus. This clade is considered to be the result of in situ proliferation, possibly in response to late-Miocene and Pliocene aridification events. Nested within it is a radiation comprising c. 80 species from the geophytic Pelargonium section Hoarea, all characterised by the possession of (a series of) tunicate tubers.
Resumo:
Nine different classifications have been produced in the last 70 years for the horticulturally valuable genus Cyclamen, a small genus with fewer than 30 species. These classifications, generated by intuitive methods and cladistic analyses, incorporated a total of four infrageneric ranks above that of species and were based on data from morphology, cytology and DNA sequencing. Our results, based on cladistic analyses of three independent data sources − nrDNA ITS, cpDNA trnL intron and morphological data − reveal good resolution only in nrDNA sequence data. However, when these three data sources are combined they provide stronger resolution and support for three major clades, only one of which, subgenus Psilanthum, has been consistently supported in previous classifications. The differing infrageneric classifications produced in Cyclamen result from varying taxon sampling, differing interpretation of morphological data, changes in the sources and analysis of data, and inconsistent application of names. Extensive subdivision of small genera in the absence of adequate data that could provide evidence for consistent patterns of relationship is premature and leads to a proliferation of names.© 2004 The Linnean Society of London, Botanical Journal of the Linnean Society, 2004, 146, 339-349.