955 resultados para phylogenetic analysis, complete genome, composition vector, correlation-related distance metric
Resumo:
Symbiotic associations with microorganisms are pivotal in many insects. Yet, the functional roles of obligate symbionts have been difficult to study because it has not been possible to cultivate these organisms in vitro. The medically important tsetse fly (Diptera: Glossinidae) relies on its obligate endosymbiont, Wigglesworthia glossinidia, a member of the Enterobacteriaceae, closely related to Escherichia coli, for fertility and possibly nutrition. We show here that the intracellular Wigglesworthia has a reduced genome size smaller than 770 kb. In an attempt to understand the composition of its genome, we used the gene arrays developed for E. coli. We were able to identify 650 orthologous genes in Wigglesworthia corresponding to ≈85% of its genome. The arrays were also applied for expression analysis using Wigglesworthia cDNA and 61 gene products were detected, presumably coding for some of its most abundant products. Overall, genes involved in cell processes, DNA replication, transcription, and translation were found largely retained in the small genome of Wigglesworthia. In addition, genes coding for transport proteins, chaperones, biosynthesis of cofactors, and some amino acids were found to comprise a significant portion, suggesting an important role for these proteins in its symbiotic life. Based on its expression profile, we predict that Wigglesworthia may be a facultative anaerobic organism that utilizes ammonia as its major source of nitrogen. We present an application of E. coli gene arrays to obtain broad genome information for a closely related organism in the absence of complete genome sequence data.
Resumo:
Os rotavírus do grupo A (RVA) são importantes causadores de diarreias virais em crianças e animais jovens de diferentes espécies, com impactos na saúde pública e animal. Visando contribuir para o entendimento e prevenção das rotaviroses assim como suas possíveis relações zoonóticas, caracterizou-se os 11 segmentos de dsRNA de rotavírus codificadores das proteínas estruturais e não estruturais presentes em amostras fecais positivas de suínos coletadas nos anos de 2012-2013, em 2 estados brasileiros. Mediante o emprego de RT-PCR, sequenciamento nucleotídico e análises filogenéticas, todos os segmentos genéticos oriundos de 12 amostras de RVA detectados em suínos foram analisados e comparados com os de outras amostras descritas previamente. As sequências obtidas para os genes codificadores das proteínas NSP2, NSP3 e VP6 contemplaram a open reading frame (ORF) completa do gene, enquanto que a ORF parcial foi determinada para os genes codificadores das proteínas VP1, VP2, VP3, VP4, VP7, NSP1, NSP4, NSP5 e NSP6. Os genotipos de rotavírus suíno provenientes das regiões amostradas concordam com os mais frequentemente descritos nesta espécie animal, apresentando, assim, uma matriz genética suína com a maioria dos segmentos pertencentes à constelação genotípica 1, com exceção dos genes codificadores das proteínas VP6 e NSP1, os quais foram os genotipos I5 e A8, respectivamente. Apesar de predominar o genotipo 1 (Wa-like) nas sequências deste estudo, a análise genômica sugere a existência de uma variação intragenotípica no genoma do rotavírus do grupo A atualmente circulante nas populações suína amostradas dos estados de São Paulo e Mato Grosso. Adicionalmente, buscou-se identificar os aminoácidos relacionados com a adaptação dos rotavírus no hospedeiro e assinaturas genéticas que distinguissem RVA suíno e humano. Para isso, as sequências obtidas neste estudo foram comparadas com outras cepas de RVA detectadas nestas duas espécies e pertencentes ao genotipo 1 (Wa-like) disponíveis no Genbank. Como resultados foram encontrados mais de 75 sítios de mudanças deaminoácidos que diferenciam RVA suíno e humano além de sítios de substituiçãopresentes em algumas proteínas virais que frequentemente covariaram entre elas. Estes resultados proporcionam um maior entendimento da diversidade viral circulante em unidades de produção suína e uma melhor compreensão dos animaiscomo reservatórios genéticos de cepas de rotavírus emergentes em humanos.
Resumo:
The diversity of the culturable microbial communities was examined in two sponge species-Pseudoceratina clavata and Rhabdastrella globostellata. Isolates were characterized by 16S rRNA gene sequencing and phylogenetic analysis. The bacterial community structures represented in both sponges were found to be similar at the phylum level by the same four phyla in this study and also at a finer scale at the species level in both Firmicutes and Alphaproteobacteria. The majority of the Alphaproteobacteria isolates were most closely related to isolates from other sponge species including alpha proteobacterium NW001 sp. and alpha proteobacterium MBIC3368. Members of the low %G + C gram-positive (phylum Firmicutes), high %G + C gram-positive (phylum Actinobacteria), and Cytophaga-Flavobacterium-Bacteroides (phylum Bacteroidetes) phyla of domain Bacteria were also represented in both sponges. In terms of culturable organisms, taxonomic diversity of the microbial community in the two sponge species displays similar structure at phylum level. Within phyla, isolates often belonged to the same genus-level monophyletic group. Community structure and taxonomic composition in the two sponge species P. clavata and Rha. globostellata share significant features with those of other sponge species including those from widely separated geographical and climatic regions of the sea.
Resumo:
We completed the genome sequence of Lettuce necrotic yellows virus (LNYV) by determining the nucleotide sequences of the 4a (putative phosphoprotein), 4b, M (matrix protein), G (glycoprotein) and L (polymerase) genes. The genome consists of 12,807 nucleotides and encodes six genes in the order 3' leader-N-4a(P)-4b-M-G-L-5' trailer. Sequences were derived from clones of a cDNA library from LNYV genomic RNA and from fragments amplified using reverse transcription-polymerase chain reaction. The 4a protein has a low isoelectric point characteristic for rhabdovirus phosphoproteins. The 4b protein has significant sequence similarities with the movement proteins of capillo- and trichoviruses and may be involved in cell-to-cell movement. The putative G protein sequence contains a predicted 25 amino acids signal peptide and endopeptidase cleavage site, three predicted glycosylation sites and a putative transmembrane domain. The deduced L protein sequence shows similarities with the L proteins of other plant rhabdoviruses and contains polymerase module motifs characteristic for RNA-dependent RNA polymerases of negative-strand RNA viruses. Phylogenetic analysis of this motif among rhabdoviruses placed LNYV in a group with other sequenced cytorhabdoviruses, most closely related to Strawberry crinkle virus. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
In Late summer 1999, an outbreak of human encephalitis occurred in the northeastern United States that was concurrent with extensive mortality in crows (Corvus species) as well as the deaths of several exotic birds at a zoological park in the same area. Complete genome sequencing of a flavivirus isolated from the brain of a dead Chilean flamingo (Phoenicopterus chilensis), together with partial sequence analysis of envelope glycoprotein (E-glycoprotein) genes amplified from several other species including mosquitoes and two fatal human cases, revealed that West Nile (WN) virus circulated in natural transmission cycles and was responsible for the human disease. Antigenic mapping with E-glycoprotein-specific monoclonal antibodies and E-glycoprotein phylogenetic analysis confirmed these viruses as WN. This North American WN virus was most closely related to a WN virus isolated from a dead goose in Israel in 1998.
Resumo:
This thesis describes two newly sequenced B. longum subsp. longum genomes and subsequent comparative analysis with publicly available B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis genomes (Chapter 2). The acquired data revealed a closed pan-genome for this bifidobacterial species and furthermore facilitated the definition of the B. longum core genome. The comparative analysis also highlights differences in the potential metabolic abilities of all three sub-species. Interestingly, phylogenetic analysis of the B. longum core genome indicated the existence of a novel B. longum subspecies. Characterisation of restriction-modification systems from two B. longum subsp. longum strains is described in Chapter 3. These defence mechanisms limit the uptake of genetic material, which was successfully demonstrated for some of the identified systems. When these systems were by-passed by methylation of DNA prior to the transformation procedure, the resulting transformation efficiency of both B. longum subsp. longum strains was increased to a level that allowed for the generation of mutants via homologous recombination. Arabinoxylan metabolism by B. longum subsp. longum NCIMB 8809 was investigated in Chapter 4 of this thesis. Transcriptome analysis allowed the identification of a number of genes involved in the degradation, uptake and utilisation of arabinoxylan. Biochemical analysis revealed that three of the identified genes encode arabinofuranosidase activity. Phenotypic assessment of a number of insertion mutants in genes identified by the transcriptome analysis revealed the essential role of two of these enzymes in arabinoxylan metabolism, and a third enzyme in the metabolism of debranched arabinan. Furthermore, this investigation revealed that B. longum subsp. longum NCIMB 8809 does not completely degrade arabinoxylan, but utilises the arabinose substitutions only, while leaving the xylan backbone untouched.Finally, Chapter 5 outlines that B. longum subsp. longum NCIMB 8809 is capable of removing ferulic and p-coumaric acid substitutions that originate from arabinoxylan. Analysis of the genome sequence led to the identification of a candidate gene for this activity, which was subsequently cloned and expressed in E. coli. Biochemical analysis revealed that the enzyme, designated here as FaeA, is indeed capable of releasing both ferulic and p-coumaric acid from arabinoxylan. Furthermore, it is shown that a derivative of B. longum subsp. longum NCIMB 8809 carrying an insertion mutation in faeA had lost the ability to release ferulic and p-coumaric acid from arabinoxylan, and that growth of this mutant strain is negatively affected when cultivated on growth-limiting levels of arabinoxylan.
Resumo:
The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements.
Resumo:
The genus Streptobacillus (S.) remained monotypic for almost 90 years until two new species were recently described. The type species, S. moniliformis, is one of the two etiological agents of rat bite fever, an under-diagnosed, worldwide occurring zoonosis. In a polyphasic approach field isolates and reference strains of S. moniliformis, S. hongkongensis, S. felis as well as divergent isolates were characterized by comparison of molecular data (n = 29) and from the majority also by their physiological as well as proteomic properties (n = 22). Based on growth-independent physiological profiling using VITEK2-compact, API ZYM and the Micronaut system fastidious growth-related difficulties could be overcome and streptobacilli could definitively be typed despite generally few differences. While differing in their isolation sites and dates, S. moniliformis isolates were found to possess almost identical spectra in matrix-assisted laser desorption ionization-time of flight mass spectrometry and Fourier transform infrared spectroscopy. Spectroscopic methods facilitated differentiation of S. moniliformis, S. hongkongensis and S. felis as well as one divergent isolate. Sequencing of 16S rRNA gene as well as functional genes groEL, recA and gyrB revealed only little intraspecific variability, but generally proved suitable for interspecies discrimination between all three taxa and two groups of divergent isolates.
Resumo:
The last decades of the 20th century defined the genetic engineering advent, climaxing in the development of techniques, such as PCR and Sanger sequencing. This, permitted the appearance of new techniques to sequencing whole genomes, identified as next-generation sequencing. One of the many applications of these techniques is the in silico search for new secondary metabolites, synthesized by microorganisms exhibiting antimicrobial properties. The peptide antibiotics compounds can be classified in two classes, according to their biosynthesis, in ribosomal or nonribosomal peptides. Lanthipeptides are the most studied ribosomal peptides and are characterized by the presence of lanthionine and methylanthionine that result from posttranslational modifications. Lanthipeptides are divided in four classes, depending on their biosynthetic machinery. In class I, a LanB enzyme dehydrate serine and threonine residues in the C-terminus precursor peptide. Then, these residues undergo a cyclization step performed by a LanC enzyme, forming the lanthionine rings. The cleavage and the transport of the peptide is achieved by the LanP and LanT enzymes, respectively. Although, in class II only one enzyme, LanM, is responsible for the dehydration and cyclization steps and also only one enzyme performs the cleavage and transport, LanT. Pedobacter sp. NL19 is a Gram-negative bacterium, isolated from sludge of an abandon uranium mine, in Viseu (Portugal). Antibacterial activity in vitro was detected against several Gram-positive and Gram-negative bacteria. Sequencing and in silico analysis of NL19 genome revealed the presence of 21 biosynthetic clusters for secondary metabolites, including nonribosomal and ribosomal peptides biosynthetic clusters. Four lanthipeptides clusters were predicted, comprising the precursor peptides, the modifying enzymes (LanB and LanC), and also a bifunctional LanT. This result revealed the hybrid nature of the clusters, comprising characteristics from two distinct classes, which are poorly described in literature. The phylogenetic analysis of their enzymes showed that they clustered within the bacteroidetes clade. Furthermore, hybrid gene clusters were also found in other species of this phylum, revealing that it is a common characteristic in this group. Finally, the analysis of NL19 colonies by MALDI-TOF MS allowed the identification of a 3180 Da mass that corresponds to the predicted mass of a lanthipeptide encoded in one of the clusters. However, this result is not fully conclusive and further experiments are needed to understand the full potential of the compounds encoded in this type of clusters. In conclusion, it was determined that NL19 strain has the potential to produce diverse secondary metabolites, including lanthipeptides that were not functionally characterized so far.
Resumo:
We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility ( het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer ( HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated ""gene dumps'' and, perhaps, simultaneously, as "" gene factories''.
Resumo:
Background: Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. Results: The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110) diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs) revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp), a target of juvenile hormone (JH). The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste-and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Conclusions: Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body and gonads suggest that, in addition to their primary role in supplying amino acids for metamorphosis, hexamerins serve as storage proteins for gonad development, egg production, and to support foraging activity. A phylogenetic analysis including the four deduced hexamerins and related proteins revealed a complex pattern of evolution, with independent radiation in insect orders.
Resumo:
The dengue virus has a single-stranded positive-sense RNA genome of similar to 10.700 nucleotides with a single open reading frame that encodes three structural (C, prM, and E) and seven nonstructural (NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5) proteins. It possesses four antigenically distinct serotypes (DENV 1-4). Many phylogenetic studies address particularities of the different serotypes using convenience samples that are not conducive to a spatio-temporal analysis in a single urban setting. We describe the pattern of spread of distinct lineages of DENV-3 circulating in Sao Jose do Rio Preto, Brazil, during 2006. Blood samples from patients presenting dengue-like symptoms were collected for DENV testing. We performed M-N-PCR using primers based on NS5 for virus detection and identification. The fragments were purified from PCR mixtures and sequenced. The positive dengue cases were geo-coded. To type the sequenced samples, 52 reference sequences were aligned. The dataset generated was used for iterative phylogenetic reconstruction with the maximum likelihood criterion. The best demographic model, the rate of growth, rate of evolutionary change, and Time to Most Recent Common Ancestor (TMRCA) were estimated. The basic reproductive rate during the epidemics was estimated. We obtained sequences from 82 patients among 174 blood samples. We were able to geo-code 46 sequences. The alignment generated a 399-nucleotide-long dataset with 134 taxa. The phylogenetic analysis indicated that all samples were of DENV-3 and related to strains circulating on the isle of Martinique in 2000-2001. Sixty DENV-3 from Sao Jose do Rio Preto formed a monophyletic group (lineage 1), closely related to the remaining 22 isolates (lineage 2). We assumed that these lineages appeared before 2006 in different occasions. By transforming the inferred exponential growth rates into the basic reproductive rate, we obtained values for lineage 1 of R(0) = 1.53 and values for lineage 2 of R(0) = 1.13. Under the exponential model, TMRCA of lineage 1 dated 1 year and lineage 2 dated 3.4 years before the last sampling. The possibility of inferring the spatio-temporal dynamics from genetic data has been generally little explored, and it may shed light on DENV circulation. The use of both geographic and temporally structured phylogenetic data provided a detailed view on the spread of at least two dengue viral strains in a populated urban area.
Resumo:
We present here the sequence of the mitochondrial genome of the basidiomycete phytopathogenic hemibiotrophic fungus Moniliophthora perniciosa, causal agent of the Witches` Broom Disease in Theobroma cacao. The DNA is a circular molecule of 109103 base pairs, with 31.9 % GC, and is the largest sequenced so far. This size is due essentially to the presence of numerous non-conserved hypothetical ORFs. It contains the 14 genes coding for proteins involved in the oxidative phosphorylation, the two rRNA genes, one ORF coding for a ribosomal protein (rps3), and a set of 26 tRNA genes that recognize codons for all amino acids. Seven homing endonucleases are located inside introns. Except atp8, all conserved known genes are in the same orientation. Phylogenetic analysis based on the cox genes agrees with the commonly accepted fungal taxonomy. An uncommon feature of this mitochondrial genome is the presence of a region that contains a set of four, relatively small, nested, inverted repeats enclosing two genes coding for polymerases with an invertron-type structure and three conserved hypothetical genes interpreted as the stable integration of a mitochondrial linear plasmid. The integration of this plasmid seems to be a recent evolutionary event that could have implications in fungal biology. This sequence is available under GenBank accession number AY376688. (c) 2008 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Resumo:
The proposed method to analyze the composition of the cost of electricity is based on the energy conversion processes and the destruction of the exergy through the several thermodynamic processes that comprise a combined cycle power plant. The method uses thermoeconomics to evaluate and allocate the cost of exergy throughout the processes, considering costs related to inputs and investment in equipment. Although the concept may be applied to any combined cycle or cogeneration plant, this work develops only the mathematical modeling for three-pressure heat recovery steam generator (HRSG) configurations and total condensation of the produced steam. It is possible to study any n x 1 plant configuration (n sets of gas turbine and HRSGs associated to one steam turbine generator and condenser) with the developed model, assuming that every train operates identically and in steady state. The presented model was conceived from a complex configuration of a real power plant, over which variations may be applied in order to adapt it to a defined configuration under study [Borelli SJS. Method for the analysis of the composition of electricity costs in combined cycle thermoelectric power plants. Master in Energy Dissertation, Interdisciplinary Program of Energy, Institute of Eletro-technical and Energy, University of Sao Paulo, Sao Paulo, Brazil, 2005 (in Portuguese)]. The variations and adaptations include, for instance, use of reheat, supplementary firing and partial load operation. It is also possible to undertake sensitivity analysis on geometrical equipment parameters. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Bioethanol is a biofuel produced mainly from the fermentation of carbohydrates derived from agricultural feedstocks by the yeast Saccharomyces cerevisiae. One of the most widely adopted strains is PE-2, a heterothallic diploid naturally adapted to the sugar cane fermentation process used in Brazil. Here we report the molecular genetic analysis of a PE-2 derived diploid (JAY270), and the complete genome sequence of a haploid derivative (JAY291). The JAY270 genome is highly heterozygous (similar to 2 SNPs/kb) and has several structural polymorphisms between homologous chromosomes. These chromosomal rearrangements are confined to the peripheral regions of the chromosomes, with breakpoints within repetitive DNA sequences. Despite its complex karyotype, this diploid, when sporulated, had a high frequency of viable spores. Hybrid diploids formed by outcrossing with the laboratory strain S288c also displayed good spore viability. Thus, the rearrangements that exist near the ends of chromosomes do not impair meiosis, as they do not span regions that contain essential genes. This observation is consistent with a model in which the peripheral regions of chromosomes represent plastic domains of the genome that are free to recombine ectopically and experiment with alternative structures. We also explored features of the JAY270 and JAY291 genomes that help explain their high adaptation to industrial environments, exhibiting desirable phenotypes such as high ethanol and cell mass production and high temperature and oxidative stress tolerance. The genomic manipulation of such strains could enable the creation of a new generation of industrial organisms, ideally suited for use as delivery vehicles for future bioenergy technologies.