968 resultados para EUKARYOTIC GENOMES
Resumo:
The predicted secondary structure of sub-genomic RNA in dengue virus defective interfering (D.I.) particles from patients, or generated in vitro, resembled that of the 3′ and 5′ regions of wild type dengue virus (DENV) genomes. While these structures in the sub-genomic RNA were found to be essential for its replication, their nucleotide sequences were not, so long as any new sequences maintained wild type RNA secondary structure. These observations suggested that these sub-genomic fragments of RNA from dengue viruses were replicated in the same manner as the full length genomes of their wild type, “helper”, viruses and that they probably represent the smallest fragments of DENV RNA that can be replicated during a natural infection. While D.I. particles containing sub-genomic RNA are completely parasitic, the relationship between wild type and D.I. DENV may be symbiotic, with the D.I. particles enhancing the transmission of infectious DENV.
Resumo:
Extracellular polysaccharides are major immunogenic components of the bacterial cell envelope. However, little is known about their biosynthesis in the genus Acinetobacter, which includes A. baumannii, an important nosocomial pathogen. Whether Acinetobacter sp. produce a capsule or a lipopolysaccharide carrying an O antigen or both is not resolved. To explore these issues, genes involved in the synthesis of complex polysaccharides were located in 10 complete A. baumannii genome sequences, and the function of each of their products was predicted via comparison to enzymes with a known function. The absence of a gene encoding a WaaL ligase, required to link the carbohydrate polymer to the lipid A-core oligosaccharide (lipooligosaccharide) forming lipopolysaccharide, suggests that only a capsule is produced. Nine distinct arrangements of a large capsule biosynthesis locus, designated KL1 to KL9, were found in the genomes. Three forms of a second, smaller variable locus, likely to be required for synthesis of the outer core of the lipid A-core moiety, were designated OCL1 to OCL3 and also annotated. Each K locus includes genes for capsule export as well as genes for synthesis of activated sugar precursors, and for glycosyltransfer, glycan modification and oligosaccharide repeat-unit processing. The K loci all include the export genes at one end and genes for synthesis of common sugar precursors at the other, with a highly variable region that includes the remaining genes in between. Five different capsule loci, KL2, KL6, KL7, KL8 and KL9 were detected in multiply antibiotic resistant isolates belonging to global clone 2, and two other loci, KL1 and KL4, in global clone 1. This indicates that this region is being substituted repeatedly in multiply antibiotic resistant isolates from these clones.
Resumo:
Lipooligosaccharide (LOS) is a complex surface structure that is linked to many pathogenic properties of Acinetobacter baumannii. In A. baumannii, the genes responsible for the synthesis of the outer core (OC) component of the LOS are located between ilvE and aspS. The content of the OC locus is usually variable within a species, and examination of 6 complete and 227 draft A. baumannii genome sequences available in GenBank non-redundant and Whole Genome Shotgun databases revealed nine distinct new types, OCL4-OCL12, in addition to the three known ones. The twelve gene clusters fell into two distinct groups, designated Group A and Group B, based on similarities in the genes present. OCL6 (Group B) was unique in that it included genes for the synthesis of L-Rhamnosep. Genetic exchange of the different configurations between strains has occurred as some OC forms were found in several different sequence types (STs). OCL1 (Group A) was the most widely distributed being present in 18 STs, and OCL6 was found in 16 STs. Variation within clones was also observed, with more than one OC locus type found in the two globally disseminated clones, GC1 and GC2, that include the majority of multiply antibiotic resistant isolates. OCL1 was the most abundant gene cluster in both GC1 and GC2 genomes but GC1 isolates also carried OCL2, OCL3 or OCL5, and OCL3 was also present in GC2. As replacement of the OC locus in the major global clones indicates the presence of sub-lineages, a PCR typing scheme was developed to rapidly distinguish Group A and Group B types, and to distinguish the specific forms found in GC1 and GC2 isolates.
Resumo:
A phylogenetic hypothesis for the lepidopteran superfamily Noctuoidea was inferred based on the complete mitochondrial (mt) genomes of 12 species (six newly sequenced). The monophyly of each noctuoid family in the latest classification was well supported. Novel and robust relationships were recovered at the family level, in contrast to previous analyses using nuclear genes. Erebidae was recovered as sister to (Nolidae+(Euteliidae+Noctuidae)), while Notodontidae was sister to all these taxa (the putatively basalmost lineage Oenosandridae was not included). In order to improve phylogenetic resolution using mt genomes, various analytical approaches were tested: Bayesian inference (BI) vs. maximum likelihood (ML), excluding vs. including RNA genes (rRNA or tRNA), and Gblocks treatment. The evolutionary signal within mt genomes had low sensitivity to analytical changes. Inference methods had the most significant influence. Inclusion of tRNAs positively increased the congruence of topologies, while inclusion of rRNAs resulted in a range of phylogenetic relationships varying depending on other analytical factors. The two Gblocks parameter settings had opposite effects on nodal support between the two inference methods. The relaxed parameter (GBRA) resulted in higher support values in BI analyses, while the strict parameter (GBDH) resulted in higher support values in ML analyses.
Resumo:
Termites have colonized many habitats and are among the most abundant animals in tropical ecosystems, which they modify considerably through their actions. The timing of their rise in abundance and of the dispersal events that gave rise to modern termite lineages is not well understood. To shed light on termite origins and diversification, we sequenced the mitochondrial genome of 48 termite species and combined them with 18 previously sequenced termite mitochondrial genomes for phylogenetic and molecular clock analyses using multiple fossil calibrations. The 66 genomes represent most major clades of termites. Unlike previous phylogenetic studies based on fewer molecular data, our phylogenetic tree is fully resolved for the lower termites. The phylogenetic positions of Macrotermitinae and Apicotermitinae are also resolved as the basal groups in the higher termites, but in the crown termitid groups, including Termitinae + Syntermitinae + Nasutitermitinae + Cubitermitinae, the position of some nodes remains uncertain. Our molecular clock tree indicates that the lineages leading to termites and Cryptocercus roaches diverged 170 Ma (153-196 Ma 95% confidence interval [CI]), that modern Termitidae arose 54 Ma (46-66 Ma 95% CI), and that the crown termitid group arose 40 Ma (35-49 Ma 95% CI). This indicates that the distribution of basal termite clades was influenced by the final stages of the breakup of Pangaea. Our inference of ancestral geographic ranges shows that the Termitidae, which includes more than 75% of extant termite species, most likely originated in Africa or Asia, and acquired their pantropical distribution after a series of dispersal and subsequent diversification events.
Resumo:
Whole genome sequences are generally accepted as excellent tools for studying evolutionary relationships. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignments could not be directly applied to the whole-genome comparison and phylogenomic studies. There has been a growing interest in alignment-free methods for phylogenetic analysis using complete genome data. The “distances” used in these alignment-free methods are not proper distance metrics in the strict mathematical sense. In this study, we first review them in a more general frame — dissimilarity. Then we propose some new dissimilarities for phylogenetic analysis. Last three genome datasets are employed to evaluate these dissimilarities from a biological point of view.
Resumo:
The uses of genetic sequences to inform, enable or create products or services for human biomedicine are substantially different from their uses in crop-based agriculture. Here, we explore what similarities and differences may emerge in patent use and strategies, and map patent-disclosed sequences onto three important plant genomes: maize (corn), rice and soybean. We focus on those referenced in the granted patent claims to compare their uses to the approach used in human gene patenting.
Resumo:
Guanylyl cyclases (GCs) are enzymes that generate cyclic GMP and regulate different physiologic and developmental processes in a number of organisms. GCs possess sequence similarity to class III adenylyl cyclases (ACs) and are present as either membrane-bound receptor GCs or cytosolic soluble GCs. We sought to determine the evolution of GCs using a large-scale bioinformatic analysis and found multiple lineage-specific expansions of GC genes in the genomes of many eukaryotes. Moreover, a few GC-like proteins were identified in prokaryotes, which come fused to a number of different domains, suggesting allosteric regulation of nucleotide cyclase activity Eukaryotic receptor GCs are associated with a kinase homology domain (KHD), and phylogenetic analysis of these proteins suggest coevolution of the KHD and the associated cyclase domain as well as a conservation of the sequence and the size of the linker region between the KHD and the associated cyclase domain. Finally, we also report the existence of mimiviral proteins that contain putative active kinase domains associated with a cyclase domain, which could suggest early evolution of the fusion of these two important domains involved in signa transduction.
Resumo:
Postnatal myofibre characteristics and muscle mass are largely determined during fetal development and may be significantly affected by epigenetic parent-of-origin effects. However, data on such effects in prenatal muscle development that could help understand unexplained variation in postnatal muscle traits are lacking. In a bovine model we studied effects of distinct maternal and paternal genomes, fetal sex, and non-genetic maternal effects on fetal myofibre characteristics and muscle mass. Data from 73 fetuses (Day153, 54% term) of four genetic groups with purebred and reciprocal cross Angus and Brahman genetics were analyzed using general linear models. Parental genomes explained the greatest proportion of variation in myofibre size of Musculus semitendinosus (80-96%) and in absolute and relative weights of M. supraspinatus, M. longissimus dorsi, M. quadriceps femoris and M. semimembranosus (82-89% and 56-93%, respectively). Paternal genome in interaction with maternal genome (P<0.05) explained most genetic variation in cross sectional area (CSA) of fast myotubes (68%), while maternal genome alone explained most genetic variation in CSA of fast myofibres (93%, P<0.01). Furthermore, maternal genome independently (M. semimembranosus, 88%, P<0.0001) or in combination (M. supraspinatus, 82%; M. longissimus dorsi, 93%; M. quadriceps femoris, 86%) with nested maternal weight effect (5-6%, P<0.05), was the predominant source of variation for absolute muscle weights. Effects of paternal genome on muscle mass decreased from thoracic to pelvic limb and accounted for all (M. supraspinatus, 97%, P<0.0001) or most (M. longissimus dorsi, 69%, P<0.0001; M. quadriceps femoris, 54%, P<0.001) genetic variation in relative weights. An interaction between maternal and paternal genomes (P<0.01) and effects of maternal weight (P<0.05) on expression of H19, a master regulator of an imprinted gene network, and negative correlations between H19 expression and fetal muscle mass (P<0.001), suggested imprinted genes and miRNA interference as mechanisms for differential effects of maternal and paternal genomes on fetal muscle.
Resumo:
We report here the structures and properties of heat-stable, non-protein, and mammalian cell-toxic compounds produced by spore-forming bacilli isolated from indoor air of buildings and from food. Little information is available on the effects and occurrence of heat-stable non-protein toxins produced by bacilli in moisture-damaged buildings. Bacilli emit spores that move in the air and can serve as the carriers of toxins, in a manner similar to that of the spores of toxic fungi found in contaminated indoor air. Bacillus spores in food cause problems because they tolerate the temperatures applied in food manufacture and the spores later initiate growth when food storage conditions are more favorable. Detection of the toxic compounds in Bacillus is based on using the change in mobility of boar spermatozoa as an indicator of toxic exposure. GC, LC, MS, and nuclear magnetic resonance NMR spectroscopy were used for purification, detection, quantitation, and analysis of the properties and structures of the compounds. Toxicity and the mechanisms of toxicity of the compounds were studied using boar spermatozoa, feline lung cells, human neural cells, and mitochondria isolated from rat liver. The ionophoric properties were studied using the BLM (black-lipid membrane) method. One novel toxin, forming ion channels permeant to K+ > Na+ > Ca2+, was found and named amylosin. It is produced by B. amyloliquefaciens isolated from indoor air of moisture-damaged buildings. Amylosin was purified with an RP-HPLC and a monoisotopic mass of 1197 Da was determined with ESI-IT-MS. Furthermore, acid hydrolysis of amylosin followed by analysis of the amino acids with the GS-MS showed that it was a peptide. The presence of a chromophoric polyene group was found using a NMR spectroscopy. The quantification method developed for amylosin based on RP-HPLC-UV, using the macrolactone polyene, amphotericin B (MW 924), as a reference compound. The B. licheniformis strains isolated from a food poisoning case produced a lipopeptide, lichenysin A, that ruptured mammalian cell membranes and was purified with a LC. Lichenysin A was identified by its protonated molecules and sodium- and potassium- cationized molecules with MALDI-TOF-MS. Its protonated forms were observed at m/z 1007, 1021 and 1035. The amino acids of lichenysin A were analyzed with ESI-TQ-MS/MS and, after acid hydrolysis, the stereoisomeric forms of the amino acids with RP-HPLC. The indoor air isolates of the strain of B. amyloliquefaciens produced not only amylosin but also lipopeptides: the cell membrane-damaging surfactin and the fungicidal fengycin. They were identified with ESI-IT-MS observing their protonated molecules, the sodium- and potassium-cationized molecules and analysing the MS/MS spectra. The protonated molecules of surfactin and fengycin showed m/z values of 1009, 1023, and 1037 and 1450, 1463, 1493, and 1506, respectively. Cereulide (MW 1152) was purified with RP-HPLC from a food poisoning strain of B. cereus. Cereulide was identified with ESI-TQ-MS according to the protonated molecule observed at m/z 1154 and the ammonium-, sodium- and potassium-cationized molecules observed at m/z 1171, 1176, and 1192, respectively. The fragment ions of the MS/MS spectrum obtained from the protonated molecule of cereulide at m/z 1154 were also interpreted. We developed a quantification method for cereulide, using RP-HPLC-UV and valinomycin (MW 1110, which structurally resembles cereulide) as the reference compound. Furthermore, we showed empirically, using the BLM method, that the emetic toxin cereulide is a specific and effective potassium ionophore of whose toxicity target is especially the mitochondria.
Resumo:
Viral genomes are encapsidated within protective protein shells. This encapsidation can be achieved either by a co-condensation reaction of the nucleic acid and coat proteins, or by first forming empty viral particles which are subsequently packaged with nucleic acid, the latter mechanism being typical for many dsDNA bacteriophages. Bacteriophage PRD1 is an icosahedral, non-tailed dsDNA virus that has an internal lipid membrane, the hallmark of the Tectiviridae family. Although PRD1 has been known to assemble empty particles into which the genome is subsequently packaged, the mechanism for this has been unknown, and there has been no evidence for a separate packaging vertex, similar to the portal structures used for packaging in the tailed bacteriophages and herpesviruses. In this study, a unique DNA packaging vertex was identified for PRD1, containing the packaging ATPase P9, packaging factor P6 and two small membrane proteins, P20 and P22, extending the packaging vertex to the internal membrane. Lack of small membrane protein P20 was shown to totally abolish packaging, making it an essential part of the PRD1 packaging mechanism. The minor capsid proteins P6 was shown to be an important packaging factor, its absence leading to greatly reduced packaging efficiency. An in vitro DNA packaging mechanism consisting of recombinant packaging ATPase P9, empty procapsids and mutant PRD1 DNA with a LacZ-insert was developed for the analysis of PRD1 packaging, the first such system ever for a virus containing an internal membrane. A new tectiviral sequence, a linear plasmid called pBClin15, was identified in Bacillus cereus, providing material for sequence analysis of the tectiviruses. Analysis of PRD1 P9 and other putative tectiviral ATPase sequences revealed several conserved sequence motifs, among them a new tectiviral packaging ATPase motif. Mutagenesis studies on PRD1 P9 were used to confirm the significance of the motifs. P9-type putative ATPase sequences carrying a similar sequence motif were identified in several other membrane containing dsDNA viruses of bacterial, archaeal and eukaryotic hosts, suggesting that these viruses may have similar packaging mechanisms. Interestingly, almost the same set of viruses that were found to have similar putative packaging ATPases had earlier been found to share similar coat protein folds and capsid structures, and a common origin for these viruses had been suggested. The finding in this study of similar packaging proteins further supports the idea that these viruses are descendants of a common ancestor.
Resumo:
The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.
Resumo:
DNA topoisomerases are ubiquitous nuclear enzymes that govern the topological interconversions of DNA by transiently breaking/rejoining the phosphodiester backbone of one (type I) or both (type II) strands of the double helix. Consistent with these functions, topoisomerases play key roles in many aspects of DNA metabolism. Type II DNA topoisomerase (topo II) is vital for various nuclear processes, including DNA replication, chromosome segregation, and maintenance of chromosome structure. Topo II expression is regulated at multiple stages, including transcriptional, posttranscriptional, and posttranslational levels, by a multitude of signaling factors. Topo II is also the cellular target for a variety of clinically relevant anti-tumor drugs. Despite significant progress in our understanding of the role of topo II in diverse nuclear processes, several important aspects of topo II function, expression, and regulation are poorly understood. We have focused this review specifically on eukaryotic DNA topoisomerase II, with an emphasis on functional and regulatory characteristics.
Resumo:
The csrA is a carbon storage regulator gene that encodes a protein with multiple RNA interaction sites. Bacterial non-coding small RNAs like csrB, csrC and their counterparts in diverse bacterial genus are identified to control the regulatory activities of CsrA and its orthologs. An attempt has been made in this study to identify 'novel' non-coding small RNAs that are involved in the regulatory activities of csrA gene. All CsrA-interacting small RNAs are computationally fingerprinted to have multiple occurrence of 7-nucleotide CsrA interacting repeats [CAGGA(U/A/C)G] along with a 18-nucleotide upstream binding site. However, in several of the genomes like Haemophilus spp, the upstream binding site is not identified. The current methodology overcomes this difficulty by identifying small RNA-specific orphan transcriptional units within the intergenic regions of the genome. The results could identify all known CsrA-interacting small RNAs in E. coli, Vibrio cholerae and Pseudomonas aeruginosa genomes and additionally has picked six new possible CsrA-interacting small RNA regions in E. coli. Our computational analysis indicates that known rygD and rprA sRNAs in E. coli could possibly interact with CsrA proteins. The study is extended to three of the Haemophilus genomes that could identify seven new possible CsrA interacting small RNAs.
Resumo:
Rpb4, the fourth largest subunit of the eukaryotic RNA polymerase II (RNAPII), is required for growth at extreme temperatures and for an appropriate response to nutrient starvation in yeast. Sequence homologs of Rpb4 are found in most sequenced genomes from yeast to humans. To elucidate the role of this subunit in nutrient starvation, we chose Dictyostelium discoideum, a soil amoeba, which responds to nutrient deprivation by undergoing a complex developmental program. Here we report the identification of homolog of Saccharomyces cerevisiae RPB4 in D. discoideum. Localization and complementation studies suggest that Rpb4 is functionally conserved. DdRPB4 transcript and protein levels are developmentally regulated. Although DdRPB4 could not be deleted, overexpression revealed that the Rpb4 protein is essential for cell survival and is regulated stringently at the post-transcriptional level in D. discoideum. Thus maintaining a critical level of Rpb4 is important for this organism.