18 resultados para sequence similarity searches
em CentAUR: Central Archive University of Reading - UK
Resumo:
The genome of Salmonella enterica serovar Enteritidis was shown to possess three IS3-like insertion elements, designated IS1230A, B and C, and each was cloned and their respective deoxynucleotide sequences determined. Mutations in elements IS1230A and B resulted in frameshifts in the open reading frames that encoded a putative transposase to be inactive. IS1230C was truncated at nucleotide 774 relative to IS1230B and therefore did not possess the 3' terminal inverted repeat. The three IS1230 derivatives were closely related to each other based on nucleotide sequence similarity. IS1230A was located adjacent to the sef operon encoding SEF14 fimbriae located at minute 97 of the genome of S. Enteritidis. IS1230B was located adjacent to the umuDC operon at minute 42.5 on the genome, itself located near to one terminus of an 815-kb genome inversion of S. Enteritidis relative to S. Typhimurium. IS1230C was located next to attB, the bacteriophage P22 attachment site, and proB, encoding gamma-glutamyl phosphate reductase. A truncated 3' remnant of IS1230, designated IS1230T, was identified in a clinical isolate of S. Typhimurium DT193 strain 2391. This element was located next to attB adjacent to which were bacteriophage P22-like sequences. Southern hybridisation of total genomic DNA from eighteen phage types of S. Enteritidis and eighteen definitive types of S. Typhimurium showed similar, if not identical, restriction fragment profiles in the respective serovars when probed with IS1230A.
Resumo:
BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a powerful tool for genome-wide transcription studies. Unlike microarrays, it has the ability to detect novel forms of RNA such as alternatively spliced and antisense transcripts, without the need for prior knowledge of their existence. One limitation of using SAGE on an organism with a complex genome and lacking detailed sequence information, such as the hexaploid bread wheat Triticum aestivum, is accurate annotation of the tags generated. Without accurate annotation it is impossible to fully understand the dynamic processes involved in such complex polyploid organisms. Hence we have developed and utilised novel procedures to characterise, in detail, SAGE tags generated from the whole grain transcriptome of hexaploid wheat. RESULTS: Examination of 71,930 Long SAGE tags generated from six libraries derived from two wheat genotypes grown under two different conditions suggested that SAGE is a reliable and reproducible technique for use in studying the hexaploid wheat transcriptome. However, our results also showed that in poorly annotated and/or poorly sequenced genomes, such as hexaploid wheat, considerably more information can be extracted from SAGE data by carrying out a systematic analysis of both perfect and "fuzzy" (partially matched) tags. This detailed analysis of the SAGE data shows first that while there is evidence of alternative polyadenylation this appears to occur exclusively within the 3' untranslated regions. Secondly, we found no strong evidence for widespread alternative splicing in the developing wheat grain transcriptome. However, analysis of our SAGE data shows that antisense transcripts are probably widespread within the transcriptome and appear to be derived from numerous locations within the genome. Examination of antisense transcripts showing sequence similarity to the Puroindoline a and Puroindoline b genes suggests that such antisense transcripts might have a role in the regulation of gene expression. CONCLUSION: Our results indicate that the detailed analysis of transcriptome data, such as SAGE tags, is essential to understand fully the factors that regulate gene expression and that such analysis of the wheat grain transcriptome reveals that antisense transcripts maybe widespread and hence probably play a significant role in the regulation of gene expression during grain development.
Resumo:
It is known that germin, which is a marker of the onset of growth in germinating wheat, is an oxalate oxidase, and also that germins possess sequence similarity with legumin and vicilin seed storage proteins. These two pieces of information have been combined in order to generate a 3D model of germin based on the structure of vicilin and to examine the model with regard to a potential oxalate oxidase active site. A cluster of three histidine residues has been located within the conserved beta-barrel structure. While there is a relatively low level of overall sequence similarity between the model and the vicilin structures, the conservation of amino acids important in maintaining the scaffold of the beta-barrel lends confidence to the juxtaposition of the histidine residues. The cluster is similar structurally to those found in copper amine oxidase and other proteins, leading to the suggestion that it defines a metal-binding location within the oxalate oxidase active site. It is also proposed that the structural elements involved in intermolecular interactions in vicilins may play a role in oligomer formation in germin/oxalate oxidase.
Resumo:
There is a strong desire to exploit transcriptomics data from model species for the genetic improvement of non-model crops. Here, we use gene expression profiles from the commercial model Pinus taeda to identify candidate genes implicated in juvenile-mature wood transition in the non-model relative, P. sylvestris. Re-analysis of 'public domain' SAGE data from xylem tissues of P. taeda revealed 283 mature-abundant and 396 juvenile-abundant tags (P < 0.01), of which 70 and 137, respectively matched to genes with known function. Based on sequence similarity, we then isolated 16 putative homologues of genes that in P. taeda exhibited widest divergence in expression between juvenile and mature samples. Candidate expression levels in P. sylvestris were almost invariably differential between juvenile and mature woody tissue samples among two cohorts of five trees collected from the same seed source and selected for genetic uniformity by genetic distance analysis. However, the direction of differential expression was not always consistent with that described in the original P. taeda SAGE data. Correlation was observed between gene expression and juvenile-mature wood anatomical characteristics by OPLS analysis. Four candidates (alpha-tubulin, porin MIP1, lipid transfer protein and aquaporin like protein) apparently had greatest influence on the wood traits measured. Speculative function of these genes in relation to juvenile-mature wood transition is briefly explored. Thus, we demonstrate the feasibility of exploiting SAGE data from a model species to identify consistently differentially expressed candidates in a related non-model species.
Resumo:
Four Gram-positive-staining, strictly anaerobic, non-spore-forming, rod-shaped organisms were isolated from a pig manure storage pit. Comparative 16S rRNA gene sequence analysis revealed that the isolates belonged to two related but distinct groups. Sequence analysis showed that the two groups of isolates were highly related to each other (approx. 97% 16S rRNA gene sequence similarity), forming a distinct cluster within the Clostridium coccoides suprageneric rDNA grouping. Biochemical and physiological studies confirmed the division of the isolates into two related, albeit distinct, groups. Based on both phenotypic and phylogenetic evidence, it is proposed that the unidentified rod-shaped isolates from pig manure should be classified in a novel genus, Hespellia gen. nov., as Hespellia stercorisuis sp. nov. and Hespellia porcina sp. nov. The type species of the novel genus is H. stercorisuis (type strain, PC18(T) = NRRL B-23456(T) = CCUG 46279(T) = ATCC BAA-677(T)) and the type strain of H. porcina is PC80(T) (= NRRL B-23458(T) = ATCC BAA-674(T)).
Resumo:
Unusual Gram-negative, catalase- and oxidase-positive, coccus-shaped bacteria isolated from the lungs of two lambs were characterized by phenotypic and molecular-genetic methods. Comparative 16S rRNA gene sequencing studies demonstrated that the unknown isolates were genealogically highly related to each other (99.8% sequence similarity) and represent a novel subline within the genus Psychrobacter. The unknown bacterium was phylogenetically closely related to, but distinct from, Psychrobacter phenylpyruvicus, Psychrobacter immobilis, Psychrobacter glacincola and Psychrobacter urativorans. The novel Psychrobacter isolates were readily distinguished from all other Psychrobacter species and other Gram-negative, oxidase-positive bacteria usually responsible for lung infections in sheep by physiological and biochemical tests. Based on molecular-genetic and phenotypic evidence, it is proposed that the unknown Psychrobacter isolates from lambs be classified as Psychrobacterpulmonis sp. nov. The type strain is strain S-606(T) (= CECT 5989(T) = CCUG 46240(T)).
Resumo:
Seven obligately anaerobic, gram-positive, rod-shaped, spore-forming organisms isolated from human sources were characterized using phenotypic and molecular taxonomic methods. Comparative 16S rRNA gene sequencing showed that the strains were genetically highly related to each other (displaying >99% sequence similarity) and represent a previously unknown sub-line within the Clostridium coccoides rRNA group of organisms. Strains of the unidentified bacterium used carbohydrate as fermentable substrates, producing acetic acid and lactic acid as the major products of glucose metabolism. The closest described species to the novel bacterium corresponded to Clostridium clostridioforme, although a 16S rRNA sequence divergence of 3% demonstrated they represent different species. Genomic DNA-DNA pairing studies confirmed the separateness of the unknown species and Clostridium clostridioforme. Based on phenotypic and phylogenetic evidence, it is therefore proposed that the unknown bacterium, be classified as Clostridium bolteae sp. nov. The type strain of Clostridium bolteae is WAL 16351(T) (= ATCC(T) = BAA-613(T), CCUG(T) = 46953(T)).
Resumo:
Phenotypic and phylogenetic studies were performed on four isolates of an unidentified gram-negative, microaerotolerant, non-spore-forming, rod-shaped bacterium isolated from the feces of children. The unknown organism was bile resistant and produced acetic acid as the major end product of metabolism of peptides and carbohydrates. It possessed a low DNA G + C content of 31 mol %. Comparative 16S rRNA gene sequencing demonstrated that the four isolates were phylogenetically identical (100% 16S rRNA sequence similarity) and represent a hitherto unknown sub-line within the genus Cetobacterium. The novel bacterium displayed approximately 5% sequence divergence with Cetobacterium ceti, and can be readily distinguished from the latter by physiological and biochemical criteria. Based on phylogenetic and phenotypic evidence, it is proposed that the unknown fecal bacterium be classified in the genus Cetobacterium, as Cetobacterium somerae sp. nov. The proposed type strain of Cetobacterium somerae is WAL 14325(T) (ATCC BAA-474(T) = CCUG 46254T).
Synapsing variable length crossover: An algorithm for crossing and comparing variable length genomes
Resumo:
The Synapsing Variable Length Crossover (SVLC) algorithm provides a biologically inspired method for performing meaningful crossover between variable length genomes. In addition to providing a rationale for variable length crossover it also provides a genotypic similarity metric for variable length genomes enabling standard niche formation techniques to be used with variable length genomes. Unlike other variable length crossover techniques which consider genomes to be rigid inflexible arrays and where some or all of the crossover points are randomly selected, the SVLC algorithm considers genomes to be flexible and chooses non-random crossover points based on the common parental sequence similarity. The SVLC Algorithm recurrently "glues" or synapses homogenous genetic sub-sequences together. This is done in such a way that common parental sequences are automatically preserved in the offspring with only the genetic differences being exchanged or removed, independent of the length of such differences. In a variable length test problem the SVLC algorithm is shown to outperform current variable length crossover techniques. The SVLC algorithm is also shown to work in a more realistic robot neural network controller evolution application.
Resumo:
The synapsing variable-length crossover (SVLC algorithm provides a biologically inspired method for performing meaningful crossover between variable-length genomes. In addition to providing a rationale for variable-length crossover, it also provides a genotypic similarity metric for variable-length genomes, enabling standard niche formation techniques to be used with variable-length genomes. Unlike other variable-length crossover techniques which consider genomes to be rigid inflexible arrays and where some or all of the crossover points are randomly selected, the SVLC algorithm considers genomes to be flexible and chooses non-random crossover points based on the common parental sequence similarity. The SVLC algorithm recurrently "glues" or synapses homogenous genetic subsequences together. This is done in such a way that common parental sequences are automatically preserved in the offspring with only the genetic differences being exchanged or removed, independent of the length of such differences. In a variable-length test problem, the SVLC algorithm compares favorably with current variable-length crossover techniques. The variable-length approach is further advocated by demonstrating how a variable-length genetic algorithm (GA) can obtain a high fitness solution in fewer iterations than a traditional fixed-length GA in a two-dimensional vector approximation task.
Resumo:
YqjH is a cytoplasmic FAD-containing protein from Escherichia coli; based on homology to ViuB of Vibrio cholerae, it potentially acts as a ferri-siderophore reductase. This work describes its overexpression, purification, crystallization and structure solution at 3.0 A resolution. YqjH shares high sequence similarity with a number of known siderophore-interacting proteins and its structure was solved by molecular replacement using the siderophore-interacting protein from Shewanella putrefaciens as the search model. The YqjH structure resembles those of other members of the NAD(P)H:flavin oxidoreductase superfamily.
Resumo:
Escherichia fergusonii has been associated with a wide variety of intestinal and extra-intestinal infections in both humans and animals but, despite strong circumstantial evidence, the degree to which the organism is responsible for the pathologies identified remains uncertain. Thirty isolates of E fergusonii collected between 2003 and 2004 were screened using an Escherichia coli virulence gene array to test for the presence of homologous virulence genes in E. fergusonii. The iss (increased serum survival) gene was present in 13/30 (43%) of the test strains and the prfB (P-related fimbriae regulatory) and ireA (siderophore receptor IreA) genes were also detected jointly in 3/30 (10%) strains. No known virulence genes were detected in 14/30 (47%) of strains. Following confirmatory PCR and sequence analysis, the E. fergusonii prfB, iss and ireA genes shared a high degree of sequence similarity to their counterparts in E. coli, and a particular resemblance was noted with the E. coli strain APEC O1 pathogenicity island. In tissue culture adherence assays, nine E. fergusonii isolates associated with HEp-2 cells with a 'localised adherence' or 'diffuse adherence' phenotype, and they proved to be moderately invasive. The E fergusonii isolates in this study possess both some phenotypic and genotypic features linked to known pathotypes of E coli, and support existing evidence that strains of E fergusonii may act as an opportunistic pathogens, although their specific virulence factors may need to be explored. Crown Copyright (c) 2008 Published by Elsevier Ltd. All rights reserved.
Resumo:
The human pathogen enterohemorrhagic Escherichia coli (EHEC) O157:H7 colonizes human and animal gut via formation of attaching and effacing lesions. EHEC strains use a type III secretion system to translocate a battery of effector proteins into the mammalian host cell, which subvert diverse signal transduction pathways implicated in actin dynamics, phagocytosis, and innate immunity. The genomes of sequenced EHEC O157: H7 strains contain two copies of the effector protein gene nleH, which share 49% sequence similarity with the gene for the Shigella effector OspG, recently implicated in inhibition of migration of the transcriptional regulator NF-kappa B to the nucleus. In this study we investigated the role of NleH during EHEC O157: H7 infection of calves and lambs. We found that while EHEC Delta nleH colonized the bovine gut more efficiently than the wild-type strain, in lambs the wild-type strain exhibited a competitive advantage over the mutant during mixed infection. Using the mouse pathogen Citrobacter rodentium, which shares many virulence factors with EHEC O157: H7, including NleH, we observed that the wild-type strain exhibited a competitive advantage over the mutant during mixed infection. We found no measurable differences in T-cell infiltration or hyperplasia in colons of mice inoculated with the wild-type or the nleH mutant strain. Using NF-kappa B reporter mice carrying a transgene containing a luciferase reporter driven by three NF-kappa B response elements, we found that NleH causes an increase in NF-kappa B activity in the colonic mucosa. Consistent with this, we found that the nleH mutant triggered a significantly lower tumor necrosis factor alpha response than the wild-type strain.
Resumo:
Three strains of a Gram-positive, catalase-positive, fermentative, non-lipophilic, previously unknown bacterium were isolated from urogenital samples taken from mares in Scotland (M401624/00/1) and Sweden (VM 2074 and VM 2298T). All were deposited with the CCUG with tentative identifications as Corynebacterium spp. The strains were characterized using a polyphasic taxonomic approach. Biochemically, the strains were very similar to each other, but phylogenetically distinct from Corynebacterium species with validly published names (≤95% sequence similarity). rpoB gene sequence data confirmed the strains belonged to the same species (>99% sequence similarity) and were distinct from species with validly published names (>13% sequence divergence). On the basis of phenotypic and sequence data, the strains represent a novel species within the genus Corynebacterium, for which the name Corynebacterium uterequi is proposed. The type strain is VM 2298T (=CCUG 61235T = DSM 45634T), isolated from equine uterus.
Resumo:
The rulAB operon of Pseudomonas spp. confers fitness traits on the host and has been suggested to be a hotspot for insertion of mobile elements that carry avirulence genes. Here, for the first time, we show that rulB on plasmid pWW0 is a hotspot for the active site-specific integration of related integron-like elements (ILEs) found in six environmental pseudomonads (strains FH1–FH6). Integration into rulB on pWW0 occurred at position 6488 generating a 3 bp direct repeat. ILEs from FH1 and FH5 were 9403 bp in length and contained eight open reading frames (ORFs), while the ILE from FH4 was 16 233 bp in length and contained 16 ORFs. In all three ILEs, the first 5.1 kb (containing ORFs 1–4) were structurally conserved and contained three predicted site-specific recombinases/integrases and a tetR homologue. Downstream of these resided ORFs of the ‘variable side’ with structural and sequence similarity to those encoding survival traits on the fitness enhancing plasmid pGRT1 (ILEFH1 and ILEFH5) and the NR-II virulence region of genomic island PAGI-5 (ILEFH4). Collectively, these ILEs share features with the previously described type III protein secretion system effector ILEs and are considered important to host survival and transfer of fitness enhancing and (a)virulence genes between bacteria.