972 resultados para Applied microbiology
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
The laser diode (LD) is a unique light source that can efficiently produce all radiant energy within the narrow wavelength range used most effectively by a photosynthetic microorganism. We have investigated the use of a single type of LID for the cultivation of the well-studied anoxygenic photosynthetic bacterium, Rhodobacter capsulatus (Rb. capsulatus). An array of vertical-cavity surface-emitting lasers (VCSELs) was driven with a current of 25 mA, and delivered radiation at 860 nm with 0.4 nm linewidth. The emitted light was found to be a suitable source of radiant energy for the cultivation of Rb. capsulatus. The dependence of growth rate on incident irradiance was quantified. Despite the unusual nearly monochromatic light source used in these experiments, no significant changes in the pigment composition and in the distribution of bacteriochlorophyll between LHII and LHI-RC were detected in bacterial cells transferred from incandescent light to laser light. We were also able to show that to achieve a given growth rate in a light-limited culture, the VCSEL required only 30% of the electricity needed by an incandescent bulb, which is of great significance for the potential use of laser-devices in biotechnological applications and photobioreactor construction. (c) 2006 Wiley Periodicals, Inc.
Resumo:
Enhanced biological phosphorus removal (EBPR) is one of the best-studied microbially mediated industrial processes because of its ecological and economic relevance. Despite this, it is not well understood at the metabolic level. Here we present a metagenomic analysis of two lab-scale EBPR sludges dominated by the uncultured bacterium, Candidatus Accumulibacter phosphatis.'' The analysis sheds light on several controversies in EBPR metabolic models and provides hypotheses explaining the dominance of A. phosphatis in this habitat, its lifestyle outside EBPR and probable cultivation requirements. Comparison of the same species from different EBPR sludges highlights recent evolutionary dynamics in the A. phosphatis genome that could be linked to mechanisms for environmental adaptation. In spite of an apparent lack of phylogenetic overlap in the flanking communities of the two sludges studied, common functional themes were found, at least one of them complementary to the inferred metabolism of the dominant organism. The present study provides a much needed blueprint for a systems-level understanding of EBPR and illustrates that metagenomics enables detailed, often novel, insights into even well-studied biological systems.
Resumo:
We have determined the three-dimensional structure of the protein complex between latexin and carboxypeptidase A using a combination of chemical cross-linking, mass spectrometry and molecular docking. The locations of three intermolecular cross-links were identified using mass spectrometry and these constraints were used in combination with a speed-optimised docking algorithm allowing us to evaluate more than 3 x 10(11) possible conformations. While cross-links represent only limited structural constraints, the combination of only three experimental cross-links with very basic molecular docking was sufficient to determine the complex structure. The crystal structure of the complex between latexin and carboxypeptidase A4 determined recently allowed us to assess the success of this structure determination approach. Our structure was shown to be within 4 angstrom r.m.s. deviation of C alpha atoms of the crystal structure. The study demonstrates that cross-linking in combination with mass spectrometry can lead to efficient and accurate structural modelling of protein complexes.
Resumo:
Gateway technology is a powerful system for converting a single entry vector into a wide variety of expression vectors. We expressed recombinant influenza matrix protein M1 (FMP), a potent antigen for cytotoxic T cells, using the Gateway vector pET-DEST42 containing the FMP cDNA, and purified the expressed FMP as a single 32 kDa recombinant protein. N-terminal and internal protein sequencing, however, showed that the recombinant FMP contained an extra 10 amino acids fused to the N-terminal of native FMP. Further investigation of the DNA sequence adjacent to the 5'-FMP cDNA indicated that the TTG in the attB1 site (30bp upstream of the ATG in the 5'-FMP cDNA) behaved as a dominant translation start site, resulting in a 10 amino acid extension of the recombinant FMP. Thus, it is possible that recombinant proteins produced by this Gateway vector contain unexpected vector-derived peptides, which may affect experimental outcomes. (c) 2006 Elsevier Inc. All rights reserved.
Resumo:
The recently described process of simultaneous nitrification, denitrification and phosphorus removal (SNDPR) has a great potential to save capital and operating costs for wastewater treatment plants. However, the presence of glycogen-accumulating organisms (GAOs) and the accumulation of nitrous oxide (N2O) can severely compromise the advantages of this process. In this study, these two issues were investigated using a lab-scale sequencing batch reactor performing SNDPR over a 5-month period. The reactor was highly enriched in polyphosphate-accumulating organisms (PAOs) and GAOs representing around 70% of the total microbial community. PAOs were the dominant population at all times and their abundance increased, while GAOs population decreased over the study period. Anoxic batch tests demonstrated that GAOs rather than denitrifying PAOs were responsible for denitrification. NO accumulated from denitrification and more than half of the nitrogen supplied in a reactor cycle was released into the atmosphere as NO. After mixing SNDPR sludge with other denitrifying sludge, N2O present in the bulk liquid was reduced immediately if external carbon was added. We therefore suggest that the N2O accumulation observed in the SNDPR reactor is an artefact of the low microbial diversity facilitated by the use of synthetic wastewater with only a single carbon source. (C) 2005 Elsevier B.V. All rights reserved.
Resumo:
Motivation: Conformational flexibility is essential to the function of many proteins, e.g. catalytic activity. To assist efforts in determining and exploring the functional properties of a protein, it is desirable to automatically identify regions that are prone to undergo conformational changes. It was recently shown that a probabilistic predictor of continuum secondary structure is more accurate than categorical predictors for structurally ambivalent sequence regions, suggesting that such models are suited to characterize protein flexibility. Results: We develop a computational method for identifying regions that are prone to conformational change directly from the amino acid sequence. The method uses the entropy of the probabilistic output of an 8-class continuum secondary structure predictor. Results for 171 unique amino acid sequences with well-characterized variable structure (identified in the 'Macromolecular movements database') indicate that the method is highly sensitive at identifying flexible protein regions, but false positives remain a problem. The method can be used to explore conformational flexibility of proteins (including hypothetical or synthetic ones) whose structure is yet to be determined experimentally.
Resumo:
The Thames Estuary, UK, and the Brisbane River, Australia, are comparable in size and catchment area. Both are representative of the large and growing number of the world's estuaries associated with major cities. Principle differences between the two systems relate to climate and human population pressures. In order to assess the potential phytotoxic impact of herbicide residues in the estuaries, surface waters were analysed with a PAM fluorometry-based bioassay that employs the photosynthetic efficiency (photosystem II quantum yield) of laboratory cultured microalgae, as an endpoint measure of phytotoxicity. In addition, surface waters were chemically analysed for a limited number of herbicides. Diuron atrazine and simazine were detected in both systems at comparable concentrations. In contrast, bioassay results revealed that whilst detected herbicides accounted for the observed phytotoxicity of Brisbane River extracts with great accuracy, they consistently explained only around 50% of the phytotoxicity induced by Thames Estuary extracts. Unaccounted for phytotoxicity in Thames surface waters is indicative of unidentified phytotoxins. The greatest phytotoxic response was measured at Charing Cross, Thames Estuary, and corresponded to a diuron equivalent concentration of 180 ng L-1. The study employs relative potencies (REP) of PSII impacting herbicides and demonstrates that chemical analysis alone is prone to omission of valuable information. Results of the study provide support for the incorporation of bioassays into routine monitoring programs where bioassay data may be used to predict and verify chemical contamination data, alert to unidentified compounds and provide the user with information regarding cumulative toxicity of complex mixtures. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
Background: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. Results: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. Conclusion: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.