4 resultados para Biology, Molecular|Biology, Cell|Engineering, Biomedical
em Bucknell University Digital Commons - Pensilvania - USA
Resumo:
Background: In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results: We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion: The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms.
Resumo:
Talk of different types of cells is commonplace in the biological sciences. We know a great deal, for example, about human muscle cells by studying the same type of cells in mice. Information about cell type is apparently largely projectible across species boundaries. But what defines cell type? Do cells come pre-packaged into different natural kinds? Philosophical attention to these questions has been extremely limited [see e.g., Wilson (Species: New Interdisciplinary Essays, pp 187-207, 1999; Genes and the Agents of Life, 2005; Wilson et al. Philos Top 35(1/2): 189-215, 2007)]. On the face of it, the problems we face in individuating cellular kinds resemble those biologists and philosophers of biology encountered in thinking about species: there are apparently many different (and interconnected) bases on which we might legitimately classify cells. We could, for example, focus on their developmental history (a sort of analogue to a species' evolutionary history); or we might divide on the basis of certain structural features, functional role, location within larger systems, and so on. In this paper, I sketch an approach to cellular kinds inspired by Boyd's Homeostatic Property Cluster Theory, applying some lessons from this application back to general questions about the nature of natural kinds.
Resumo:
Early embryonic exposure to maternal glucocorticoids can broadly impact physiology and behaviour across phylogenetically diverse taxa. The transfer of maternal glucocorticoids to offspring may be an inevitable cost associated with poor environmental conditions, or serve as a maternal effect that alters offspring phenotype in preparation for a stressful environment. Regardless, maternal glucocorticoids are likely to have both costs and benefits that are paid and collected over different developmental time periods. We manipulated yolk corticosterone (cort) in domestic chickens (Gallus domesticus) to examine the potential impacts of embryonic exposure to maternal stress on the juvenile stress response and cellular ageing. Here, we report that juveniles exposed to experimentally increased cort in ovo had a protracted decline in cort during the recovery phase of the stress response. All birds, regardless of treatment group, shifted to oxidative stress during an acute stress response. In addition, embryonic exposure to cort resulted in higher levels of reactive oxygen metabolites and an over-representation of short telomeres compared with the control birds. In many species, individuals with higher levels of oxidative stress and shorter telomeres have the poorest survival prospects. Given this, long-term costs of glucocorticoid-induced phenotypes may include accelerated ageing and increased mortality.
Performance Tuning Non-Uniform Sampling for Sensitivity Enhancement of Signal-Limited Biological NMR
Resumo:
Non-uniform sampling (NUS) has been established as a route to obtaining true sensitivity enhancements when recording indirect dimensions of decaying signals in the same total experimental time as traditional uniform incrementation of the indirect evolution period. Theory and experiments have shown that NUS can yield up to two-fold improvements in the intrinsic signal-to-noise ratio (SNR) of each dimension, while even conservative protocols can yield 20-40 % improvements in the intrinsic SNR of NMR data. Applications of biological NMR that can benefit from these improvements are emerging, and in this work we develop some practical aspects of applying NUS nD-NMR to studies that approach the traditional detection limit of nD-NMR spectroscopy. Conditions for obtaining high NUS sensitivity enhancements are considered here in the context of enabling H-1,N-15-HSQC experiments on natural abundance protein samples and H-1,C-13-HMBC experiments on a challenging natural product. Through systematic studies we arrive at more precise guidelines to contrast sensitivity enhancements with reduced line shape constraints, and report an alternative sampling density based on a quarter-wave sinusoidal distribution that returns the highest fidelity we have seen to date in line shapes obtained by maximum entropy processing of non-uniformly sampled data.