6 resultados para Clustering a large document collection
em National Center for Biotechnology Information - NCBI
Resumo:
Statistically significant charge clusters (basic, acidic, or of mixed charge) in tertiary protein structures are identified by new methods from a large representative collection of protein structures. About 10% of protein structures show at least one charge cluster, mostly of mixed type involving about equally anionic and cationic residues. Positive charge clusters are very rare. Negative (or histidine-acidic) charge clusters often coordinate calcium, or magnesium or zinc ions [e.g., thermolysin (PDB code: 3tln), mannose-binding protein (2msb), aminopeptidase (1amp)]. Mixed-charge clusters are prominent at interchain contacts where they stabilize quaternary protein formation [e.g., glutathione S-transferase (2gst), catalase (8act), and fructose-1,6-bisphosphate aldolase (1fba)]. They are also involved in protein-protein interaction and in substrate binding. For example, the mixed-charge cluster of aspartate carbamoyl-transferase (8atc) envelops the aspartate carbonyl substrate in a flexible manner (alternating tense and relaxed states) where charge associations can vary from weak to strong. Other proteins with charge clusters include the P450 cytochrome family (BM-3, Terp, Cam), several flavocytochromes, neuraminidase, hemagglutinin, the photosynthetic reaction center, and annexin. In each case in Table 2 we discuss the possible role of the charge clusters with respect to protein structure and function.
Resumo:
Eukaryotic genome similarity relationships are inferred using sequence information derived from large aggregates of genomic sequences. Comparisons within and between species sample sequences are based on the profile of dinucleotide relative abundance values (The profile is ρ*XY = f*XY/f*Xf*Y for all XY, where f*X denotes the frequency of the nucleotide X and f*XY denotes the frequency of the dinucleotide XY, both computed from the sequence concatenated with its inverted complement). Previous studies with respect to prokaryotes and this study document that profiles of different DNA sequence samples (sample size ≥50 kb) from the same organism are generally much more similar to each other than they are to profiles from other organisms, and that closely related organisms generally have more similar profiles than do distantly related organisms. On this basis we refer to the collection {ρ*XY} as the genome signature. This paper identifies ρ*XY extremes and compares genome signature differences for a diverse range of eukaryotic species. Interpretations on the mechanisms maintaining these profile differences center on genome-wide replication, repair, DNA structures, and context-dependent mutational biases. It is also observed that mitochondrial genome signature differences between species parallel the corresponding nuclear genome signature differences despite large differences between corresponding mitochondrial and nuclear signatures. The genome signature differences also have implications for contrasts between rodents and other mammals, and between monocot and dicot plants, as well as providing evidence for similarities among fungi and the diversity of protists.
Resumo:
In the yeast Saccharomyces cerevisiae, meiotic recombination is initiated by transient DNA double-strand breaks (DSBs) that are repaired by interaction of the broken chromosome with its homologue. To identify a large number of DSB sites and gain insight into the control of DSB formation at both the local and the whole chromosomal levels, we have determined at high resolution the distribution of meiotic DSBs along the 340 kb of chromosome III. We have found 76 DSB regions, mostly located in intergenic promoter-containing intervals. The frequency of DSBs varies at least 50-fold from one region to another. The global distribution of DSB regions along chromosome III is nonrandom, defining large (39–105 kb) chromosomal domains, both hot and cold. The distribution of these localized DSBs indicates that they are likely to initiate most crossovers along chromosome III, but some discrepancies remain to be explained.
Resumo:
Rat basophilic leukemia (RBL-2H3) cells predominantly express the type II receptor for inositol 1,4,5-trisphosphate (InsP3), which operates as an InsP3-gated calcium channel. In these cells, cross-linking the high-affinity immunoglobulin E receptor (FcεR1) leads to activation of phospholipase C γ isoforms via tyrosine kinase- and phosphatidylinositol 3-kinase-dependent pathways, release of InsP3-sensitive intracellular Ca2+ stores, and a sustained phase of Ca2+ influx. These events are accompanied by a redistribution of type II InsP3 receptors within the endoplasmic reticulum and nuclear envelope, from a diffuse pattern with a few small aggregates in resting cells to large isolated clusters after antigen stimulation. Redistribution of type II InsP3 receptors is also seen after treatment of RBL-2H3 cells with ionomycin or thapsigargin. InsP3 receptor clustering occurs within 5–10 min of stimulus and persists for up to 1 h in the presence of antigen. Receptor clustering is independent of endoplasmic reticulum vesiculation, which occurs only at ionomycin concentrations >1 μM, and maximal clustering responses are dependent on the presence of extracellular calcium. InsP3 receptor aggregation may be a characteristic cellular response to Ca2+-mobilizing ligands, because similar results are seen after activation of phospholipase C-linked G-protein-coupled receptors; cholecystokinin causes type II receptor redistribution in rat pancreatoma AR4–2J cells, and carbachol causes type III receptor redistribution in muscarinic receptor-expressing hamster lung fibroblast E36M3R cells. Stimulation of these three cell types leads to a reduction in InsP3 receptor levels only in AR4–2J cells, indicating that receptor clustering does not correlate with receptor down-regulation. The calcium-dependent aggregation of InsP3 receptors may contribute to the previously observed changes in affinity for InsP3 in the presence of elevated Ca2+ and/or may establish discrete regions within refilled stores with varying capacity to release Ca2+ when a subsequent stimulus results in production of InsP3.
Resumo:
We have undertaken an extensive screen to identify Saccharomyces cerevisiae genes whose products are involved in cell cycle progression. We report the identification of 113 genes, including 19 hypothetical ORFs, which confer arrest or delay in specific compartments of the cell cycle when overexpressed. The collection of genes identified by this screen overlaps with those identified in loss-of-function cdc screens but also includes genes whose products have not previously been implicated in cell cycle control. Through analysis of strains lacking these hypothetical ORFs, we have identified a variety of new CDC and checkpoint genes.
Resumo:
Among fruit-fly species of the genus Drosophila there is remarkable variation in sperm length, with some species producing gigantic sperm (e.g., > 10 times total male body length). These flies are also unusual in that males of some species exhibit a prolonged adult nonreproductive phase. We document sperm length, body size, and sex-specific ages of reproductive maturity for 42 species of Drosophila and, after controlling for phylogeny, test hypotheses to explain the variation in rates of sexual maturation. Results suggest that delayed male maturity is a cost of producing long sperm. A possible physiological mechanism to explain the observed relationship is discussed.