3 resultados para Gc Content
em National Center for Biotechnology Information - NCBI
Resumo:
A quantitative model of interphase chromosome higher-order structure is presented based on the isochore model of the genome and results obtained in the field of copolymer research. G1 chromosomes are approximated in the model as multiblock copolymers of the 30-nm chromatin fiber, which alternately contain two types of 0.5- to 1-Mbp blocks (R and G minibands) differing in GC content and DNA-bound proteins. A G1 chromosome forms a single-chain string of loop clusters (micelles), with each loop ∼1–2 Mbp in size. The number of ∼20 loops per micelle was estimated from the dependence of geometrical versus genomic distances between two points on a G1 chromosome. The greater degree of chromatin extension in R versus G minibands and a difference in the replication time for these minibands (early S phase for R versus late S phase for G) are explained in this model as a result of the location of R minibands at micelle cores and G minibands at loop apices. The estimated number of micelles per nucleus is close to the observed number of replication clusters at the onset of S phase. A relationship between chromosomal and nuclear sizes for several types of higher eukaryotic cells (insects, plants, and mammals) is well described through the micelle structure of interphase chromosomes. For yeast cells, this relationship is described by a linear coil configuration of chromosomes.
Resumo:
One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes.
Resumo:
The global amino acid compositions as deduced from the complete genomic sequences of six thermophilic archaea, two thermophilic bacteria, 17 mesophilic bacteria and two eukaryotic species were analysed by hierarchical clustering and principal components analysis. Both methods showed an influence of several factors on amino acid composition. Although GC content has a dominant effect, thermophilic species can be identified by their global amino acid compositions alone. This study presents a careful statistical analysis of factors that affect amino acid composition and also yielded specific features of the average amino acid composition of thermophilic species. Moreover, we introduce the first example of a ‘compositional tree’ of species that takes into account not only homologous proteins, but also proteins unique to particular species. We expect this simple yet novel approach to be a useful additional tool for the study of phylogeny at the genome level.