950 resultados para Complete Equipartite Graphs
Resumo:
The nucleocapsid of hepatitis B virus (HBV), or HBcAg, is a highly symmetric structure formed by multiple dimers of a single core protein that contains potent T helper epitopes in its 183-aa sequence. Both factors make HBcAg an unusually strong immunogen and an attractive candidate as a carrier for foreign epitopes. The immunodominant c/e1 epitope on the capsid has been suggested as a superior location to convey high immunogenicity to a heterologous sequence. Because of its central position, however, any c/e1 insert disrupts the core protein’s primary sequence; hence, only peptides, or rather small protein fragments seemed to be compatible with particle formation. According to recent structural data, the epitope is located at the tips of prominent surface spikes formed by the very stable dimer interfaces. We therefore reasoned that much larger inserts might be tolerated, provided the individual parts of a corresponding fusion protein could fold independently. Using the green fluorescent protein (GFP) as a model insert, we show that the chimeric protein efficiently forms fluorescent particles; hence, all of its structurally important parts must be properly folded. We also demonstrate that the GFP domains are surface-exposed and that the chimeric particles elicit a potent humoral response against native GFP. Hence, proteins of at least up to 238 aa can be natively displayed on the surface of HBV core particles. Such chimeras may not only be useful as vaccines but may also open the way for high resolution structural analyses of nonassembling proteins by electron microscopy.
Resumo:
Current global phylogenies are built predominantly on rRNA sequences. However, an experimental system for studying the evolution of rRNA is not readily available, mainly because the rRNA genes are highly repeated in most experimental organisms. We have constructed an Escherichia coli strain in which all seven chromosomal rRNA operons are inactivated by deletions spanning the 16S and 23S coding regions. A single E. coli rRNA operon carried by a multicopy plasmid supplies 16S and 23S rRNA to the cell. By using this strain we have succeeded in creating microorganisms that contain only a foreign rRNA operon derived from either Salmonella typhimurium or Proteus vulgaris, microorganisms that have diverged from E. coli about 120–350 million years ago. We also were able to replace the E. coli rRNA operon with an E. coli/yeast hybrid one in which the GTPase center of E. coli 23S rRNA had been substituted by the corresponding domain from Saccharomyces cerevisiae. These results suggest that, contrary to common belief, coevolution of rRNA with many other components in the translational machinery may not completely preclude the horizontal transfer of rRNA genes.
Resumo:
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.
Resumo:
This paper describes the design of a parallel algorithm that uses moving fluids in a three-dimensional microfluidic system to solve a nondeterministically polynomial complete problem (the maximal clique problem) in polynomial time. This algorithm relies on (i) parallel fabrication of the microfluidic system, (ii) parallel searching of all potential solutions by using fluid flow, and (iii) parallel optical readout of all solutions. This algorithm was implemented to solve the maximal clique problem for a simple graph with six vertices. The successful implementation of this algorithm to compute solutions for small-size graphs with fluids in microchannels is not useful, per se, but does suggest broader application for microfluidics in computation and control.
Resumo:
Candida albicans is a diploid fungus that has become a medically important opportunistic pathogen in immunocompromised individuals. We have sequenced the C. albicans genome to 10.4-fold coverage and performed a comparative genomic analysis between C. albicans and Saccharomyces cerevisiae with the objective of assessing whether Candida possesses a genetic repertoire that could support a complete sexual cycle. Analyzing over 500 genes important for sexual differentiation in S. cerevisiae, we find many homologues of genes that are implicated in the initiation of meiosis, chromosome recombination, and the formation of synaptonemal complexes. However, others are striking in their absence. C. albicans seems to have homologues of all of the elements of a functional pheromone response pathway involved in mating in S. cerevisiae but lacks many homologues of S. cerevisiae genes for meiosis. Other meiotic gene homologues in organisms ranging from filamentous fungi to Drosophila melanogaster and Caenorhabditis elegans were also found in the C. albicans genome, suggesting potential alternative mechanisms of genetic exchange.
Resumo:
We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.
Resumo:
The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.
Resumo:
The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial “molecular mimicry” of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.
Resumo:
Understanding the factors responsible for variations in mutation patterns and selection efficacy along chromosomes is a prerequisite for deciphering genome sequences. Population genetics models predict a positive correlation between the efficacy of selection at a given locus and the local rate of recombination because of Hill–Robertson effects. Codon usage is considered one of the most striking examples that support this prediction at the molecular level. In a wide range of species including Caenorhabditis elegans and Drosophila melanogaster, codon usage is essentially shaped by selection acting for translational efficiency. Codon usage bias correlates positively with recombination rate in Drosophila, apparently supporting the hypothesis that selection on codon usage is improved by recombination. Here we present an exhaustive analysis of codon usage in C. elegans and D. melanogaster complete genomes. We show that in both genomes there is a positive correlation between recombination rate and the frequency of optimal codons. However, we demonstrate that in both species, this effect is due to a mutational bias toward G and C bases in regions of high recombination rate, possibly as a direct consequence of the recombination process. The correlation between codon usage bias and recombination rate in these species appears to be essentially determined by recombination-dependent mutational patterns, rather than selective effects. This result highlights that it is necessary to take into account the mutagenic effect of recombination to understand the evolutionary role and impact of recombination.
Resumo:
The genome of the crenarchaeon Sulfolobus solfataricus P2 contains 2,992,245 bp on a single chromosome and encodes 2,977 proteins and many RNAs. One-third of the encoded proteins have no detectable homologs in other sequenced genomes. Moreover, 40% appear to be archaeal-specific, and only 12% and 2.3% are shared exclusively with bacteria and eukarya, respectively. The genome shows a high level of plasticity with 200 diverse insertion sequence elements, many putative nonautonomous mobile elements, and evidence of integrase-mediated insertion events. There are also long clusters of regularly spaced tandem repeats. Different transfer systems are used for the uptake of inorganic and organic solutes, and a wealth of intracellular and extracellular proteases, sugar, and sulfur metabolizing enzymes are encoded, as well as enzymes of the central metabolic pathways and motility proteins. The major metabolic electron carrier is not NADH as in bacteria and eukarya but probably ferredoxin. The essential components required for DNA replication, DNA repair and recombination, the cell cycle, transcriptional initiation and translation, but not DNA folding, show a strong eukaryal character with many archaeal-specific features. The results illustrate major differences between crenarchaea and euryarchaea, especially for their DNA replication mechanism and cell cycle processes and their translational apparatus.
Resumo:
In human patients, a wide range of mutations in keratin (K) 5 or K14 lead to the blistering skin disorder epidermolysis bullosa simplex. Given that K14 deficiency does not lead to the ablation of a basal cell cytoskeleton because of a compensatory role of K15, we have investigated the requirement for the keratin cytoskeleton in basal cells by inactivating the K5 gene in mice. We report that the K5−/− mice die shortly after birth, lack keratin filaments in the basal epidermis, and are more severely affected than K14−/− mice. In contrast to the K14−/− mice, we detected a strong induction of the wound-healing keratin K6 in the suprabasal epidermis of cytolyzed areas of postnatal K5−/− mice. In addition, K5 and K14 mice differed with respect to tongue lesions. Moreover, we show that in the absence of K5 and other type II keratins, residual K14 and K15 aggregated along hemidesmosomes, demonstrating that individual keratins without a partner are stable in vivo. Our data indicate that K5 may be the natural partner of K15 and K17. We suggest that K5 null mutations may be lethal in human epidermolysis bullosa simplex patients.
Resumo:
The recently sequenced genome of the parasitic bacterium Mycoplasma genitalium contains only 468 identified protein-coding genes that have been dubbed a minimal gene complement [Fraser, C.M., Gocayne, J.D., White, O., Adams, M.D., Clayton, R.A., et al. (1995) Science 270, 397-403]. Although the M. genitalium gene complement is indeed the smallest among known cellular life forms, there is no evidence that it is the minimal self-sufficient gene set. To derive such a set, we compared the 468 predicted M. genitalium protein sequences with the 1703 protein sequences encoded by the other completely sequenced small bacterial genome, that of Haemophilus influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Therefore, the genes that are conserved in these two bacteria are almost certainly essential for cellular function. It is this category of genes that is most likely to approximate the minimal gene set. We found that 240 M. genitalium genes have orthologs among the genes of H. influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in essential pathways are missing. The apparent reason for this is the phenomenon that we call nonorthologous gene displacement when the same function is fulfilled by nonorthologous proteins in two organisms. We identified 22 nonorthologous displacements and supplemented the set of orthologs with the respective M. genitalium genes. After examining the resulting list of 262 genes for possible functional redundancy and for the presence of apparently parasite-specific genes, 6 genes were removed. We suggest that the remaining 256 genes are close to the minimal gene set that is necessary and sufficient to sustain the existence of a modern-type cell. Most of the proteins encoded by the genes from the minimal set have eukaryotic or archaeal homologs but seven key proteins of DNA replication do not. We speculate that the last common ancestor of the three primary kingdoms had an RNA genome. Possibilities are explored to further reduce the minimal set to model a primitive cell that might have existed at a very early stage of life evolution.
Resumo:
The rearrangement of antibody and T-cell receptor gene segments is indispensable to the vertebrate immune response. All extant jawed vertebrates can rearrange these gene segments. This ability is conferred by the recombination activating genes I and II (RAG I and RAG II). To elucidate their origin and function, the cDNA encoding RAG I from a member of the most ancient class of extant gnathostomes, the Carcharhine sharks, was characterized. Homology domains identified within shark RAG I prompted sequence comparison analyses that suggested similarity of the RAG I and II genes, respectively, to the integrase family genes and integration host factor genes of the bacterial site-specific recombination system. Thus, the apparent explosive evolution (or "big bang") of the ancestral immune system may have been initiated by a transfer of microbial site-specific recombinases.