980 resultados para genome structure


Relevância:

30.00% 30.00%

Publicador:

Resumo:

GOBASE (http://megasun.bch.umontreal.ca/gobase/) is a network-accessible biological database, which is unique in bringing together diverse biological data on organelles with taxonomically broad coverage, and in furnishing data that have been exhaustively verified and completed by experts. So far, we have focused on mitochondrial data: GOBASE contains all published nucleotide and protein sequences encoded by mitochondrial genomes, selected RNA secondary structures of mitochondria-encoded molecules, genetic maps of completely sequenced genomes, taxonomic information for all species whose sequences are present in the database and organismal descriptions of key protistan eukaryotes. All of these data have been integrated and organized in a formal database structure to allow sophisticated biological queries using terms that are inherent in biological concepts. Most importantly, data have been validated, completed, corrected and standardized, a prerequisite of meaningful analysis. In addition, where critical data are lacking, such as genetic maps and RNA secondary structures, they are generated by the GOBASE team and collaborators, and added to the database. The database is implemented in a relational database management system, but features an object-oriented view of the biological data through a Web/Genera-generated World Wide Web interface. Finally, we have developed software for database curation (i.e. data updates, validation and correction), which will be described in some detail in this paper.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report the genetic organisation of six prophages present in the genome of Lactococcus lactis IL1403. The three larger prophages (36–42 kb), belong to the already described P335 group of temperate phages, whereas the three smaller ones (13–15 kb) are most probably satellites relying on helper phage(s) for multiplication. These data give a new insight into the genetic structure of lactococcal phage populations. P335 temperate phages have variable genomes, sharing homology over only 10–33% of their length. In contrast, virulent phages have highly similar genomes sharing homology over >90% of their length. Further analysis of genetic structure in all known groups of phages active on other bacterial hosts such as Escherichia coli, Bacillus subtilis, Mycobacterium and Streptococcus thermophilus confirmed the existence of two types of genetic structure related to the phage way of life. This might reflect different intensities of horizontal DNA exchange: low among purely virulent phages and high among temperate phages and their lytic homologues. We suggest that the constraints on genetic exchange among purely virulent phages reflect their optimal genetic organisation, adapted to a more specialised and extreme form of parasitism than temperate/lytic phages.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing ‘global views’ of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein–protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and protein–protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to V–b, for attribute value V and constant exponent b), with a few folds having large values and most having small values.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The poly(A)-binding protein (PABP) recognizes the 3′ mRNA poly(A) tail and plays an essential role in eukaryotic translation initiation and mRNA stabilization/degradation. PABP is a modular protein, with four N-terminal RNA-binding domains and an extensive C terminus. The C-terminal region of PABP is essential for normal growth in yeast and has been implicated in mediating PABP homo-oligomerization and protein–protein interactions. A small, proteolytically stable, highly conserved domain has been identified within this C-terminal segment. Remarkably, this domain is also present in the hyperplastic discs protein (HYD) family of ubiquitin ligases. To better understand the function of this conserved region, an x-ray structure of the PABP-like segment of the human HYD protein has been determined at 1.04-Å resolution. The conserved domain adopts a novel fold resembling a right-handed supercoil of four α-helices. Sequence profile searches and comparative protein structure modeling identified a small ORF from the Arabidopsis thaliana genome that encodes a structurally similar but distantly related PABP/HYD domain. Phylogenetic analysis of the experimentally determined (HYD) and homology modeled (PABP) protein surfaces revealed a conserved feature that may be responsible for binding to a PABP interacting protein, Paip1, and other shared interaction partners.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene targeting in mammalian cells has proven invaluable in biotechnology, in studies of gene structure and function, and in understanding chromosome dynamics. It also offers a potential tool for gene-therapeutic applications. Two limitations constrain the current technology: the low rate of homologous recombination in mammalian cells and the high rate of random (nontargeted) integration of the vector DNA. Here we consider possible ways to overcome these limitations within the framework of our present understanding of recombination mechanisms and machinery. Several studies suggest that transient alteration of the levels of recombination proteins, by overexpression or interference with expression, may be able to increase homologous recombination or decrease random integration, and we present a list of candidate genes. We consider potentially beneficial modifications to the vector DNA and discuss the effects of methods of DNA delivery on targeting efficiency. Finally, we present work showing that gene-specific DNA damage can stimulate local homologous recombination, and we discuss recent results with two general methodologies—chimeric nucleases and triplex-forming oligonucleotides—for stimulating recombination in cells.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have analyzed the developmental molecular programs of the mouse hippocampus, a cortical structure critical for learning and memory, by means of large-scale DNA microarray techniques. Of 11,000 genes and expressed sequence tags examined, 1,926 showed dynamic changes during hippocampal development from embryonic day 16 to postnatal day 30. Gene-cluster analysis was used to group these genes into 16 distinct clusters with striking patterns that appear to correlate with major developmental hallmarks and cellular events. These include genes involved in neuronal proliferation, differentiation, and synapse formation. A complete list of the transcriptional changes has been compiled into a comprehensive gene profile database (http://BrainGenomics.Princeton.edu), which should prove valuable in advancing our understanding of the molecular and genetic programs underlying both the development and the functions of the mammalian brain.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The full sequence of the genome-linked viral protein (VPg) cistron located in the central part of potato virus Y (common strain) genome has been identified. The VPg gene codes for a protein of 188 amino acids, with significant homology to other known potyviral VPg polypeptides. A three-dimensional model structure of VPg is proposed on the basis of similarity of hydrophobic-hydrophilic residue distribution to the sequence of malate dehydrogenase of known crystal structure. The 5' end of the viral RNA can be fitted to interact with the protein through the exposed hydroxyl group of Tyr-64, in agreement with experimental data. The complex favors stereochemically the formation of a phosphodiester bond [5'-(O4-tyrosylphospho)adenylate] typical for representatives of picornavirus-like viruses. The chemical mechanisms of viral RNA binding to VPg are discussed on the basis of the model structure of protein-RNA complex.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Chlorarachniophyte algae contain a complex, multi-membraned chloroplast derived from the endosymbiosis of a eukaryotic alga. The vestigial nucleus of the endosymbiont, called the nucleomorph, contains only three small linear chromosomes with a haploid genome size of 380 kb and is the smallest known eukaryotic genome. Nucleotide sequence data from a subtelomeric fragment of chromosome III were analyzed as a preliminary investigation of the coding capacity of this vestigial genome. Several housekeeping genes including U6 small nuclear RNA (snRNA), ribosomal proteins S4 and S13, a core protein of the spliceosome [small nuclear ribonucleoprotein (snRNP) E], and a cip-like protease (clpP) were identified. Expression of these genes was confirmed by combinations of Northern blot analysis, in situ hybridization, immunocytochemistry, and cDNA analysis. The protein-encoding genes are typically eukaryotic in overall structure and their messenger RNAs are polyadenylylated. A novel feature is the abundance of 18-, 19-, or 20-nucleotide introns; the smallest spliceosomal introns known. Two of the genes, U6 and S13, overlap while another two genes, snRNP E and clpP, are cotranscribed in a single mRNA. The overall gene organization is extraordinarily compact, making the nucleomorph a unique model for eukaryotic genomics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mouse is the best model system for the study of mammalian genetics and physiology. Because of the feasibility and importance of studying genetic crosses, the mouse genetic map has received tremendous attention in recent years. It currently contains over 14,000 genetically mapped markers, including 700 mutant loci, 3500 genes, and 6500 simple sequence length polymorphisms (SSLPs). The mutant loci and genes allow insights and correlations concerning physiology and development. The SSLPs provide highly polymorphic anchor points that allow inheritance to be traced in any cross and provide a scaffold for assembling physical maps. Adequate physical mapping resources--notably large-insert yeast artificial chromosome (YAC) libraries--are available to support positional cloning projects based on the genetic map, but a comprehensive physical map is still a few years away. Large-scale sequencing efforts have not yet begun in mouse, but comparative sequence analysis between mouse and human is likely to provide tremendous information about gene structure and regulation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-resolution physical maps of the genomes of three Rhodobacter capsulatus strains, derived from ordered cosmid libraries, were aligned. The 1.2-Mb segment of the SB1003 genome studied here is adjacent to a 1-Mb region analyzed previously [Fonstein, M., Nikolskaya, T. & Haselkorn, H. (1995) J. Bacteriol. 177, 2368-2372]. Probes derived from the ordered cosmid set of R. capsulatus SB1003 were used to link cosmids from the St. Louis and 2.3.1 strain libraries. Cosmids selected this way did not merge into a single contig but formed several unlinked groups. EcoRV restriction maps of the ordered cosmids were then constructed using lambda terminase and fused to derive fragments of the chromosomal map. In order to link these fragments, their ends were transcribed to produce secondary probes for hybridization to gridded cosmid libraries of the same strains. This linking reduced the number of subcontigs to three for the St. Louis strain and one for the 2.3.1 strain. Hybridization of the same probes back to the ordered cosmid set of SB1003 positioned the subcontigs on the high-resolution physical map of SB1003. The final alignment of the restriction maps shows numerous large and small translocations in this 1.2-Mb chromosomal region of the three Rhodobacter strains. In addition, the chromosomes of the three strains, whose fine-structure maps can now be compared over 2.2 Mb, are seen to contain regions of 15-80 kb in which restriction sites are highly polymorphic, interspersed among regions in which the positions of restriction sites are highly conserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Using allozymes and mtDNA sequences from the cytochrome b gene, we report that the brown kiwi has the highest levels of genetic structuring observed in birds. Moreover, the mtDNA sequences are, with two minor exceptions, diagnostic genetic markers for each population investigated, even though they are among the more slowly evolving coding regions in this genome. A major unexpected finding was the concordant split in molecular phylogenies between brown kiwis in the southern South Island and elsewhere in New Zealand. This basic phylogeographic boundary halfway down the South Island coincides with a fixed allele difference in the Hb nuclear locus and strongly suggests that two morphologically cryptic species are currently merged under one polytypic species. This is another striking example of how molecular genetic assays can detect phylogenetic discontinuities that are not reflected in traditional morphologically based taxonomies. However, reanalysis of the morphological characters by using phylogenetic methods revealed that the reason for this discordance is that most are primitive and thus are phylogenetically uninformative. Shared-derived morphological characters support the same relationships evident in the molecular phylogenies and, in concert with the molecular data, suggest that as brown kiwis colonized northward from the southern South Island, they retained many primitive characters that confounded earlier systematists. Strong subdivided population structure and cryptic species in brown kiwis seem to have evolved relatively recently as a consequence of Pleistocene range disjunctions, low dispersal power, and genetic drift in small populations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PR-39 is a porcine 39-aa peptide antibiotic composed of 49% proline and 24% arginine, with an activity against Gram-negative bacteria comparable to that of tetracycline. In Escherichia coli, it inhibits DNA and protein synthesis. PR-39 was originally isolated from pig small intestine, but subsequent cDNA cloning showed that the gene is expressed in the bone marrow. The open reading frame of the clone showed that PR-39 is made as 173-aa precursor whose proregion belongs to the cathelin family. The PR39 gene, which is rather compact and spans only 1784 bp has now been sequenced. The coding information is split into four exons. The first exon contains the signal sequence of 29 residues and the first 37 residues of the cathelin propart. Exons 2 and 3 contain only cathelin information, while exon 4 codes for the four C-terminal cathelin residues and the mature PR-39 peptide extended by three residues. The sequenced upstream region (1183 bp) contains four potential recognition sites for NF-IL6 and three for APRF, transcription factors known to regulate genes for both cytokines and acute phase response factors. Genomic hybridizations revealed a fairly high level of restriction fragment length polymorphism and indicated that there are at least two copies of the PR39 gene in the pig genome. PR39 was mapped to pig chromosome 13 by linkage and in situ hybridization mapping. The gene for the human peptide antibiotic FALL-39 (also a member of the cathelin family) was mapped to human chromosome 3, which is homologous to pig chromosome 13.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Elongated particles of simple RNA viruses of plants are composed of an RNA molecule coated with numerous identical capsid protein subunits to form a regular helical structure, of which tobacco mosaic virus is the archetype. Filamentous particles of the closterovirus beet yellow virus (BYV) reportedly contain approximately 4000 identical 22-kDa (p22) capsid protein subunits. The BYV genome encodes a 24-kDa protein (p24) that is structurally related to the p22. We searched for the p24 in BYV particles by using immunoelectron microscopy with specific antibodies against the recombinant p24 protein and its N-terminal peptide. A 75-nm segment at one end of the 1370-nm filamentous viral particle was found to be consistently labeled with both types of antibodies, thus indicating that p24 is indeed the second capsid protein and that the closterovirus particle, unlike those of other plant viruses with helical symmetry, has a "rattlesnake" rather than uniform structure.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We sampled leaves from 678 individuals in 21 natural populations (30-36 individuals per population), covering the entire distribution of Euptelea pleiospermum in China.Total DNA was isolated from about 50 mg powdered leaf tissue following the protocol of a DNA extraction kit (Tiangen Biotech Co., LTD., Beijing, China). We used seven fluorescence-labeled microsatellite loci (EP036, EP059, EP081, EP087, EP091, EP278 and EP294; Zhang et al., 2008) to genotype our 678 DNA samples.