987 resultados para sequence analysis


Relevância:

60.00% 60.00%

Publicador:

Resumo:

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25 320 structural domains and a further 160 000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153–165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homo­logous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389–3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A database (SpliceDB) of known mammalian splice site sequences has been developed. We extracted 43 337 splice pairs from mammalian divisions of the gene-centered Infogene database, including sites from incomplete or alternatively spliced genes. Known EST sequences supported 22 815 of them. After discarding sequences with putative errors and ambiguous location of splice junctions the verified dataset includes 22 489 entries. Of these, 98.71% contain canonical GT–AG junctions (22 199 entries) and 0.56% have non-canonical GC–AG splice site pairs. The remainder (0.73%) occurs in a lot of small groups (with a maximum size of 0.05%). We especially studied non-canonical splice sites, which comprise 3.73% of GenBank annotated splice pairs. EST alignments allowed us to verify only the exonic part of splice sites. To check the conservative dinucleotides we compared sequences of human non-canonical splice sites with sequences from the high throughput genome sequencing project (HTG). Out of 171 human non-canonical and EST-supported splice pairs, 156 (91.23%) had a clear match in the human HTG. They can be classified after sequence analysis as: 79 GC–AG pairs (of which one was an error that corrected to GC–AG), 61 errors corrected to GT–AG canonical pairs, six AT–AC pairs (of which two were errors corrected to AT–AC), one case was produced from a non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two other cases left of supported non-canonical splice pairs. The information about verified splice site sequences for canonical and non-canonical sites is presented in SpliceDB with the supporting evidence. We also built weight matrices for the major splice groups, which can be incorporated into gene prediction programs. SpliceDB is available at the computational genomic Web server of the Sanger Centre: http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The multispanning membrane protein Ste6, a member of the ABC-transporter family, is transported to the yeast vacuole for degradation. To identify functions involved in the intracellular trafficking of polytopic membrane proteins, we looked for functions that block Ste6 transport to the vacuole upon overproduction. In our screen, we identified several known vacuolar protein sorting (VPS) genes (SNF7/VPS32, VPS4, and VPS35) and a previously uncharacterized open reading frame, which we named MOS10 (more of Ste6). Sequence analysis showed that Mos10 is a member of a small family of coiled-coil–forming proteins, which includes Snf7 and Vps20. Deletion mutants of all three genes stabilize Ste6 and show a “class E vps phenotype.” Maturation of the vacuolar hydrolase carboxypeptidase Y was affected in the mutants and the endocytic tracer FM4-64 and Ste6 accumulated in a dot or ring-like structure next to the vacuole. Differential centrifugation experiments demonstrated that about half of the hydrophilic proteins Mos10 and Vps20 was membrane associated. The intracellular distribution was further analyzed for Mos10. On sucrose gradients, membrane-associated Mos10 cofractionated with the endosomal t-SNARE Pep12, pointing to an endosomal localization of Mos10. The growth phenotypes of the mutants suggest that the “Snf7-family” members are involved in a cargo-specific event.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Efficient motility of the eukaryotic flagellum requires precise temporal and spatial control of its constituent dynein motors. The central pair and its associated structures have been implicated as important members of a signal transduction cascade that ultimately regulates dynein arm activity. To identify central pair components involved in this process, we characterized a Chlamydomonas motility mutant (pf6-2) obtained by insertional mutagenesis. pf6-2 flagella twitch ineffectively and lack the 1a projection on the C1 microtubule of the central pair. Transformation with constructs containing a full-length, wild-type copy of the PF6 gene rescues the functional, structural, and biochemical defects associated with the pf6 mutation. Sequence analysis indicates that the PF6 gene encodes a large polypeptide that contains numerous alanine-rich, proline-rich, and basic domains and has limited homology to an expressed sequence tag derived from a human testis cDNA library. Biochemical analysis of an epitope-tagged PF6 construct demonstrates that the PF6 polypeptide is an axonemal component that cosediments at 12.6S with several other polypeptides. The PF6 protein appears to be an essential component required for assembly of some of these polypeptides into the C1-1a projection.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Detection of similarity is particularly difficult for small proteins and thus connections between many of them remain unnoticed. Structure and sequence analysis of several metal-binding proteins reveals unexpected similarities in structural domains classified as different protein folds in SCOP and suggests unification of seven folds that belong to two protein classes. The common motif, termed treble clef finger in this study, forms the protein structural core and is 25–45 residues long. The treble clef motif is assembled around the central zinc ion and consists of a zinc knuckle, loop, β-hairpin and an α-helix. The knuckle and the first turn of the helix each incorporate two zinc ligands. Treble clef domains constitute the core of many structures such as ribosomal proteins L24E and S14, RING fingers, protein kinase cysteine-rich domains, nuclear receptor-like fingers, LIM domains, phosphatidylinositol-3-phosphate-binding domains and His-Me finger endonucleases. The treble clef finger is a uniquely versatile motif adaptable for various functions. This small domain with a 25 residue structural core can accommodate eight different metal-binding sites and can have many types of functions from binding of nucleic acids, proteins and small molecules, to catalysis of phosphodiester bond hydrolysis. Treble clef motifs are frequently incorporated in larger structures or occur in doublets. Present analysis suggests that the treble clef motif defines a distinct structural fold found in proteins with diverse functional properties and forms one of the major zinc finger groups.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We have cloned, expressed and purified a hexameric human DNA helicase (hHcsA) from HeLa cells. Sequence analysis demonstrated that the hHcsA has strong sequence homology with DNA helicase genes from Saccharomyces cerevisiae and Caenorhabditis elegans, indicating that this gene appears to be well conserved from yeast to human. The hHcsA gene was cloned and expressed in Escherichia coli and purified to homogeneity. The expressed protein had a subunit molecular mass of 116 kDa and analysis of its native molecular mass by size exclusion chromatography suggested that hHcsA is a hexameric protein. The hHcsA protein had a strong DNA-dependent ATPase activity that was stimulated ≥5-fold by single-stranded DNA (ssDNA). Human hHcsA unwinds duplex DNA and analysis of the polarity of translocation demonstrated that the polarity of DNA unwinding was in a 5′→3′ direction. The helicase activity was stimulated by human and yeast replication protein A, but not significantly by E.coli ssDNA-binding protein. We have analyzed expression levels of the hHcsA gene in HeLa cells during various phases of the cell cycle using in situ hybridization analysis. Our results indicated that the expression of the hHcsA gene, as evidenced from the mRNA levels, is cell cycle-dependent. The maximal level of hHcsA expression was observed in late G1/early S phase, suggesting a possible role for this protein during S phase and in DNA synthesis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We have determined the solution structure of the C-terminal quarter of human poly(A)-binding protein (hPABP). The protein fragment contains a protein domain, PABC [for poly(A)-binding protein C-terminal domain], which is also found associated with the HECT family of ubiquitin ligases. By using peptides derived from PABP interacting protein (Paip) 1, Paip2, and eRF3, we show that PABC functions as a peptide binding domain. We use chemical shift perturbation analysis to identify the peptide binding site in PABC and the major elements involved in peptide recognition. From comparative sequence analysis of PABC-binding peptides, we formulate a preliminary PABC consensus sequence and identify human ataxin-2, the protein responsible for type 2 spinocerebellar ataxia (SCA2), as a potential PABC ligand.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Arabidopsis ERD1 is a ClpC-like protein that sequence analysis suggests may interact with the chloroplast-localized ClpP protease to facilitate proteolysis. The mRNA encoded by the ERD1 gene has previously been shown to accumulate in response to senescence and to a variety of stresses and hormones. Here we show that the ERD1 protein, in contrast to the ERD1 mRNA, strongly declines in abundance with age, becoming undetectable in fully expanded leaves. Sequence analysis also suggests that ERD1 is chloroplast targeted, and we show in an in vitro system that the native protein is properly imported, processed, and present within the soluble fraction of the chloroplast, presumably the stroma. We show that ClpP protein, which is also present in the stroma, declines with age in parallel with ERD1. These results are consistent with the interaction of ERD1 and ClpP, but they suggest that it is unlikely that either plays a major role during senescence. Certain other chloroplast proteins decline with age coordinately with ERD1 and ClpP, suggesting that these declines are markers of an early age-mediated change that occurs within the chloroplast.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Fusicoccin (FC) is a fungal toxin that activates the plant plasma membrane H+-ATPase by binding with 14-3-3 proteins, causing membrane hyperpolarization. Here we report on the effect of FC on a gene-for-gene pathogen-resistance response and show that FC application induces the expression of several genes involved in plant responses to pathogens. Ten members of the FC-binding 14-3-3 protein gene family were isolated from tomato (Lycopersicon esculentum) to characterize their role in defense responses. Sequence analysis is suggestive of common biochemical functions for these tomato 14-3-3 proteins, but their genes showed different expression patterns in leaves after challenges. Different specific subsets of 14-3-3 genes were induced after treatment with FC and during a gene-for-gene resistance response. Possible roles for the H+-ATPase and 14-3-3 proteins in responses to pathogens are discussed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The existence in higher plants of an additional β-oxidation system in mitochondria, besides the well-characterized peroxisomal system, is often considered controversial. Unequivocal demonstration of β-oxidation activity in mitochondria should rely on identification of the enzymes specific to mitochondrial β-oxidation. Acyl-coenzyme A dehydrogenase (ACAD) (EC 1.3.99.2,3) activity was detected in purified mitochondria from maize (Zea mays L.) root tips and from embryonic axes of early-germinating sunflower (Helianthus annuus L.) seeds, using as the enzyme assay the reduction of 2,6-dichlorophenolindophenol, with phenazine methosulfate as the intermediate electron carrier. Subcellular fractionation showed that this ACAD activity was associated with mitochondrial fractions. Comparison of ACAD activity in mitochondria and acyl-coenzyme A oxidase activity in peroxisomes showed differences of substrate specificities. Embryonic axes of sunflower seeds were used as starting material for the purification of ACADs. Two distinct ACADs, with medium-chain and long-chain substrate specificities, respectively, were separated by their chromatographic behavior, which was similar to that of mammalian ACADs. The characterization of these ACADs is discussed in relation to the identification of expressed sequenced tags corresponding to ACADs in cDNA sequence analysis projects and with the potential roles of mitochondrial β-oxidation in higher plants.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Four cDNAs encoding phosphoribosyl diphosphate (PRPP) synthase were isolated from a spinach (Spinacia oleracea) cDNA library by complementation of an Escherichia coli Δprs mutation. The four gene products produced PRPP in vitro from ATP and ribose-5-phosphate. Two of the enzymes (isozymes 1 and 2) required inorganic phosphate for activity, whereas the others were phosphate independent. PRPP synthase isozymes 2 and 3 contained 76 and 87 amino acid extensions, respectively, at their N-terminal ends in comparison with other PRPP synthases. Isozyme 2 was synthesized in vitro and shown to be imported and processed by pea (Pisum sativum) chloroplasts. Amino acid sequence analysis indicated that isozyme 3 may be transported to mitochondria and that isozyme 4 may be located in the cytosol. The deduced amino acid sequences of isozymes 1 and 2 and isozymes 3 and 4 were 88% and 75% identical, respectively. In contrast, the amino acid identities of PRPP synthase isozyme 1 or 2 with 3 or 4 was modest (22%–25%), but the sequence motifs for binding of PRPP and divalent cation-nucleotide were identified in all four sequences. The results indicate that PRPP synthase isozymes 3 and 4 belong to a new class of PRPP synthases that may be specific to plants.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Previously conducted sequence analysis of Arabidopsis thaliana (ecotype Columbia-0) reported an insertion of 270-kb mtDNA into the pericentric region on the short arm of chromosome 2. DNA fiber-based fluorescence in situ hybridization analyses reveal that the mtDNA insert is 618 ± 42 kb, ≈2.3 times greater than that determined by contig assembly and sequencing analysis. Portions of the mitochondrial genome previously believed to be absent were identified within the insert. Sections of the mtDNA are repeated throughout the insert. The cytological data illustrate that DNA contig assembly by using bacterial artificial chromosomes tends to produce a minimal clone path by skipping over duplicated regions, thereby resulting in sequencing errors. We demonstrate that fiber-fluorescence in situ hybridization is a powerful technique to analyze large repetitive regions in the higher eukaryotic genomes and is a valuable complement to ongoing large genome sequencing projects.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Filamentous fungi are a large group of diverse and economically important microorganisms. Large-scale gene disruption strategies developed in budding yeast are not applicable to these organisms because of their larger genomes and lower rate of targeted integration (TI) during transformation. We developed transposon-arrayed gene knockouts (TAGKO) to discover genes and simultaneously create gene disruption cassettes for subsequent transformation and mutant analysis. Transposons carrying a bacterial and fungal drug resistance marker are used to mutagenize individual cosmids or entire libraries in vitro. Cosmids are annotated by DNA sequence analysis at the transposon insertion sites, and cosmid inserts are liberated to direct insertional mutagenesis events in the genome. Based on saturation analysis of a cosmid insert and insertions in a fungal cosmid library, we show that TAGKO can be used to rapidly identify and mutate genes. We further show that insertions can create alterations in gene expression, and we have used this approach to investigate an amino acid oxidation pathway in two important fungal phytopathogens.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dystrobrevin is a component of the dystrophin-associated protein complex and has been shown to interact directly with dystrophin, α1-syntrophin, and the sarcoglycan complex. The precise role of α-dystrobrevin in skeletal muscle has not yet been determined. To study α-dystrobrevin's function in skeletal muscle, we used the yeast two-hybrid approach to look for interacting proteins. Three overlapping clones were identified that encoded an intermediate filament protein we subsequently named desmuslin (DMN). Sequence analysis revealed that DMN has a short N-terminal domain, a conserved rod domain, and a long C-terminal domain, all common features of type 6 intermediate filament proteins. A positive interaction between DMN and α-dystrobrevin was confirmed with an in vitro coimmunoprecipitation assay. By Northern blot analysis, we find that DMN is expressed mainly in heart and skeletal muscle, although there is some expression in brain. Western blotting detected a 160-kDa protein in heart and skeletal muscle. Immunofluorescent microscopy localizes DMN in a stripe-like pattern in longitudinal sections and in a mosaic pattern in cross sections of skeletal muscle. Electron microscopic analysis shows DMN colocalized with desmin at the Z-lines. Subsequent coimmunoprecipitation experiments confirmed an interaction with desmin. Our findings suggest that DMN may serve as a direct linkage between the extracellular matrix and the Z-discs (through plectin) and may play an important role in maintaining muscle cell integrity.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Genetic mapping of wheat, maize, and rice and other grass species with common DNA probes has revealed remarkable conservation of gene content and gene order over the 60 million years of radiation of Poaceae. The linear organization of genes in some nine different genomes differing in basic chromosome number from 5 to 12 and nuclear DNA amount from 400 to 6,000 Mb, can be described in terms of only 25 “rice linkage blocks.” The extent to which this intergenomic colinearity is confounded at the micro level by gene duplication and micro-rearrangements is still an open question. Nevertheless, it is clear that the elucidation of the organization of the economically important grasses with larger genomes, such as maize (2n = 10, 4,500 Mb DNA), will, to a greater or lesser extent, be predicted from sequence analysis of smaller genomes such as rice, with only 400 Mb, which in turn may be greatly aided by knowledge of the entire sequence of Arabidopsis, which may be available as soon as the turn of the century. Comparative genetics will provide the key to unlock the genomic secrets of crop plants with bigger genomes than Homo sapiens.