968 resultados para Protein Sequence Analysis
Resumo:
The MyHits web site (http://myhits.isb-sib.ch) is an integrated service dedicated to the analysis of protein sequences. Since its first description in 2004, both the user interface and the back end of the server were improved. A number of tools (e.g. MAFFT, Jacop, Dotlet, Jalview, ESTScan) were added or updated to improve the usability of the service. The MySQL schema and its associated API were revamped and the database engine (HitKeeper) was separated from the web interface. This paper summarizes the current status of the server, with an emphasis on the new services.
Resumo:
The sequence profile method (Gribskov M, McLachlan AD, Eisenberg D, 1987, Proc Natl Acad Sci USA 84:4355-4358) is a powerful tool to detect distant relationships between amino acid sequences. A profile is a table of position-specific scores and gap penalties, providing a generalized description of a protein motif, which can be used for sequence alignments and database searches instead of an individual sequence. A sequence profile is derived from a multiple sequence alignment. We have found 2 ways to improve the sensitivity of sequence profiles: (1) Sequence weights: Usage of individual weights for each sequence avoids bias toward closely related sequences. These weights are automatically assigned based on the distance of the sequences using a published procedure (Sibbald PR, Argos P, 1990, J Mol Biol 216:813-818). (2) Amino acid substitution table: In addition to the alignment, the construction of a profile also needs an amino acid substitution table. We have found that in some cases a new table, the BLOSUM45 table (Henikoff S, Henikoff JG, 1992, Proc Natl Acad Sci USA 89:10915-10919), is more sensitive than the original Dayhoff table or the modified Dayhoff table used in the current implementation. Profiles derived by the improved method are more sensitive and selective in a number of cases where previous methods have failed to completely separate true members from false positives.
Resumo:
Signal search analysis is a general method to discover and characterize sequence motifs that are positionally correlated with a functional site (e.g. a transcription or translation start site). The method has played an instrumental role in the analysis of eukaryotic promoter elements. The signal search analysis server provides access to four different computer programs as well as to a large number of precompiled functional site collections. The programs offered allow: (i) the identification of non-random sequence regions under evolutionary constraint; (ii) the detection of consensus sequence-based motifs that are over- or under-represented at a particular distance from a functional site; (iii) the analysis of the positional distribution of a consensus sequence- or weight matrix-based sequence motif around a functional site; and (iv) the optimization of a weight matrix description of a locally over-represented sequence motif. These programs can be accessed at: http://www.isrec.isb-sib.ch/ssa/.
Resumo:
BACKGROUND: The exceptionally diverse species flocks of cichlid fishes in East Africa are prime examples of parallel adaptive radiations. About 80% of East Africa's more than 1 800 endemic cichlid species, and all species of the flocks of Lakes Victoria and Malawi, belong to a particularly rapidly evolving lineage, the haplochromines. One characteristic feature of the haplochromines is their possession of egg-dummies on the males' anal fins. These egg-spots mimic real eggs and play an important role in the mating system of these maternal mouthbrooding fish. RESULTS: Here, we show that the egg-spots of haplochromines are made up of yellow pigment cells, xanthophores, and that a gene coding for a type III receptor tyrosine kinase, colony-stimulating factor 1 receptor a (csf1ra), is expressed in egg-spot tissue. Molecular evolutionary analyses reveal that the extracellular ligand-binding and receptor-interacting domain of csf1ra underwent adaptive sequence evolution in the ancestral lineage of the haplochromines, coinciding with the emergence of egg-dummies. We also find that csf1ra is expressed in the egg-dummies of a distantly related cichlid species, the ectodine cichlid Ophthalmotilapia ventralis, in which markings with similar functions evolved on the pelvic fin in convergence to those of the haplochromines. CONCLUSION: We conclude that modifications of existing signal transduction mechanisms might have evolved in the haplochromine lineage in association with the origination of anal fin egg-dummies. That positive selection has acted during the evolution of a color gene that seems to be involved in the morphogenesis of a sexually selected trait, the egg-dummies, highlights the importance of further investigations of the comparative genomic basis of the phenotypic diversification of cichlid fishes.
Resumo:
Parasites of the Leishmania Viannia subgenus are major causative agents of mucocutaneous leishmaniasis (MCL), a disease characterised by parasite dissemination (metastasis) from the original cutaneous lesion to form debilitating secondary lesions in the nasopharyngeal mucosa. We employed a protein profiling approach to identify potential metastasis factors in laboratory clones of L. (V.) guyanensis with stable phenotypes ranging from highly metastatic (M+) through infrequently metastatic (M+/M-) to non-metastatic (M-). Comparison of the soluble proteomes of promastigotes by two-dimensional electrophoresis revealed two abundant protein spots specifically associated with M+ and M+/M- clones (Met2 and Met3) and two others exclusively expressed in M- parasites (Met1 and Met4). The association between clinical disease phenotype and differential expression of Met1-Met4 was less clear in L. Viannia strains from mucosal (M+) or cutaneous (M-) lesions of patients. Identification of Met1-Met4 by biological mass spectrometry (LC-ES-MS/MS) and bioinformatics revealed that M+ and M- clones express distinct acidic and neutral isoforms of both elongation factor-1 subunit beta (EF-1beta) and cytosolic tryparedoxin peroxidase (TXNPx). This interchange of isoforms may relate to the mechanisms by which the activities of EF-1beta and TXNPx are modulated, and/or differential post-translational modification of the gene product(s). The multiple metabolic functions of EF-1 and TXNPx support the plausibility of their participation in parasite survival and persistence and thereby, metastatic disease. Both polypeptides are active in resistance to chemical and oxidant stress, providing a basis for further elucidation of the importance of antioxidant defence in the pathogenesis underlying MCL.
Resumo:
Transcripts with ESTs derived exclusively or predominantly from testis, and not from other normal tissues, are likely to be products of genes with testis-restricted expression, and are thus potential cancer/testis (CT) antigen genes. A list of 371 genes with such characteristics was compiled by analyzing publicly available EST databases. RT-PCR analysis of normal and tumor tissues was performed to validate an initial selection of 20 of these genes. Several new CT and CT-like genes were identified. One of these, CT46/HORMAD1, is expressed strongly in testis and weakly in placenta; the highest level of expression in other tissues is <1% of testicular expression. The CT46/HORMAD1 gene was expressed in 31% (34/109) of the carcinomas examined, with 11% (12/109) showing expression levels >10% of the testicular level of expression. CT46/HORMAD1 is a single-copy gene on chromosome 1q21.3, encoding a putative protein of 394 aa. Conserved protein domain analysis identified a HORMA domain involved in chromatin binding. The CT46/HORMAD1 protein was found to be homologous to the prototype HORMA domain-containing protein, Hop1, a yeast meiosis-specific protein, as well as to asy1, a meiotic synaptic mutant protein in Arabidopsis thaliana.
Resumo:
Plaque formation in vaccinia virus is inhibited by the compound N1-isonicotinoyl-N2-3-methyl-4-chlorobenzoylhydrazine (IMCBH). We have isolated a mutant virus that forms wild-type plaques in the presence of the drug. Comparison of wild-type and mutant virus showed that both viruses produced similar amounts of infectious intracellular naked virus in the presence of the drug. In contrast to the mutant, no extracellular enveloped virus was obtained from IMCBH-treated cells infected with wild-type virus. Marker rescue experiments were used to map the mutation conferring IMCBH resistance to the mutant virus. The map position coincided with that of the gene encoding the viral envelope antigen of M(r) 37,000. Sequence analysis of both wild-type and mutant genes showed a single nucleotide change (G to T) in the mutant gene. In the deduced amino acid sequence, the mutation changes the codon for an acidic Asp residue in the wild-type gene to one for a polar noncharged Tyr residue in the mutant.
Resumo:
Proteins secreted from adipose tissue are increasingly recognized to play an important role in the regulation of glucose metabolism. However, much less is known about their effect on lipid metabolism. The fasting-induced adipose factor (FIAF/angiopoietin-like protein 4/peroxisome proliferator-activated receptor gamma angiopoietin-related protein) was previously identified as a target of hypolipidemic fibrate drugs and insulin-sensitizing thiazolidinediones. Using transgenic mice that mildly overexpress FIAF in peripheral tissues we show that FIAF is an extremely powerful regulator of lipid metabolism and adiposity. FIAF overexpression caused a 50% reduction in adipose tissue weight, partly by stimulating fatty acid oxidation and uncoupling in fat. In addition, FIAF overexpression increased plasma levels of triglycerides, free fatty acids, glycerol, total cholesterol, and high density lipoprotein (HDL)-cholesterol. Functional tests indicated that FIAF overexpression severely impaired plasma triglyceride clearance but had no effect on very low density lipoprotein production. The effects of FIAF overexpression were amplified by a high fat diet, resulting in markedly elevated plasma and liver triglycerides, plasma free fatty acids, and plasma glycerol levels, and impaired glucose tolerance in FIAF transgenic mice fed a high fat diet. Remarkably, in mice the full-length form of FIAF was physically associated with HDL, whereas truncated FIAF was associated with low density lipoprotein. In human both full-length and truncated FIAF were associated with HDL. The composite data suggest that via physical association with plasma lipoproteins, FIAF acts as a powerful signal from fat and other tissues to prevent fat storage and stimulate fat mobilization. Our data indicate that disturbances in FIAF signaling might be involved in dyslipidemia.
Resumo:
In the vast majority of bottom-up proteomics studies, protein digestion is performed using only mammalian trypsin. Although it is clearly the best enzyme available, the sole use of trypsin rarely leads to complete sequence coverage, even for abundant proteins. It is commonly assumed that this is because many tryptic peptides are either too short or too long to be identified by RPLC-MS/MS. We show through in silico analysis that 20-30% of the total sequence of three proteomes (Schizosaccharomyces pombe, Saccharomyces cerevisiae, and Homo sapiens) is expected to be covered by Large post-Trypsin Peptides (LpTPs) with M(r) above 3000 Da. We then established size exclusion chromatography to fractionate complex yeast tryptic digests into pools of peptides based on size. We found that secondary digestion of LpTPs followed by LC-MS/MS analysis leads to a significant increase in identified proteins and a 32-50% relative increase in average sequence coverage compared to trypsin digestion alone. Application of the developed strategy to analyze the phosphoproteomes of S. pombe and of a human cell line identified a significant fraction of novel phosphosites. Overall our data indicate that specific targeting of LpTPs can complement standard bottom-up workflows to reveal a largely neglected portion of the proteome.
Resumo:
A new type of high avidity binding molecule, termed "peptabody" was created by harnessing the effect of multivalent interaction. A short peptide ligand was fused via a semi-rigid hinge region with the coiled-coil assembly domain of the cartilage oligomeric matrix protein, resulting in a pentameric multivalent binding molecule. In the first peptabody (Pab-S) described here, a peptide (S) specific for the mouse B-cell lymphoma BCL1 surface Ig idiotype, was selected from a phage display library. A fusion gene was constructed encoding peptide S, followed by the 24 aa hinge region from camel IgG and a modified 55 aa cartilage oligomeric matrix protein pentamerization domain. The Pab-S fusion protein was expressed in Escherichia coli in a soluble form at high levels and purified in a single step by metal-affinity chromatography. Pab-S specifically bound the BCL1 surface idiotype with an avidity of about 1 nM, which corresponds to a 2 x 10(5)-fold increase compared with the affinity of the synthetic peptide S itself. Biochemical characterization showed that Pab-S is a stable homopentamer of about 85 kDa, with interchain disulfide bonds. Pab-S can be dissociated under denaturing and reducing conditions and reassociated as a pentamer with full-binding activity. This intrinsic feature provides an easy way to combine Pab molecules with two different peptide specificities, thus producing heteropentamers with bispecific and/or chelating properties.
Resumo:
With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website ( http://www.uniprot.org/ ). It also evokes precautions that are necessary for successful predictions and extrapolations.
Resumo:
Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome's structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction ( approximately 80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.
Resumo:
Thyroid hormones are involved in the regulation of growth and metabolism in all vertebrates. Transthyretin is one of the extracellular proteins with high affinity for thyroid hormones which determine the partitioning of these hormones between extracellular compartments and intracellular lipids. During vertebrate evolution, both the tissue pattern of expression and the structure of the gene for transthyretin underwent characteristic changes. The purpose of this study was to characterize the position of Insectivora in the evolution of transthyretin in eutherians, a subclass of Mammalia. Transthyretin was identified by thyroxine binding and Western analysis in the blood of adult shrews, hedgehogs, and moles. Transthyretin is synthesized in the liver and secreted into the bloodstream, similar to the situation for other adult eutherians, birds, and diprotodont marsupials, but different from that for adult fish, amphibians, reptiles, monotremes, and Australian polyprotodont marsupials. For the characterization of the structure of the gene and the processing of mRNA for transthyretin, cDNA libraries were prepared from RNA from hedgehog and shrew livers, and full-length cDNA clones were isolated and sequenced. Sections of genomic DNA in the regions coding for the splice sites between exons 1 and 2 were synthesized by polymerase chain reaction and sequenced. The location of splicing was deduced from comparison of genomic with cDNA nucleotide sequences. Changes in the nucleotide sequence of the transthyretin gene during evolution are most pronounced in the region coding for the N-terminal region of the protein. Both the derived overall amino sequences and the N-terminal regions of the transthyretins in Insectivora were found to be very similar to those in other eutherians but differed from those found in marsupials, birds, reptiles, amphibians, and fish. Also, the pattern of transthyretin precursor mRNA splicing in Insectivora was more similar to that in other eutherians than to that in marsupials, reptiles, and birds. Thus, in contrast to the marsupials, with a different pattern of transthyretin gene expression in the evolutionarily "older" polyprotodonts compared with the evolutionarily "younger" diprotodonts, no separate lineages of transthyretin evolution could be identified in eutherians. We conclude that transthyretin gene expression in the liver of adult eutherians probably appeared before the branching of the lineages leading to modern eutherian species.
Resumo:
Pseudomonas fluorescens CHA0 and the related strain Pf-5 are well-characterized representatives of rhizosphere bacteria that have the capacity to protect crop plants from fungal root diseases, mainly by releasing a variety of exoproducts that are toxic to plant pathogenic fungi. Here, we report that the two plant-beneficial pseudomonads also exhibit potent insecticidal activity. Anti-insect activity is linked to a novel genomic locus encoding a large protein toxin termed Fit (for P. fluorescensinsecticidal toxin) that is related to the insect toxin Mcf (Makes caterpillars floppy) of the entomopathogen Photorhabdus luminescens, a mutualist of insect-invading nematodes. When injected into the haemocoel, even low doses of P. fluorescens CHA0 or Pf-5 killed larvae of the tobacco hornworm Manduca sexta and the greater wax moth Galleria mellonella. In contrast, mutants of CHA0 or Pf-5 with deletions in the Fit toxin gene were significantly less virulent to the larvae. When expressed from an inducible promoter in a non-toxic Escherichia coli host, the Fit toxin gene was sufficient to render the bacterium toxic to both insect hosts. Our findings establish the Fit gene products of P. fluorescens CHA0 and Pf-5 as potent insect toxins that define previously unappreciated anti-insect properties of these plant-colonizing bacteria
Resumo:
The interleukin-6 cytokines, acting via gp130 receptor pathways, play a pivotal role in the reduction of cardiac injury induced by mechanical stress or ischemia and in promoting subsequent adaptive remodeling of the heart. We have now identified the small proline-rich repeat proteins (SPRR) 1A and 2A as downstream targets of gp130 signaling that are strongly induced in cardiomyocytes responding to biomechanical/ischemic stress. Upregulation of SPRR1A and 2A was markedly reduced in the gp130 cardiomyocyte-restricted knockout mice. In cardiomyocytes, MEK1/2 inhibitors prevented SPRR1A upregulation by gp130 cytokines. Furthermore, binding of NF-IL6 (C/EBPbeta) and c-Jun to the SPRR1A promoter was observed after CT-1 stimulation. Histological analysis revealed that SPRR1A induction after mechanical stress of pressure overload was restricted to myocytes surrounding piecemeal necrotic lesions. A similar expression pattern was found in postinfarcted rat hearts. Both in vitro and in vivo ectopic overexpression of SPRR1A protected cardiomyocytes against ischemic injury. Thus, this study identifies SPRR1A as a novel stress-inducible downstream mediator of gp130 cytokines in cardiomyocytes and documents its cardioprotective effect against ischemic stress.