929 resultados para protein sequence classification


Relevância:

90.00% 90.00%

Publicador:

Resumo:

There is increasing evidence that heterotrimeric G-proteins (G-proteins) are involved in many plant processes including phytohormone response, pathogen defence and stomatal control. In animal systems, each of the three G-protein subunits belong to large multigene families; however, few subunits have been isolated from plants. Here we report the cloning of a second plant G-protein γ-subunit (AGG2) from Arabidopsis thaliana. The predicted AGG2 protein sequence shows 48% identity to the first identified Arabidopsis Gγ-subunit, AGG1. Furthermore, AGG2 contains all of the conserved characteristics of γ-subunits including a small size (100 amino acids, 11.1 kDa), C-terminal CAAX box and a N-terminal α-helix region capable of forming a coiled-coil interaction with the β-subunit. A strong interaction between AGG2 and both the tobacco (TGB1) and Arabidopsis (AGB1) β-subunits was observed in vivo using the yeast two-hybrid system. The strong association between AGG2 and AGB1 was confirmed in vitro. Southern and Northern analyses showed that AGG2 is a single copy gene in Arabidopsis producing two transcripts that are present in all tissues tested. The isolation of a second γ-subunit from A. thaliana indicates that plant G-proteins, like their mammalian counterparts, may form different heterotrimer combinations that presumably regulate multiple signal transduction pathways.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The 101 residue protein early pregnancy factor (EPF), also known as human chaperonin 10, was synthesized from four functionalized, but unprotected, peptide segments by a sequential thioether ligation strategy. The approach exploits the differential reactivity of a peptide-NHCH2CH2SH thiolate with XCH2CO-peptides, where X = Cl or I/Br. Initial model studies with short functionalized (but unprotected) peptides showed a significantly faster reaction of a peptide-NHCH2CH2SH thiolate with a BrCH2CO-peptide than with a CICH2CO-peptide, where thiolate displacement of the halide leads to chemoselective formation of a thioether surrogate for the Gly-Gly peptide bond. This rate difference was used as the basis of a novel sequential ligation approach to the synthesis of large polypeptide chains. Thus, ligation of a model bifunctional N-alpha-chloroacetyl, C-terminal thiolated peptide with a second N-alpha-bromoacetyl peptide demonstrated chemoselective bromide displacement by the thiol group. Further investigations showed that the relatively unreactive N-alpha-chloroacetyl peptides could be activated by halide exchange using saturated KI solutions to yield the highly reactive No-iodoacetyl peptides. These findings were used to formulate a sequential thioether ligation strategy for the synthesis of EPF, a 101 amino acid protein containing three Gly-Gly sites approximately equidistantly spaced within the peptide chain. Four peptide segments or cassettes comprising the EPF protein sequence (BrAc-[EPF 78-101] 12, ClAc-[EPF 58-75]-[NHCH2CH2SH] 13, ClAc-[EPF 30-55]-[NHCH2CH2SH] 14, and Ac-[EPF 1-27]-[NHCH2CH2SH] 15) of EPF were synthesized in high yield and purity using Boc SPPS chemistry. In the stepwise sequential ligation strategy, reaction of peptides 12 and 13 was followed by conversion of the N-terminal chloroacetyl functional group to an iodoacetyl, thus activating the product peptide for further ligation with peptide 14. The process of ligation followed by iodoacetyl activation was repeated to yield an analogue of EPF (EPF psi(CH2S)(28-29,56-57,76-77)) 19 in 19% overall yield.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A general overview of the protein sequence set for the mouse transcriptome produced during the FANTOM2 sequencing project is presented here. We applied different algorithms to characterize protein sequences derived from a nonredundant representative protein set (RPS) and a variant protein set (VPS) of the mouse transcriptome. The functional characterization and assignment of Gene Ontology terms was done by analysis of the proteome using InterPro. The Superfamily database analyses gave a detailed structural classification according to SCOP and provide additional evidence for the functional characterization of the proteome data. The MDS database analysis revealed new domains which are not presented in existing protein domain databases. Thus the transcriptome gives us a unique source of data for the detection of new functional groups. The data obtained for the RPS and VPS sets facilitated the comparison of different patterns of protein expression. A comparison of other existing mouse and human protein sequence sets (e.g., the International Protein Index) demonstrates the common patterns in mammalian proteornes. The analysis of the membrane organization within the transcriptome of multiple eukaryotes provides valuable statistics about the distribution of secretory and transmembrane proteins

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Trabalho apresentado no âmbito do European Master in Computational Logics, como requisito parcial para obtenção do grau de Mestre em Computational Logics

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Duchenne muscular dystrophy is an X-linked genetic disease caused by the absence of functional dystrophin. Pharmacological upregulation of utrophin, the autosomal homologue of dystrophin, offers a potential therapeutic approach to treat Duchenne patients. Full-length utrophin mRNA is transcribed from two alternative promoters, called A and B. In contrast to the utrophin promoter A, little is known about the factors regulating the activity of the utrophin promoter B. Computer analysis of this second promoter revealed the presence of several conserved binding motives for Ets-transcription factors. Using electrotransfer of cDNA into mouse muscles, we demonstrate that a genetically modified beta-subunit of the Ets-transcription factor GA-binding protein potently activates a utrophin promoter B reporter construct in innervated muscle fibers in vivo. These results make the GA-binding protein and the signaling cascade regulating its activity in muscle cells, potential targets for the pharmacological modulation of utrophin expression in Duchenne patients.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The MyHits web server (http://myhits.isb-sib.ch) is a new integrated service dedicated to the annotation of protein sequences and to the analysis of their domains and signatures. Guest users can use the system anonymously, with full access to (i) standard bioinformatics programs (e.g. PSI-BLAST, ClustalW, T-Coffee, Jalview); (ii) a large number of protein sequence databases, including standard (Swiss-Prot, TrEMBL) and locally developed databases (splice variants); (iii) databases of protein motifs (Prosite, Interpro); (iv) a precomputed list of matches ('hits') between the sequence and motif databases. All databases are updated on a weekly basis and the hit list is kept up to date incrementally. The MyHits server also includes a new collection of tools to generate graphical representations of pairwise and multiple sequence alignments including their annotated features. Free registration enables users to upload their own sequences and motifs to private databases. These are then made available through the same web interface and the same set of analytical tools. Registered users can manage their own sequences and annotations using only web tools and freeze their data in their private database for publication purposes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The primary mission of UniProt is to support biological research by maintaining a stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces freely accessible to the scientific community. UniProt is produced by the UniProt Consortium which consists of groups from the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB) and the Protein Information Resource (PIR). UniProt is comprised of four major components, each optimized for different uses: the UniProt Archive, the UniProt Knowledgebase, the UniProt Reference Clusters and the UniProt Metagenomic and Environmental Sequence Database. UniProt is updated and distributed every 3 weeks and can be accessed online for searches or download at http://www.uniprot.org.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Early immunological data, obtained by immunodiffusion and immunoelectrophoresis, on the whole-cell antigenicity of kinetoplastid protozoa were retrieved and used to construct a dendrogram of antigenic distances. Remarkably, they supported the same taxonomic conclusions as analyses based on DNA and protein sequence data.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The primary mission of Universal Protein Resource (UniProt) is to support biological research by maintaining a stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces freely accessible to the scientific community. UniProt is produced by the UniProt Consortium which consists of groups from the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB) and the Protein Information Resource (PIR). UniProt is comprised of four major components, each optimized for different uses: the UniProt Archive, the UniProt Knowledgebase, the UniProt Reference Clusters and the UniProt Metagenomic and Environmental Sequence Database. UniProt is updated and distributed every 4 weeks and can be accessed online for searches or download at http://www.uniprot.org.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Background: Single nucleotide polymorphisms (SNPs) are the most frequent type of sequence variation between individuals, and represent a promising tool for finding genetic determinants of complex diseases and understanding the differences in drug response. In this regard, it is of particular interest to study the effect of non-synonymous SNPs in the context of biological networks such as cell signalling pathways. UniProt provides curated information about the functional and phenotypic effects of sequence variation, including SNPs, as well as on mutations of protein sequences. However, no strategy has been developed to integrate this information with biological networks, with the ultimate goal of studying the impact of the functional effect of SNPs in the structure and dynamics of biological networks. Results: First, we identified the different challenges posed by the integration of the phenotypic effect of sequence variants and mutations with biological networks. Second, we developed a strategy for the combination of data extracted from public resources, such as UniProt, NCBI dbSNP, Reactome and BioModels. We generated attribute files containing phenotypic and genotypic annotations to the nodes of biological networks, which can be imported into network visualization tools such as Cytoscape. These resources allow the mapping and visualization of mutations and natural variations of human proteins and their phenotypic effect on biological networks (e.g. signalling pathways, protein-protein interaction networks, dynamic models). Finally, an example on the use of the sequence variation data in the dynamics of a network model is presented. Conclusion: In this paper we present a general strategy for the integration of pathway and sequence variation data for visualization, analysis and modelling purposes, including the study of the functional impact of protein sequence variations on the dynamics of signalling pathways. This is of particular interest when the SNP or mutation is known to be associated to disease. We expect that this approach will help in the study of the functional impact of disease-associated SNPs on the behaviour of cell signalling pathways, which ultimately will lead to a better understanding of the mechanisms underlying complex diseases.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Thyroid hormones are involved in the regulation of growth and metabolism in all vertebrates. Transthyretin is one of the extracellular proteins with high affinity for thyroid hormones which determine the partitioning of these hormones between extracellular compartments and intracellular lipids. During vertebrate evolution, both the tissue pattern of expression and the structure of the gene for transthyretin underwent characteristic changes. The purpose of this study was to characterize the position of Insectivora in the evolution of transthyretin in eutherians, a subclass of Mammalia. Transthyretin was identified by thyroxine binding and Western analysis in the blood of adult shrews, hedgehogs, and moles. Transthyretin is synthesized in the liver and secreted into the bloodstream, similar to the situation for other adult eutherians, birds, and diprotodont marsupials, but different from that for adult fish, amphibians, reptiles, monotremes, and Australian polyprotodont marsupials. For the characterization of the structure of the gene and the processing of mRNA for transthyretin, cDNA libraries were prepared from RNA from hedgehog and shrew livers, and full-length cDNA clones were isolated and sequenced. Sections of genomic DNA in the regions coding for the splice sites between exons 1 and 2 were synthesized by polymerase chain reaction and sequenced. The location of splicing was deduced from comparison of genomic with cDNA nucleotide sequences. Changes in the nucleotide sequence of the transthyretin gene during evolution are most pronounced in the region coding for the N-terminal region of the protein. Both the derived overall amino sequences and the N-terminal regions of the transthyretins in Insectivora were found to be very similar to those in other eutherians but differed from those found in marsupials, birds, reptiles, amphibians, and fish. Also, the pattern of transthyretin precursor mRNA splicing in Insectivora was more similar to that in other eutherians than to that in marsupials, reptiles, and birds. Thus, in contrast to the marsupials, with a different pattern of transthyretin gene expression in the evolutionarily "older" polyprotodonts compared with the evolutionarily "younger" diprotodonts, no separate lineages of transthyretin evolution could be identified in eutherians. We conclude that transthyretin gene expression in the liver of adult eutherians probably appeared before the branching of the lineages leading to modern eutherian species.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The increase of publicly available sequencing data has allowed for rapid progress in our understanding of genome composition. As new information becomes available we should constantly be updating and reanalyzing existing and newly acquired data. In this report we focus on transposable elements (TEs) which make up a significant portion of nearly all sequenced genomes. Our ability to accurately identify and classify these sequences is critical to understanding their impact on host genomes. At the same time, as we demonstrate in this report, problems with existing classification schemes have led to significant misunderstandings of the evolution of both TE sequences and their host genomes. In a pioneering publication Finnegan (1989) proposed classifying all TE sequences into two classes based on transposition mechanisms and structural features: the retrotransposons (class I) and the DNA transposons (class II). We have retraced how ideas regarding TE classification and annotation in both prokaryotic and eukaryotic scientific communities have changed over time. This has led us to observe that: (1) a number of TEs have convergent structural features and/or transposition mechanisms that have led to misleading conclusions regarding their classification, (2) the evolution of TEs is similar to that of viruses by having several unrelated origins, (3) there might be at least 8 classes and 12 orders of TEs including 10 novel orders. In an effort to address these classification issues we propose: (1) the outline of a universal TE classification, (2) a set of methods and classification rules that could be used by all scientific communities involved in the study of TEs, and (3) a 5-year schedule for the establishment of an International Committee for Taxonomy of Transposable Elements (ICTTE).

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Leafroll is an economically important disease affecting grapevines (Vitis spp.). Nine serologically distinct viruses, Grapevine leafroll-associated virus-1 through 9, are associated with this disease. The present study describes the coat protein gene sequence of four GLRaV-3 isolates occurring in the São Francisco River basin, Northeastern Brazil. The viral RNA was extracted from GLRaV-3 ELISA-positive plants and the complete coat protein gene was amplified by RT-PCR. Sequences were generated automatically and compared to the complete coat protein sequence from North American (NY1) and Chinese (Dawanhong Nº2 and SL10) GLRaV-3 isolates. The four studied isolates, named Pet-1 through 4, showed deduced amino acid identities of 98-100% (Pet-1 through 3) and 95% (Pet-4) with North American and Chinese isolates. A total of seventeen amino acid substitutions was detected among the four characterized isolates in comparison to the NY1, Dawanhong No.2 and SL10 sequences. The results indicated the existence of natural variation among GLRaV-3 isolates from grapevines, also demonstrating a lack of correlation between sequence data and geographic origin. This variability should be considered when selecting regions of the viral genome targeted for reliable and consistent virus molecular detection.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Lignocellulosic biomass is probably the best alternative resource for biofuel production and it is composed mainly of cellulose, hemicelluloses and lignin. Cellulose is the most abundant among the three and conversion of cellulose to glucose is catalyzed by the enzyme cellulase. Cellulases are groups of enzymes act synergistically upon cellulose to produce glucose and comprise of endoglucanase, cellobiohydrolase and β-glucosidase. β -glucosidase assumes great importance due to the fact that it is the rate limiting enzyme. Endoglucanases (EG) produces nicks in the cellulose polymer exposing reducing and non reducing ends, cellobiohydrolases (CBH) acts upon the reducing or non reducing ends to liberate cellobiose units, and β - glucosidases (BGL) cleaves the cellobiose to liberate glucose completing the hydrolysis. . β -glucosidases undergo feedback inhibition by their own product- β glucose, and cellobiose which is their substrate. Few filamentous fungi produce glucose tolerant β - glucosidases which can overcome this inhibition by tolerating the product concentration to a particular threshold. The present study had targeted a filamentous fungus producing glucose tolerant β - glucosidase which was identified by morphological as well as molecular method. The fungus showed 99% similarity to Aspergillus unguis strain which comes under the Aspergillus nidulans group where most of the glucose tolerant β -glucosidase belongs. The culture was designated the strain number NII 08123 and was deposited in the NII culture collection at CSIR-NIIST. β -glucosidase multiplicity is a common occurrence in fungal world and in A.unguis this was demonstrated using zymogram analysis. A total 5 extracellular isoforms were detected in fungus and the expression levels of these five isoforms varied based on the carbon source available in the medium. Three of these 5 isoforms were expressed in higher levels as identified by the increased fluorescence (due to larger amounts of MUG breakdown by enzyme action) and was speculated to contribute significantly to the total _- β glucosidase activity. These isoforms were named as BGL 1, BGL3 and BGL 5. Among the three, BGL5 was demonstrated to be the glucose tolerant β -glucosidase and this was a low molecular weight protein. Major fraction was a high molecular weight protein but with lesser tolerance to glucose. BGL 3 was between the two in both activity and glucose tolerance.121 Glucose tolerant .β -glucosidase was purified and characterized and kinetic analysis showed that the glucose inhibition constant (Ki) of the protein is 800mM and Km and Vmax of the enzyme was found to be 4.854 mM and 2.946 mol min-1mg protein-1respectively. The optimumtemperature was 60°C and pH 6.0. The molecular weight of the purified protein was ~10kDa in both SDS as well as Native PAGE indicating that the glucose tolerant BGL is a monomeric protein.The major β -glucosidase, BGL1 had a pH and temperature optima of 5.0 and 60 °C respectively. The apparent molecular weight of the Native protein is 240kDa. The Vmax and Km was 78.8 mol min-1mg protein-1 and 0.326mM respectively. Degenerate primers were designed for glycosyl hydrolase families 1, 3 and 5 and the BGL genes were amplified from genomic DNA of Aspergillus unguis. The sequence analyses performed on the amplicons results confirmed the presence of all the three genes. Amplicon with a size of ~500bp was sequenced and which matched to a GH1 –BGL from Aspergillus oryzae. GH3 degenerate primers producing amplicons were sequenced and the sequences matched to β - glucosidase of GH3 family from Aspergillus nidulans and Aspergillus acculateus. GH5 degenerate primers also gave amplification and sequencing results indicated the presence of GH5 family BGL gene in the Aspergillus unguis genomic DNA.From the partial gene sequencing results, specific as well as degenerate primers were designed for TAIL PCR. Sequencing results of the 1.0 Kb amplicon matched Aspergillus nidulans β -glucosidase gene which belongs to the GH1 family. The sequence mainly covered the N-Terminal region of the matching peptide. All the three BGL proteins ie. BGL1, BGL3 and BGL5 were purified by chromatography an electro elution from Native PAGE gels and were subjected to MALDI-TOF mass spectrometric analysis. The results showed that BGL1 peptide mass matched to . β -glucosidase-I of Aspergillus flavus which is a 92kDa protein with 69% protein coverage. The glucose tolerant β -glucosidase BGL5 mass matched to the catalytic C-terminal domain of β -glucosidase-F from Emericella nidulans, but the protein coverage was very low compared to the size of the Emericella nidulans protein. While comparing the size of BGL5 from Aspergillus unguis, the protein sequence coverage is more than 80%. BGL F is a glycosyl hydrolase family 3 protein.The properties of BGL5 seem to be very unique, in that it is a GH3 β -glucosidase with a very low molecular weight of ~10kDa and at the same time having catalytic activity and glucose 122 tolerance which is as yet un-described in GH β -glucosidases. The occurrence of a fully functional 10kDA protein with glucose tolerant BGL activity has tremendous implications both from the points of understanding the structure function relationships as well as for applications of BGL enzymes. BGL-3 showed similarity to BGL1 of Aspergillus aculateus which was another GH3 β -glucosidase. It may be noted that though PCR could detect GH1, GH3 and GH5 β-glucosidases in the fungus, the major isoforms BGL1 BGL3 and BGL5 were all GH3 family enzymes. This would imply that β-glucosidases belonging to other families may also co-exist in the fungus and the other minor isoforms detected in zymograms may account for them. In biomass hydrolysis, GT-BGL containing BGL enzyme was supplemented to cellulase and the performances of blends were compared with a cocktail where commercial β- glucosidase was supplemented to the biomass hydrolyzing enzyme preparation. The cocktail supplemented with A unguis BGL preparation yielded 555mg/g sugar in 12h compared to the commercial enzyme preparation which gave only 333mg/g in the same period and the maximum sugar yield of 858 mg/g was attained in 36h by the cocktail containing A. unguis BGL. While the commercial enzyme achieved almost similar sugar yield in 24h, there was rapid drop in sugar concentration after that, indicating probably the conversion of glucose back to di-or oligosaccharides by the transglycosylation activity of the BGl in that preparation. Compared this, the A.unguis enzyme containing preparation supported peak yields for longer duration (upto 48h) which is important for biomass conversion to other products since the hydrolysate has to undergo certain unit operations before it goes into the next stage ie – fermentation in any bioprocesses for production of either fuels or chemicals.. Most importantly the Aspergillus unguis BGL preparation yields approximately 1.6 fold increase in the sugar release compared to the commercial BGL within 12h of time interval and 2.25 fold increase in the sugar release compared to the control ie. Cellulase without BGL supplementation. The current study therefore leads to the identification of a potent new isolate producing glucose tolerant β - glucosidase. The organism identified as Aspergillus unguis comes under the Aspergillus nidulans group where most of the GT-BGL producers belong and the detailed studies showed that the glucose tolerant β -glucosidase was a very low molecular weight protein which probably belongs to the glycosyl hydrolase family 3. Inhibition kinetic studies helped to understand the Ki and it is the second highest among the nidulans group of Aspergilli. This has promoted us for a detailed study regarding the mechanism of glucose tolerance. The proteomic 123 analyses clearly indicate the presence of GH3 catalytic domain in the protein. Since the size of the protein is very low and still its active and showed glucose tolerance it is speculated that this could be an entirely new protein or the modification of the existing β -glucosidase with only the catalytic domain present in it. Hydrolysis experiments also qualify this BGL, a suitable candidate for the enzyme cocktail development for biomass hydrolysis