929 resultados para protein sequence classification


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Human N-acetyltransferase Type I (NAT1) catalyses the acetylation of many aromatic amine and hydrazine compounds and it has been implicated in the catabolism of folic acid. The enzyme is widely expressed in the body, although there are considerable differences in the level of activity between tissues. A search of the mRNA databases revealed the presence of several NAT1 transcripts in human tissue that appear to be derived from different promoters. Because little is known about NAT1 gene regulation, the present study was undertaken to characterize one of the putative promoter sequences of the NAT1 gene located just upstream of the coding region. We show with reverse-transcriptase PCR that mRNA transcribed from this promoter (Promoter 1) is present in a variety of human cell-lines, but not in quiescent peripheral blood mononuclear cells. Using deletion mutant constructs, we identified a 20 bp sequence located 245 bases upstream of the translation start site which was sufficient for basal NAT1 expression. It comprised an AP-1 (activator protein 1)-binding site, flanked on either side by a TCATT motif. Mutational analysis showed that the AP-1 site and the 3' TCATT sequence were necessary for gene expression, whereas the 5' TCATT appeared to attenuate promoter activity. Electromobility shift assays revealed two specific bands made up by complexes of c-Fos/Fra, c-Jun, YY-1 (Yin and Yang 1) and possibly Oct-1. PMA treatment enhanced expression from the NAT1 promoter via the AP-1-binding site. Furthermore, in peripheral blood mononuclear cells, PMA increased endogenous NAT1 activity and induced mRNA expression from Promoter I, suggesting that it is functional in vivo.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

CysView is a web-based application tool that identifies and classifies proteins according to their disulfide connectivity patterns. It accepts a dataset of annotated protein sequences in various formats and returns a graphical representation of cysteine pairing patterns. CysView displays cysteine patterns for those records in the data with disulfide annotations. It allows the viewing of records grouped by connectivity patterns. CysView's utility as an analysis tool was demonstrated by the rapid and correct classification of scorpion toxin entries from GenPept on the basis of their disulfide pairing patterns. It has proved useful for rapid detection of irrelevant and partial records, or those with incomplete annotations. CysView can be used to support distant homology between proteins. CysView is publicly available at http://research.i2r.a-star.edu.sg/CysView/.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

As a consequence of selective pressure exerted by the immune response during hepatitis C virus (HCV) infection, a high rate of nucleotide mutations in the viral genome is observed which leads to the emergence of viral escape mutants. The aim of this study was to evaluate the evolution of the amino acid (aa) sequence of the HCV nonstructural protein 3 (NS3) in viral isolates after liver transplantation. Six patients with HCV-induced liver disease undergoing liver transplantation (LT) were followed up for sequence analysis. Hepatitis C recurrence was observed in all patients after LT. The rate of synonymous (dS) nucleotide substitutions was much higher than that of nonsynonymous (dN) ones in the NS3 encoding region. The high values of the dS/dN ratios suggest no sustained adaptive evolution selection pressure and, therefore, absence of specific NS3 viral populations. Clinical genotype assignments were supported by phylogenetic analysis. Serial samples from each patient showed lower mean nucleotide genetic distance when compared with samples of the same HCV genotype and subtype. The NS3 samples studied had an N-terminal aa sequence with several differences as compared with reference ones, mainly in genotype 1b-infected patients. After LT, as compared with the sequences before, a few reverted aa substitutions and several established aa substitutions were observed at the N-terminal of NS3. Sites described to be involved in important functions of NS3, notably those of the catalytic triad and zinc binding, remained unaltered in terms of aa sequence. Rare or frequent aa substitutions occurred indiscriminately in different positions. Several cytotoxic T lymphocyte epitopes described for HCV were present in our 1b samples. Nevertheless, the deduced secondary structure of the NS3 protease showed a few alterations in samples from genotype 3a patients, but none were seen in 1b cases. Our data, obtained from patients under important selective pressure during LT, show that the NS3 protease remains well conserved, mainly in HCV 3a patients. It reinforces its potential use as an antigenic candidate for further studies aiming at the development of a protective immune response.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

At the end of 2002 and throughout 2003, there was a severe outbreak of infectious laryngotracheitis (ILT) in an intensive production area of commercial hens in the Sao Paulo State of Brazil. ILT virus was isolated from 28 flocks, and 21 isolates were genotyped by polymerase chain reaction and restriction fragment length polymorphism (PCR-RFLP) using four genes and eight restriction enzymes, and by partial sequencing of the infected cell protein 4 (ICP4) and thymidine kinase (TK) genes. Three groups resulted from the combinations of PCR-RFLP patterns: 19 field isolates formed Group I, and the remaining two isolates together with the chicken embryo origin (CEO) vaccine strains formed Group II. Group III comprised the tissue-culture origin (TCO) vaccine strain by itself. The PCR-RFLP results agreed with the sequencing results of two ICP4 gene fragments. The ICP4 gene sequence analysis showed that the 19 field isolates classified into Group I by RFLP-PCR were identical among themselves, but were different to the TCO and CEO vaccines. The two Group II isolates could not be distinguished from one of the CEO vaccines. The nucleotide and amino acid sequence analyses discriminated between the Brazilian and non-Brazilian isolates, as well as between the TCO and CEO vaccines. Sequence analysis of the TK gene enabled classification of the field isolates (Group I) as virulent and non-vaccine. This work shows that the severe ILT outbreak was caused by a highly virulent, non-vaccine strain.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We have developed a computational strategy to identify the set of soluble proteins secreted into the extracellular environment of a cell. Within the protein sequences predominantly derived from the RIKEN representative transcript and protein set, we identified 2033 unique soluble proteins that are potentially secreted from the cell. These proteins contain a signal peptide required for entry into the secretory pathway and lack any transmembrane domains or intracellular localization signals. This class of proteins, which we have termed the mouse secretome, included >500 novel proteins and 92 proteins

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Cytotoxic T cells (CTL) recognize short peptides that are derived from the proteolysis of endogenous cellular proteins and presented on the cell surface as a complex with MHC class I molecules. CTL can recognize single amino acid substitutions in proteins, including those involved in malignant transformation. The mutated sequence of an oncogene may be presented on the cell surface as a peptide, and thus represents a potential target antigen for tumour therapy. The p21ras gene is mutated in a wide variety of tumours and since the transforming mutations result in amino acid substitutions at positions 12, 13 and 61 of the protein, a limited number of ras peptides could potentially be used in the treatment of a wide variety of malignancies. A common substitution is Val for Gly at position 12 of p21ras. In this study, we show that the peptide sequence from position 5 to position 14 with Val at position 12-ras p5-14 (Val-12)-has a motif which allows it to bind to HLA-A2.1. HLA-A2.1-restricted ras p5-14 (Val-12)-specific CTL were induced in mice transgenic for both HLA-A2.1 and human beta2-microglobulin after in vivo priming with the peptide. The murine CTL could recognize the ras p5-14 (Val-12) peptide when they were presented on both murine and human target cells bearing HLA-A2.1. No cross-reactivity was observed with the native peptide ras p5-14 (Gly-12), and this peptide was not immunogenic in HLA-A2.1 transgenic mice. This represents an interesting model for the study of an HLA-restricted CD8 cytotoxic T cell response to a defined tumour antigen in vivo.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The development of targeted treatment strategies adapted to individual patients requires identification of the different tumor classes according to their biology and prognosis. We focus here on the molecular aspects underlying these differences, in terms of sets of genes that control pathogenesis of the different subtypes of astrocytic glioma. By performing cDNA-array analysis of 53 patient biopsies, comprising low-grade astrocytoma, secondary glioblastoma (respective recurrent high-grade tumors), and newly diagnosed primary glioblastoma, we demonstrate that human gliomas can be differentiated according to their gene expression. We found that low-grade astrocytoma have the most specific and similar expression profiles, whereas primary glioblastoma exhibit much larger variation between tumors. Secondary glioblastoma display features of both other groups. We identified several sets of genes with relatively highly correlated expression within groups that: (a). can be associated with specific biological functions; and (b). effectively differentiate tumor class. One prominent gene cluster discriminating primary versus nonprimary glioblastoma comprises mostly genes involved in angiogenesis, including VEGF fms-related tyrosine kinase 1 but also IGFBP2, that has not yet been directly linked to angiogenesis. In situ hybridization demonstrating coexpression of IGFBP2 and VEGF in pseudopalisading cells surrounding tumor necrosis provided further evidence for a possible involvement of IGFBP2 in angiogenesis. The separating groups of genes were found by the unsupervised coupled two-way clustering method, and their classification power was validated by a supervised construction of a nearly perfect glioma classifier.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. Results: The method has been tested against the PSI-BLAST program using a set of 3,890 protein sequences from which interaction data was available. For protein sequences that align with at least 40% sequence identity to a known enzyme, the specificity of our method in predicting the first three EC digits increased from 80% to 90% at 80% coverage when compared to PSI-BLAST. Conclusion: Our method can also be used in proteins for which homologous sequences with known interacting partners can be detected. Thus, our method could increase 10% the specificity of genome-wide enzyme predictions based on sequence matching by PSI-BLAST alone.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

During the last 2 years, several novel genes that encode glucose transporter-like proteins have been identified and characterized. Because of their sequence similarity with GLUT1, these genes appear to belong to the family of solute carriers 2A (SLC2A, protein symbol GLUT). Sequence comparisons of all 13 family members allow the definition of characteristic sugar/polyol transporter signatures: (1) the presence of 12 membrane-spanning helices, (2) seven conserved glycine residues in the helices, (3) several basic and acidic residues at the intracellular surface of the proteins, (4) two conserved tryptophan residues, and (5) two conserved tyrosine residues. On the basis of sequence similarities and characteristic elements, the extended GLUT family can be divided into three subfamilies, namely class I (the previously known glucose transporters GLUT1-4), class II (the previously known fructose transporter GLUT5, the GLUT7, GLUT9 and GLUT11), and class III (GLUT6, 8, 10, 12, and the myo-inositol transporter HMIT1). Functional characteristics have been reported for some of the novel GLUTs. Like GLUT1-4, they exhibit a tissue/cell-specific expression (GLUT6, leukocytes, brain; GLUT8, testis, blastocysts, brain, muscle, adipocytes; GLUT9, liver, kidney; GLUT10, liver, pancreas; GLUT11, heart, skeletal muscle). GLUT6 and GLUT8 appear to be regulated by sub-cellular redistribution, because they are targeted to intra-cellular compartments by dileucine motifs in a dynamin dependent manner. Sugar transport has been reported for GLUT6, 8, and 11; HMIT1 has been shown to be a H+/myo-inositol co-transporter. Thus, the members of the extended GLUT family exhibit a surprisingly diverse substrate specificity, and the definition of sequence elements determining this substrate specificity will require a full functional characterization of all members.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Association studies have revealed expression quantitative trait loci (eQTLs) for a large number of genes. However, the causative variants that regulate gene expression levels are generally unknown. We hypothesized that copy-number variation of sequence repeats contribute to the expression variation of some genes. Our laboratory has previously identified that the rare expansion of a repeat c.-174CGGGGCGGGGCG in the promoter region of the CSTB gene causes a silencing of the gene, resulting in progressive myoclonus epilepsy. Here, we genotyped the repeat length and quantified CSTB expression by quantitative real-time polymerase chain reaction in 173 lymphoblastoid cell lines (LCLs) and fibroblast samples from the GenCord collection. The majority of alleles contain either two or three copies of this repeat. Independent analysis revealed that the c.-174CGGGGCGGGGCG repeat length is strongly associated with CSTB expression (P = 3.14 × 10(-11)) in LCLs only. Examination of both genotyped and imputed single-nucleotide polymorphisms (SNPs) within 2 Mb of CSTB revealed that the dodecamer repeat represents the strongest cis-eQTL for CSTB in LCLs. We conclude that the common two or three copy variation is likely the causative cis-eQTL for CSTB expression variation. More broadly, we propose that polymorphic tandem repeats may represent the causative variation of a fraction of cis-eQTLs in the genome.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

RasGAP is a multifunctional protein that controls Ras activity and that is found in chromosomal passenger complexes. It also negatively or positively regulates apoptosis depending on the extent of its cleavage by caspase-3. RasGAP has been reported to bind to G3BP1 (RasGAP SH3-domain-binding protein 1), a protein regulating mRNA stability and stress granule formation. The region of RasGAP (amino acids 317-326) thought to bind to G3BP1 corresponds exactly to the sequence within fragment N2, a caspase-3-generated fragment of RasGAP, that mediates sensitization of tumor cells to genotoxins. While assessing the contribution of G3BP1 in the anti-cancer function of a cell-permeable peptide containing the 317-326 sequence of RasGAP (TAT-RasGAP₃₁₇₋₃₂₆), we found that, in conditions where G3BP1 and RasGAP bind to known partners, no interaction between G3BP1 and RasGAP could be detected. TAT-RasGAP₃₁₇₋₃₂₆ did not modulate binding of G3BP1 to USP10, stress granule formation or c-myc mRNA levels. Finally, TAT-RasGAP₃₁₇₋₃₂₆ was able to sensitize G3BP1 knock-out cells to cisplatin-induced apoptosis. Collectively these results indicate that G3BP1 and its putative RasGAP binding region have no functional influence on each other. Importantly, our data provide arguments against G3BP1 being a genuine RasGAP-binding partner. Hence, G3BP1-mediated signaling may not involve RasGAP.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Lassa virus (LASV) causing hemorrhagic Lassa fever in West Africa, Mopeia virus (MOPV) from East Africa, and lymphocytic choriomeningitis virus (LCMV) are the main representatives of the Old World arenaviruses. Little is known about how the components of the arenavirus replication machinery, i.e., the genome, nucleoprotein (NP), and L protein, interact. In addition, it is unknown whether these components can function across species boundaries. We established minireplicon systems for MOPV and LCMV in analogy to the existing LASV system and exchanged the components among the three systems. The functional and physical integrity of the resulting complexes was tested by reporter gene assay, Northern blotting, and coimmunoprecipitation studies. The minigenomes, NPs, and L proteins of LASV and MOPV could be exchanged without loss of function. LASV and MOPV L protein was also active in conjunction with LCMV NP, while the LCMV L protein required homologous NP for activity. Analysis of LASV/LCMV NP chimeras identified a single LCMV-specific NP residue (Ile-53) and the C terminus of NP (residues 340 to 558) as being essential for LCMV L protein function. The defect of LASV and MOPV NP in supporting transcriptional activity of LCMV L protein was not caused by a defect in physical NP-L protein interaction. In conclusion, components of the replication complex of Old World arenaviruses have the potential to functionally and physically interact across species boundaries. Residue 53 and the C-terminal domain of NP are important for function of L protein during genome replication and transcription.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The Zein-2 component named Zc 1 corresponds to a storage protein of an apparent M.W. of 16 kDa present in maize endosperm.