929 resultados para protein sequence classification
Resumo:
Parkinson disease is mainly characterized by the degeneration of dopaminergic neurons in the central nervous system, including the retina. Different interrelated molecular mechanisms underlying Parkinson disease-associated neuronal death have been put forward in the brain, including oxidative stress and mitochondrial dysfunction. Systemic injection of the proneurotoxin 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) to monkeys elicits the appearance of a parkinsonian syndrome, including morphological and functional impairments in the retina. However, the intracellular events leading to derangement of dopaminergic and other retinal neurons in MPTP-treated animal models have not been so far investigated. Here we have used a comparative proteomics approach to identify proteins differentially expressed in the retina of MPTP-treated monkeys. Proteins were solubilized from the neural retinas of control and MPTP-treated animals, labelled separately with two different cyanine fluorophores and run pairwise on 2D DIGE gels. Out of >700 protein spots resolved and quantified, 36 were found to exhibit statistically significant differences in their expression levels, of at least ±1.4-fold, in the parkinsonian monkey retina compared with controls. Most of these spots were excised from preparative 2D gels, trypsinized and subjected to MALDI-TOF MS and LC-MS/MS analyses. Data obtained were used for protein sequence database interrogation, and 15 different proteins were successfully identified, of which 13 were underexpressed and 2 overexpressed. These proteins were involved in key cellular functional pathways such as glycolysis and mitochondrial electron transport, neuronal protection against stress and survival, and phototransduction processes. These functional categories underscore that alterations in energy metabolism, neuroprotective mechanisms and signal transduction are involved in MPTPinduced neuronal degeneration in the retina, in similarity to mechanisms thought to underlie neuronal death in the Parkinson’s diseased brain and neurodegenerative diseases of the retina proper.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
Sulfadoxine is predominantly used in combination with pyrimethamine, commonly known as Fansidar, for the treatment of Plasmodium falciparum. This combination is usually less effective against Plasmodium vivax, probably due to the innate refractoriness of parasites to the sulfadoxine component. To investigate this mechanism of resistance by P. vivax to sulfadoxine, we cloned and sequenced the P. vivax dhps (pvdhps) gene. The protein sequence was determined, and three-dimensional homology models of dihydropteroate synthase (DHPS) from P. vivax as well as P. falciparum were created. The docking of sulfadoxine to the two DHPS models allowed us to compare contact residues in the putative sulfadoxine-binding site in both species. The predicted sulfadoxine-binding sites between the species differ by one residue, V585 in P. vivax, equivalent to A613 in P. falciparum. V585 in P. vivax is predicted by energy minimization to cause a reduction in binding of sulfadoxine to DHPS in P. vivax compared to P. falciparum. Sequencing dhps genes from a limited set of geographically different P. vivax isolates revealed that V585 was present in all of the samples, suggesting that V585 may be responsible for innate resistance of P. vivax to sulfadoxine. Additionally, amino acid mutations were observed in some P. vivax isolates in positions known to cause resistance in P. falciparum, suggesting that, as in P. falciparum, these mutations are responsible for acquired increases in resistance of P. vivax to sulfadoxine.
Resumo:
A key component of the venom of many Australian snakes belonging to the elapid family is a toxin that is structurally and functionally similar to that of the mammalian prothrombinase complex. In mammals, this complex is responsible for the cleavage of prothrombin to thrombin and is composed of factor Xa in association with its cofactors calcium, phospholipids, and factor Va. The snake prothrombin activators have been classified on the basis of their requirement for cofactors for activity. The two major subgroups described in Australian elapid snakes, groups C and D, are differentiated by their requirement for mammalian coagulation factor Va. In this study, we describe the cloning, characterization, and comparative analysis of the factor X- and factor V-like components of the prothrombin activators from the venom glands of snakes possessing either group C or D prothrombin activators. The overall domain arrangement in these proteins was highly conserved between all elapids and with the corresponding mammalian clotting factors. The deduced protein sequence for the factor X-like protease precursor, identified in elapids containing either group C or D prothrombin activators, demonstrated a remarkable degree of relatedness to each other (80%-97%). The factor V-like component of the prothrombin activator, present only in snakes containing group C complexes, also showed a very high degree of homology (96%-98%). Expression of both the factor X- and factor V-like proteins determined by immunoblotting provided an additional means of separating these two groups at the molecular level. The molecular phylogenetic analysis described here represents a new approach for distinguishing group C and D snake prothrombin activators and correlates well with previous classifications.
Resumo:
Orphan nuclear receptors: therapeutic opportunities in skeletal muscle. Am J Physiol Cell Physiol 291: C203-C217, 2006; doi: 10.1152/ajpcell. 00476.2005.-Nuclear hormone receptors (NRs) are ligand-dependent transcription factors that bind DNA and translate physiological signals into gene regulation. The therapeutic utility of NRs is underscored by the diversity of drugs created to manage dysfunctional hormone signaling in the context of reproductive biology, inflammation, dermatology, cancer, and metabolic disease. For example, drugs that target nuclear receptors generate over $10 billion in annual sales. Almost two decades ago, gene products were identified that belonged to the NR superfamily on the basis of DNA and protein sequence identity. However, the endogenous and synthetic small molecules that modulate their action were not known, and they were denoted orphan NRs. Many of the remaining orphan NRs are highly enriched in energy-demanding major mass tissues, including skeletal muscle, brown and white adipose, brain, liver, and kidney. This review focuses on recently adopted and orphan NR function in skeletal muscle, a tissue that accounts for similar to 35% of the total body mass and energy expenditure, and is a major site of fatty acid and glucose utilization. Moreover, this lean tissue is involved in cholesterol efflux and secretes that control energy expenditure and adiposity. Consequently, muscle has a significant role in insulin sensitivity, the blood lipid profile, and energy balance. Accordingly, skeletal muscle plays a considerable role in the progression of dyslipidemia, diabetes, and obesity. These are risk factors for cardiovascular disease, which is the the foremost cause of global mortality (> 16.7 million deaths in 2003). Therefore, it is not surprising that orphan NRs and skeletal muscle are emerging as therapeutic candidates in the battle against dyslipidemia, diabetes, obesity, and cardiovascular disease.
Resumo:
Large-scale gene discovery has been performed for the grass fungal endophytes Neotyphodium coenophialum, Neotyphodium lolii, and Epichloe festucae. The resulting sequences have been annotated by comparison with public DNA and protein sequence databases and using intermediate gene ontology annotation tools. Endophyte sequences have also been analysed for the presence of simple sequence repeat and single nucleotide polymorphism molecular genetic markers. Sequences and annotation are maintained within a MySQL database that may be queried using a custom web interface. Two cDNA-based microarrays have been generated from this genome resource, They permit the interrogation of 3806 Neotyphodium genes (Nchip (TM) rnicroarray), and 4195 Neotyphodium and 920 Epichloe genes (EndoChip (TM) microarray), respectively. These microarrays provide tools for high-throughput transcriptome analysis, including genome-specific gene expression studies, profiling of novel endophyte genes, and investigation of the host grass-symbiont interaction. Comparative transcriptome analysis in Neotyphodium and Epichloe was performed. (c) 2006 Elsevier
Resumo:
Human CD81 (hCD81) protein has been recombinantly produced in the methylotrophic yeast Pichia pastoris. The purified protein, produced at a yield of 1.75 mg/L of culture, was shown to interact with Hepatitis C virus E2 glycoprotein. Immunofluorescent and flow cytometric staining of P. pastoris protoplasts with monoclonal antibodies specific for the second extracellular loop (EC2) of hCD81 confirmed the antigenicity of the recombinant molecule. Full-length hCD81 was solubilized with an array of detergents and subsequently characterized using circular dichroism (CD) and analytical ultracentrifugation. These biophysical techniques confirmed that the protein solution comprises a homogenous species possessing a highly-defined alpha-helical secondary structure. The predicted alpha-helical content of the protein from CD analysis (77.1%) fits remarkably well with what would be expected (75.2%) from knowledge of the protein sequence together with the data from the crystal structure of the second extracellular loop. This study represents the first biophysical characterization of a full-length recombinant tetraspanin, and opens the way for structure-activity analyses of this ubiquitous family of transmembrane proteins.
Resumo:
Cachexia is a wasting phenomenon that often accompanies malignant disease. Its manifestation is associated with shortened survival and reduced responsiveness to anti-tumour therapy and as yet there is no established, effective amelioratory treatment. The MAC 16 model of cancer cachexia has been shown by many studies to closely mirror the human condition. Thus, cachexia is mediated by the presence of a small, slow-growing solid tumour that is mainly resistant to chemotherapy. In addition, the condition is largely attributable to aberrations in metabolic processes, while weight loss due to anorexia is negligible. Cachexia induced by the MAC 16 tumour, has been shown to be mediated by the production of tumour-derived circulatory catabolic factors, and the further elucidation of the structure of these molecules contributes towards the main content of this report. Thus, a factor with in vitro lipid-mobilising activity has been purified from the MAC 16 tumour, and has been found to have similarities to tumour-derived lipolytic factors published to date. Further work demonstrated that this factor was also purifiable from the urine of a patient with pancreatic cancer, and that it was capable of inducing weight loss in non tumour-bearing mice. Sequence analysis of the homogeneous material revealed an identity to Zn-α-2-glycoprotein, the significance of which is discussed. An additional factor, first detected as a result of its specific reactivity with a monoclonal antibody produced by fusion of splenocytes from MAC 16 tumour-bearing mice with mouse BALB/c myeloma cells, was identified as a co-purificant during studies to isolate the lipolytic factor. Subsequent purification of this material to homogeneity resulted in the determination of 18 of the N-terminal amino acids and revealed the highly glycosylated nature of its structure. Thus, this material (P24) was found to have an apparent molecular mass of 24kD of which 2kD was due to protein, while the remainder (92%) was due to the presence of carbohydrate groups. Sequence analysis of the protein core of P24 revealed an identity with Streptococcal pre-absorbing antigen (PA-Ag) in 11 of the amino acids, and the significance of this is discussed. P24 was shown to induce muscle protein breakdown in vitro and to induce cachexia in vivo, as measured by the depletion of fat (29%) and muscle (14%) tissue in the absence of a reduction of food and water intake. Further studies revealed that the same material was purifiable from the urine of patients with pancreatic cancer and was found to be detectable in the urine of cancer patients with weight loss greater than l.Skg/month. Thus, cachexia induced by the MAC 16 tumour in mice and by malignant disease in humans may be induced by similar mediators. Attempts to isolate the gene for P24 using information provided by the N-terminal protein sequence were unsuccessful. This was probably due to the low abundance o[ the material, as determined by protein purification studies; and the nature of the amino acids of the N-terminal sequence, which conferred a high degree o[ degeneracy to the oligonucleotides designed for the polymerase chain reaction.
Resumo:
In this work we propose the hypothesis that replacing the current system of representing the chemical entities known as amino acids using Latin letters with one of several possible alternative symbolic representations will bring significant benefits to the human construction, modification, and analysis of multiple protein sequence alignments. We propose ways in which this might be done without prescribing the choice of actual scripts used. Specifically we propose and explore three ways to encode amino acid texts using novel symbolic alphabets free from precedents. Primary orthographic encoding is the direct substitution of a new alphabet for the standard, Latin-based amino acid code. Secondary encoding imposes static residue groupings onto the orthography of the alphabet by manipulating the shape and/or orientation of amino acid symbols. Tertiary encoding renders each residue as a composite symbol; each such symbol thus representing several alternative amino acid groupings simultaneously. We also propose that the use of a new group-focussed alphabet will free the colouring of amino acid residues often used as a tool to facilitate the representation or construction of multiple alignments for other purposes, possibly to indicate dynamic properties of an alignment such as position-wise residue conservation.
Resumo:
Background - The main processing pathway for MHC class I ligands involves degradation of proteins by the proteasome, followed by transport of products by the transporter associated with antigen processing (TAP) to the endoplasmic reticulum (ER), where peptides are bound by MHC class I molecules, and then presented on the cell surface by MHCs. The whole process is modeled here using an integrated approach, which we call EpiJen. EpiJen is based on quantitative matrices, derived by the additive method, and applied successively to select epitopes. EpiJen is available free online. Results - To identify epitopes, a source protein is passed through four steps: proteasome cleavage, TAP transport, MHC binding and epitope selection. At each stage, different proportions of non-epitopes are eliminated. The final set of peptides represents no more than 5% of the whole protein sequence and will contain 85% of the true epitopes, as indicated by external validation. Compared to other integrated methods (NetCTL, WAPP and SMM), EpiJen performs best, predicting 61 of the 99 HIV epitopes used in this study. Conclusion - EpiJen is a reliable multi-step algorithm for T cell epitope prediction, which belongs to the next generation of in silico T cell epitope identification methods. These methods aim to reduce subsequent experimental work by improving the success rate of epitope prediction.
Resumo:
Background: DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. Identification of DNA-binding proteins is one of the major challenges in the field of genome annotation. There have been several computational methods proposed in the literature to deal with the DNA-binding protein identification. However, most of them can't provide an invaluable knowledge base for our understanding of DNA-protein interactions. Results: We firstly presented a new protein sequence encoding method called PSSM Distance Transformation, and then constructed a DNA-binding protein identification method (SVM-PSSM-DT) by combining PSSM Distance Transformation with support vector machine (SVM). First, the PSSM profiles are generated by using the PSI-BLAST program to search the non-redundant (NR) database. Next, the PSSM profiles are transformed into uniform numeric representations appropriately by distance transformation scheme. Lastly, the resulting uniform numeric representations are inputted into a SVM classifier for prediction. Thus whether a sequence can bind to DNA or not can be determined. In benchmark test on 525 DNA-binding and 550 non DNA-binding proteins using jackknife validation, the present model achieved an ACC of 79.96%, MCC of 0.622 and AUC of 86.50%. This performance is considerably better than most of the existing state-of-the-art predictive methods. When tested on a recently constructed independent dataset PDB186, SVM-PSSM-DT also achieved the best performance with ACC of 80.00%, MCC of 0.647 and AUC of 87.40%, and outperformed some existing state-of-the-art methods. Conclusions: The experiment results demonstrate that PSSM Distance Transformation is an available protein sequence encoding method and SVM-PSSM-DT is a useful tool for identifying the DNA-binding proteins. A user-friendly web-server of SVM-PSSM-DT was constructed, which is freely accessible to the public at the web-site on http://bioinformatics.hitsz.edu.cn/PSSM-DT/.
Resumo:
The presenilins are the catalytic component of the gamma-secretase protease complex, involved in the regulated intramembrane proteolysis of numerous type-1 transmembrane proteins, including Amyloid precursor protein (APP) and Notch. In addition to their role in the γ-secretase complex the presenilins are involved in a number of γ-secretase independent functions such as calcium homeostasis, apoptosis, inflammation and protein trafficking. Presenilin function is known to be regulated through posttranslational modifications like endoproteolysis, phosphorylation and ubiquitination. Using a bioinformatics and protein sequence analysis approach this lab has identified a putative ubiquitin binding CUE domain in the presenilins. The aim of this project was to characterise the function of the presenilin CUE domains. Firstly, the presenilins are shown to contain a functional ubiquitin-binding CUE domain that preferentially binds to K63-linked polyubiquitin chains. The PS1 CUE domain is shown to be dispensable for PS1 endoproteolysis and γ-secretase mediated cleavage of APP, Notch and IL-1R1. This suggests the PS1 CUE domain is involved in a γ-secretase independent PS1 function. Our hypothesis is that the PS1 CUE domain is involved in regulating PS1’s intermolecular protein-protein interactions or intramolecular PS1:PS1 interactions. Here the PS1 CUE domain is shown to be dispensable for the interaction of PS1 and the K63-linked polyubiquitinated PS1 interacting proteins P75NTR, IL-1R1, TRAF6, TRAF2 and RIP1. To further investigate PS1 CUE domain function a mass spectrometry proteomics based approach is used to identify PS1 CUE domain interacting proteins. This proteomics approach demonstrated that the PS1 CUE domain is not required for PS1 dimerization. Instead a number of proteins thatinteract with the PS1 CUE domain are identified as well as proteins whose interaction with PS1 is downregulated by the presence of the PS1 CUE domain. Bioinformatic analysis of these proteins suggests possible roles for the PS1 CUE domain in regulating cell signalling, ubiquitination or cellular trafficking.
Resumo:
Background: The Nme gene family is involved in multiple physiological and pathological processes such as cellular differentiation, development, metastatic dissemination, and cilia functions. Despite the known importance of Nme genes and their use as clinical markers of tumor aggressiveness, the associated cellular mechanisms remain poorly understood. Over the last 20 years, several non-vertebrate model species have been used to investigate Nme functions. However, the evolutionary history of the family remains poorly understood outside the vertebrate lineage. The aim of the study was thus to elucidate the evolutionary history of the Nme gene family in Metazoans. Methodology/Principal Findings: Using a total of 21 eukaryote species including 14 metazoans, the evolutionary history of Nme genes was reconstructed in the metazoan lineage. We demonstrated that the complexity of the Nme gene family, initially thought to be restricted to chordates, was also shared by the metazoan ancestor. We also provide evidence suggesting that the complexity of the family is mainly a eukaryotic innovation, with the exception of Nme8 that is likely to be a choanoflagellate/metazoan innovation. Highly conserved gene structure, genomic linkage, and protein domains were identified among metazoans, some features being also conserved in eukaryotes. When considering the entire Nme family, the starlet sea anemone is the studied metazoan species exhibiting the most conserved gene and protein sequence features with humans. In addition, we were able to show that most of the proteins known to interact with human NME proteins were also found in starlet sea anemone. Conclusion/Significance: Together, our observations further support the association of Nme genes with key cellular functions that have been conserved throughout metazoan evolution. Future investigations of evolutionarily conserved Nme gene functions using the starlet sea anemone could shed new light on a wide variety of key developmental and cellular processes.
Resumo:
Circulating low density lipoproteins (LDL) are thought to play a crucial role in the onset and development of atherosclerosis, though the detailed molecular mechanisms responsible for their biological effects remain controversial. The complexity of biomolecules (lipids, glycans and protein) and structural features (isoforms and chemical modifications) found in LDL particles hampers the complete understanding of the mechanism underlying its atherogenicity. For this reason the screening of LDL for features discriminative of a particular pathology in search of biomarkers is of high importance. Three major biomolecule classes (lipids, protein and glycans) in LDL particles were screened using mass spectrometry coupled to liquid chromatography. Dual-polarity screening resulted in good lipidome coverage, identifying over 300 lipid species from 12 lipid sub-classes. Multivariate analysis was used to investigate potential discriminators in the individual lipid sub-classes for different study groups (age, gender, pathology). Additionally, the high protein sequence coverage of ApoB-100 routinely achieved (≥70%) assisted in the search for protein modifications correlating to aging and pathology. The large size and complexity of the datasets required the use of chemometric methods (Partial Least Square-Discriminant Analysis, PLS-DA) for their analysis and for the identification of ions that discriminate between study groups. The peptide profile from enzymatically digested ApoB-100 can be correlated with the high structural complexity of lipids associated with ApoB-100 using exploratory data analysis. In addition, using targeted scanning modes, glycosylation sites within neutral and acidic sugar residues in ApoB-100 are also being explored. Together or individually, knowledge of the profiles and modifications of the major biomolecules in LDL particles will contribute towards an in-depth understanding, will help to map the structural features that contribute to the atherogenicity of LDL, and may allow identification of reliable, pathology-specific biomarkers. This research was supported by a Marie Curie Intra-European Fellowship within the 7th European Community Framework Program (IEF 255076). Work of A. Rudnitskaya was supported by Portuguese Science and Technology Foundation, through the European Social Fund (ESF) and "Programa Operacional Potencial Humano - POPH".