996 resultados para Storage proteins
Resumo:
Background: The function of a protein can be deciphered with higher accuracy from its structure than from its amino acid sequence. Due to the huge gap in the available protein sequence and structural space, tools that can generate functionally homogeneous clusters using only the sequence information, hold great importance. For this, traditional alignment-based tools work well in most cases and clustering is performed on the basis of sequence similarity. But, in the case of multi-domain proteins, the alignment quality might be poor due to varied lengths of the proteins, domain shuffling or circular permutations. Multi-domain proteins are ubiquitous in nature, hence alignment-free tools, which overcome the shortcomings of alignment-based protein comparison methods, are required. Further, existing tools classify proteins using only domain-level information and hence miss out on the information encoded in the tethered regions or accessory domains. Our method, on the other hand, takes into account the full-length sequence of a protein, consolidating the complete sequence information to understand a given protein better. Results: Our web-server, CLAP (Classification of Proteins), is one such alignment-free software for automatic classification of protein sequences. It utilizes a pattern-matching algorithm that assigns local matching scores (LMS) to residues that are a part of the matched patterns between two sequences being compared. CLAP works on full-length sequences and does not require prior domain definitions. Pilot studies undertaken previously on protein kinases and immunoglobulins have shown that CLAP yields clusters, which have high functional and domain architectural similarity. Moreover, parsing at a statistically determined cut-off resulted in clusters that corroborated with the sub-family level classification of that particular domain family. Conclusions: CLAP is a useful protein-clustering tool, independent of domain assignment, domain order, sequence length and domain diversity. Our method can be used for any set of protein sequences, yielding functionally relevant clusters with high domain architectural homogeneity. The CLAP web server is freely available for academic use at http://nslab.mbu.iisc.ernet.in/clap/.
Resumo:
-helices are amongst the most common secondary structural elements seen in membrane proteins and are packed in the form of helix bundles. These -helices encounter varying external environments (hydrophobic, hydrophilic) that may influence the sequence preferences at their N and C-termini. The role of the external environment in stabilization of the helix termini in membrane proteins is still unknown. Here we analyze -helices in a high-resolution dataset of integral -helical membrane proteins and establish that their sequence and conformational preferences differ from those in globular proteins. We specifically examine these preferences at the N and C-termini in helices initiating/terminating inside the membrane core as well as in linkers connecting these transmembrane helices. We find that the sequence preferences and structural motifs at capping (Ncap and Ccap) and near-helical (N' and C') positions are influenced by a combination of features including the membrane environment and the innate helix initiation and termination property of residues forming structural motifs. We also find that a large number of helix termini which do not form any particular capping motif are stabilized by formation of hydrogen bonds and hydrophobic interactions contributed from the neighboring helices in the membrane protein. We further validate the sequence preferences obtained from our analysis with data from an ultradeep sequencing study that identifies evolutionarily conserved amino acids in the rat neurotensin receptor. The results from our analysis provide insights for the secondary structure prediction, modeling and design of membrane proteins. Proteins 2014; 82:3420-3436. (c) 2014 Wiley Periodicals, Inc.
Resumo:
Terrestrial water storage (TWS) plays a key role in the global water cycle and is highly influenced by climate variability and human activities. In this study, monthly TWS, rainfall and Ganga-Brahmaputra river discharge (GBRD) are analysed over India for the period of 2003-12 using remote sensing satellite data. The spatial pattern of mean TWS shows a decrease over a large and populous region of Northern India comprising the foothills of the Himalayas, the Indo-Gangetic Plains and North East India. Over this region, the mean monthly TWS exhibits a pronounced seasonal cycle and a large interannual variability, highly correlated with rainfall and GBRD variations (r > 0.8) with a lag time of 2 months and 1 month respectively. The time series of monthly TWS shows a consistent and statistically significant decrease of about 1 cm year(-1) over Northern India, which is not associated with changes in rainfall and GBRD. This recent change in TWS suggests a possible impact of rapid industrialization, urbanization and increase in population on land water resources. Our analysis highlights the potential of the Earth-observation satellite data for hydrological applications.
Resumo:
Cis-peptide embedded segments are rare in proteins but often highlight their important role in molecular function when they do occur. The high evolutionary conservation of these segments illustrates this observation almost universally, although no attempt has been made to systematically use this information for the purpose of function annotation. In the present study, we demonstrate how geometric clustering and level-specific Gene Ontology molecular-function terms (also known as annotations) can be used in a statistically significant manner to identify cis-embedded segments in a protein linked to its molecular function. The present study identifies novel cis-peptide fragments, which are subsequently used for fragment-based function annotation. Annotation recall benchmarks interpreted using the receiver-operator characteristic plot returned an area-under-curve >0.9, corroborating the utility of the annotation method. In addition, we identified cis-peptide fragments occurring in conjunction with functionally important trans-peptide fragments, providing additional insights into molecular function. We further illustrate the applicability of our method in function annotation where homology-based annotation transfer is not possible. The findings of the present study add to the repertoire of function annotation approaches and also facilitate engineering, design and allied studies around the cis-peptide neighborhood of proteins.
Resumo:
Streptococcus pneumoniae causes pneumonia, septicemia and meningitis. S. pneumoniae is responsible for significant mortality both in children and in the elderly. In recent years, the whole genome sequencing of various S. pneumoniae strains have increased manifold and there is an urgent need to provide organism specific annotations to the scientific community. This prompted us to develop the Streptococcus pneumoniae Genome Database (SPGDB) to integrate and analyze the completely sequenced and available S. pneumoniae genome sequences. Further, links to several tools are provided to compare the pool of gene and protein sequences, and proteins structure across different strains of S. pneumoniae. SPGDB aids in the analysis of phenotypic variations as well as to perform extensive genomics and evolutionary studies with reference to S. pneumoniae. (C) 2014 Elsevier Inc. All rights reserved.
Resumo:
The cyclic AMP receptor protein (CRP) family of transcription factors consists of global regulators of bacterial gene expression. Here, we identify two paralogous CRPs in the genome of Mycobacterium smegmatis that have 78% identical sequences and characterize them biochemically and functionally. The two proteins (MSMEG_0539 and MSMEG_6189) show differences in cAMP binding affinity, trypsin sensitivity, and binding to a CRP site that we have identified upstream of the msmeg_3781 gene. MSMEG_6189 binds to the CRP site readily in the absence of cAMP, while MSMEG_0539 binds in the presence of cAMP, albeit weakly. msmeg_6189 appears to be an essential gene, while the ?msmeg_0539 strain was readily obtained. Using promoter-reporter constructs, we show that msmeg_3781 is regulated by CRP binding, and its transcription is repressed by MSMEG_6189. Our results are the first to characterize two paralogous and functional CRPs in a single bacterial genome. This gene duplication event has subsequently led to the evolution of two proteins whose biochemical differences translate to differential gene regulation, thus catering to the specific needs of the organism.
Resumo:
Variations in surface water extent and storage are poorly characterized from regional to global scales. In this study, a multi-satellite approach is proposed to estimate the water stored in the floodplains of the Orinoco Basin at a monthly time-scale using remotely-sensed observations of surface water from the Global Inundation Extent Multi-Satellite (GIEMS) and stages from Envisat radar altimetry. Surface water storage variations over 2003-2007 exhibit large interannual variability and a strong seasonal signal, peaking during summer, and associated with the flood pulse. The volume of surface water storage in the Orinoco Basin was highly correlated with the river discharge at Ciudad Bolivar (R = 0.95), the closest station to the mouth where discharge was estimated, although discharge lagged one month behind storage. The correlation remained high (R = 0.73) after removing seasonal effects. Mean annual variations in surface water volume represented similar to 170 km(3), contributing to similar to 45% of the Gravity Recovery and Climate Experiment (GRACE)-derived total water storage variations and representing similar to 13% of the total volume of water that flowed out of the Orinoco Basin to the Atlantic Ocean.
Resumo:
While the tradeoff between the amount of data stored and the repair bandwidth of an (n, k, d) regenerating code has been characterized under functional repair (FR), the case of exact repair (ER) remains unresolved. It is known that there do not exist ER codes which lie on the FR tradeoff at most of the points. The question as to whether one can asymptotically approach the FR tradeoff was settled recently by Tian who showed that in the (4, 3, 3) case, the ER region is bounded away from the FR region. The FR tradeoff serves as a trivial outer bound on the ER tradeoff. In this paper, we extend Tian's results by establishing an improved outer bound on the ER tradeoff which shows that the ER region is bounded away from the FR region, for any (n; k; d). Our approach is analytical and builds upon the framework introduced earlier by Shah et. al. Interestingly, a recently-constructed, layered regenerating code is shown to achieve a point on this outer bound for the (5, 4, 4) case. This represents the first-known instance of an optimal ER code that does not correspond to a point on the FR tradeoff.
Resumo:
Hydrogen storage capacity of Tin-1B (n = 3-7) clusters is studied and compared with that of the pristine Ti-n (n = 3-7), using density functional theory (DFT) based calculations. Among these clusters, Ti3B shows the most significant enhancement in the storage capacity by adsorbing 12 H-2, out of which three are dissociated and the other nine are stored as dihydrogen via Kubas-interaction. The best storage in Ti3B is owed to a large charge transfer from Ti to B along with the largest distance of Ti empty d-states above the Fermi level, which is a distinct feature of this particular cluster. Furthermore, the effect of substrates on the storage capacity of Ti3B was assessed by calculating the number of adsorbed H-2 on Ti-3 cluster anchored onto B atoms in the B-doped graphene, BC3, and BN substrates. Similar to free-standing Ti3B, Ti-3 anchored onto boron atom in BC3, stores nine di-hydrogen via Kubas interaction, at the same time eliminating the total number of non-useful dissociated hydrogen. Gibbs energy of adsorption as a function of H-2 partial pressure, indicated that at 250 K and 300 K the di-hydrogens on Ti-3@BC3 adsorb and desorb at ambient pressures. Importantly, Ti-3@BC3 avoids the clustering, hence meeting the criteria for efficient and reversible hydrogen storage media. Copyright (C) 2014, Hydrogen Energy Publications, LLC. Published by Elsevier Ltd. All rights reserved.
Resumo:
A simple methodology has been developed for the synthesis of functional nanoporous carbon (NPC) materials using a metal-organic framework (IRMOF-3) that can act as a template for external carbon precursor (viz, sucrose) and also a self-sacrificing carbon source. The resultant graphitic NPC samples (abbreviated as NPC-0, NPC-150, NPC-300, NPC-500 and NPC-1000 based on sucrose loading) obtained through loading different amounts of sucrose exhibit tunable textural parameters. Among these, NPC-300 shows very high surface area (BET approximate to 3119 m(2)/g, Langmuir approximate to 4031 m(2)/g) with a large pore volume of 1.93 cm(3)/g. High degree of porosity coupled with polar surface functional groups, make NPC-300 remarkable candidate for the uptake of H-2 (2.54 wt% at 1 bar, and 5.1 wt% at 50 bar, 77 K) and CO2 (64 wt% at 1 bar, 195 K and 16.9 wt% at 30 bar, 298 K). As a working electrode in a supercapacitor cell, NPC-300 shows excellent reversible charge storage thus, demonstrating multifunctional usage of the carbon materials. (C) 2015 Elsevier Inc. All rights reserved.
Resumo:
Storage of water within a river basin is often estimated by analyzing recession flow curves as it cannot be `instantly' estimated with the aid of available technologies. In this study we explicitly deal with the issue of estimation of `drainable' storage, which is equal to the area under the `complete' recession flow curve (i.e. a discharge vs. time curve where discharge continuously decreases till it approaches zero). But a major challenge in this regard is that recession curves are rarely `complete' due to short inter-storm time intervals. Therefore, it is essential to analyze and model recession flows meaningfully. We adopt the wellknown Brutsaert and Nieber analytical method that expresses time derivative of discharge (dQ/dt) as a power law function of Q : -dQ/dt = kQ(alpha). However, the problem with dQ/dt-Q analysis is that it is not suitable for late recession flows. Traditional studies often compute alpha considering early recession flows and assume that its value is constant for the whole recession event. But this approach gives unrealistic results when alpha >= 2, a common case. We address this issue here by using the recently proposed geomorphological recession flow model (GRFM) that exploits the dynamics of active drainage networks. According to the model, alpha is close to 2 for early recession flows and 0 for late recession flows. We then derive a simple expression for drainable storage in terms the power law coefficient k, obtained by considering early recession flows only, and basin area. Using 121 complete recession curves from 27 USGS basins we show that predicted drainable storage matches well with observed drainable storage, indicating that the model can also reliably estimate drainable storage for `incomplete' recession events to address many challenges related to water resources. (C) 2014 Elsevier Ltd. All rights reserved.
Resumo:
In many organisms ``Universal Stress Proteins'' CUSPS) are induced in response to a variety of environmental stresses. Here we report the structures of two USPs, YnaF and YdaA from Salmonella typhimurium determined at 1.8 angstrom and 2.4 angstrom resolutions, respectively. YnaF consists of a single USP domain and forms a tetrameric organization stabilized by interactions mediated through chloride ions. YdaA is a larger protein consisting of two tandem USP domains. Two protomers of YdaA associate to form a structure similar to the YnaF tetramer. YdaA showed ATPase activity and an ATP binding motif G-2X-G-9X-G(S/T/N) was found in its C-terminal domain. The residues corresponding to this motif were not conserved in YnaF although YnaF could bind ATP. However, unlike YdaA, YnaF did not hydrolyse ATP in vitro. Disruption of interactions mediated through chloride ions by selected mutations converted YnaF into an ATPase. Residues that might be important for ATP hydrolysis could be identified by comparing the active sites of native and mutant structures. Only the C-terminal domain of YdaA appears to be involved in ATP hydrolysis. The structurally similar N-terminal domain was found to bind a zinc ion near the segment equivalent to the phosphate binding loop of the C-terminal domain. Mass spectrometric analysis showed that YdaA might bind a ligand of approximate molecular weight 800 daltons. Structural comparisons suggest that the ligand, probably related to an intermediate in lipid A biosynthesis, might bind at a site close to the zinc ion. Therefore, the N-terminal domain of YdaA binds zinc and might play a role in lipid metabolism. Thus, USPs appear to perform several distinct functions such as ATP hydrolysis, altering membrane properties and chloride sensing. (C) 2015 Elsevier Inc. All rights reserved.
Resumo:
Small heat shock proteins (sHSPs) are a family of ATP-independent molecular chaperones which prevent cellular protein aggregation by binding to misfolded proteins. sHSPs form large oligomers that undergo drastic rearrangement/dissociation in order to execute their chaperone activity in protecting substrates from stress. Substrate-binding sites on sHSPs have been predominantly mapped on their intrinsically disordered N-terminal arms. This region is highly variable in sequence and length across species, and has been implicated in both oligomer formation and in mediating chaperone activity. Here, we present our results on the functional and structural characterization of five sHSPs in rice, each differing in their subcellular localisation, viz., cytoplasm, nucleus, chloroplast, mitochondria and peroxisome. We performed activity assays and dynamic light scattering studies to highlight differences in the chaperone activity and quaternary assembly of sHSPs targeted to various organelles. By cloning constructs that differ in the length and sequence of the tag in the N-terminal region, we have probed the sensitivity of sHSP oligomer assembly and chaperone activity to the length and amino acid composition of the N-terminus. In particular, we have shown that the incorporation of an N-terminal tag has significant consequences on sHSP quaternary structure.
Resumo:
Secondary-structure elements (SSEs) play an important role in the folding of proteins. Identification of SSEs in proteins is a common problem in structural biology. A new method, ASSP (Assignment of Secondary Structure in Proteins), using only the path traversed by the C atoms has been developed. The algorithm is based on the premise that the protein structure can be divided into continuous or uniform stretches, which can be defined in terms of helical parameters, and depending on their values the stretches can be classified into different SSEs, namely -helices, 3(10)-helices, -helices, extended -strands and polyproline II (PPII) and other left-handed helices. The methodology was validated using an unbiased clustering of these parameters for a protein data set consisting of 1008 protein chains, which suggested that there are seven well defined clusters associated with different SSEs. Apart from -helices and extended -strands, 3(10)-helices and -helices were also found to occur in substantial numbers. ASSP was able to discriminate non--helical segments from flanking -helices, which were often identified as part of -helices by other algorithms. ASSP can also lead to the identification of novel SSEs. It is believed that ASSP could provide a better understanding of the finer nuances of protein secondary structure and could make an important contribution to the better understanding of comparatively less frequently occurring structural motifs. At the same time, it can contribute to the identification of novel SSEs. A standalone version of the program for the Linux as well as the Windows operating systems is freely downloadable and a web-server version is also available at .
Resumo:
Rapid and high wing-beat frequencies achieved during insect flight are powered by the indirect flight muscles, the largest group of muscles present in the thorax. Any anomaly during the assembly and/or structural impairment of the indirect flight muscles gives rise to a flightless phenotype. Multiple mutagenesis screens in Drosophila melanogaster for defective flight behavior have led to the isolation and characterization of mutations that have been instrumental in the identification of many proteins and residues that are important for muscle assembly, function, and disease. In this article, we present a molecular-genetic characterization of a flightless mutation, flightless-H (fliH), originally designated as heldup-a (hdp-a). We show that fliH is a cis-regulatory mutation of the wings up A (wupA) gene, which codes for the troponin-I protein, one of the troponin complex proteins, involved in regulation of muscle contraction. The mutation leads to reduced levels of troponin-I transcript and protein. In addition to this, there is also coordinated reduction in transcript and protein levels of other structural protein isoforms that are part of the troponin complex. The altered transcript and protein stoichiometry ultimately culminates in unregulated acto-myosin interactions and a hypercontraction muscle phenotype. Our results shed new insights into the importance of maintaining the stoichiometry of structural proteins during muscle assembly for proper function with implications for the identification of mutations and disease phenotypes in other species, including humans.