470 resultados para epicanthic folds
Resumo:
The gene for the maturation protein of the single-stranded RNA coliphage MS2 is preceded by an untranslated leader of 130 nt, which folds into a cloverleaf, i.e., three stem–loop structures enclosed by a long distance interaction (LDI). This LDI prevents translation because its 3′ moiety contains the Shine–Dalgarno sequence of the maturation gene. Previously, several observations suggested that folding of the cloverleaf is kinetically delayed, providing a time window for ribosomes to access the RNA. Here we present direct evidence for this model. In vitro experiments show that ribosome binding to the maturation gene is faster than refolding of the denatured cloverleaf. This folding delay appears related to special properties of the leader sequence. We have replaced the three stem–loop structures by a single five nt loop. This change does not affect the equilibrium structure of the LDI. Nevertheless, in this construct, the folding delay has virtually disappeared, suggesting that now the RNA folds faster than ribosomes can bind. Perturbation of the cloverleaf by an insertion makes the maturation start permanently accessible. A pseudorevertant that evolved from an infectious clone carrying the insertion had overcome this defect. It showed a wild-type folding delay before closing down the maturation gene. This experiment reveals the biological significance of retarded cloverleaf formation.
Resumo:
We examine the occurrence of the ≈300 known protein folds in different groups of organisms. To do this, we characterize a large fraction of the currently known protein sequences (≈140,000) in structural terms, by matching them to known structures via sequence comparison (or by secondary-structure class prediction for those without structural homologues). Overall, we find that an appreciable fraction of the known folds are present in each of the major groups of organisms (e.g., bacteria and eukaryotes share 156 of 275 folds), and most of the common folds are associated with many families of nonhomologous sequences (i.e., >10 sequence families for each common fold). However, different groups of organisms have characteristically distinct distributions of folds. So, for instance, some of the most common folds in vertebrates, such as globins or zinc fingers, are rare or absent in bacteria. Many of these differences in fold usage are biologically reasonable, such as the folds of metabolic enzymes being common in bacteria and those associated with extracellular transport and communication being common in animals. They also have important implications for database-based methods for fold recognition, suggesting that an unknown sequence from a plant is more likely to have a certain fold (e.g., a TIM barrel) than an unknown sequence from an animal.
Resumo:
Histones H3 and H4 have a well defined structural role in the nucleosome and an established role in the regulation of transcription. We have made use of a microinjection strategy using Xenopus embryos to define the minimal structural components of H3 and H4 necessary for nucleosome assembly into metazoan chromosomes in vivo. We find that both the N-terminal tail of H4, including all sites of acetylation, and the C-terminal α-helix of the H4 histone fold domain are dispensable for chromatin assembly. The N-terminal tail and an N-terminal α-helix of H3 are also dispensable for chromatin assembly. However, the remainder of the H3 and H4 histone folds are essential for incorporation of these proteins into chromatin. We suggest that elements of the histone fold domain maintain both nucleosomal integrity and have distinct functions essential for cell viability.
Resumo:
An increasing number of proteins with weak sequence similarity have been found to assume similar three-dimensional fold and often have similar or related biochemical or biophysical functions. We propose a method for detecting the fold similarity between two proteins with low sequence similarity based on their amino acid properties alone. The method, the proximity correlation matrix (PCM) method, is built on the observation that the physical properties of neighboring amino acid residues in sequence at structurally equivalent positions of two proteins of similar fold are often correlated even when amino acid sequences are different. The hydrophobicity is shown to be the most strongly correlated property for all protein fold classes. The PCM method was tested on 420 proteins belonging to 64 different known folds, each having at least three proteins with little sequence similarity. The method was able to detect fold similarities for 40% of the 420 sequences. Compared with sequence comparison and several fold-recognition methods, the method demonstrates good performance in detecting fold similarities among the proteins with low sequence identity. Applied to the complete genome of Methanococcus jannaschii, the method recognized the folds for 22 hypothetical proteins.
Resumo:
Vpu is an 81-residue membrane protein encoded by the HIV-1 genome. NMR experiments show that the protein folds into two distinct domains, a transmembrane hydrophobic helix and a cytoplasmic domain with two in-plane amphipathic α-helices separated by a linker region. Resonances in one-dimensional solid-state NMR spectra of uniformly 15N labeled Vpu are clearly segregated into two bands at chemical shift frequencies associated with NH bonds in a transmembrane α-helix, perpendicular to the membrane surface, and with NH bonds in the cytoplasmic helices parallel to the membrane surface. Solid-state NMR spectra of truncated Vpu2–51 (residues 2–51), which contains the transmembrane α-helix and the first amphipathic helix of the cytoplasmic domain, and of a construct Vpu28–81 (residues 28–81), which contains only the cytoplasmic domain, support this structural model of Vpu in the membrane. Full-length Vpu (residues 2–81) forms discrete ion-conducting channels of heterogeneous conductance in lipid bilayers. The most frequent conductances were 22 ± 3 pS and 12 ± 3 pS in 0.5 M KCl and 29 ± 3 pS and 12 ± 3 pS in 0.5 M NaCl. In agreement with the structural model, truncated Vpu2–51, which has the transmembrane helix, forms discrete channels in lipid bilayers, whereas the cytoplasmic domain Vpu28–81, which lacks the transmembrane helix, does not. This finding shows that the channel activity is associated with the transmembrane helical domain. The pattern of channel activity is characteristic of the self-assembly of conductive oligomers in the membrane and is compatible with the structural and functional findings.
Resumo:
The end of a telomeric DNA sequence isolated from a polytene chromosome of a hypotrichous ciliate folds back and hybridizes with downstream telomeric sequence to form a t loop that is stable in the absence of protein and DNA cross-linking. The single-stranded, telomeric DNA sequence at the end of a macronuclear molecule does not form a t loop but, instead, is complexed with a heterodimeric, telomere-binding protein. Thus, two mechanisms for capping the ends of DNA molecules are used in the same cell.
Resumo:
Transcriptional activation domains share little sequence homology and generally lack folded structures in the absence of their targets, aspects that have rendered activation domains difficult to characterize. Here, a combination of biochemical and nuclear magnetic resonance experiments demonstrates that the activation domain of the tumor suppressor p53 has an FXXΦΦ motif (F, Phe; X, any amino acids; Φ, hydrophobic residues) that folds into an α-helix upon binding to one of its targets, hTAFII31 (a human TFIID TATA box-binding protein-associated factor). MDM2, the cellular attenuator of p53, discriminates the FXXΦΦ motif of p53 from those of NF-κB p65 and VP16 and specifically inhibits p53 activity. Our studies support the notion that the FXXΦΦ sequence is a general α-helical recognition motif for hTAFII31 and provide insights into the mechanistic basis for regulation of p53 function.
Resumo:
The structure of a 29-nucleotide RNA containing the sarcin/ricin loop (SRL) of rat 28 S rRNA has been determined at 2.1 Å resolution. Recognition of the SRL by elongation factors and by the ribotoxins, sarcin and ricin, requires a nearly universal dodecamer sequence that folds into a G-bulged cross-strand A stack and a GAGA tetraloop. The juxtaposition of these two motifs forms a distorted hairpin structure that allows direct recognition of bases in both grooves as well as recognition of nonhelical backbone geometry and two 5′-unstacked purines. Comparisons with other RNA crystal structures establish the cross-strand A stack and the GNRA tetraloop as defined and modular RNA structural elements. The conserved region at the top is connected to the base of the domain by a region presumed to be flexible because of the sparsity of stabilizing contacts. Although the conformation of the SRL RNA previously determined by NMR spectroscopy is similar to the structure determined by x-ray crystallography, significant differences are observed in the “flexible” region and to a lesser extent in the G-bulged cross-strand A stack.
Resumo:
The immunoglobulin (Ig) molecule is composed of two identical heavy chains and two identical light chains (H2L2). Transport of this heteromeric complex is dependent on the correct assembly of the component parts, which is controlled, in part, by the association of incompletely assembled Ig heavy chains with the endoplasmic reticulum (ER) chaperone, BiP. Although other heavy chain-constant domains interact transiently with BiP, in the absence of light chain synthesis, BiP binds stably to the first constant domain (CH1) of the heavy chain, causing it to be retained in the ER. Using a simplified two-domain Ig heavy chain (VH-CH1), we have determined why BiP remains bound to free heavy chains and how light chains facilitate their transport. We found that in the absence of light chain expression, the CH1 domain neither folds nor forms its intradomain disulfide bond and therefore remains a substrate for BiP. In vivo, light chains are required to facilitate both the folding of the CH1 domain and the release of BiP. In contrast, the addition of ATP to isolated BiP–heavy chain complexes in vitro causes the release of BiP and allows the CH1 domain to fold in the absence of light chains. Therefore, light chains are not intrinsically essential for CH1 domain folding, but play a critical role in removing BiP from the CH1 domain, thereby allowing it to fold and Ig assembly to proceed. These data suggest that the assembly of multimeric protein complexes in the ER is not strictly dependent on the proper folding of individual subunits; rather, assembly can drive the complete folding of protein subunits.
Resumo:
The endosperm of a sorghum mutant cultivar, with high in vitro uncooked and cooked protein digestibilities, was examined by transmission electron microscopy and α-, β-, and γ-kafirins (storage proteins) were localized within its protein bodies. Transmission electron microscopy micrographs revealed that these protein bodies had a unique microstructure related to high protein digestibility. They were irregular in shape and had numerous invaginations, often reaching to the central area of the protein body. Protein bodies from normal cultivars, such as P721N studied here, with much lower uncooked and cooked digestibilities are spherical and contain no invaginations. Immunocytochemistry results showed that the relative location of α- and β-kafirins within the protein bodies of the highly digestible genotype were similar to the normal cultivar, P721N. γ-Kafirin, however, was concentrated in dark-staining regions at the base of the folds instead of at the protein body periphery, as is typical of normal cultivars. The resulting easy accessibility of digestive enzymes to α-kafirin, the major storage protein, in addition to the increased surface area of the protein bodies of the highly digestible cultivar appear to account for its high in vitro protein digestibility.
Resumo:
Structural genomics aims to solve a large number of protein structures that represent the protein space. Currently an exhaustive solution for all structures seems prohibitively expensive, so the challenge is to define a relatively small set of proteins with new, currently unknown folds. This paper presents a method that assigns each protein with a probability of having an unsolved fold. The method makes extensive use of protomap, a sequence-based classification, and scop, a structure-based classification. According to protomap, the protein space encodes the relationship among proteins as a graph whose vertices correspond to 13,354 clusters of proteins. A representative fold for a cluster with at least one solved protein is determined after superposition of all scop (release 1.37) folds onto protomap clusters. Distances within the protomap graph are computed from each representative fold to the neighboring folds. The distribution of these distances is used to create a statistical model for distances among those folds that are already known and those that have yet to be discovered. The distribution of distances for solved/unsolved proteins is significantly different. This difference makes it possible to use Bayes' rule to derive a statistical estimate that any protein has a yet undetermined fold. Proteins that score the highest probability to represent a new fold constitute the target list for structural determination. Our predicted probabilities for unsolved proteins correlate very well with the proportion of new folds among recently solved structures (new scop 1.39 records) that are disjoint from our original training set.
Resumo:
Mammalian electron transfer flavoproteins (ETF) are heterodimers containing a single equivalent of flavin adenine dinucleotide (FAD). They function as electron shuttles between primary flavoprotein dehydrogenases involved in mitochondrial fatty acid and amino acid catabolism and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. The structure of human ETF solved to 2.1-Å resolution reveals that the ETF molecule is comprised of three distinct domains: two domains are contributed by the α subunit and the third domain is made up entirely by the β subunit. The N-terminal portion of the α subunit and the majority of the β subunit have identical polypeptide folds, in the absence of any sequence homology. FAD lies in a cleft between the two subunits, with most of the FAD molecule residing in the C-terminal portion of the α subunit. Alignment of all the known sequences for the ETF α subunits together with the putative FixB gene product shows that the residues directly involved in FAD binding are conserved. A hydrogen bond is formed between the N5 of the FAD isoalloxazine ring and the hydroxyl side chain of αT266, suggesting why the pathogenic mutation, αT266M, affects ETF activity in patients with glutaric acidemia type II. Hydrogen bonds between the 4′-hydroxyl of the ribityl chain of FAD and N1 of the isoalloxazine ring, and between αH286 and the C2-carbonyl oxygen of the isoalloxazine ring, may play a role in the stabilization of the anionic semiquinone. With the known structure of medium chain acyl-CoA dehydrogenase, we hypothesize a possible structure for docking the two proteins.
Resumo:
The small all-β protein tendamistat folds and unfolds with two-state kinetics. We determined the volume changes associated with the folding process by performing kinetic and equilibrium measurements at variable pressure between 0.1 and 100 MPa (1 to 1,000 bar). GdmCl-induced equilibrium unfolding transitions reveal that the volume of the native state is increased by 41.4 ± 2.0 cm3/mol relative to the unfolded state. This value is virtually independent of denaturant concentration. The use of a high-pressure stopped-flow instrument enabled us to measure the activation volumes for the refolding (ΔVf0‡) and unfolding reaction (ΔVu0‡) over a broad range of GdmCl concentrations. The volume of the transition state is 60% native-like (ΔVf0‡ = 25.0 ± 1.2 cm3/mol) in the absence of denaturant, indicating partial solvent accessibility of the core residues. The volume of the transition state increases linearly with denaturant concentration and exceeds the volume of the native state above 6 M GdmCl. This result argues for a largely desolvated transition state with packing deficiencies at high denaturant concentrations and shows that the structure of the transition state depends strongly on the experimental conditions.
Resumo:
The structure of the extracellular, three-domain poliovirus receptor (CD155) complexed with poliovirus (serotype 1) has been determined to 22-Å resolution by means of cryo-electron microscopy and three-dimensional image-reconstruction techniques. Density corresponding to the receptor was isolated in a difference electron density map and fitted with known structures, homologous to those of the three individual CD155 Ig-like domains. The fit was confirmed by the location of carbohydrate moieties in the CD155 glycoprotein, the conserved properties of elbow angles in the structures of cell surface molecules with Ig-like folds, and the concordance with prior results of CD155 and poliovirus mutagenesis. CD155 binds in the poliovirus “canyon” and has a footprint similar to that of the intercellular adhesion molecule-1 receptor on human rhinoviruses. However, the orientation of the long, slender CD155 molecule relative to the poliovirus surface is quite different from the orientation of intercellular adhesion molecule-1 on rhinoviruses. In addition, the residues that provide specificity of recognition differ for the two receptors. The principal feature of receptor binding common to these two picornaviruses is the site in the canyon at which binding occurs. This site may be a trigger for initiation of the subsequent uncoating step required for viral infection.
Resumo:
The conformational space annealing (CSA) method for global optimization has been applied to the 10-55 fragment of the B-domain of staphylococcal protein A (protein A) and to a 75-residue protein, apo calbindin D9K (PDB ID code 1CLB), by using the UNRES off-lattice united-residue force field. Although the potential was not calibrated with these two proteins, the native-like structures were found among the low-energy conformations, without the use of threading or secondary-structure predictions. This is because the CSA method can find many distinct families of low-energy conformations. Starting from random conformations, the CSA method found that there are two families of low-energy conformations for each of the two proteins, the native-like fold and its mirror image. The CSA method converged to the same low-energy folds in all cases studied, as opposed to other optimization methods. It appears that the CSA method with the UNRES force field, which is based on the thermodynamic hypothesis, can be used in prediction of protein structures in real time.