984 resultados para Base sequence
Resumo:
Molecular and fragment ion data of intact 8- to 43-kDa proteins from electrospray Fourier-transform tandem mass spectrometry are matched against the corresponding data in sequence data bases. Extending the sequence tag concept of Mann and Wilm for matching peptides, a partial amino acid sequence in the unknown is first identified from the mass differences of a series of fragment ions, and the mass position of this sequence is defined from molecular weight and the fragment ion masses. For three studied proteins, a single sequence tag retrieved only the correct protein from the data base; a fourth protein required the input of two sequence tags. However, three of the data base proteins differed by having an extra methionine or by missing an acetyl or heme substitution. The positions of these modifications in the protein examined were greatly restricted by the mass differences of its molecular and fragment ions versus those of the data base. To characterize the primary structure of an unknown represented in the data base, this method is fast and specific and does not require prior enzymatic or chemical degradation.
Resumo:
Hairpin polyamides are synthetic ligands for sequence-specific recognition in the minor groove of double-helical DNA. A thermodynamic characterization of the DNA-binding properties exhibited by a six-ring hairpin polyamide, ImPyPy-gamma-PyPyPy-beta-Dp (where Im = imidazole, Py = pyrrole, gamma = gamma-aminobutyric acid, beta = beta-alanine, and Dp = dimethylaminopropylamide), reveals an approximately 1-2 kcal/mol greater affinity for the designated match site, 5'-TGTTA-3', relative to the single base pair mismatch sites, 5'-TGGTA-3' and 5'-TATTA-3'. The enthalpy and entropy data at 20 degrees C reveal this sequence specificity to be entirely enthalpic in origin. Correlations between the thermodynamic driving forces underlying the sequence specificity exhibited by ImPyPy-gamma-PyPyPy-beta-Dp and the structural properties of the heterodimeric complex of PyPyPy and ImPyPy bound to the minor groove of DNA provide insight into the molecular forces that govern the affinity and specificity of pyrrole-imidazole polyamides.
Resumo:
DNA methyltransferases modify specific cytosines and adenines within 2-6 bp recognition sequences. We used scanning force microscopy and gel shift analysis to show that M.HhaI, a cytosine C-5 DNA methyltransferase, causes only a 2 degree bend upon binding its recognition site. Our results are consistent with prior crystallographic analysis showing that the enzyme stabilizes an extrahelical base while leaving the DNA duplex otherwise unperturbed. In contrast, similar analysis of M.EcoRI, an adenine N6 DNA methyltransferase, shows an average bend angle of approximately 52 degrees. This distortion of DNA conformation by M.EcoRI is shown to be important for sequence-specific binding.
Resumo:
Bacterial and mammalian mismatch repair systems have been implicated in the cellular response to certain types of DNA damage, and genetic defects in this pathway are known to confer resistance to the cytotoxic effects of DNA-methylating agents. Such observations suggest that in addition to their ability to recognize DNA base-pairing errors, members of the MutS family may also respond to genetic lesions produced by DNA damage. We show that the human mismatch recognition activity MutSalpha recognizes several types of DNA lesion including the 1,2-intrastrand d(GpG) crosslink produced by cis-diamminedichloroplatinum(II), as well as base pairs between O6-methylguanine and thymine or cytosine, or between O4-methylthymine and adenine. However, the protein fails to recognize 1,3-intrastrand adduct produced by trans-diamminedichloroplatinum(II) at a d(GpTpG) sequence. These observations imply direct involvement of the mismatch repair system in the cytotoxic effects of DNA-methylating agents and suggest that recognition of 1,2-intrastrand cis-diamminedichloroplatinum(II) adducts by MutSalpha may be involved in the cytotoxic action of this chemotherapeutic agent.
Resumo:
Fluorescence spectroscopy and isothermal titration calorimetry were used to study the thermodynamics of binding of the glucocorticoid receptor DNA-binding domain to four different, but similar, DNA-binding sites. The binding sites are two naturally occurring sites that differ in the composition of one base pair, i.e., an A-T to G-C mutation, and two sites containing chemical intermediates of these base pairs. The calorimetrically determined heat capacity change (Delta C(p)o(obs)) for glucocorticoid receptor DNA-binding domain binding agrees with that calculated for dehydration of solvent-accessible surface areas. A dominating effect of dehydration or solvent reorganization on the thermodynamics is also consistent with an observed linear relationship between observed enthalpy change (Delta Ho(obs)) and observed entropy change (Delta So(obs)) with a slope close to the experimental temperature. Comparisons with structural data allow us to rationalize individual differences between Delta Ho(obs) (and Delta So(obs)) for the four complexes. For instance, we find that the removal of a methyl group at the DNA-protein interface is enthalpically favorable but entropically unfavorable, which is consistent with a replacement by an ordered water molecule.
Resumo:
We have devised a combinatorial method, restriction endonuclease protection selection and amplification (REPSA), to identify consensus ligand binding sequences in DNA. In this technique, cleavage by a type IIS restriction endonuclease (an enzyme that cleaves DNA at a site distal from its recognition sequence) is prevented by a bound ligand while unbound DNA is cleaved. Since the selection step of REPSA is performed in solution under mild conditions, this approach is amenable to the investigation of ligand-DNA complexes that are either insufficiently stable or not readily separable by other methods. Here we report the use of REPSA to identify the consensus duplex DNA sequence recognized by a G/T-rich oligodeoxyribonucleotide under conditions favoring purine-motif triple-helix formation. Analysis of 47 sequences indicated that recognition between 13 bases on the oligonucleotide 3' end and the duplex DNA was sufficient for triplex formation and indicated the possible existence of a new base triplet, G.AT. This information should help identify appropriate target sequences for purine-motif triplex formation and demonstrates the power of REPSA for investigating ligand-DNA interactions.
Mapping nucleosome position at single base-pair resolution by using site-directed hydroxyl radicals.
Resumo:
A base-pair resolution method for determining nucleosome position in vitro has been developed to com- plement existing, less accurate methods. Cysteaminyl EDTA was tethered to a recombinant histone octamer via a mutant histone H4 with serine 47 replaced by cysteine. When assembled into nucleosome core particles, the DNA could be cut site specifically by hydroxyl radical-catalyzed chain scission by using the Fenton reaction. Strand cleavage occurs mainly at a single nucleotide close to the dyad axis of the core particle, and assignment of this location via the symmetry of the nucleosome allows base-pair resolution mapping of the histone octamer position on the DNA. The positions of the histone octamer and H3H4 tetramer were mapped on a 146-bp Lytechinus variegatus 5S rRNA sequence and a twofold-symmetric derivative. The weakness of translational determinants of nucleosome positioning relative to the overall affinity of the histone proteins for this DNA is clearly demonstrated. The predominant location of both histone octamer and H3H4 tetramer assembled on the 5S rDNA is off center. Shifting the nucleosome core particle position along DNA within a conserved rotational phase could be induced under physiologically relevant conditions. Since nucleosome shifting has important consequences for chromatin structure and gene regulation, an approach to the thermodynamic characterization of this movement is proposed. This mapping method is potentially adaptable for determining nucleosome position in chromatin in vivo.
Resumo:
Amino acid sequencing by recombinant DNA technology, although dramatically useful, is subject to base reading errors, is indirect, and is insensitive to posttranslational processing. Mass spectrometry techniques can provide molecular weight data from even relatively large proteins for such cDNA sequences and can serve as a check of an enzyme's purity and sequence integrity. Multiply-charged ions from electrospray ionization can be dissociated to yield structural information by tandem mass spectrometry, providing a second method for gaining additional confidence in primary sequence confirmation. Here, accurate (+/- 1 Da) molecular weight and molecular ion dissociation information for human muscle and brain creatine kinases has been obtained by electrospray ionization coupled with Fourier-transform mass spectrometry to help distinguish which of several published amino acid sequences for both enzymes are correct. The results herein are consistent with one published sequence for each isozyme, and the heterogeneity indicated by isoelectric focusing due to 1-Da deamidation changes. This approach appears generally useful for detailed sequence verification of recombinant proteins.
Resumo:
Methods of structural and statistical analysis of the relation between the sequence and secondary and three-dimensional structures are developed. About 5000 secondary structures of immunoglobulin molecules from the Kabat data base were predicted. Two statistical analyses of amino acids reveal 47 universal positions in strands and loops. Eight universally conservative positions out of the 47 are singled out because they contain the same amino acid in > 90% of all chains. The remaining 39 positions, which we term universally alternative positions, were divided into five groups: hydrophobic, charged and polar, aromatic, hydrophilic, and Gly-Ala, corresponding to the residues that occupied them in almost all chains. The analysis of residue-residue contacts shows that the 47 universal positions can be distinguished by the number and types of contacts. The calculations of contact maps in the 29 antibody structures revealed that residues in 24 of these 47 positions have contacts only with residues of antiparallel beta-strands in the same beta-sheet and residues in the remaining 23 positions always have far-away contacts with residues from other beta-sheets as well. In addition, residues in 6 of the 47 universal positions are also involved in interactions with residues of the other variable or constant domains.
Resumo:
A new method for computing evolutionary distances between DNA sequences is proposed. Contrasting with classical methods, the underlying model does not assume that sequence base compositions (A, C, G, and T contents) are at equilibrium, thus allowing unequal base compositions among compared sequences. This makes the method more efficient than the usual ones in recovering phylogenetic trees from sequence data when base composition is heterogeneous within the data set, as we show by using both simulated and empirical data. When applied to small-subunit ribosomal RNA sequences from several prokaryotic or eukaryotic organisms, this method provides evidence for an early divergence of the microsporidian Vairimorpha necatrix in the eukaryotic lineage.
Resumo:
We have previously reported an enhanced version of sequencing by hybridization (SBH), termed positional SBH (PSBH). PSBH uses partially duplex probes containing single-stranded 3' overhangs, instead of simple single-stranded probes. Stacking interactions between the duplex probe and a single-stranded target allow us to reduce the probe sizes required to 5-base single-stranded overhangs. Here we demonstrate the use of PSBH to capture relatively long single-stranded DNA targets and perform standard solid-state Sanger sequencing on these primer-template complexes without ligation. Our results indicate that only 5 bases of known terminal sequence are required for priming. In addition, the partially duplex probes have the ability to capture their specific target from a mixture of five single-stranded targets with different 3'-terminal sequences. This indicates the potential utility of the PSBH approach to sequence mixtures of DNA targets without prior purification.
Resumo:
An extensive sequence comparison of the chloroplast ndhF gene from all major clades of the largest flowering plant family (Asteraceae) shows that this gene provides approximately 3 times more phylogenetic information than rbcL. This is because it is substantially longer and evolves twice as fast. The 5' region (1380 bp) of ndhF is very different from the 3' region (855 bp) and is similar to rbcL in both the rate and the pattern of sequence change. The 3' region is more A+T-rich, has higher levels of nonsynonymous base substitution, and shows greater transversion bias at all codon positions. These differences probably reflect different functional constraints on the 5' and 3' regions of ndhF. The two patterns of base substitutions of ndhF are particularly advantageous for phylogenetic reconstruction because the conserved and variable segments can be used for older and recent groups, respectively. Phylogenetic analyses of 94 ndhF sequences provided much better resolution of relationships than previous molecular and morphological phylogenies of the Asteraceae. The ndhF tree identified five major clades: (i) the Calyceraceae is the sister family of Asteraceae; (ii) the Barnadesioideae is monophyletic and is the sister group to the rest of the family; (iii) the Cichorioideae and its two basal tribes Mutisieae and Cardueae are paraphyletic; (iv) four tribes of Cichorioideae (Lactuceae, Arctoteae, Liabeae, and Vernonieae) form a monophyletic group, and these are the sister clade of the Asteroideae; and (v) the Asteroideae is monophyletic and includes three major clades.
Resumo:
The correspondence between the transversion/transition ratio and the neighboring base composition in chloroplast DNA is examined. For 18 noncoding regions of the chloroplast genome, alignments between rice (Oryza sativa) and maize (Zea mays) were generated by two different methods. Difficulties of aligning noncoding DNA are discussed, and the alignments are analyzed in a manner that reduces alignment artifacts. Sequence divergence is < 10%, so multiple substitutions at a site are assumed to be rare. Observed substitutions were analyzed with respect to the A+T content of the two immediately flanking bases. It is shown that as this content increases, the proportion of transversions also increases. When both the 5'- and 3'-flanking nucleotides are G or C (A+T content of 0), only 25% of the observed substitutions are transversions. However, when both the 5'- and 3'-flanking nucleotides are A or T (A+T content of 2), 57% of the observed substitutions are transversions. Therefore, the influence of flanking base composition on substitutions, previously reported for a single noncoding region, is a general feature of the chloroplast genome.
Resumo:
We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.
Resumo:
Background: This paper describes SeqDoC, a simple, web-based tool to carry out direct comparison of ABI sequence chromatograms. This allows the rapid identification of single nucleotide polymorphisms (SNPs) and point mutations without the need to install or learn more complicated analysis software. Results: SeqDoC produces a subtracted trace showing differences between a reference and test chromatogram, and is optimised to emphasise those characteristic of single base changes. It automatically aligns sequences, and produces straightforward graphical output. The use of direct comparison of the sequence chromatograms means that artefacts introduced by automatic base-calling software are avoided. Homozygous and heterozygous substitutions and insertion/deletion events are all readily identified. SeqDoC successfully highlights nucleotide changes missed by the Staden package 'tracediff' program. Conclusion: SeqDoC is ideal for small-scale SNP identification, for identification of changes in random mutagenesis screens, and for verification of PCR amplification fidelity. Differences are highlighted, not interpreted, allowing the investigator to make the ultimate decision on the nature of the change.