933 resultados para Low Autocorrelation Binary Sequence Problem


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The insulin-like growth factor (IGF) binding proteins (IGFBPs) modulate the actions of the insulin-like growth factors in endocrine, paracrine, and autocrine settings. Additionally, some IGFBPs appear to exhibit biological effects that are IGF independent. The six high-affinity IGFBPs that have been characterized to date exhibit 40–60% amino acid sequence identity overall, with the most conserved sequences in their NH2 and COOH termini. We have recently demonstrated that the product of the mac25/IGFBP-7 gene, which shows significant conservation in the NH2 terminus, including an “IGFBP motif” (GCGCCXXC), exhibits low-affinity IGF binding. The closely related mammalian genes connective tissue growth factor (CTGF) gene, nov, and cyr61 encode secreted proteins that also contain the conserved sequences and IGFBP motifs in their NH2 termini. To ascertain if these genes, along with mac25/IGFBP-7, encode a family of low-affinity IGFBPs, we assessed the IGF binding characteristics of recombinant human CTGF (rhCTGF). The ability of baculovirus-synthesized rhCTGF to bind IGFs was demonstrated by Western ligand blotting, affinity cross-linking, and competitive affinity binding assays using 125I-labeled IGF-I or IGF-II and unlabeled IGFs. CTGF, like mac25/IGFBP-7, specifically binds IGFs, although with relatively low affinity. On the basis of these data, we propose that CTGF represents another member of the IGFBP family (IGFBP-8) and that the CTGF gene, mac25/IGFBP-7, nov, and cyr61 are members of a family of low-affinity IGFBP genes. These genes, along with those encoding the high-affinity IGFBPs 1–6, together constitute an IGFBP superfamily whose products function in IGF-dependent or IGF-independent modes to regulate normal and neoplastic cell growth.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Human basic fibroblast growth factor (FGF-2) occurs in four isoforms: a low molecular weight (LMW FGF-2, 18 kDa) and three high molecular weight (HMW FGF-2, 22, 22.5, and 24 kDa) forms. LMW FGF-2 is primarily cytoplasmic and functions in an autocrine manner, whereas HMW FGF-2s are nuclear and exert activities through an intracrine, perhaps nuclear, pathway. Selective overexpression of HMW FGF-2 forms in fibroblasts promotes growth in low serum, whereas overexpression of LMW FGF-2 does not. The HMW FGF-2 forms have two functional domains: an amino-terminal extension and a common 18-kDa amino acid sequence. To investigate the role of these regions in the intracrine signaling of HMW FGF-2, we produced stable transfectants of NIH 3T3 fibroblasts overexpressing either individual HMW FGF-2 forms or artificially nuclear-targeted LMW FGF-2. All of these forms of FGF-2 localize to the nucleus/nucleolus and induce growth in low serum. The nuclear forms of FGF-2 trigger a mitogenic stimulus under serum starvation conditions and do not specifically protect the cells from apoptosis. These data indicate the existence of a specific role for nuclear FGF-2 and suggest that LMW FGF-2 represents the biological messenger in both the autocrine/paracrine and intracrine FGF-2 pathways.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pairwise sequence comparison methods have been assessed using proteins whose relationships are known reliably from their structures and functions, as described in the scop database [Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia C. (1995) J. Mol. Biol. 247, 536–540]. The evaluation tested the programs blast [Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). J. Mol. Biol. 215, 403–410], wu-blast2 [Altschul, S. F. & Gish, W. (1996) Methods Enzymol. 266, 460–480], fasta [Pearson, W. R. & Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444–2448], and ssearch [Smith, T. F. & Waterman, M. S. (1981) J. Mol. Biol. 147, 195–197] and their scoring schemes. The error rate of all algorithms is greatly reduced by using statistical scores to evaluate matches rather than percentage identity or raw scores. The E-value statistical scores of ssearch and fasta are reliable: the number of false positives found in our tests agrees well with the scores reported. However, the P-values reported by blast and wu-blast2 exaggerate significance by orders of magnitude. ssearch, fasta ktup = 1, and wu-blast2 perform best, and they are capable of detecting almost all relationships between proteins whose sequence identities are >30%. For more distantly related proteins, they do much less well; only one-half of the relationships between proteins with 20–30% identity are found. Because many homologs have low sequence similarity, most distant relationships cannot be detected by any pairwise comparison method; however, those which are identified may be used with confidence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The DNA binding activity of p53 is crucial for its tumor suppressor function and is subject to tight regulation. Previous studies revealed that the inhibitory function of the p53 C terminus is implicated in the latent, low affinity sequence-specific DNA binding activity of p53 in the uninduced state. Sequence-specific DNA binding of p53 has been shown to be activated by several posttranslational modifications and interacting proteins that target predominantly the C terminus. Moreover, several authors have shown that synthetic peptides corresponding to p53 C-terminal sequences activate p53 sequence-specific DNA binding. In an effort to identify the interaction site of p53 with these activating peptides we assessed complex formation between p53 deletion constructs and C-terminal activating peptides by peptide affinity precipitation. This study revealed that two distal regions of the p53 molecule contribute synergistically to the interaction with activating C-terminal peptides: amino acids 80–93 and 364–393. The C-terminal residues 364–393 are already well characterized as having negative regulatory function. DNA binding analyses with these deletion constructs reveal a comparable negative regulatory activity for residues 80–93, defining this region as a previously unidentified negative regulatory domain of p53. Furthermore, synthetic peptides spanning this newly identified proline-rich negative regulatory region (residues 80–93) are able to activate p53 sequence-specific DNA binding in vitro. We suggest that both negative regulatory regions, residues 80–93 and 364–393, contribute cooperatively to the maintenance of the latent, low-affinity DNA binding conformation of p53.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The double helix is a ubiquitous feature of RNA molecules and provides a target for nucleases involved in RNA maturation and decay. Escherichia coli ribonuclease III participates in maturation and decay pathways by site-specifically cleaving double-helical structures in cellular and viral RNAs. The site of cleavage can determine RNA functional activity and half-life and is specified in part by local tertiary structure elements such as internal loops. The involvement of base pair sequence in determining cleavage sites is unclear, because RNase III can efficiently degrade polymeric double-stranded RNAs of low sequence complexity. An alignment of RNase III substrates revealed an exclusion of specific Watson–Crick bp sequences at defined positions relative to the cleavage site. Inclusion of these “disfavored” sequences in a model substrate strongly inhibited cleavage in vitro by interfering with RNase III binding. Substrate cleavage also was inhibited by a 3-bp sequence from the selenocysteine-accepting tRNASec, which acts as an antideterminant of EF-Tu binding to tRNASec. The inhibitory bp sequences, together with local tertiary structure, can confer site specificity to cleavage of cellular and viral substrates without constraining the degradative action of RNase III on polymeric double-stranded RNA. Base pair antideterminants also may protect double-helical elements in other RNA molecules with essential functions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An additivity-based sequence to reactivity algorithm for the interaction of members of the Kazal family of protein inhibitors with six selected serine proteinases is described. Ten consensus variable contact positions in the inhibitor were identified, and the 19 possible variants at each of these positions were expressed. The free energies of interaction of these variants and the wild type were measured. For an additive system, this data set allows for the calculation of all possible sequences, subject to some restrictions. The algorithm was extensively tested. It is exceptionally fast so that all possible sequences can be predicted. The strongest, the most specific possible, and the least specific inhibitors were designed, and an evolutionary problem was solved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The human prion gene contains five copies of a 24 nt repeat that is highly conserved among species. An analysis of folding free energies of the human prion mRNA, in particular in the repeat region, suggested biased codon selection and the presence of RNA patterns. In particular, pseudoknots, similar to the one predicted by Wills in the human prion mRNA, were identified in the repeat region of all available prion mRNAs available in GenBank, but not those of birds and the red slider turtle. An alignment of these mRNAs, which share low sequence homology, shows several co-variations that maintain the pseudoknot pattern. The presence of pseudoknots in yeast Sup35p and Rnq1 suggests acquisition in the prokaryotic era. Computer generated three-dimensional structures of the human prion pseudoknot highlight protein and RNA interaction domains, which suggest a possible effect in prion protein translation. The role of pseudoknots in prion diseases is discussed as individuals with extra copies of the 24 nt repeat develop the familial form of Creutzfeldt–Jakob disease.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We describe a technique, sequence-tagged microsatellite profiling (STMP), to rapidly generate large numbers of simple sequence repeat (SSR) markers from genomic or cDNA. This technique eliminates the need for library screening to identify SSR-containing clones and provides an ∼25-fold increase in sequencing throughput compared to traditional methods. STMP generates short but characteristic nucleotide sequence tags for fragments that are present within a pool of SSR amplicons. These tags are then ligated together to form concatemers for cloning and sequencing. The analysis of thousands of tags gives rise to a representational profile of the abundance and frequency of SSRs within the DNA pool, from which low copy sequences can be identified. As each tag contains sufficient nucleotide sequence for primer design, their conversion into PCR primers allows the amplification of corresponding full-length fragments from the pool of SSR amplicons. These fragments permit the full characterisation of a SSR locus and provide flanking sequence for the development of a microsatellite marker. Alternatively, sequence tag primers can be used to directly amplify corresponding SSR loci from genomic DNA, thereby reducing the cost of developing a microsatellite marker to the synthesis of just one sequence-specific primer. We demonstrate the utility of STMP by the development of SSR markers in bread wheat.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Trehalose (α-d-glucopyranosyl-1,1-α-d-glucopyranoside), a disaccharide widespread among microbes and lower invertebrates, is generally believed to be nonexistent in higher plants. However, the recent discovery of Arabidopsis genes whose products are involved in trehalose synthesis has renewed interest in the possibility of a function of trehalose in higher plants. We previously showed that trehalase, the enzyme that degrades trehalose, is present in nodules of soybean (Glycine max [L.] Merr.), and we characterized the enzyme as an apoplastic glycoprotein. Here we describe the purification of this trehalase to homogeneity and the cloning of a full-length cDNA encoding this enzyme, named GMTRE1 (G. max trehalase 1). The amino acid sequence derived from the open reading frame of GMTRE1 shows strong homology to known trehalases from bacteria, fungi, and animals. GMTRE1 is a single-copy gene and is expressed at a low but constant level in many tissues.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A whole genome cattle-hamster radiation hybrid cell panel was used to construct a map of 54 markers located on bovine chromosome 5 (BTA5). Of the 54 markers, 34 are microsatellites selected from the cattle linkage map and 20 are genes. Among the 20 mapped genes, 10 are new assignments that were made by using the comparative mapping by annotation and sequence similarity strategy. A LOD-3 radiation hybrid framework map consisting of 21 markers was constructed. The relatively low retention frequency of markers on this chromosome (19%) prevented unambiguous ordering of the other 33 markers. The length of the map is 398.7 cR, corresponding to a ratio of ≈2.8 cR5,000/cM. Type I genes were binned for comparison of gene order among cattle, humans, and mice. Multiple internal rearrangements within conserved syntenic groups were apparent upon comparison of gene order on BTA5 and HSA12 and HSA22. A similarly high number of rearrangements were observed between BTA5 and MMU6, MMU10, and MMU15. The detailed comparative map of BTA5 should facilitate identification of genes affecting economically important traits that have been mapped to this chromosome and should contribute to our understanding of mammalian chromosome evolution.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Both high- and low-molecular-weight glutenin subunits (LMW-GS) play the major role in determining the viscoelastic properties of wheat (Triticum aestivum L.) flour. To date there has been no clear correspondence between the amino acid sequences of LMW-GS derived from DNA sequencing and those of actual LMW-GS present in the endosperm. We have characterized a particular LMW-GS from hexaploid bread wheat, a major component of the glutenin polymer, which we call the 42K LMW-GS, and have isolated and sequenced the putative corresponding gene. Extensive amino acid sequences obtained directly for this 42K LMW-GS indicate correspondence between this protein and the putative corresponding gene. This subunit did not show a cysteine (Cys) at position 5, in contrast to what has frequently been reported for nucleotide-based sequences of LMW-GS. This Cys has been replaced by one occurring in the repeated-sequence domain, leaving the total number of Cys residues in the molecule the same as in various other LMW-GS. On the basis of the deduced amino acid sequence and literature-based assignment of disulfide linkages, a computer-generated molecular model of the 42K subunit was constructed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We previously reported that short exposure of tomato (Lycopersicon esculentum L.) fruits to high temperature protects them from chilling injury. To study the involvement of heat-shock proteins (HSPs) in the acquisition of low-temperature tolerance, we cloned two heat-shock-induced genes that are also expressed at low temperatures. The cloned cDNAs belong to the small HSP group. Sequence analyses of the clones showed perfect homology to the tomato-ripening gene tom66 and to the tomato chloroplastic HSP21 gene tom111. The expression of both genes was induced by high temperature in fruits, flowers, leaves, and stems, but not by low or ambient temperatures or by other stresses such as drought and anaerobic conditions. When the heated fruits were transferred to low temperature, tom66 and tom111 mRNA levels first decreased but were then reinduced. Induction was not observed in nonheated fruits at low temperature. Immunodetection of tom111-encoded protein indicated that this protein is present at low temperatures in the heated fruits. The results of this study show that the expression of tom66 and tom111 is correlated with protection against some, but not all, symptoms of chilling injury.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Phospholipase A2 (PLA2) was purified about 180,000 times compared with the starting soluble-protein extract from developing elm (Ulmus glabra) seeds. On sodium dodecyl sulfate-polyacrylamide gel electrophoresis the purified fraction showed a single protein band with a mobility that corresponded to 15 kD, from which activity could be recovered. When analyzed by matrix-assisted laser-desorption ionization-time-of-flight mass spectrometry, the enzyme had a deduced mass of 13,900 D. A 53-amino acid-long N-terminal sequence was determined and aligned with other sequences, giving 62% identity to the deduced amino acid sequence of some rice (Oryza sativa) expressed sequence tag clones. The purified enzyme had an alkaline pH optimum and required Ca2+ for activity. It was unusually stable with regard to heat, acidity, and organic solvents but was sensitive to disulfide bond-reducing agents. The enzyme is a true PLA2, neither hydrolyzing the sn-1 position of phosphatidylcholine nor having any activity toward lysophosphatidylcholine or diacylglycerol. The biochemical data and amino acid sequence alignments indicate that the enzyme is related to the well-characterized family of animal secretory PLA2s and, to our knowledge, is the first plant enzyme of this type to be described.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene recognition is one of the most important problems in computational molecular biology. Previous attempts to solve this problem were based on statistics, and applications of combinatorial methods for gene recognition were almost unexplored. Recent advances in large-scale cDNA sequencing open a way toward a new approach to gene recognition that uses previously sequenced genes as a clue for recognition of newly sequenced genes. This paper describes a spliced alignment algorithm and software tool that explores all possible exon assemblies in polynomial time and finds the multiexon structure with the best fit to a related protein. Unlike other existing methods, the algorithm successfully recognizes genes even in the case of short exons or exons with unusual codon usage; we also report correct assemblies for genes with more than 10 exons. On a test sample of human genes with known mammalian relatives, the average correlation between the predicted and actual proteins was 99%. The algorithm correctly reconstructed 87% of genes and the rare discrepancies between the predicted and real exon-intron structures were caused either by short (less than 5 amino acids) initial/terminal exons or by alternative splicing. Moreover, the algorithm predicts human genes reasonably well when the homologous protein is nonvertebrate or even prokaryotic. The surprisingly good performance of the method was confirmed by extensive simulations: in particular, with target proteins at 160 accepted point mutations (PAM) (25% similarity), the correlation between the predicted and actual genes was still as high as 95%.