911 resultados para Sequence Motif
Resumo:
Recombinant antibodies capable of sequence-specific interactions with nucleic acids represent a class of DNA- and RNA-binding proteins with potential for broad application in basic research and medicine. We describe the rational design of a DNA-binding antibody, Fab-Ebox, by replacing a variable segment of the immunoglobulin heavy chain with a 17-amino acid domain derived from TFEB, a class B basic helix-loop-helix protein. DNA-binding activity was studied by electrophoretic mobility-shift assays in which Fab-Ebox was shown to form a specific complex with DNA containing the TFEB recognition motif (CACGTG). Similarities were found in the abilities of TFEB and Fab-Ebox to discriminate between oligodeoxyribonucleotides containing altered recognition sequences. Comparable interference of binding by methylation of cytosine residues indicated that Fab-Ebox and TFEB both contact DNA through interactions along the major groove of double-stranded DNA. The results of this study indicate that DNA-binding antibodies of high specificity can be developed by using the modular nature of both immunoglobulins and transcription factors.
Resumo:
The cyclotides are the largest family of naturally occurring circular proteins. The mechanism by which the termini of these gene-encoded proteins are linked seamlessly with a peptide bond to form a circular backbone is unknown. Here we report cyclotide-encoding cDNA sequences from the plant Viola odorata and compare them with those from an evolutionarily distinct species, Oldenlandia affinis. Individual members of this multigene family encode one to three mature cyclotide domains. These domains are preceded by N-terminal repeat regions (NTRs) that are conserved within a plant species but not between species. We have structurally characterized peptides corresponding to these NTRs and show that, despite them having no sequence homology, they form a structurally conserved alpha-helical motif. This structural conservation suggests a vital role for the NTR in the in vivo folding, processing, or detoxification of cyclotide domains from the precursor protein.
Resumo:
Chemokine (C-C motif) ligand-2 (CCL2) is a chemoattractant and activator of macrophages and is a key determinant of the macrophage infiltrate into tumours. We demonstrate here that CCL2 is expressed in normal human ovarian surface epithelium ( HOSE) cells and is silenced in most ovarian cancer cell lines, and silenced or downregulated in the majority of primary ovarian adenocarcinomas. Analysis of the CCL2 locus at 17q11.2-q12 showed loss of heterozygosity (LOH) in 70% of primary tumours, and this was significantly more common in tumours of advanced stage or grade. However, we did not detect any mutations in the CCL2 coding sequence in 94 primary ovarian adenocarcinomas. These data support the hypothesis that CCL2 may play a role in the pathobiology of ovarian cancers, but additional studies will be required to evaluate this possibility.
Resumo:
This project identified a novel family of six 66-68 residue peptides from the venom of two Australian funnel-web spiders, Hadronyche sp. 20 and H. infensa: Orchid Beach (Hexathelidae: Atracinae), that appear to undergo N- and/or C-terminal post-translational modifications and conform to an ancestral protein fold. These peptides all show significant amino acid sequence homology to atracotoxin-Hvf17 (ACTX-Hvf17), a non-toxic peptide isolated from the venom of H. versuta, and a variety of AVIT family proteins including mamba intestinal toxin 1 (MIT1) and its mammalian and piscine orthologs prokineticin 1 (PK1) and prokineticin 2 PK2). These AVIT family proteins target prokineticin receptors involved in the sensitization of nociceptors and gastrointestinal smooth muscle activation. Given their sequence homology to MITI, we have named these spider venom peptides the MIT-like atracotoxin (ACTX) family. Using isolated rat stomach fundus or guinea-pia ileum organ bath preparations we have shown that the prototypical ACTX-Hvf17, at concentrations up to 1 mu M, did not stimulate smooth muscle contractility, nor did it inhibit contractions induced by human PK1 (hPK1). The peptide also lacked activity on other isolated smooth muscle preparations including rat aorta. Furthermore, a FLIPR Ca2+ flux assay using HEK293 cells expressing prokineticin receptors showed that ACTX-Hvf17 fails to activate or block hPK1 or hPK2 receptors. Therefore, while the MIT-like ACTX family appears to adopt the ancestral disulfide-directed beta-hairpin protein fold of MIT1, a motif believed to be shared by other AVIT family peptides, variations in the amino acid sequence and surface charge result in a loss of activity on prokineticin receptors. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular, the problem can be made more tractable by deliberately using algorithms that are biased toward solutions of the requisite kind. In this paper, we argue that recurrent neural networks have a natural bias toward a problem domain of which biological sequence analysis tasks are a subset. We use experiments with synthetic data to illustrate this bias. We then demonstrate that this bias can be exploitable using a data set of protein sequences containing several classes of subcellular localization targeting peptides. The results show that, compared with feed forward, recurrent neural networks will generally perform better on sequence analysis tasks. Furthermore, as the patterns within the sequence become more ambiguous, the choice of specific recurrent architecture becomes more critical.
Resumo:
We completed the genome sequence of Lettuce necrotic yellows virus (LNYV) by determining the nucleotide sequences of the 4a (putative phosphoprotein), 4b, M (matrix protein), G (glycoprotein) and L (polymerase) genes. The genome consists of 12,807 nucleotides and encodes six genes in the order 3' leader-N-4a(P)-4b-M-G-L-5' trailer. Sequences were derived from clones of a cDNA library from LNYV genomic RNA and from fragments amplified using reverse transcription-polymerase chain reaction. The 4a protein has a low isoelectric point characteristic for rhabdovirus phosphoproteins. The 4b protein has significant sequence similarities with the movement proteins of capillo- and trichoviruses and may be involved in cell-to-cell movement. The putative G protein sequence contains a predicted 25 amino acids signal peptide and endopeptidase cleavage site, three predicted glycosylation sites and a putative transmembrane domain. The deduced L protein sequence shows similarities with the L proteins of other plant rhabdoviruses and contains polymerase module motifs characteristic for RNA-dependent RNA polymerases of negative-strand RNA viruses. Phylogenetic analysis of this motif among rhabdoviruses placed LNYV in a group with other sequenced cytorhabdoviruses, most closely related to Strawberry crinkle virus. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
The cyclotides are a family of circular proteins with a range of biological activities and potential pharmaceutical and agricultural applications. The biosynthetic mechanism of cyclization is unknown and the discovery of novel sequences may assist in achieving this goal. In the present study, we have isolated a new cyclotide from Oldenlandia affinis, kalata B8, which appears to be a hybrid of the two major subfamilies (Mobius and bracelet) of currently known cyclotides. We have determined the three-dimensional structure of kalata B8 and observed broadening of resonances directly involved in the cystine knot motif, suggesting flexibility in this region despite it being the core structural element of the cyclotides. The cystine knot motif is widespread throughout Nature and inherently stable, making this apparent flexibility a surprising result. Further-more, there appears to be isomerization of the peptide backbone at an Asp-Gly sequence in the region involved in the cyclization process. Interestingly, such isomerization has been previously characterized in related cyclic knottins from Momordica cochinchinensis that have no sequence similarity to kalata B8 apart from the six conserved cysteine residues and may result from a common mechanism of cyclization. Kalata B8 also provides insight into the structure-activity relationships of cyclotides as it displays anti-HIV activity but lacks haemolytic activity. The 'uncoupling' of these two activities has not previously been observed for the cyclotides and may be related to the unusual hydrophilic nature of the peptide.
Resumo:
Cyclotides are a fascinating family of plant-derived peptides characterized by their head-to-tail cyclized backbone and knotted arrangement of three disulfide bonds. This conserved structural architecture, termed the CCK (cyclic cystine knot), is responsible for their exceptional resistance to thermal, chemical and enzymatic degradation. Cyclotides have a variety of biological activities, but their insecticidal activities suggest that their primary function is in plant defence. In the present study, we determined the cyclotide content of the sweet violet Viola odorata, a member of the Violaceae family. We identified 30 cyclotides from the aerial parts and roots of this plant, 13 of which are novel sequences. The new sequences provide information about the natural diversity of cyclotides and the role of particular residues in defining structure and function. As many of the biological activities of cyclotides appear to be associated with membrane interactions, we used haemolytic activity as a marker of bioactivity for a selection of the new cyclotides. The new cyclotides were tested for their ability to resist proteolysis by a range of enzymes and, in common with other cyclotides, were completely resistant to trypsin, pepsin and thermolysin. The results show that while biological activity varies with the sequence, the proteolytic stability of the framework does not, and appears to be an inherent feature of the cyclotide framework. The structure of one of the new cyclotides, cycloviolacin O14, was determined and shown to contain the CCK motif. This study confirms that cyclotides may be regarded as a natural combinatorial template that displays a variety of peptide epitopes most likely targeted to a range of plant pests and pathogens.
Resumo:
Prediction of peroxisomal matrix proteins generally depends on the presence of one of two distinct motifs at the end of the amino acid sequence. PTS1 peroxisomal proteins have a well conserved tripeptide at the C-terminal end. However, the preceding residues in the sequence arguably play a crucial role in targeting the protein to the peroxisome. Previous work in applying machine learning to the prediction of peroxisomal matrix proteins has failed W capitalize on the full extent of these dependencies. We benchmark a range of machine learning algorithms, and show that a classifier - based on the Support Vector Machine - produces more accurate results when dependencies between the conserved motif and the preceding section are exploited. We publish an updated and rigorously curated data set that results in increased prediction accuracy of most tested models.
Resumo:
Bacteriophage T7 DNA primase recognizes 5'-GTC-3' in single-stranded DNA. The primase contains a single Cys4 zinc-binding motif that is essential for recognition. Biochemical and mutagenic analyses suggest that the Cys4 motif contacts cytosine of 5'-GTC-3' and may also contribute to thymine recognition. Residues His33 and Asp31 are critical for these interactions. Biochemical analysis also reveals that T7 primase selectively binds CTP in the absence of DNA. We propose that bound CTP selects the remaining base G, of 5'-GTC-3', by base pairing. Our deduced mechanism for recognition of ssDNA by Cys4 motifs bears little resemblance to the recognition of trinucleotides of double-stranded DNA by Cys2His2 zinc fingers.
Resumo:
The G-protein coupled receptors--or GPCRs--comprise simultaneously one of the largest and one of the most multi-functional protein families known to modern-day molecular bioscience. From a drug discovery and pharmaceutical industry perspective, the GPCRs constitute one of the most commercially and economically important groups of proteins known. The GPCRs undertake numerous vital metabolic functions and interact with a hugely diverse range of small and large ligands. Many different methodologies have been developed to efficiently and accurately classify the GPCRs. These range from motif-based techniques to machine learning as well as a variety of alignment-free techniques based on the physiochemical properties of sequences. We review here the available methodologies for the classification of GPCRs. Part of this work focuses on how we have tried to build the intrinsically hierarchical nature of sequence relations, implicit within the family, into an adaptive approach to classification. Importantly, we also allude to some of the key innate problems in developing an effective approach to classifying the GPCRs: the lack of sequence similarity between the six classes that comprise the GPCR family and the low sequence similarity to other family members evinced by many newly revealed members of the family.
Resumo:
We have previously identified a phosphorothioate oligonucleotide (PS-ODN) that inhibited epidermal growth factor receptor tyrosine kinase (TK) activity both in cell fractions and in intact A431 cells. Since ODN-based TK inhibitors may have anti-cancer applications and may also help understand the non-antisense mediated effects of PS-ODNs, we have further studied the sequence and chemistry requirements of the parent PS-ODN (sequence: 5′-GGA GGG TCG CAT CGC-3′) as a sequence-dependent TK inhibitor. Sequence deletion and substitution studies revealed that the 5′-terminal GGA GGG hexamer sequence in the parent compound was essential for anti-TK activity in A431 cells. Site-specific substitution of any G with a T in this 5′-terminal motif within the parent compound caused a significant loss in anti-TK activity. The fully PS-modified hexameric motif alone exhibited equipotent activity as the parent 15-mer whereas phosphodiester (PO) or 2′-O-methyl-modified versions of this motif had significantly reduced anti-TK activity. Further, T substitutions within the two 5′-terminal G residues of the hexameric PS-ODN to produce a sequence, TTA GGG, representing the telomeric repeats in human chromosomes, also did not exhibit a significant anti-TK activity. Multiple repeats of the active hexameric motif in PS-ODNs resulted in more potent inhibitors of TK activity than the parent ODN. These results suggested that PS-ODNs, but not PO or 2′-O-methyl modified ODNs, containing the GGA GGG motif can exert potent anti-TK activity which may be desirable in some anti-tumor applications. Additionally, the presence of this previously unidentified motif in antisense PS-ODN constructs may contribute to their biological effects in vitro and in vivo and should be accounted for in the design of the PS-modified antisense ODNs. © 2002 Published by Elsevier Science Inc.
Resumo:
The actinobacterium Streptomyces wadayamensis A23 is an endophyte of Citrus reticulata that produces the antimycin and mannopeptimycin antibiotics, among others. The strain has the capability to inhibit Xylella fastidiosa growth. The draft genome of S. wadayamensis A23 has ~7.0 Mb and 6,006 protein-coding sequences, with a 73.5% G+C content.
Resumo:
Bacillus safensis is a microorganism recognized for its biotechnological and industrial potential due to its interesting enzymatic portfolio. Here, as a means of gathering information about the importance of this species in oil biodegradation, we report a draft genome sequence of a strain isolated from petroleum.
Resumo:
Avian pathogenic Escherichia coli (APEC) strains belong to a category that is associated with colibacillosis, a serious illness in the poultry industry worldwide. Additionally, some APEC groups have recently been described as potential zoonotic agents. In this work, we compared APEC strains with extraintestinal pathogenic E. coli (ExPEC) strains isolated from clinical cases of humans with extra-intestinal diseases such as urinary tract infections (UTI) and bacteremia. PCR results showed that genes usually found in the ColV plasmid (tsh, iucA, iss, and hlyF) were associated with APEC strains while fyuA, irp-2, fepC sitDchrom, fimH, crl, csgA, afa, iha, sat, hlyA, hra, cnf1, kpsMTII, clpVSakai and malX were associated with human ExPEC. Both categories shared nine serogroups (O2, O6, O7, O8, O11, O19, O25, O73 and O153) and seven sequence types (ST10, ST88, ST93, ST117, ST131, ST155, ST359, ST648 and ST1011). Interestingly, ST95, which is associated with the zoonotic potential of APEC and is spread in avian E. coli of North America and Europe, was not detected among 76 APEC strains. When the strains were clustered based on the presence of virulence genes, most ExPEC strains (71.7%) were contained in one cluster while most APEC strains (63.2%) segregated to another. In general, the strains showed distinct genetic and fingerprint patterns, but avian and human strains of ST359, or ST23 clonal complex (CC), presented more than 70% of similarity by PFGE. The results demonstrate that some zoonotic-related STs (ST117, ST131, ST10CC, ST23CC) are present in Brazil. Also, the presence of moderate fingerprint similarities between ST359 E. coli of avian and human origin indicates that strains of this ST are candidates for having zoonotic potential.