989 resultados para sequence similarity searches


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Parasiten der Apicomplexa umfassen sowohl humanpathogene, als auch tierpathogene Protozoen. Beispiele für wichtige Vertreter human- und tierpathogener Parasiten sind Plasmodium falciparum und Eimeria tenella. E. tenella verursacht die Kokzidiose des Hühnchens, eine Darmerkrankung die weltweit für Verluste in einer geschätzten Höhe von bis zu 3 Milliarden US$ verantwortlich zeichnet. Eine prophylaktische Vakzinierung gegen diese Krankheit ist ökonomisch meist ineffizient, und eine Behandlung mit Kokzidiostatika wird durch häufige Resistenzbildung gegen bekannte Wirkstoffe erschwert. Diese Situation erfordert die Entwicklung neuer kostengünstiger Alternativen. Geeignete Zielproteine für die Entwicklung neuartiger Arzneistoffe zur Behandlung der Kokzidiose sind die Zyklin-abhängigen Kinasen (CDKs), zu denen auch die CDK-related Kinase 2 (EtCRK2) aus E. tenella gehört. Diese Proteine sind maßgeblich an der Regulation des Zellzyklus beteiligt. Durch chemische Validierung mit dem CDK Inhibitor Flavopiridol konnte nachgewiesen werden, dass ein Funktionsverlust von CDKs in E. tenella die Vermehrung des Parasiten in Zellkultur inhibiert. E. tenella CDKs sind daher als Zielproteine für die Entwicklung einer Chemotherapie der Kokzidiose geeignet. Mittels bioinformatischer Tiefenanalysen sollten CDK Proteine im Parasiten E. tenella identifiziert werden. Das Genom von E. tenella liegt in Rohfassung vor [ftp://ftp.sanger.ac.uk]. Jedoch waren zum Zeitpunkt dieser Arbeiten viele Sequenzen des Genoms noch nicht annotiert. Homologe CDK Proteine von E. tenella konnten durch den Vergleich von Sequenzinformationen mit anderen Organismen der Apicomplexa identifiziert und analysiert werden. Durch diese Analysen konnten neben der bereits bekannten EtCRK2, drei weitere, bislang nicht annotierte CDKs in E. tenella identifiziert werden (EtCRK1, EtCRK3 sowie EtMRK). Darüber hinaus wurde eine Analyse der entsprechenden Zykline – der Aktivatoren der CDKs – bezüglich Funktion und Struktur, sowie eine Datenbanksuche nach bisher nicht beschriebenen Zyklinen in E. tenella durchgeführt. Diese Suchen ergaben vier neue potentielle Zykline für E. tenella, wovon EtCYC3a als Aktivator der EtCRK2 von María L. Suárez Fernández (Intervet Innovation GmbH, Schwabenheim) bestätigt werden konnte. Sequenzvergleiche lassen vermuten, dass auch EtCYC1 und EtCYC3b in der Lage sind, EtCRK2 zu aktivieren. Außerdem ist anzunehmen, dass EtCYC4 als Aktivator der EtCRK1 fungiert. Ein weiterer Schwerpunkt der vorliegenden Arbeit war die Suche und Optimierung nach neuen Inhibitoren von CDKs aus E. tenella. In vorangegangenen Arbeiten konnten bereits Inhibitoren der EtCRK2 gefunden werden [BEYER, 2007]. Mittels Substruktur- und Ähnlichkeitssuchen konnten im Rahmen dieser Arbeit weitere Inhibitoren der EtCRK2 identifiziert werden. Vier dieser Strukturklassen erfüllen die Kriterien einer Leitstruktur. Eine dieser Leitstrukturen gehört zur Strukturklasse der Benzimidazol-Carbonitrile und ist bislang nicht als Inhibitor anderer Kinasen beschrieben. Diese neu identifizierte Leitstruktur konnte in silico weiter optimiert werden. Im Rahmen dieser Arbeit wurden Bindungsenergien von Vertretern dieser Strukturklasse berechnet, um einen wahrscheinlichen Bindemodus vorherzusagen. Für die weiterführende in silico Optimierung wurde eine virtuelle kombinatorische Substanzbibliothek dieser Klasse erstellt. Die Auswahl geeigneter Verbindungen für eine chemische Synthese erfolgte durch molekulares Docking unter Nutzung von Homologiemodellen der EtCRK2. Darüber hinaus wurde ein in silico Screening nach potentiellen Inhibitoren der PfMRK und EtMRK durchgeführt. Dabei konnten weitere interessante virtuelle Hit-Strukturen aus einer Substanzdatenbank kommerziell erhältlicher Verbindungen gefunden werden. Durch dieses virtuelle Screening konnten jeweils sieben Verbindungen als virtuelle Hits der PfMRK sowie der EtMRK identifiziert werden. Die Häufung von Strukturklassen mit bekannter CDK Aktivität deutet darauf hin, dass während des virtuellen Screenings eine Anreicherung von CDK Inhibitoren stattgefunden hat. Diese Ergebnisse lassen auf eine Weiterentwicklung neuer Wirkstoffe gegen Kokzidiose und Malaria hoffen.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Background Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasitoid wasp Chelonus inanitus (Hymenoptera: Braconidae), we have randomly sequenced 2111 expressed sequence tags (ESTs) from a cDNA library of venom gland. In parallel, proteins from pure venom were separated by gel electrophoresis and individually submitted to a nano-LC-MS/MS analysis allowing comparison of peptides and ESTs sequences. Results About 60% of sequenced ESTs encoded proteins whose presence in venom was attested by mass spectrometry. Most of the remaining ESTs corresponded to gene products likely involved in the transcriptional and translational machinery of venom gland cells. In addition, a small number of transcripts were found to encode proteins that share sequence similarity with well-known venom constituents of social hymenopteran species, such as hyaluronidase-like proteins and an Allergen-5 protein. An overall number of 29 venom proteins could be identified through the combination of ESTs sequencing and proteomic analyses. The most highly redundant set of ESTs encoded a protein that shared sequence similarity with a venom protein of unknown function potentially specific of the Chelonus lineage. Venom components specific to C. inanitus included a C-type lectin domain containing protein, a chemosensory protein-like protein, a protein related to yellow-e3 and ten new proteins which shared no significant sequence similarity with known sequences. In addition, several venom proteins potentially able to interact with chitin were also identified including a chitinase, an imaginal disc growth factor-like protein and two putative mucin-like peritrophins. Conclusions The use of the combined approaches has allowed to discriminate between cellular and truly venom proteins. The venom of C. inanitus appears as a mixture of conserved venom components and of potentially lineage-specific proteins. These new molecular data enrich our knowledge on parasitoid venoms and more generally, might contribute to a better understanding of the evolution and functional diversity of venom proteins within Hymenoptera.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

An increasing number of proteins with weak sequence similarity have been found to assume similar three-dimensional fold and often have similar or related biochemical or biophysical functions. We propose a method for detecting the fold similarity between two proteins with low sequence similarity based on their amino acid properties alone. The method, the proximity correlation matrix (PCM) method, is built on the observation that the physical properties of neighboring amino acid residues in sequence at structurally equivalent positions of two proteins of similar fold are often correlated even when amino acid sequences are different. The hydrophobicity is shown to be the most strongly correlated property for all protein fold classes. The PCM method was tested on 420 proteins belonging to 64 different known folds, each having at least three proteins with little sequence similarity. The method was able to detect fold similarities for 40% of the 420 sequences. Compared with sequence comparison and several fold-recognition methods, the method demonstrates good performance in detecting fold similarities among the proteins with low sequence identity. Applied to the complete genome of Methanococcus jannaschii, the method recognized the folds for 22 hypothetical proteins.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Pairwise sequence comparison methods have been assessed using proteins whose relationships are known reliably from their structures and functions, as described in the scop database [Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia C. (1995) J. Mol. Biol. 247, 536–540]. The evaluation tested the programs blast [Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). J. Mol. Biol. 215, 403–410], wu-blast2 [Altschul, S. F. & Gish, W. (1996) Methods Enzymol. 266, 460–480], fasta [Pearson, W. R. & Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444–2448], and ssearch [Smith, T. F. & Waterman, M. S. (1981) J. Mol. Biol. 147, 195–197] and their scoring schemes. The error rate of all algorithms is greatly reduced by using statistical scores to evaluate matches rather than percentage identity or raw scores. The E-value statistical scores of ssearch and fasta are reliable: the number of false positives found in our tests agrees well with the scores reported. However, the P-values reported by blast and wu-blast2 exaggerate significance by orders of magnitude. ssearch, fasta ktup = 1, and wu-blast2 perform best, and they are capable of detecting almost all relationships between proteins whose sequence identities are >30%. For more distantly related proteins, they do much less well; only one-half of the relationships between proteins with 20–30% identity are found. Because many homologs have low sequence similarity, most distant relationships cannot be detected by any pairwise comparison method; however, those which are identified may be used with confidence.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The terminal regions (last 20 kb) of Saccharomyces cerevisiae chromosomes universally contain blocks of precise sequence similarity to other chromosome terminal regions. The left and right terminal regions are distinct in the sense that the sequence similarities between them are reverse complements. Direct sequence similarity occurs between the left terminal regions and also between the right terminal regions, but not between any left ends and right ends. With minor exceptions the relationships range from 80% to 100% match within blocks. The regions of similarity are composites of familiar and unfamiliar repeated sequences as well as what could be considered “single-copy” (or better “two-copy”) sequences. All terminal regions were compared with all other chromosomes, forward and reverse complement, and 768 comparisons are diagrammed. It appears there has been an extensive history of sequence exchange or copying between terminal regions. The subtelomeric sequences fall into two classes. Seventeen of the chromosome ends terminate with the Y′ repeat, while 15 end with the 800-nt “X2” repeats just adjacent to the telomerase simple repeats. The just-subterminal repeats are very similar to each other except that chromosome 1 right end is more divergent.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Several recent reports indicate that mobile elements are frequently found in and flanking many wild-type plant genes. To determine the extent of this association, we performed computer-based systematic searches to identify mobile elements in the genes of two "model" plants, Oryza sativa (domesticated rice) and Arabidopsis thaliana. Whereas 32 common sequences belonging to nine putative mobile element families were found in the noncoding regions of rice genes, none were found in Arabidopsis genes. Five of the nine families (Gaijin, Castaway, Ditto, Wanderer, and Explorer) are first described in this report, while the other four were described previously (Tourist, Stowaway, p-SINE1, and Amy/LTP). Sequence similarity, structural similarity, and documentation of past mobility strongly suggests that many of the rice common sequences are bona fide mobile elements. Members of four of the new rice mobile element families are similar in some respects to members of the previously identified inverted-repeat element families, Tourist and Stowaway. Together these elements are the most prevalent type of transposons found in the rice genes surveyed and form a unique collection of inverted-repeat transposons we refer to as miniature inverted-repeat transposable elements or MITEs. The sequence and structure of MITEs are clearly distinct from short or long interspersed nuclear elements (SINEs or LINEs), the most common transposable elements associated with mammalian nuclear genes. Mobile elements, therefore, are associated with both animal and plant genes, but the identity of these elements is strikingly different.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Molecular mimicry, normally defined by the level of primary-sequence similarities between self and foreign antigens, has been considered a key element in the pathogenesis of autoimmunity. Here we describe an example of molecular mimicry between two overlapping peptides within a single self-antigen, both of which are recognized by the same human self-reactive T-cell clone. Two intervening peptides did not stimulate the T-cell clone, even though they share nine amino acids with the stimulatory peptides. Molecular modeling of major histocompatibility complex class II-peptide complexes suggests that both of the recognized peptides generate similar antigenic surfaces, although these are composed of different sets of amino acids. The molecular modeling of a peptide shifted one residue from the stimulatory peptide, which was recognized in the context of the same HLA molecule by another T-cell clone, generated a completely different antigenic surface. Functional studies using truncated peptides confirmed that the anchor residues of the two "mimicking" epitopes in the HLA groove differ. Our results show, for two natural epitopes, how molecular mimicry can occur and suggest that studies of potential antigenic surfaces, rather than sequence similarity, are necessary for analyzing suspected peptide mimicry.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We have identified and further characterized a Caenorhabditis elegans gene, CEZF, that encodes a protein with substantial homology to the zinc finger and leucine zipper motifs of the human gene products AF10, MLLT6, and BR140. The first part of the zinc finger region of CEZF has strong similarity to the corresponding regions of AF10 (66%) and MLLT6 (64%) at the cDNA level. As this region is structurally different from previously described zinc finger motifs, sequence homology searches were done. Twenty-five other proteins with a similar motif were identified. Because the functional domain of this motif is potentially disrupted in leukemia-associated chromosomal translocations, we propose the name of leukemia-associated protein (LAP) finger. On the basis of these comparisons, the LAP domain consensus sequence is Cys1-Xaa1-2-Cys2-Xaa9-21-Cys3-Xaa2-4 -Cys4-Xaa4-5-His5-Xaa2-Cys6-Xaa12-46 - Cys7-Xaa2-Cys8, where subscripted numbers represent the number of amino acid residues. We review the evidence that this motif binds zinc, is the important DNA-binding domain in this group of regulatory proteins, and may be involved in leukemogenesis.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present homologies between archaeal and eucaryal DNA-dependent RNA polymerase (RNAP) subunits and transcription factors. The sequences of the Sulfolobus acidocaldarius subunits D, E, and N and alignments with eucaryal homologs are presented here. The similarities between archaeal transcription factors and their eucaryal homologs TFIIB and TBP have been established in other laboratories. The archaeal RNAP subunits H, K, and N, respectively, show high sequence similarity to ABC27, ABC23, and ABC10 beta (found in all three eucaryal RNAPs); subunit D, to AC40 (common to polymerase II and polymerase III) and B44 (polymerase II); and subunit L, to AC19 and B12.5. The similarity of subunit D and its eucaryal homologs to bacterial alpha is limited to the "alpha-motif," which is also present in subunit L and its eucaryal homologs. Genes encoding homologs of the related eucaryal RNAP subunits A12.2/B12.6 and also homologs of eucaryal transcription elongation factors of the TFIIS family have been detected in Sulfolobus acidocaldarius and Thermococcus celer. In archaea, the protein is not an RNAP subunit. Together with the sequence similarities between archaeal box A-containing and eucaryal TATA box-containing promoters, this shows that the archaeal and eucaryal transcription systems are truly homologous and that they differ structurally and functionally from the bacterial transcription machinery. In contrast, however, a number of genes for the archaeal transcription apparatus are organized in clusters resembling the clusters of transcription-associated genes in Bacteria.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Analysis of an Aeromonas salmonicida A layer-deficient/O polysaccharide-deficient mutant carrying a Tn5 insertion in the structural gene for A protein (vapA) showed that the abcA gene immediately downstream of vapA had been interrupted by the endogenous insertion sequence element ISAS1. Immunoelectron microscopy showed that O polysaccharides did not accumulate at the inner membrane-cytoplasm interface of this mutant. abcA encodes an unusual protein; it carries both an amino-terminal ATP-binding cassette (ABC) domain showing high sequence similarity to ABC proteins implicated in the transport of certain capsular and O polysaccharides and a carboxyl-terminal potential DNA-binding domain, which distinguishes AbcA from other polysaccharide transport proteins in structural and evolutionary terms. The smooth lipopolysaccharide phenotype was restored by complementation with abcA but not by abcA carrying site-directed mutations in the sequence encoding the ATP-binding site of the protein. The genetic organization of the A. salmonicida ABC polysaccharide system differs from other bacteria. abcA also differs in apparently being required for both O-polysaccharide synthesis and in energizing the transport of O polysaccharides to the cell surface.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Members of the IRF family mediate transcriptional responses to interferons (IFNs) and to virus infection. So far, proteins of this family have been studied only among mammalian species. Here we report the isolation of cDNA clones encoding two members of this family from chicken, interferon consensus sequence-binding protein (ICSBP) and IRF-1. The predicted chicken ICSBP and IRF-1 proteins show high levels of sequence similarity to their corresponding human and mouse counterparts. Sequence identities in the putative DNA-binding domains of chicken and human ICSBP and IRF-1 were 97% and 89%, respectively, whereas the C-terminal regions showed identities of 64% and 51%; sequence relationships with mouse ICSBP and IRF-1 are very similar. Chicken ICSBP was found to be expressed in several embryonic tissues, and both chicken IRF-1 and ICSBP were strongly induced in chicken fibroblasts by IFN treatment, supporting the involvement of these factors in IFN-regulated gene expression. The presence of proteins homologous to mammalian IRF family members, together with earlier observations on the occurrence of functionally homologous IFN-responsive elements in chicken and mammalian genes, highlights the conservation of transcriptional mechanisms in the IFN system, a finding that contrasts with the extensive sequence and functional divergence of the IFNs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The Xenopus DG42 gene is expressed only between the late midblastula and neurulation stages of embryonic development. Recent database searches show that DG42 has striking sequence similarity to the Rhizobium NodC protein. NodC catalyzes the synthesis of chitin oligosaccharides which subsequently are transformed into bacterium-plant root signaling molecules. We find that the DG42 protein made in an in vitro coupled transcription-translation system catalyzes the synthesis of an array of chitin oligosaccharides. The result suggests the intriguing possibility that a bacterium-plant type of "Nod" signaling system may operate during early stages of vertebrate embryonic development and raises issues about the use of chitin synthase inhibitors as fungal-specific drugs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Scorpion toxins are important experimental tools for characterization of vast array of ion channels and serve as scaffolds for drug design. General public database entries contain limited annotation whereby rich structure-function information from mutation studies is typically not available. SCORPION2 contains more than 800 records of native and mutant toxin sequences enriched with binding affinity and toxicity information, 624 three-dimensional structures and some 500 references. SCORPION2 has a set of search and prediction tools that allow users to extract and perform specific queries: text searches of scorpion toxin records, sequence similarity search, extraction of sequences, visualization of scorpion toxin structures, analysis of toxic activity, and functional annotation of previously uncharacterized scorpion toxins. The SCORPION2 database is available at http://sdmc.i2r.a-star.edu.sg/scorpion/. (c) 2006 Elsevier Ltd. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The genome of Salmonella enterica serovar Enteritidis was shown to possess three IS3-like insertion elements, designated IS1230A, B and C, and each was cloned and their respective deoxynucleotide sequences determined. Mutations in elements IS1230A and B resulted in frameshifts in the open reading frames that encoded a putative transposase to be inactive. IS1230C was truncated at nucleotide 774 relative to IS1230B and therefore did not possess the 3' terminal inverted repeat. The three IS1230 derivatives were closely related to each other based on nucleotide sequence similarity. IS1230A was located adjacent to the sef operon encoding SEF14 fimbriae located at minute 97 of the genome of S. Enteritidis. IS1230B was located adjacent to the umuDC operon at minute 42.5 on the genome, itself located near to one terminus of an 815-kb genome inversion of S. Enteritidis relative to S. Typhimurium. IS1230C was located next to attB, the bacteriophage P22 attachment site, and proB, encoding gamma-glutamyl phosphate reductase. A truncated 3' remnant of IS1230, designated IS1230T, was identified in a clinical isolate of S. Typhimurium DT193 strain 2391. This element was located next to attB adjacent to which were bacteriophage P22-like sequences. Southern hybridisation of total genomic DNA from eighteen phage types of S. Enteritidis and eighteen definitive types of S. Typhimurium showed similar, if not identical, restriction fragment profiles in the respective serovars when probed with IS1230A.