992 resultados para Motif analysis


Relevância:

30.00% 30.00%

Publicador:

Resumo:

cERMIT is a computationally efficient motif discovery tool based on analyzing genome-wide quantitative regulatory evidence. Instead of pre-selecting promising candidate sequences, it utilizes information across all sequence regions to search for high-scoring motifs. We apply cERMIT on a range of direct binding and overexpression datasets; it substantially outperforms state-of-the-art approaches on curated ChIP-chip datasets, and easily scales to current mammalian ChIP-seq experiments with data on thousands of non-coding regions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cathepsin L proteases secreted by the helminth pathogen Fasciola hepatica have functions in parasite virulence including tissue invasion and suppression of host immune responses. Using proteomics methods alongside phylogenetic studies we characterized the profile of cathepsin L proteases secreted by adult F. hepatica and hence identified those involved in host-pathogen interaction. Phylogenetic analyses showed that the Fasciola cathepsin L gene family expanded by a series of gene duplications followed by divergence that gave rise to three clades associated with mature adult worms (Clades 1, 2, and 5) and two clades specific to infective juvenile stages (Clades 3 and 4). Consistent with these observations our proteomics studies identified representatives from Clades 1, 2, and 5 but not from Clades 3 and 4 in adult F. hepatica secretory products. Clades 1 and 2 account for 67.39 and 27.63% of total secreted cathepsin Ls, respectively, suggesting that their expansion was positively driven and that these proteases are most critical for parasite survival and adaptation. Sequence comparison studies revealed that the expansion of cathepsin Ls by gene duplication was followed by residue changes in the S2 pocket of the active site. Our biochemical studies showed that these changes result in alterations in substrate binding and suggested that the divergence of the cathepsin L family produced a repertoire of enzymes with overlapping and complementary substrate specificities that could cleave host macromolecules more efficiently. Although the cathepsin Ls are produced as zymogens containing a prosegment and mature domain, all secreted enzymes identified by MS were processed to mature active enzymes. The prosegment region was highly conserved between the clades except at the boundary of prosegment and mature enzyme. Despite the lack of conservation at this section, sites for exogenous cleavage by asparaginyl endopeptidases and a Leu-Ser[downward arrow]His motif for autocatalytic cleavage by cathepsin Ls were preserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Résumé Le transfert du phosphate des racines vers les feuilles s'effectue par la voie du xylème. Il a été précédemment démontré que la protéine AtPHO1 était indispensable au transfert du phosphate dans les vaisseaux du xylème des racines chez la plante modèle Arabidopsis thaliana. Le séquençage et l'annotation du génome d'Arabidopsis ont permis d'identifier dix séquences présentant un niveau de similarité significatif avec le gène AtPHO1 et constituant une nouvelle famille de gène appelé la famille de AtPHO1. Basée sur une étude moléculaire et génétique, cette thèse apporte des éléments de réponse pour déterminer le rôle des membres de ia famille de AtPHO1 chez Arabidopsis, inconnue à ce jour. Dans un premier temps, une analyse bioinformatique des séquences protéiques des membres de la famille de AtPHO1 a révélé la présence dans leur région N-terminale d'un domaine nommé SPX. Ce dernier est conservé parmi de nombreuses protéines impliquées dans l'homéostasie du phosphate chez la levure, renforçant ainsi l'hypothèse que les membres de la famille de AtPHO1 auraient comme AtPHO1 un rôle dans l'équilibre du phosphate dans la plante. En parallèle, la localisation tissulaire de l'expression des gènes AtPHO dans Arabidopsis a été identifiée par l'analyse de plantes transgéniques exprimant le gène rapporteur uidA sous le contrôle des promoteurs respectifs des gènes AtPHO. Un profil d'expression de chaque gène AtPHO au cours du développement de la plante a été obtenu. Une expression prédominante au niveau des tissus vasculaires des racines, des feuilles, des tiges et des fleurs a été observée, suggérant que les gènes AtPHO pourraient avoir des fonctions redondantes au niveau du transfert de phosphate dans le cylindre vasculaire de ces différents organes. Toutefois, plusieurs régions promotrices des gènes AtPHO contrôlent également un profil d'expression GUS non-vasculaire, indiquant un rôle putatif des gènes AtPHO dans l'acquisition ou le recyclage de phosphate dans la plante. Dans un deuxième temps, l'analyse de l'expression des gènes AtPHO durant une carence en phosphate a établi que seule l'expression des gènes AtPHO1, AtPHO1; H1 et AtPHO1; H10 est régulée par cette carence. Une étude approfondie de leur expression en réponse à des traitements affectant l'homéostasie du phosphate dans la plante a ensuite démontré leur régulation par différentes voies de signalisation. Ensuite, une analyse détaillée de la régulation de l'expression du gène AtPHO1; H1O dans des feuilles d'Arabidopsis blessées ou déshydratées a révélé que ce gène constitue le premìer gène marqueur d'une nouvelle voie de signalisation induite par l'OPDA, pas par le JA et dépendante de la protéine COI1. Ces résultats démontrent pour la première fois que l'OPDA et le JA peuvent activer différents gènes via des voies de signalisation dépendantes de COI1. Enfin, cette thèse révèle l'identification d'un nouveau rôle de la protéine AtPHO1 dans la régulation de l'action de l'ABA au cours des processus de fermeture stomatique et de germination des graines chez Arabidopsis. Bien que les fonctions exactes des protéines AtPHO restent à être déterminées, ce travail de thèse suggère leur implication dans la propagation de différents signaux dans la plante via la modulation du potentiel membranaire et/ou l'affectation de la composition en ions des cellules comme le font de nombreux transporteurs ou régulateur du transport d'ions. Summary Phosphate is transferred from the roots to the shoot via the xylem. The requirement for AtPHO1 protein to transfer phosphate to the xylem vessels of the root has been previously demonstrated in Arabidopsis thaliana. The sequencing and the annotation of the Arabidopsis genome had allowed the identification of ten sequences that show a significant level of similarity with the AtPHO1 gene. These 10 genes, of unknown functions, constitute a new gene family called the AtPHO1 gene family. Based on a molecular and genetics study, this thesis reveals some information needed to understand the role of the AtPHO1 family members in the plant Arabidopsis. First, a bioinformatics study revealed that the AtPHO sequences contained, in the N-terminal hydrophilic region, a motif called SPX and conserved among multiple proteins involved in phosphate homeostasis in yeast. This finding reinforces the hypothesis that all AtPHO1 family members have, as AtPHO1, a role in phosphate homeostasis. In parallel, we identified the pattern of expression of AtPHO genes in Arabidopsis via analysis of transgenic plants expressing the uidA reporter gene under the control of respective AtPHO promoter regions. The results exhibit a predominant expression of AtPHO genes in vascular tissues of all organs of the plant, implying that these AtPHO genes could have redundant functions in the transfer of phosphate to the vascular cylinder of various organs. The GUS expression pattern for several AtPHO promoter regions was also detected in non-vascular tissue indicating a broad role of AtPHO genes in the acquisition or in the recycling of phosphate in the plant. In a second step, the analysis of the expression of AtPHO genes during phosphate starvation established that only the expression of the AtPHO1, AtPHO1; H1 and AtPHO1; H10 genes were regulated by Pi starvation. Interestingly, different signalling pathways appeared to regulate these three genes during various treatments affecting Pi homeostasis in the plant. The third chapter presents a detailed analysis of the signalling pathways regulating the expression of the AtPHO1; H10 gene in Arabidopsis leaves during wound and dehydrated stresses. Surprisingly, the expression of AtPHO1; H10 was found to be regulated by OPDA (the precursor of JA) but not by JA itself and via the COI1 protein (the central regulator of the JA signalling pathway). These results demonstrated for the first time that OPDA and JA could activate distinct genes via COI1-dependent pathways. Finally, this thesis presents the identification of a novel role of the AtPHO1 protein in the regulation of ABA action in Arabidopsis guard cells and during seed germination. Although the exact role and function of AtPHO1 still need to be determined, these last findings suggest that AtPHO1 and by extension other AtPHO proteins could mediate the propagation of various signals in the plant by modulating the membrane potential and/or by affecting cellular ion composition, as it is the case for many ion transporters or regulators of ion transport.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis consists of a quantitative analysis of the regional prevalence of certain artistic motifs as they appear in Minoan wall painting of the Neopalatial period. This will help to establish the relative degree of artistic autonomy exercised by each of the sites included in this study. The results show that the argument for itinerant artists during this time period is a strong one, but the assumption that these travelling artists were being controlled by any one palace-centre is erroneous. Rather, the similarities and differences seen suggest that the choices were predicated either by the specific patrons, or by the function of the associated building or room. Thus, the motifs found within this study should be understood as constituting a cultural identity, with greater or lesser degrees of regional homogeneity, which act as one facet of a number of cultural indicators that can be used to better understand the role of artists and regional dynamics on the island during the Bronze Age.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Ce mémoire porte sur la polémique qui a eu lieu au Québec entre mars 2006 et décembre 2007 autour des pratiques d’« accommodements raisonnables » pour motif religieux. À partir d’une approche compréhensive et d’un cadre théorique propre à la sociologie des relations ethniques, il propose une analyse qualitative de lettres d’opinion publiées dans des quotidiens québécois. Une première analyse, thématique, a permis de constituer des registres argumentaires dans lesquels ont puisé les participants au débat public sur les « accommodements raisonnables » par le biais de lettres d’opinion. Une seconde analyse, comparative, a permis de construire des figures d’intervenants du débat public qui témoignent non seulement des forces idéologiques qui se sont affrontées dans le débat public, mais également de leur positionnement au croisement des axes saillants de la différenciation sociale dans cette polémique Les résultats de ces analyses suggèrent d’abord que la polémique résulte d’un conflit entre marqueurs identitaires devant servir au positionnement des frontières ethniques, et ensuite que la polémique des « accommodements raisonnables » a donné lieu à une reconfiguration des rapports ethniques au Québec, attribuable à la dissociation entre le conflit entre deux nations et celui sur les critères d’inclusion à la nation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

La plupart des molécules d’ARN doivent se replier en structure tertiaire complexe afin d’accomplir leurs fonctions biologiques. Cependant, les déterminants d’une chaîne de polynucléotides qui sont nécessaires à son repliement et à ses interactions avec d’autres éléments sont essentiellement inconnus. L’établissement des relations structure-fonction dans les grandes molécules d’ARN passe inévitablement par l’analyse de chaque élément de leur structure de façon individuelle et en contexte avec d’autres éléments. À l’image d’une construction d’immeuble, une structure d’ARN est composée d’unités répétitives assemblées de façon spécifique. Les motifs récurrents d’ARN sont des arrangements de nucléotides retrouvés à différents endroits d’une structure tertiaire et possèdent des conformations identiques ou très similaires. Ainsi, une des étapes nécessaires à la compréhension de la structure et de la fonction des molécules d’ARN consiste à identifier de façon systématique les motifs récurrents et d’en effectuer une analyse comparative afin d’établir la séquence consensus. L’analyse de tous les cas d’empaquetage de doubles hélices dans la structure du ribosome a permis l’identification d’un nouvel arrangement nommé motif d’empaquetage le long du sillon (AGPM) (along-groove packing motif). Ce motif est retrouvé à 14 endroits dans la structure du ribosome de même qu’entre l’ARN ribosomique 23S et les molécules d’ARN de transfert liées aux sites ribosomaux P et E. Le motif se forme par l’empaquetage de deux doubles hélices via leur sillon mineur. Le squelette sucre-phosphate d’une hélice voyage le long du sillon mineur de l’autre hélice et vice versa. Dans chacune des hélices, la région de contact comprend quatre paires de bases. L’empaquetage le plus serré est retrouvé au centre de l’arrangement où l’on retrouve souvent une paire de bases GU dans une hélice interagissant avec une paire de bases Watson-Crick (WC) dans l’autre hélice. Même si la présence des paires de bases centrales GU versus WC au centre du motif augmente sa stabilité, d’autres alternatives existent pour différents représentants du motif. L’analyse comparative de trois librairies combinatoires de gènes d’AGPM, où les paires de bases centrales ont été variées de manière complètement aléatoire, a montré que le contexte structural influence l’étendue de la variabilité des séquences de nucléotides formant les paires de bases centrales. Le fait que l’identité des paires de bases centrales puisse varier suggérait la présence d’autres déterminants responsables au maintien de l’intégrité du motif. L’analyse de tous les contacts entre les hélices a révélé qu’en dehors du centre du motif, les interactions entre les squelettes sucre-phosphate s’effectuent via trois contacts ribose-ribose. Pour chacun de ces contacts, les riboses des nucléotides qui interagissent ensemble doivent adopter des positions particulières afin d’éviter qu’ils entrent en collision. Nous montrons que la position de ces riboses est modulée par des conformations spécifiques des paires de bases auxquelles ils appartiennent. Finalement, un autre motif récurrent identifié à l’intérieur même de la structure de trois cas d’AGPM a été nommé « adenosine-wedge ». Son analyse a révélé que ce dernier est lui-même composé d’un autre arrangement, nommé motif triangle-NAG (NAG-triangle). Nous montrons que le motif « adenosine-wedge » représente un arrangement complexe d’ARN composé de quatre éléments répétitifs, c’est-à-dire des motifs AGPM, « hook-turn », « A-minor » et triangle-NAG. Ceci illustre clairement l’arrangement hiérarchique des structures d’ARN qui peut aussi être observé pour d’autres motifs d’ARN. D’un point de vue plus global, mes résultats enrichissent notre compréhension générale du rôle des différents types d’interactions tertiaires dans la formation des molécules d’ARN complexes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Les trois paradigmes majeurs à partir desquels s’oriente l’analyse du voile dans Sordidissimes de Pascal Quignard sont les vêtements et la nudité, les « sordes » et le linceul, la toile et le regard. C’est à l’aide de l’analyse thématique et de la psychanalyse que la relation du voile au corps, à la mort et à l’art, est interprétée. Ce que l’on souhaite mettre en évidence est que le voile tient lieu de l’ambivalence. Il se trouve perpétuellement tendu par la volonté du sujet qui l’utilise tour à tour pour recouvrir ou révéler l’objet de ses désirs ou de ses peurs. Le voile incarne ainsi la frontière d’où s’origine la fascination, qu’elle soit morbide ou sexuelle.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The recently described cupin superfamily of proteins includes the germin and germinlike proteins, of which the cereal oxalate oxidase is the best characterized. This superfamily also includes seed storage proteins, in addition to several microbial enzymes and proteins with unknown function. All these proteins are characterized by the conservation of two central motifs, usually containing two or three histidine residues presumed to be involved with metal binding in the catalytic active site. The present study on the coding regions of Synechocystis PCC6803 identifies a previously unknown group of 12 related cupins, each containing the characteristic two-motif signature. This group comprises 11 single-domain proteins, ranging in length from 104 to 289 residues, and includes two phosphomannose isomerases and two epimerases involved in cell wall synthesis, a member of the pirin group of nuclear proteins, a possible transcriptional regulator, and a close relative-of a cytochrome c551 from Rhodococcus. Additionally, there is a duplicated, two-domain protein that has close similarity to an oxalate decarboxylase from the fungus Collybia velutipes and that is a putative progenitor of the storage proteins of land plants.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The hepatitis C virus (HCV) non-structural 5A protein (NS5A) contains a highly conserved C-terminal polyproline motif with the consensus sequence Pro-X-X- Pro-X-Arg that is able to interact with the Src-homology 3 (SH3) domains of a variety of cellular proteins. Results: To understand this interaction in more detail we have expressed two N-terminally truncated forms of NS5A in E. coli and examined their interactions with the SH3 domain of the Src-family tyrosine kinase, Fyn. Surface plasmon resonance analysis revealed that NS5A binds to the Fyn SH3 domain with what can be considered a high affinity SH3 domain-ligand interaction (629 nM), and this binding did not require the presence of domain I of NS5A (amino acid residues 32-250). Mutagenic analysis of the Fyn SH3 domain demonstrated the requirement for an acidic cluster at the C-terminus of the RT-Src loop of the SH3 domain, as well as several highly conserved residues previously shown to participate in SH3 domain peptide binding. Conclusion: We conclude that the NS5A: Fyn SH3 domain interaction occurs via a canonical SH3 domain binding site and the high affinity of the interaction suggests that NS5A would be able to compete with cognate Fyn ligands within the infected cell.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Snakebites are a major neglected tropical disease responsible for as many as 95000 deaths every year worldwide. Viper venom serine proteases disrupt haemostasis of prey and victims by affecting various stages of the blood coagulation system. A better understanding of their sequence, structure, function and phylogenetic relationships will improve the knowledge on the pathological conditions and aid in the development of novel therapeutics for treating snakebites. A large dataset for all available viper venom serine proteases was developed and analysed to study various features of these enzymes. Despite the large number of venom serine protease sequences available, only a small proportion of these have been functionally characterised. Although, they share some of the common features such as a C-terminal extension, GWG motif and disulphide linkages, they vary widely between each other in features such as isoelectric points, potential N-glycosylation sites and functional characteristics. Some of the serine proteases contain substitutions for one or more of the critical residues in catalytic triad or primary specificity pockets. Phylogenetic analysis clustered all the sequences in three major groups. The sequences with substitutions in catalytic triad or specificity pocket clustered together in separate groups. Our study provides the most complete information on viper venom serine proteases to date and improves the current knowledge on the sequence, structure, function and phylogenetic relationships of these enzymes. This collective analysis of venom serine proteases will help in understanding the complexity of envenomation and potential therapeutic avenues.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The design of therapeutic compounds targeting transthyretin (TTR) is challenging due to the low specificity of interaction in the hormone binding site. Such feature is highlighted by the interactions of TTR with diclofenac, a compound with high affinity for TTR, in two dissimilar modes, as evidenced by crystal structure of the complex. We report here structural analysis of the interactions of TTR with two small molecules, 1-amino-5-naphthalene sulfonate (1,5-AmNS) and 1-anilino-8-naphthalene sulfonate (1,8-ANS). Crystal structure of TTR: 1,8-ANS complex reveals a peculiar interaction, through the stacking of the naphthalene ring between the side-chain of Lys15 and Leu17. The sulfonate moiety provides additional interaction with Lys15` and a water-mediated hydrogen bond with Thr119`. The uniqueness of this mode of ligand recognition is corroborated by the crystal structure of TTR in complex with the weak analogue 1,5-AmNS, the binding of which is driven mainly by hydrophobic partition and one electrostatic interaction between the sulfonate group and the Lys15. The ligand binding motif unraveled by 1,8-ANS may open new possibilities to treat TTR amyloid diseases by the elucidation of novel candidates for a more specific pharmacophoric pattern. (C) 2009 Published by Elsevier Ltd.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The main goal of our research was to search for SSRs in the Eucalyptus EST FORESTs database (using a software for mining SSR-motifs). With this objective, we created a database for cataloging Eucalyptus EST-derived SSRs, and developed a bioinformatics tool, named Satellyptus, for finding and analyzing microsatellites in the Eucalyptus EST database. The search for microsatellites in the FORESTs database containing 71,115 Eucalyptus EST sequences (52.09 Mb) revealed 20,530 SSRs in 15,621 ESTs. The SSR abundance detected on the Eucalyptus ESTs database (29% or one microsatellite every four sequences) is considered very high for plants. Amongst the categories of SSR motifs, the dimeric (37%) and trimeric ones (33%) predominated. The AG/CT motif was the most frequent (35.15%) followed by the trimeric CCG/CGG (12.81%). From a random sample of 1,217 sequences, 343 microsatellites in 265 SSR-containing sequences were identified. Approximately 48% of these ESTs containing microsatellites were homologous to proteins with known biological function. Most of the microsatellites detected in Eucalyptus ESTs were positioned at either the 5 or 3 end. Our next priority involves the design of flanking primers for codominant SSR loci, which could lead to the development of a set of microsatellite-based markers suitable for marker-assisted Eucalyptus breeding programs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

To understand the regulatory dynamics of transcription factors (TFs) and their interplay with other cellular components we have integrated transcriptional, protein-protein and the allosteric or equivalent interactions which mediate the physiological activity of TFs in Escherichia coli. To study this integrated network we computed a set of network measurements followed by principal component analysis (PCA), investigated the correlations between network structure and dynamics, and carried out a procedure for motif detection. In particular, we show that outliers identified in the integrated network based on their network properties correspond to previously characterized global transcriptional regulators. Furthermore, outliers are highly and widely expressed across conditions, thus supporting their global nature in controlling many genes in the cell. Motifs revealed that TFs not only interact physically with each other but also obtain feedback from signals delivered by signaling proteins supporting the extensive cross-talk between different types of networks. Our analysis can lead to the development of a general framework for detecting and understanding global regulatory factors in regulatory networks and reinforces the importance of integrating multiple types of interactions in underpinning the interrelationships between them.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Numerous bacterial pathogens subvert cellular functions of eukaryotic host cells by the injection of effector proteins via dedicated secretion systems. The type IV secretion system (T4SS) effector protein BepA from Bartonella henselae is composed of an N-terminal Fic domain and a C-terminal Bartonella intracellular delivery domain, the latter being responsible for T4SS-mediated translocation into host cells. A proteolysis resistant fragment (residues 10-302) that includes the Fic domain shows autoadenylylation activity and adenylyl transfer onto Hela cell extract proteins as demonstrated by autoradiography on incubation with α-[(32)P]-ATP. Its crystal structure, determined to 2.9-Å resolution by the SeMet-SAD method, exhibits the canonical Fic fold including the HPFxxGNGRxxR signature motif with several elaborations in loop regions and an additional β-rich domain at the C-terminus. On crystal soaking with ATP/Mg(2+), additional electron density indicated the presence of a PP(i) /Mg(2+) moiety, the side product of the adenylylation reaction, in the anion binding nest of the signature motif. On the basis of this information and that of the recent structure of IbpA(Fic2) in complex with the eukaryotic target protein Cdc42, we present a detailed model for the ternary complex of Fic with the two substrates, ATP/Mg(2+) and target tyrosine. The model is consistent with an in-line nucleophilic attack of the deprotonated side-chain hydroxyl group onto the α-phosphorus of the nucleotide to accomplish AMP transfer. Furthermore, a general, sequence-independent mechanism of target positioning through antiparallel β-strand interactions between enzyme and target is suggested.