The nucleotide sequence of a genomic DNA fragment thought previously to contain the dihydrofolate reductase gene (DFR1) of Saccharomyces cerevisiae by genetic criteria was determined. This DNA fragment of 1784' basepairs contains a large open reading frame from position 800 to 1432, which encodes a enzyme with a predicted molecular weight of 24,229.8 Daltons. Analysis of the amino acid sequence of this protein revealed that the yeast polypep·tide contained 211 amino acids, compared to the 186 residues commonly found in the polypeptides of other eukaryotes. The difference in size of the gene product can be attributed mainly to an insert in the yeast gene. Within this region, several consensus sequences required for processing of yeast nuclear and class II mitochondrial introns were identified, but appear not sufficient for the RNA splicing. The primary structure of the yeast DHFR protein has considerable sequence homology with analogous polypeptides from other organisms, especially in the consensus residues involved in cofactor and/or inhibitor binding. Analysis of the nucleotide sequence also revealed the presence of a number of canonical sequences identified in yeast as having some function in the regulation of gene expression. These include UAS elements (TGACTC) required for tIle amino acid general control response, and "TATA H boxes as well as several consensus sequences thought to be required for transcriptional termination and polyadenylation. Analysis of the codon usage of the yeast DFRl coding region revealed a codon bias index of 0.0083. this valve very close to zero suggestes 3 that the gene is expressed at a relatively low level under normal physiological conditions. The information concerning the organization of the DFRl were used to construct a variety of fusions of its 5' regulatory region with the coding region of the lacZ gene of E. coli. Some of such fused genes encoded a fusion product that expressed in E.coli and/or in yeast under the control of the 5' regulatory elements of the DFR1. Further studies with these fusion constructions revealed that the beta-galactosidase activity encoded on multicopy plasmids was stimulated transiently by prior exposure of yeast host cells to UV light. This suggests that the yeast PFRl gene is indu.ced by UV light and nlay in1ply a novel function of DHFR protein in the cellular responses to DNA damage. Another novel f~ature of yeast DHFR was revealed during preliminary studies of a diploid strain containing a heterozygous DFRl null allele. The strain was constructed by insertion of a URA3 gene within the coding region of DFR1. Sporulation of this diploid revealed that meiotic products segregated 2:0 for uracil prototrophy when spore clones were germinated on medium supplemented with 5-formyltetrahydrofolate (folinic acid). This finding suggests that, in addition to its catalytic activity, the DFRl gene product nlay play some role in the anabolisln of folinic acid. Alternatively, this result may indicate that Ura+ haploid segregants were inviable and suggest that the enzyme has an essential cellular function in this species.


The characteristic "foxy" aroma of Vilis labrusca Concord grapes is due in large part to methyl anthranilate, a volatile ester formed by the enzyme anthraniloyl- CoA:methanol anthraniloyltransferase (VIAMAT) of the superfamily of BARD acyltransferases. The publication of the genome ofthe closely related wine grape Vilis vinifera, which does not accumulate methyl anthranilate, permitted the searching for any putative VlAU4T-like genes, with the result of 5 highly homologous candidates being found, with one candidate sharing 95% identity to VlAU4T. Probing the gene expression of 18 different cultivars of V. vinifora ripe berries by RT -PCR showed that many varieties do indeed express VlAU4T-like genes. Subsequent cloning of the full-length open reading frame of one of these genes from eDNA prepared from the cultivar Sauvignon Blanc permitted preliminary biochemical characterization of the enzyme after heterologous expression in E. coli. It was determined that this alcohol acyltransferase (named VvsbAATl) catalyzes the formation of cis-3-hexenyl acetate, a "green-leaf' volatile. Although the cloned gene from Sauvignon Blanc had 95% identity at the amino acid level to VIAMAT, it displayed an altered substrate specificity and expression pattern. These results highlight the difficulty in predicting substrate specificity and function of enzymes through the basis of sequence homology, which is a common finding in the study of BARD acyltransferases. Also, the determination of function of VvsbAATl and other BARD acyltransferases in V. vinifera could be used as a genetic marker for certain aroma characteristics in grape breeding programs.


L’ubiquitin-fold modifier (UFM1) fait partie de la classe 1 de la famille de protéine ubiquitin-like (Ubl). UFM1 et Ub ont très peu d’homologie de séquence, mais partagent des similarités remarquables au niveau de leur structure tertiaire. Tout comme l’Ub et la majorité des autres Ubls, UFM1 se lie de façon covalente à ses substrats par l’intermédiaire d’une cascade enzymatique. Il est de plus en plus fréquemment rapporté que les protéines Ubls sont impliquées dans des maladies humaines. Le gène Ufm1 est surexprimé chez des souris de type MCP développant une ischémie myocardique et dans les îlots de Langerhans de patients atteints du diabète de type 2. UFM1 et ses enzymes spécifiques, UBA5, UFL1 et UFC1, sont conservés chez les métazoaires et les plantes suggérant un rôle important pour les organismes multicellulaires. Le Caenorhabditis elegans est le modèle animal le plus simple utilisé en biologie. Sa morphologie, ses phénotypes visibles et ses lignées cellulaires ont été décrits de façon détaillée. De plus, son cycle de vie court permet de rapidement observer les effets de certains gènes sur la longévité. Ce modèle nous permet de facilement manipuler l’expression du gène Ufm1 et de mieux connaître ses fonctions. En diminuant l’expression du gène ufm-1 chez le C.elegans, par la technique de l’ARN interférence par alimentation, nous n’avons observé aucun problème morphologique grave. Les vers ressemblaient aux vers sauvages et possédaient un nombre de progéniture normal. Cependant, les vers sauvage exposés à l’ARNi d’ufm-1 vivent significativement moins longtemps que les contrôles et ce, de façon indépendante de la voie de signalisation de l’insuline/IGF. Chez le C. elegans la longévité et la résistance au stress cellulaire sont intimement liées. Nous n’avons remarqué aucun effet d’ufm-1 sur le stress thermal, osmotique ou oxydatif, mais il est requis pour la protection contre le stress protéotoxique. Il est également nécessaire au maintien de l’intégrité neuronale au cours du vieillissement des animaux. L’ensemble de nos données nous renseigne sur les fonctions putatives du gène Ufm1.


Les gènes sont les parties du génome qui codent pour les protéines. Les gènes d’une ou plusieurs espèces peuvent être regroupés en "familles", en fonction de leur similarité de séquence. Cependant, pour connaître les relations fonctionnelles entre ces copies de gènes, la similarité de séquence ne suffit pas. Pour cela, il est important d’étudier l’évolution d’une famille par duplications et pertes afin de pouvoir distinguer entre gènes orthologues, des copies ayant évolué par spéciation et susceptibles d’avoir conservé une fonction commune, et gènes paralogues, des copies ayant évolué par duplication qui ont probablement développé des nouvelles fonctions. Étant donnée une famille de gènes présents dans n espèces différentes, un arbre de gènes (obtenu par une méthode phylogénétique classique), et un arbre phylogénétique pour les n espèces, la "réconciliation" est l’approche la plus courante permettant d’inférer une histoire d’évolution de cette famille par duplications, spéciations et pertes. Le degré de confiance accordé à l’histoire inférée est directement relié au degré de confiance accordé à l’arbre de gènes lui-même. Il est donc important de disposer d’une méthode préliminaire de correction d’arbres de gènes. Ce travail introduit une méthodologie permettant de "corriger" un arbre de gènes : supprimer le minimum de feuilles "mal placées" afin d’obtenir un arbre dont les sommets de duplications (inférés par la réconciliation) sont tous des sommets de "duplications apparentes" et obtenir ainsi un arbre de gènes en "accord" avec la phylogénie des espèces. J’introduis un algorithme exact pour des arbres d’une certaine classe, et une heuristique pour le cas général.


Les molécules classiques du CMH de classe II sont responsables de la présentation de peptides exogènes par les cellules présentatrices d’antigène aux lymphocytes T CD4+. Cette présentation antigénique est essentielle à l’établissement d’une réponse immunitaire adaptative. Cependant, la reconnaissance d’auto-antigènes ainsi que l’élimination des cellules du Soi sont des problèmes à l’origine de nombreuses maladies auto-immunes. Notamment, le diabète et la sclérose en plaque. D’éventuels traitements de ces maladies pourraient impliquer la manipulation de la présentation antigénique chez les cellules dont la reconnaissance et l’élimination engendrent ces maladies. Il est donc primordial d’approfondir nos connaissances en ce qui concerne les mécanismes de régulation de la présentation antigénique. La présentation antigénique est régulée tant au niveau transcriptionnel que post-traductionnel. Au niveau post-traductionnel, diverses cytokines affectent le processus. Parmi celles-ci, l’IL-10, une cytokine anti-inflammatoire, cause une rétention intracellulaire des molécules du CMH II. Son mécanisme d’action consiste en l’ubiquitination de la queue cytoplasmique de la chaîne bêta des molécules de CMH II. Cette modification protéique est effectuée par MARCH1, une E3 ubiquitine ligase dont l’expression est restreinte aux organes lymphoïdes secondaires. Jusqu’à tout récemment, il y avait très peu de connaissance concernant la structure et les cibles de MARCH1. Considérant son impact majeur sur la présentation antigénique, nous nous sommes intéressé à la structure-fonction de cette molécule afin de mieux caractériser sa régulation ainsi que les diverses conditions nécessaires à son fonctionnement. Dans un premier article, nous avons étudié la régulation de l’expression de MARCH1 au niveau protéique. Nos résultats ont révélé l’autorégulation de la molécule par formation de dimères et son autoubiquitination. Nous avons également démontré l’importance des domaines transmembranaires de MARCH1 dans la formation de dimères et l’interaction avec le CMH II. Dans un second article, nous avons investigué l’importance de la localisation de MARCH1 pour sa fonction. Les résultats obtenus montrent la fonctionnalité des motifs de localisation de la portion C-terminale de MARCH1 ainsi que la présence d’autres éléments de localisation dans la portion N-terminale de la protéine. Les nombreux mutants utilisés pour ce projet nous ont permis d’identifier un motif ‘‘VQNC’’, situé dans la portion cytoplasmique C-terminale de MARCH1, dont la valine est requise au fonctionnement optimal de la molécule. En effet, la mutation de la valine engendre une diminution de la fonction de la molécule et des expériences de BRET ont démontré une modification de l’orientation spatiale des queues cytoplasmiques. De plus, une recherche d’homologie de séquence a révélé la présence de ce même motif dans d’autres ubiquitines ligases, dont Parkin. Parkin est fortement exprimée dans le cerveau et agirait, entre autre, sur la dégradation des agrégats protéiques. La dysfonction de Parkin cause l’accumulation de ces agrégats, nommés corps de Lewy, qui entraînent des déficiences au niveau du fonctionnement neural observé chez les patients atteints de la maladie de Parkinson. La valine comprise dans le motif ‘’VQNC’’ a d’ailleurs été identifiée comme étant mutée au sein d’une famille où cette maladie est génétiquement transmise. Nous croyons que l’importance de ce motif ne se restreint pas à MARCH1, mais serait généralisée à d’autres E3 ligases. Ce projet de recherche a permis de caractériser des mécanismes de régulation de MARCH1 ainsi que de découvrir divers éléments structuraux requis à sa fonction. Nos travaux ont permis de mieux comprendre les mécanismes de contrôle de la présentation antigénique par les molécules de CMH II.


A potential fungal strain producing extracellular β-glucosidase enzyme was isolated from sea water and identified as ^ëéÉêJ Öáääìë=ëóÇçïáá BTMFS 55 by a molecular approach based on 28S rDNA sequence homology which showed 93% identity with already reported sequences of ^ëéÉêÖáääìë=ëóÇçïáá in the GenBank. A sequential optimization strategy was used to enhance the production of β-glucosidase under solid state fermentation (SSF) with wheat bran (WB) as the growth medium. The two-level Plackett-Burman (PB) design was implemented to screen medium components that influence β-glucosidase production and among the 11 variables, moisture content, inoculums, and peptone were identified as the most significant factors for β-glucosidase production. The enzyme was purified by (NH4)2SO4 precipitation followed by ion exchange chromatography on DEAE sepharose. The enzyme was a monomeric protein with a molecular weight of ~95 kDa as determined by SDS-PAGE. It was optimally active at pH 5.0 and 50°C. It showed high affinity towards éNPG and enzyme has a hã and sã~ñ of 0.67 mM and 83.3 U/mL, respectively. The enzyme was tolerant to glucose inhibition with a há of 17 mM. Low concentration of alcohols (10%), especially ethanol, could activate the enzyme. A considerable level of ethanol could produce from wheat bran and rice straw after 48 and 24 h, respectively, with the help of p~ÅÅÜ~êçãóÅÉë=ÅÉêÉîáëá~É in presence of cellulase and the purified β-glucosidase of ^ëéÉêÖáääìë=ëóÇçïáá BTMFS 55.


Motivation: Intrinsic protein disorder is functionally implicated in numerous biological roles and is, therefore, ubiquitous in proteins from all three kingdoms of life. Determining the disordered regions in proteins presents a challenge for experimental methods and so recently there has been much focus on the development of improved predictive methods. In this article, a novel technique for disorder prediction, called DISOclust, is described, which is based on the analysis of multiple protein fold recognition models. The DISOclust method is rigorously benchmarked against the top.ve methods from the CASP7 experiment. In addition, the optimal consensus of the tested methods is determined and the added value from each method is quantified. Results: The DISOclust method is shown to add the most value to a simple consensus of methods, even in the absence of target sequence homology to known structures. A simple consensus of methods that includes DISOclust can significantly outperform all of the previous individual methods tested.


BACKGROUND: Mealybugs (Hemiptera: Coccoidea: Pseudococcidae) are key vectors of badnaviruses, including Cacao Swollen Shoot Virus (CSSV) the most damaging virus affecting cacao (Theobroma cacao L.). The effectiveness of mealybugs as virus vectors is species dependent and it is therefore vital that CSSV resistance breeding programmes in cacao incorporate accurate mealybug identification. In this work the efficacy of a CO1-based DNA barcoding approach to species identification was evaluated by screening a range of mealybugs collected from cacao in seven countries. RESULTS: Morphologically similar adult females were characterised by scanning electron microscopy and then, following DNA extraction, were screened with CO1 barcoding markers. A high degree of CO1 sequence homology was observed for all 11 individual haplotypes including those accessions from distinct geographical regions. This has allowed for the design of a High Resolution Melt (HRM) assay capable of rapid identification of the commonly encountered mealybug pests of cacao. CONCLUSIONS: HRM Analysis (HRMA) readily differentiated between mealybug pests of cacao that can not necessarily be identified by conventional morphological analysis. This new approach, therefore, has potential to facilitate breeding for resistance to CSSV and other mealybug transmitted diseases.


XACb0070 is an uncharacterized protein coded by the two large plasmids isolated from Xanthomonas axonopodis pv. cirri, the agent of citrus canker and responsible for important economical losses in citrus world production. XACb0070 presents sequence homology only with other hypothetical proteins belonging to plant pathogens, none of which have their structure determined. The NMR-derived solution structure reveals this protein is a homodimer in which each monomer presents two domains with different structural and dynamic properties: a folded N-terminal domain with beta alpha alpha topology which mediates dimerization and a long disordered C-terminal tail. The folded domain shows high structural similarity to the ribbon-helix-helix transcriptional repressors, a family of DNA-binding proteins of conserved 3D fold but low sequence homology: indeed XACb0070 binds DNA. Primary sequence and fold comparison of XACb0070 with other proteins of the ribbon-helix-helix family together with examination of the genes in the vicinity of xacb0070 suggest the protein might be the component of a toxin-antitoxin system. (C) 2010 Elsevier Inc. All rights reserved.


Atrial natriuretic peptide (ANP) and B-type NP (BNP) are hormones involved in homeostatic control of body fluid and cardiovascular regulation. Both ANP and BNP have been cloned from the heart of mammals, amphibians, and teleost fishes, while an additional cardiac peptide, ventricular NP, has been found in selected species of teleost fish. However, in chicken, BNP is the primary cardiac peptide identified thus far. In contrast, the types of NP/s present in the reptilian heart are unknown, representing a considerable gap in our understanding of NP evolution. In the present study, we cloned and sequenced a BNP cDNA from the atria of representative species of reptile, including crocodile, lizard, snake, and tortoise. In addition, we cloned BNP from the pigeon atria. The reptilian and pigeon BNP cDNAs had ATTTA repeats in the 3′ untranslated region, as observed in all vertebrate BNP mRNAs. A high sequence homology was evident when comparing reptile and pigeon preproBNP with the previously identified chicken preproBNP. In particular, the predicted mature BNP-29 was identical between crocodile, tortoise, and chicken, with pigeon having a single amino acid substitution; lizard and snake BNP had seven and nine substitutions, respectively. Furthermore, an ANP cDNA could only be cloned from the tortoise atria. Since ANP was not isolated from the heart of any non-chelonian reptile and appears to be absent in birds, we propose that the ANP gene has been lost after branching of the turtles in the amniote line. This data provides new avenues for research on NP function in reptiles.


Functions have yet to be defined for the majority of genes of Plasmodium falciparum, the agent responsible for the most serious form of human malaria. Here we report changes in P. falciparum gene expression induced by 20 compounds that inhibit growth of the schizont stage of the intraerythrocytic development cycle. In contrast with previous studies, which reported only minimal changes in response to chemically induced perturbations of P. falciparum growth, we find that ~59% of its coding genes display over three-fold changes in expression in response to at least one of the chemicals we tested. We use this compendium for guilt-by-association prediction of protein function using an interaction network constructed from gene co-expression, sequence homology, domain-domain and yeast two-hybrid data. The subcellular localizations of 31 of 42 proteins linked with merozoite invasion is consistent with their role in this process, a key target for malaria control. Our network may facilitate identification of novel antimalarial drugs and vaccines.


In this work we isolated a novel crotamine like protein from the Crotalus durissus cascavella venom by combination of molecular exclusion and analytical reverse phase HPLC. Its primary structure was:YKRCHKKGGHCFPKEKICLPPSSDLGKMDCRWKRK-CCKKGS GK. This protein showed a molecular mass of 4892.89 da that was determined by Matrix Assisted Laser Desorption Ionization Time-of-flight (MALDI-TOF) mass spectrometry. The approximately pI value of this protein was determined in 9.9 by two-dimensional electrophoresis. This crotamine-like protein isolated here and that named as Cro 2 produced skeletal muscle spasm and spastic paralysis in mice similarly to other crotamines like proteins. Cro 2 did not modify the insulin secretion at low glucose concentration (2.8 and 5.6 mM), but at high glucose concentration (16.7 mM) we observed an insulin secretion increasing of 2.7-3.0-fold than to control. The Na+ channel antagonist tetrodoxin (6 mM) decreased glucose and Cro 2-induced insulin secretion. These results suggested that Na+ channel are involved in the insulin secretion. In this article, we also purified some peptide fragment from the treatment of reduced and carboxymethylated Cro 2 (RC-Cro 2) with cyanogen bromide and protease V8 from Staphylococcus aureus. The isolated pancreatic beta-cells were then treated with peptides only at high glucose concentration (16.7 mM), in this condition only two peptides induced insulin secretion. The amino acid sequence homology analysis of the whole crotamine as well as the biologically-active peptide allowed determining the consensus region of the biologically-active crotamine responsible for insulin secretion was KGGHCFPKE and DCRWKWKCCKKGSG.


In this article we investigated the platelet aggregating activity of whole crotoxin and its subunits isolated from Crotalus durissus cascavella venom. During the purification protocols of the venom, using HPLC molecular exclusion, we detected the presence of two different serine protease activities in the gyroxin fraction, and another in the crotoxin fraction, which induced strong and irreversible platelet aggregation, in addition to blood coagulation. From crotoxin, we isolated PLA(2), crotapotin (both fractions corresponding approximately 85% of whole crotoxin) and another minor fraction (F20) that exhibited serine protease activity. After a new fractionation on reverse phase HPLC chromatography, we obtained three other fractions named as F201, F202 and F203. F202 was obtained with high degree of molecular homogeneity with molecular mass of approximately 28 kDa and a high content of acidic amino residues, such as aspartic acid and glutamic acid. Other important amino acids were histidine, cysteine and lysine. This protein exhibited a high specificity for BApNA, a Michaelis-Menten behavior with Vmax estimated in 5.64 mu M/min and a Km value of 0.58 mM for this substrate. In this work, we investigated the ability of F202 to degrade fibrinogen and observed alpha and beta chain cleavage. Enzymatic as well as the platelet aggregation activities were strongly inhibited when incubated with TLCK and PMSF, specific inhibitors of serine protease. Also, F202 induced platelet aggregation in washed and platelet-rich plasma, and in both cases, TLCK inhibited its activity. The N-terminal amino acid sequence of F202 presented a high amino acid sequence homology with other thrombin-like proteins, but it was significantly different from gyroxin. These results showed that crotoxin is a highly heterogeneous protein composed of PLA(2), thrombin-like and other fractions that might explain the diversity of physiological and pharmacological activities of this protein.


The shikimate pathway is an attractive target for herbicides and antimicrobial agent development because it is essential in algae, higher plants, bacteria, and fungi, but absent from mammals. Homologues to enzymes in the shikimate pathway have been identified in the genome sequence of Mycobacterium tuberculosis. Among them, the EPSP synthase was proposed to be present by sequence homology. Accordingly, in order to pave the way for structural and functional efforts towards anti-mycobacterial agent development, here we describe the molecular modeling of 5-enolpyruvylshikimate-3-phosphate (EPSP) synthase isolated from M. tuberculosis that should provide a structural framework on which the design of specific inhibitors may be based on. Significant differences in the relative orientation of the domains in the two models result in open and closed conformations. The possible relevance of this structural transition in the ligand biding is discussed. (C) 2003 Elsevier B.V. All rights reserved.


Bothrombin, a snake-venom serine protease, specifically cleaves fibrinogen, releasing fibrinopeptide A to form non-crosslinked soft clots, aggregates platelets in the presence of exogeneous fibrinogen and activates blood coagulation factor VIII. Bothrombin shares high sequence homology with other snake-venom proteases such as batroxobin (94% identity), but only 30 and 34% identity with human alpha-thrombin and trypsin, respectively. Single crystals of bothrombin have been obtained and X-ray diffraction data have been collected at the Laboratorio Nacional de Luz Sincrotron to a resolution of 2.8 Angstrom. The crystals belong to the space group P2(1)2(1)2(1), with unit-cell parameters a = 94.81, b = 115.68, c = 155.97 Angstrom.