948 resultados para local sequence alignment problem
Resumo:
Metacaspases are cysteine peptidases that could play a role similar to caspases in the cell death programme of plants, fungi and protozoa. The human protozoan parasite Leishmania major expresses a single metacaspase (LmjMCA) harbouring a central domain with the catalytic dyad histidine and cysteine as found in caspases. In this study, we investigated the processing sites important for the maturation of LmjMCA catalytic domain, the cellular localization of LmjMCA polypeptides, and the functional role of the catalytic domain in the cell death pathway of Leishmania parasites. Although LmjMCA polypeptide precursor form harbours a functional mitochondrial localization signal (MLS), we determined that LmjMCA polypeptides are mainly localized in the cytoplasm. In stress conditions, LmjMCA precursor forms were extensively processed into soluble forms containing the catalytic domain. This domain was sufficient to enhance sensitivity of parasites to hydrogen peroxide by impairing the mitochondrion. These data provide experimental evidences of the importance of LmjMCA processing into an active catalytic domain and of its role in disrupting mitochondria, which could be relevant in the design of new drugs to fight leishmaniasis and likely other protozoan parasitic diseases.
Resumo:
The anaerobic transcriptional regulator ANR induces the arginine deiminase and denitrification pathways in Pseudomonas aeruginosa during oxygen limitation. The homologous activator FNR of Escherichia coli, when introduced into an anr mutant of P. aeruginosa, could functionally replace ANR for anaerobic growth on nitrate but not for anaerobic induction of arginine deiminase. In an FNR-positive E. coli strain, the ANR-dependent promoter of the arcDABC operon, which encodes the enzymes of the arginine deiminase pathway, was not expressed. To analyse systematically these distinct induction patterns, a lacZ promoter-probe, broad-host-range plasmid containing various -40 regions (the ANR/FNR recognition sequences) and -10 promoter sequences was constructed. These constructs were tested in P. aeruginosa and in E. coli expressing either ANR or FNR. In conjunction with the consensus -10 hexamer of E. coli sigma 70 RNA polymerase (TATAAT), the consensus FNR site (TTGAT ..... ATCAA) was recognized efficiently by ANR and FNR in both hosts. By contrast, when promoters contained the Arc box (TTGAC .... ATCAG), which is found in the arcDABC promoter, or a symmetrical mutant FNR site (CTGAT .... ATCAG), ANR was a more effective activator than was FNR. Conversely, an extended 22 bp, fully symmetrical FNR site allowed better activation with FNR than with ANR. Combination of the arc promoter -10 sequence (CCTAAT) with the Arc box or the consensus FNR site resulted in good ANR-dependent expression in P. aeruginosa but gave practically no expression in E. coli, suggesting that RNA polymerase of P. aeruginosa differs from the E. coli enzyme in -10 recognition specificity. In conclusion, ANR and FNR are able to activate the RNA polymerases of P. aeruginosa and E. coli when the -40 and -10 promoter elements ae identical or close to the E. coli consensus sequences.
Resumo:
Nonstructural protein 4B (NS4B) is a key organizer of hepatitis C virus (HCV) replication complex formation. In concert with other nonstructural proteins, it induces a specific membrane rearrangement, designated as membranous web, which serves as a scaffold for the HCV replicase. The N-terminal part of NS4B comprises a predicted and a structurally resolved amphipathic α-helix, designated as AH1 and AH2, respectively. Here, we report a detailed structure-function analysis of NS4B AH1. Circular dichroism and nuclear magnetic resonance structural analyses revealed that AH1 folds into an amphipathic α-helix extending from NS4B amino acid 4 to 32, with positively charged residues flanking the helix. These residues are conserved among hepaciviruses. Mutagenesis and selection of pseudorevertants revealed an important role of these residues in RNA replication by affecting the biogenesis of double-membrane vesicles making up the membranous web. Moreover, alanine substitution of conserved acidic residues on the hydrophilic side of the helix reduced infectivity without significantly affecting RNA replication, indicating that AH1 is also involved in virus production. Selective membrane permeabilization and immunofluorescence microscopy analyses of a functional replicon harboring an epitope tag between NS4B AH1 and AH2 revealed a dual membrane topology of the N-terminal part of NS4B during HCV RNA replication. Luminal translocation was unaffected by the mutations introduced into AH1, but was abrogated by mutations introduced into AH2. In conclusion, our study reports the three-dimensional structure of AH1 from HCV NS4B, and highlights the importance of positively charged amino acid residues flanking this amphipathic α-helix in membranous web formation and RNA replication. In addition, we demonstrate that AH1 possesses a dual role in RNA replication and virus production, potentially governed by different topologies of the N-terminal part of NS4B.
Resumo:
The bacterial siderophore pyochelin is composed of salicylate and two cysteine-derived heterocycles, the second of which is modified by reduction and N-methylation during biosynthesis. In Pseudomonas aeruginosa, the first cysteine residue is converted to its D-isoform during thiazoline ring formation, whereas the second cysteine remains in its L-configuration. Stereochemistry is opposite in the Pseudomonas fluorescens siderophore enantio-pyochelin, in which the first ring originates from L-cysteine and the second ring from D-cysteine. Both siderophores promote growth of the producer organism during iron limitation and induce the expression of their biosynthesis genes by activating the transcriptional AraC-type regulator PchR. However, neither siderophore is functional as an iron carrier or as a transcriptional inducer in the other species, demonstrating that both processes are highly stereospecific. Stereospecificity of pyochelin/enantio-pyochelin-mediated iron uptake is ensured at two levels: (i) by the outer membrane siderophore receptors and (ii) by the cytosolic PchR regulators.
Resumo:
We describe the odorant binding proteins (OBPs) of the red imported fire ant, Solenopsis invicta, obtained from analyses of an EST library and separate 454 sequencing runs of two normalized cDNA libraries. We identified a total of 18 putative functional OBPs in this ant. A third of the fire ant OBPs are orthologs to honey bee OBPs. Another third of the OBPs belong to a lineage-specific expansion, which is a common feature of insect OBP evolution. Like other OBPs, the different fire ant OBPs share little sequence similarity (∼ 20%), rendering evolutionary analyses difficult. We discuss the resulting problems with sequence alignment, phylogenetic analysis, and tests of selection. As previously suggested, our results underscore the importance for careful exploration of the sensitivity to the effects of alignment methods for data comprising widely divergent sequences.
Resumo:
Embryonic development in nonmammalian vertebrates depends entirely on nutritional reserves that are predominantly derived from vitellogenin proteins and stored in egg yolk. Mammals have evolved new resources, such as lactation and placentation, to nourish their developing and early offspring. However, the evolutionary timing and molecular events associated with this major phenotypic transition are not known. By means of sensitive comparative genomics analyses and evolutionary simulations, we here show that the three ancestral vitellogenin-encoding genes were progressively lost during mammalian evolution (until around 30-70 million years ago, Mya) in all but the egg-laying monotremes, which have retained a functional vitellogenin gene. Our analyses also provide evidence that the major milk resource genes, caseins, which have similar functional properties as vitellogenins, appeared in the common mammalian ancestor approximately 200-310 Mya. Together, our data are compatible with the hypothesis that the emergence of lactation in the common mammalian ancestor and the development of placentation in eutherian and marsupial mammals allowed for the gradual loss of yolk-dependent nourishment during mammalian evolution
Resumo:
A variety of cellular proteins has the ability to recognize DNA lesions induced by the anti-cancer drug cisplatin, with diverse consequences on their repair and on the therapeutic effectiveness of this drug. We report a novel gene involved in the cell response to cisplatin in vertebrates. The RDM1 gene (for RAD52 Motif 1) was identified while searching databases for sequences showing similarities to RAD52, a protein involved in homologous recombination and DNA double-strand break repair. Ablation of RDM1 in the chicken B cell line DT40 led to a more than 3-fold increase in sensitivity to cisplatin. However, RDM1-/- cells were not hypersensitive to DNA damages caused by ionizing radiation, UV irradiation, or the alkylating agent methylmethane sulfonate. The RDM1 protein displays a nucleic acid binding domain of the RNA recognition motif (RRM) type. By using gel-shift assays and electron microscopy, we show that purified, recombinant chicken RDM1 protein interacts with single-stranded DNA as well as double-stranded DNA, on which it assembles filament-like structures. Notably, RDM1 recognizes DNA distortions induced by cisplatin-DNA adducts in vitro. Finally, human RDM1 transcripts are abundant in the testis, suggesting a possible role during spermatogenesis.
Resumo:
The death-inducing receptor Fas is activated when cross-linked by the type II membrane protein Fas ligand (FasL). When human soluble FasL (sFasL, containing the extracellular portion) was expressed in human embryo kidney 293 cells, the three N-linked glycans of each FasL monomer were found to be essential for efficient secretion. Based on the structure of the closely related lymphotoxin alpha-tumor necrosis factor receptor I complex, a molecular model of the FasL homotrimer bound to three Fas molecules was generated using knowledge-based protein modeling methods. Point mutations of amino acid residues predicted to affect the receptor-ligand interaction were introduced at three sites. The F275L mutant, mimicking the loss of function murine gld mutation, exhibited a high propensity for aggregation and was unable to bind to Fas. Mutants P206R, P206D, and P206F displayed reduced cytotoxicity toward Fas-positive cells with a concomitant decrease in the binding affinity for the recombinant Fas-immunoglobulin Fc fusion proteins. Although the cytotoxic activity of mutant Y218D was unaltered, mutant Y218R was inactive, correlating with the prediction that Tyr-218 of FasL interacts with a cluster of three basic amino acid side chains of Fas. Interestingly, mutant Y218F could induce apoptosis in murine, but not human cells.
Resumo:
Background: Parallel T-Coffee (PTC) was the first parallel implementation of the T-Coffee multiple sequence alignment tool. It is based on MPI and RMA mechanisms. Its purpose is to reduce the execution time of the large-scale sequence alignments. It can be run on distributed memory clusters allowing users to align data sets consisting of hundreds of proteins within a reasonable time. However, most of the potential users of this tool are not familiar with the use of grids or supercomputers. Results: In this paper we show how PTC can be easily deployed and controlled on a super computer architecture using a web portal developed using Rapid. Rapid is a tool for efficiently generating standardized portlets for a wide range of applications and the approach described here is generic enough to be applied to other applications, or to deploy PTC on different HPC environments. Conclusions: The PTC portal allows users to upload a large number of sequences to be aligned by the parallel version of TC that cannot be aligned by a single machine due to memory and execution time constraints. The web portal provides a user-friendly solution.
Resumo:
Chromatin state variation at gene regulatory elements is abundant across individuals, yet we understand little about the genetic basis of this variability. Here, we profiled several histone modifications, the transcription factor (TF) PU.1, RNA polymerase II, and gene expression in lymphoblastoid cell lines from 47 whole-genome sequenced individuals. We observed that distinct cis-regulatory elements exhibit coordinated chromatin variation across individuals in the form of variable chromatin modules (VCMs) at sub-Mb scale. VCMs were associated with thousands of genes and preferentially cluster within chromosomal contact domains. We mapped strong proximal and weak, yet more ubiquitous, distal-acting chromatin quantitative trait loci (cQTL) that frequently explain this variation. cQTLs were associated with molecular activity at clusters of cis-regulatory elements and mapped preferentially within TF-bound regions. We propose that local, sequence-independent chromatin variation emerges as a result of genetic perturbations in cooperative interactions between cis-regulatory elements that are located within the same genomic domain.
Resumo:
Mitochondrial DNA (mtDNA), a maternally inherited 16.6-Kb molecule crucial for energy production, is implicated in numerous human traits and disorders. It has been hypothesized that the presence of mutations in the mtDNA may contribute to the complex genetic basis of schizophreniadisease, due to the evidence of maternal inheritance and the presence of schizophrenia symptoms in patients affected of a mitochondrial disorder related to a mtDNA mutation. The present project aims to study the association of variants of mitochondrial DNA (mtDNA), and an increased risk of schizophrenia in a cohort of patients and controls from the same population. The entire mtDNA of 55 schizophrenia patients with an apparent maternal transmission of the disease and 38 controls was sequenced by Next Generation Sequencing (Ion Torrent PGM, Life Technologies) and compared to the reference sequence. The current method for establishing mtDNA haplotypes is Sanger sequencing, which is laborious, timeconsuming, and expensive. With the emergence of Next Generation Sequencing technologies, this sequencing process can be much more quickly and cost-efficiently. We have identified 14 variants that have not been previously reported. Two of them were missense variants: MTATP6 p.V113M and MTND5 p.F334L ,and also three variants encoding rRNA and one variant encoding tRNA. Not significant differences have been found in the number of variants between the two groups. We found that the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of the bioinformatics analysis and annotation step would be desirable to facilitate the application of NGS in mtDNA analysis.
Resumo:
It has long been known that amino acids are the building blocks for proteins and govern their folding into specific three-dimensional structures. However, the details of this process are still unknown and represent one of the main problems in structural bioinformatics, which is a highly active research area with the focus on the prediction of three-dimensional structure and its relationship to protein function. The protein structure prediction procedure encompasses several different steps from searches and analyses of sequences and structures, through sequence alignment to the creation of the structural model. Careful evaluation and analysis ultimately results in a hypothetical structure, which can be used to study biological phenomena in, for example, research at the molecular level, biotechnology and especially in drug discovery and development. In this thesis, the structures of five proteins were modeled with templatebased methods, which use proteins with known structures (templates) to model related or structurally similar proteins. The resulting models were an important asset for the interpretation and explanation of biological phenomena, such as amino acids and interaction networks that are essential for the function and/or ligand specificity of the studied proteins. The five proteins represent different case studies with their own challenges like varying template availability, which resulted in a different structure prediction process. This thesis presents the techniques and considerations, which should be taken into account in the modeling procedure to overcome limitations and produce a hypothetical and reliable three-dimensional structure. As each project shows, the reliability is highly dependent on the extensive incorporation of experimental data or known literature and, although experimental verification of in silico results is always desirable to increase the reliability, the presented projects show that also the experimental studies can greatly benefit from structural models. With the help of in silico studies, the experiments can be targeted and precisely designed, thereby saving both money and time. As the programs used in structural bioinformatics are constantly improved and the range of templates increases through structural genomics efforts, the mutual benefits between in silico and experimental studies become even more prominent. Hence, reliable models for protein three-dimensional structures achieved through careful planning and thoughtful executions are, and will continue to be, valuable and indispensable sources for structural information to be combined with functional data.
Resumo:
Avidins (Avds) are homotetrameric or homodimeric glycoproteins with typically less than 130 amino acid residues per monomer. They form a highly stable, non-covalent complex with biotin (vitamin H) with Kd = 10-15 M (for chicken Avd). The best-studied Avds are the chicken Avd from Gallus gallus and streptavidin from Streptomyces avidinii, although other Avd studies have also included Avds from various origins, e.g., from frogs, fishes, mushrooms and from many different bacteria. Several engineered Avds have been reported as well, e.g., dual-chain Avds (dcAvds) and single-chain Avds (scAvds), circular permutants with up to four simultaneously modifiable ligand-binding sites. These engineered Avds along with the many native Avds have potential to be used in various nanobiotechnological applications. In this study, we made a structure-based alignment representing all currently available sequences of Avds and studied the evolutionary relationship of Avds using phylogenetic analysis. First, we created an initial multiple sequence alignment of Avds using 42 closely related sequences, guided by the known Avd crystal structures. Next, we searched for non-redundant Avd sequences from various online databases, including National Centre for Biotechnology Information and the Universal Protein Resource; the identified sequences were added to the initial alignment to expand it to a final alignment of 242 Avd sequences. The MEGA software package was used to create distance matrices and a phylogenetic tree. Bootstrap reproducibility of the tree was poor at multiple nodes and may reflect on several possible issues with the data: the sequence length compared is relatively short and, whereas some positions are highly conserved and functional, others can vary without impinging on the structure or the function, so there are few informative sites; it may be that periods of rapid duplication have led to paralogs and that the differences among them are within the error limit of the data; and there may be other yet unknown reasons. Principle component analysis applied to alternative distance data did segregate the major groups, and success is likely due to the multivariate consideration of all the information. Furthermore, based on our extensive alignment and phylogenetic analysis, we expressed two novel Avds, lacavidin from Lactrodectus Hesperus, a western black widow spider, and hoefavidin from Hoeflea phototrophica, an aerobic marine bacterium, the ultimate aim being to determine their X-ray structures. These Avds were selected because of their unique sequences: lacavidin has an N-terminal Avd-like domain but a long C-terminal overhang, whereas hoefavidin was thought to be a dimeric Avd. Both these Avds could be used as novel scaffolds in biotechnological applications.
Resumo:
Naïvement perçu, le processus d’évolution est une succession d’événements de duplication et de mutations graduelles dans le génome qui mènent à des changements dans les fonctions et les interactions du protéome. La famille des hydrolases de guanosine triphosphate (GTPases) similaire à Ras constitue un bon modèle de travail afin de comprendre ce phénomène fondamental, car cette famille de protéines contient un nombre limité d’éléments qui diffèrent en fonctionnalité et en interactions. Globalement, nous désirons comprendre comment les mutations singulières au niveau des GTPases affectent la morphologie des cellules ainsi que leur degré d’impact sur les populations asynchrones. Mon travail de maîtrise vise à classifier de manière significative différents phénotypes de la levure Saccaromyces cerevisiae via l’analyse de plusieurs critères morphologiques de souches exprimant des GTPases mutées et natives. Notre approche à base de microscopie et d’analyses bioinformatique des images DIC (microscopie d’interférence différentielle de contraste) permet de distinguer les phénotypes propres aux cellules natives et aux mutants. L’emploi de cette méthode a permis une détection automatisée et une caractérisation des phénotypes mutants associés à la sur-expression de GTPases constitutivement actives. Les mutants de GTPases constitutivement actifs Cdc42 Q61L, Rho5 Q91H, Ras1 Q68L et Rsr1 G12V ont été analysés avec succès. En effet, l’implémentation de différents algorithmes de partitionnement, permet d’analyser des données qui combinent les mesures morphologiques de population native et mutantes. Nos résultats démontrent que l’algorithme Fuzzy C-Means performe un partitionnement efficace des cellules natives ou mutantes, où les différents types de cellules sont classifiés en fonction de plusieurs facteurs de formes cellulaires obtenus à partir des images DIC. Cette analyse démontre que les mutations Cdc42 Q61L, Rho5 Q91H, Ras1 Q68L et Rsr1 G12V induisent respectivement des phénotypes amorphe, allongé, rond et large qui sont représentés par des vecteurs de facteurs de forme distincts. Ces distinctions sont observées avec différentes proportions (morphologie mutante / morphologie native) dans les populations de mutants. Le développement de nouvelles méthodes automatisées d’analyse morphologique des cellules natives et mutantes s’avère extrêmement utile pour l’étude de la famille des GTPases ainsi que des résidus spécifiques qui dictent leurs fonctions et réseau d’interaction. Nous pouvons maintenant envisager de produire des mutants de GTPases qui inversent leur fonction en ciblant des résidus divergents. La substitution fonctionnelle est ensuite détectée au niveau morphologique grâce à notre nouvelle stratégie quantitative. Ce type d’analyse peut également être transposé à d’autres familles de protéines et contribuer de manière significative au domaine de la biologie évolutive.