968 resultados para Protein Sequence Analysis
Resumo:
Aims: The adaptive immune response against hepatitis C virus (HCV) is significantly shaped by the host's composition of HLA alleles. Thus, the HLA phenotype is a critical determinant of viral evolution during adaptive immune pressure. Potential associations of HLA class I alleles with polymorphisms of HCV immune escape variants are largely unknown. Methods: Direct sequence analysis of the genes encoding the HCV proteins E2, NS3 and NS5B in a cohort of 159 patients with chronic HCV genotype 1 infection who were treated with pegylated interferon-alfa 2b and ribavirin in a prospective controlled trial for 48 weeks was exhibited. HLA class I genotyping was performed by strand-specific reverse hybridization with the INNO-LiPA line probe assays for HLA-A and HLA-B and by strand-specific PCR-SSP. We analyzed each amino acid position of HCV proteins using an extension of Fisher's exact test for associations with HLA alleles. In addition, associations of specific HLA alleles with inflammatory activity, liver fibrosis, HCV RNA viral load and virologic treatment outcome were investigated. Results: Separate analyses of HCV subtype 1a and 1b isolates revealed substantially different patterns of HLA-restricted polymorphisms between subtypes. Only one polymorphism within NS5B (V2758x) was significantly associated with HLA B*15 in HCV genotype 1b infected patients (adjusted p=0,048). However, a number of HLA class I-restricted polymorphisms within novel putative HCV CD8+ T cell epitopes (genotype 1a: HLA-A*11 GTRTIASPK1086-1094 [NS3], HLA-B*07 WPAPQGARSL1111-1120 [NS3]; genotype 1b: HLA-A*24 HYAPRPCGI488-496 [E2], HLA-B*44 GENETDVLL530-538 [E2], HLA-B*15 RVFTEAMTRY2757-2766 [NS5B]) were observed with high predicted epitope binding scores assessed by the web-based software SYFPEITHI (>21). Most of the identified putative epitopes were overlapping with already otherwise published epitopes, indicating a high immunogenicity of the accordant HCV protein region. In addition, certain HLA class I alleles were associated with inflammatory activity, stage of liver fibrosis, and sustained virologic response to antiviral therapy. Conclusions: HLA class I restricted HCV sequence polymorphisms are rare. HCV polymorphisms identified within putative HCV CD8+ T cell epitopes in the present study differ in their genomic distribution between genotype 1a and 1b isolates, implying divergent adaptation to the host's immune pressure on the HCV subtype level.
Resumo:
Little is known about the relation between the genome organization and gene expression in Leishmania. Bioinformatic analysis can be used to predict genes and find homologies with known proteins. A model was proposed, in which genes are organized into large clusters and transcribed from only one strand, in the form of large polycistronic primary transcripts. To verify the validity of this model, we studied gene expression at the transcriptional, post-transcriptional and translational levels in a unique locus of 34kb located on chr27 and represented by cosmid L979. Sequence analysis revealed 115 ORFs on either DNA strand. Using computer programs developed for Leishmania genes, only nine of these ORFs, localized on the same strand, were predicted to code for proteins, some of which show homologies with known proteins. Additionally, one pseudogene, was identified. We verified the biological relevance of these predictions. mRNAs from nine predicted genes and proteins from seven were detected. Nuclear run-on analyses confirmed that the top strand is transcribed by RNA polymerase II and suggested that there is no polymerase entry site. Low levels of transcription were detected in regions of the bottom strand and stable transcripts were identified for four ORFs on this strand not predicted to be protein-coding. In conclusion, the transcriptional organization of the Leishmania genome is complex, raising the possibility that computer predictions may not be comprehensive.
Resumo:
HAMAP (High-quality Automated and Manual Annotation of Proteins-available at http://hamap.expasy.org/) is a system for the automatic classification and annotation of protein sequences. HAMAP provides annotation of the same quality and detail as UniProtKB/Swiss-Prot, using manually curated profiles for protein sequence family classification and expert curated rules for functional annotation of family members. HAMAP data and tools are made available through our website and as part of the UniRule pipeline of UniProt, providing annotation for millions of unreviewed sequences of UniProtKB/TrEMBL. Here we report on the growth of HAMAP and updates to the HAMAP system since our last report in the NAR Database Issue of 2013. We continue to augment HAMAP with new family profiles and annotation rules as new protein families are characterized and annotated in UniProtKB/Swiss-Prot; the latest version of HAMAP (as of 3 September 2014) contains 1983 family classification profiles and 1998 annotation rules (up from 1780 and 1720). We demonstrate how the complex logic of HAMAP rules allows for precise annotation of individual functional variants within large homologous protein families. We also describe improvements to our web-based tool HAMAP-Scan which simplify the classification and annotation of sequences, and the incorporation of an improved sequence-profile search algorithm.
Resumo:
Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.
Resumo:
Powdery mildew is an important disease of wheat caused by the obligate biotrophic fungus Blumeria graminis f. sp. tritici. This pathogen invades exclusively epidermal cells after penetrating directly through the cell wall. Because powdery mildew colonizes exclusively epidermal cells, it is of importance not only to identify genes which are activated, but also to monitor tissue specificity of gene activation. Acquired resistance of wheat to powdery mildew can be induced by a previous inoculation with the non-host pathogen B. graminis f. sp. hordei, the causal agent of barley powdery mildew. The establishment of the resistant state is accompanied by the activation of genes. Here we report the tissue-specific cDNA-AFLP analysis and cloning of transcripts accumulating 6 and 24 h after the resistance-inducing inoculation with B. graminis f. sp. hordei. A total of 25,000 fragments estimated to represent about 17,000 transcripts were displayed. Out of these, 141 transcripts, were found to accumulate after Bgh inoculation using microarray hybridization analysis. Forty-four accumulated predominantly in the epidermis whereas 76 transcripts accumulated mostly in mesophyll tissue.
Resumo:
A pool of oligonucleotides encoding a start methionine and nine random amino acids was inserted at the 5'-end of the gene for the yeast cytochrome oxidase subunit IV lacking its own mitochondrial targeting sequence. Approximately one-quarter of the randomly generated sequences targeted subunit IV to its correct intramitochondrial location in vivo. Sequence analysis of 89 randomly generated sequences showed that their efficiencies as mitochondrial targeting signals correlated with the potential to fold into an amphiphilic alpha-helix. Functional targeting sequences were enriched in arginine and isoleucine residues but contained few aspartate, glutamate, and proline residues. Nonfunctional sequences predicted to have significant helical amphiphilicity often had at least one acidic or multiple helix-breaking residues that would be expected to interfere with targeting functioning. These results support the hypothesis that the signal for targeting a protein into the mitochondrial matrix is usually a positively charged amphiphilic helix.
Resumo:
Previous microarray studies on breast cancer identified multiple tumour classes, of which the most prominent, named luminal and basal, differ in expression of the oestrogen receptor alpha gene (ER). We report here the identification of a group of breast tumours with increased androgen signalling and a 'molecular apocrine' gene expression profile. Tumour samples from 49 patients with large operable or locally advanced breast cancers were tested on Affymetrix U133A gene expression microarrays. Principal components analysis and hierarchical clustering split the tumours into three groups: basal, luminal and a group we call molecular apocrine. All of the molecular apocrine tumours have strong apocrine features on histological examination (P=0.0002). The molecular apocrine group is androgen receptor (AR) positive and contains all of the ER-negative tumours outside the basal group. Kolmogorov-Smirnov testing indicates that oestrogen signalling is most active in the luminal group, and androgen signalling is most active in the molecular apocrine group. ERBB2 amplification is commoner in the molecular apocrine than the other groups. Genes that best split the three groups were identified by Wilcoxon test. Correlation of the average expression profile of these genes in our data with the expression profile of individual tumours in four published breast cancer studies suggest that molecular apocrine tumours represent 8-14% of tumours in these studies. Our data show that it is possible with microarray data to divide mammary tumour cells into three groups based on steroid receptor activity: luminal (ER+ AR+), basal (ER- AR-) and molecular apocrine (ER- AR+).
Resumo:
PHO1 has been recently identified as a protein involved in the loading of inorganic phosphate into the xylem of roots in Arabidopsis. The genome of Arabidopsis contains 11 members of the PHO1 gene family. The cDNAs of all PHO1 homologs have been cloned and sequenced. All proteins have the same topology and harbor a SPX tripartite domain in the N-terminal hydrophilic portion and an EXS domain in the C-terminal hydrophobic portion. The SPX and EXS domains have been identified in yeast (Saccharomyces cerevisiae) proteins involved in either phosphate transport or sensing or in sorting proteins to endomembranes. The Arabidopsis genome contains additional proteins of unknown function containing either a SPX or an EXS domain. Phylogenetic analysis indicated that the PHO1 family is subdivided into at least three clusters. Reverse transcription-PCR revealed a broad pattern of expression in leaves, roots, stems, and flowers for most genes, although two genes are expressed exclusively in flowers. Analysis of the activity of the promoter of all PHO1 homologs using promoter-beta-glucuronidase fusions revealed a predominant expression in the vascular tissues of roots, leaves, stems, or flowers. beta-Glucuronidase expression is also detected for several promoters in nonvascular tissue, including hydathodes, trichomes, root tip, root cortical/epidermal cells, and pollen grains. The expression pattern of PHO1 homologs indicates a likely role of the PHO1 proteins not only in the transfer of phosphate to the vascular cylinder of various tissues but also in the acquisition of phosphate into cells, such as pollen or root epidermal/cortical cells.
Resumo:
The human TPTE (Transmembrane Phosphatase with TEnsin homology) gene family encodes a PTEN-related tyrosine phosphatase with four potential transmembrane domains. Chromosomal mapping revealed multiple copies of the TPTE gene on chromosomes 13, 15, 21, 22 and Y. Human chromosomes 13 and 21 copies encode two functional proteins, TPIP (TPTE and PTEN homologous Inositol lipid Phosphatase) and TPTE, respectively, whereas only one copy of the gene exists in the mouse genome. In the present study, we show that TPTE and TPIP proteins are expressed in secondary spermatocytes and/or prespermatids. In addition, we report the existence of several novel alternatively spliced isoforms of these two proteins with variable number of transmembrane domains. The latter has no influence on the subcellular localization of these different peptides as shown by co-immunofluorescence experiments. Finally, we identify another expressed TPTE copy, mapping to human chromosome 22, whose transcription appears to be under the control of the LTR of human endogenous retrovirus RTVL-H3.
Resumo:
Purpose. To investigate the role of the myocyte enhancer factor 2 (Mef2) transcription factor family in retinal diseases, Mef2c expression was assessed during retinal degeneration in the Rpe65(-/-) mouse model of Leber's congenital amaurosis (LCA). Mef2c-dependent expression of photoreceptor-specific genes was further addressed. Methods. Expression of Mef2 members was analyzed by oligonucleotide microarray, quantitative PCR (qPCR) and in situ hybridization. Mef2c-dependent transcriptional activity was assayed by luciferase assay in HEK293T cells. Results. Mef2c was the only Mef2 member markedly downregulated during retinal degeneration in Rpe65(-/-) mice. Mef2c mRNA level was decreased by more than 2 fold at 2 and 4 months and by 3.5 fold at 6 months in retinas of Rpe65(-/-) mice. Downregulation of Mef2c at the protein level was confirmed in Rpe65(-/-) retinas. The decrease in Mef2c mRNA levels in the developing Rpe65(-/-) retinas, from post-natal day (P)13 onward, was concomitant with the decreased expression of the rod-specific transcription factors Nrl and Nr2e3. Nrl was further shown to drive Mef2c transcriptional activity, supporting a physiological role for Mef2c in the retina. In addition, Mef2c appeared to act as a transcriptional repressor of its own expression, as well as those of the retina-specific retinal G-protein coupled receptor (Rgr), rhodopsin and M-opsin genes. Conclusions. These findings highlight the early altered regulation of the rod-specific transcriptional network in Rpe65-related disease. They further indicate that Mef2c may act as a novel transcription factor involved in the development and the maintenance of photoreceptor cells.
Resumo:
Mutations in LACERATA (LCR), FIDDLEHEAD (FDH), and BODYGUARD (BDG) cause a complex developmental syndrome that is consistent with an important role for these Arabidopsis genes in cuticle biogenesis. The genesis of their pleiotropic phenotypes is, however, poorly understood. We provide evidence that neither distorted depositions of cutin, nor deficiencies in the chemical composition of cuticular lipids, account for these features, instead suggesting that the mutants alleviate the functional disorder of the cuticle by reinforcing their defenses. To better understand how plants adapt to these mutations, we performed a genome-wide gene expression analysis. We found that apparent compensatory transcriptional responses in these mutants involve the induction of wax, cutin, cell wall, and defense genes. To gain greater insight into the mechanism by which cuticular mutations trigger this response in the plants, we performed an overlap meta-analysis, which is termed MASTA (MicroArray overlap Search Tool and Analysis), of differentially expressed genes. This suggested that different cell integrity pathways are recruited in cesA cellulose synthase and cuticular mutants. Using MASTA for an in silico suppressor/enhancer screen, we identified SERRATE (SE), which encodes a protein of RNA-processing multi-protein complexes, as a likely enhancer. In confirmation of this notion, the se lcr and se bdg double mutants eradicate severe leaf deformations as well as the organ fusions that are typical of lcr and bdg and other cuticular mutants. Also, lcr does not confer resistance to Botrytis cinerea in a se mutant background. We propose that there is a role for SERRATE-mediated RNA signaling in the cuticle integrity pathway.
Resumo:
One of the key mechanisms linking cell signaling and control of gene expression is reversible phosphorylation of transcription factors. FOXC2 is a forkhead transcription factor that is mutated in the human vascular disease lymphedema-distichiasis and plays an essential role in lymphatic vascular development. However, the mechanisms regulating FOXC2 transcriptional activity are not well understood. We report here that FOXC2 is phosphorylated on eight evolutionarily conserved proline-directed serine/threonine residues. Loss of phosphorylation at these sites triggers substantial changes in the FOXC2 transcriptional program. Through genome-wide location analysis in lymphatic endothelial cells, we demonstrate that the changes are due to selective inhibition of FOXC2 recruitment to chromatin. The extent of the inhibition varied between individual binding sites, suggesting a novel rheostat-like mechanism by which expression of specific genes can be differentially regulated by FOXC2 phosphorylation. Furthermore, unlike the wild-type protein, the phosphorylation-deficient mutant of FOXC2 failed to induce vascular remodeling in vivo. Collectively, our results point to the pivotal role of phosphorylation in the regulation of FOXC2-mediated transcription in lymphatic endothelial cells and underscore the importance of FOXC2 phosphorylation in vascular development.
Resumo:
The gene encoding type I signal peptidase (Lmjsp) has been cloned from Leishmania major. Lmjsp encodes a protein of 180 amino residues with a predicted molecular mass of 20.5 kDa. Comparison of the protein sequence with those of known type I signal peptidases indicates homology in five conserved domains A-E which are known to be important, or essential, for catalytic activity. Southern blot hybridisation analysis indicates that there is a single copy of the Lmjsp gene. A recombinant SPase protein and a synthetic peptide of the L. major signal peptidase were used to examine the presence of specific antibodies in sera from either recovered or active individuals of both cutaneous and visceral leishmaniasis. This evaluation demonstrated that sera from cutaneous and visceral forms of leishmaniasis are highly reactive to both the recombinant and synthetic signal peptidase antigens. Therefore, the Leishmania signal peptidase, albeit localised intracellularly, is a significant target of the Leishmania specific immune response and highlights its potential use for serodiagnosis of cutaneous and visceral leishmaniasis.
Resumo:
Secreted proteases constitute potential virulence factors of dermatophytes. A total of seven genes encoding putative serine proteases of the subtilisin family (SUB) were isolated in Trichophyton rubrum. Based on sequence data and intron-exon structure, a phylogenetic analysis of subtilisins from T. rubrum and other fungi revealed a presumed ancestral lineage comprising T. rubrum SUB2 and Aspergillus SUBs. All other SUBs (SUB1, SUB3-7) are dermatophyte-specific and have apparently emerged more recently, through successive gene duplication events. We showed that two subtilisins, Sub3 and Sub4, were detected in culture supernatants of T. rubrum grown in a medium containing soy protein as a sole nitrogen source. Both recombinant enzymes produced in Pichia pastoris are highly active on keratin azure suggesting that these proteases play an important role in invasion of keratinised tissues by the fungus. The set of deduced amino acid sequences of T. rubrum SUB ORFs allowed the identification of orthologous Subs secreted by other dermatophyte species using proteolysis and mass spectrometry.
Resumo:
Summary [résumé français voir ci-dessous] From the beginning of the 20th century the world population has been confronted with the human immune deficiency virus 1 (HIV-1). This virus has the particularity to mutate fast, and could thus evade and adapt to the human host. Our closest evolutionary related organisms, the non-human primates, are less susceptible to HIV-1. In a broader sense, primates are differentially susceptible to various retrovirus. Species specificity may be due to genetic differences among primates. In the present study we applied evolutionary and comparative genetic techniques to characterize the evolutionary pattern of host cellular determinants of HIV-1 pathogenesis. The study of the evolution of genes coding for proteins participating to the restriction or pathogenesis of HIV-1 may help understanding the genetic basis of modern human susceptibility to infection. To perform comparative genetics analysis, we constituted a collection of primate DNA and RNA to allow generation of de novo sequence of gene orthologs. More recently, release to the public domain of two new primate complete genomes (bornean orang-utan and common marmoset) in addition of the three previously available genomes (human, chimpanzee and Rhesus monkey) help scaling up the evolutionary and comparative genome analysis. Sequence analysis used phylogenetic and statistical methods for detecting molecular adaptation. We identified different selective pressures acting on host proteins involved in HIV-1 pathogenesis. Proteins with HIV-1 restriction properties in non-human primates were under strong positive selection, in particular in regions of interaction with viral proteins. These regions carried key residues for the antiviral activity. Proteins of the innate immunity presented an evolutionary pattern of conservation (purifying selection) but with signals of relaxed constrain if we compared them to the average profile of purifying selection of the primate genomes. Large scale analysis resulted in patterns of evolutionary pressures according to molecular function, biological process and cellular distribution. The data generated by various analyses served to guide the ancestral reconstruction of TRIM5a a potent antiviral host factor. The resurrected TRIM5a from the common ancestor of Old world monkeys was effective against HIV-1 and the recent resurrected hominoid variants were more effective against other retrovirus. Thus, as the result of trade-offs in the ability to restrict different retrovirus, human might have been exposed to HIV-1 at a time when TRIM5a lacked the appropriate specific restriction activity. The application of evolutionary and comparative genetic tools should be considered for the systematical assessment of host proteins relevant in viral pathogenesis, and to guide biological and functional studies. Résumé La population mondiale est confrontée depuis le début du vingtième siècle au virus de l'immunodéficience humaine 1 (VIH-1). Ce virus a un taux de mutation particulièrement élevé, il peut donc s'évader et s'adapter très efficacement à son hôte. Les organismes évolutivement le plus proches de l'homme les primates nonhumains sont moins susceptibles au VIH-1. De façon générale, les primates répondent différemment aux rétrovirus. Cette spécificité entre espèces doit résider dans les différences génétiques entre primates. Dans cette étude nous avons appliqué des techniques d'évolution et de génétique comparative pour caractériser le modèle évolutif des déterminants cellulaires impliqués dans la pathogenèse du VIH- 1. L'étude de l'évolution des gènes, codant pour des protéines impliquées dans la restriction ou la pathogenèse du VIH-1, aidera à la compréhension des bases génétiques ayant récemment rendu l'homme susceptible. Pour les analyses de génétique comparative, nous avons constitué une collection d'ADN et d'ARN de primates dans le but d'obtenir des nouvelles séquences de gènes orthologues. Récemment deux nouveaux génomes complets ont été publiés (l'orang-outan du Bornéo et Marmoset commun) en plus des trois génomes déjà disponibles (humain, chimpanzé, macaque rhésus). Ceci a permis d'améliorer considérablement l'étendue de l'analyse. Pour détecter l'adaptation moléculaire nous avons analysé les séquences à l'aide de méthodes phylogénétiques et statistiques. Nous avons identifié différentes pressions de sélection agissant sur les protéines impliquées dans la pathogenèse du VIH-1. Des protéines avec des propriétés de restriction du VIH-1 dans les primates non-humains présentent un taux particulièrement haut de remplacement d'acides aminés (sélection positive). En particulier dans les régions d'interaction avec les protéines virales. Ces régions incluent des acides aminés clé pour l'activité de restriction. Les protéines appartenant à l'immunité inné présentent un modèle d'évolution de conservation (sélection purifiante) mais avec des traces de "relaxation" comparé au profil général de sélection purifiante du génome des primates. Une analyse à grande échelle a permis de classifier les modèles de pression évolutive selon leur fonction moléculaire, processus biologique et distribution cellulaire. Les données générées par les différentes analyses ont permis la reconstruction ancestrale de TRIM5a, un puissant facteur antiretroviral. Le TRIM5a ressuscité, correspondant à l'ancêtre commun entre les grands singes et les groupe des catarrhiniens, est efficace contre le VIH-1 moderne. Les TRIM5a ressuscités plus récents, correspondant aux ancêtres des grands singes, sont plus efficaces contre d'autres rétrovirus. Ainsi, trouver un compromis dans la capacité de restreindre différents rétrovirus, l'homme aurait été exposé au VIH-1 à une période où TRIM5a manquait d'activité de restriction spécifique contre celui-ci. L'application de techniques d'évolution et de génétique comparative devraient être considérées pour l'évaluation systématique de protéines impliquées dans la pathogenèse virale, ainsi que pour guider des études biologiques et fonctionnelles