968 resultados para Protein Sequence Analysis
Resumo:
BACKGROUND: We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. RESULTS: The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. CONCLUSION: This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.
Resumo:
Highly quantitative biomarkers of neurodegenerative disease remain an important need in the urgent quest for disease-modifying therapies. For Huntington's disease (HD), a genetic test is available (trait marker), but necessary state markers are still in development. In this report, we describe a large battery of transcriptomic tests explored as state biomarker candidates. In an attempt to exploit the known neuroinflammatory and transcriptional perturbations of disease, we measured relevant mRNAs in peripheral blood cells. The performance of these potential markers was weak overall, with only one mRNA, immediate early response 3 (IER3), showing a modest but significant increase of 32% in HD samples compared with controls. No statistically significant differences were found for any other mRNAs tested, including a panel of 12 RNA biomarkers identified in a previous report [Borovecki F, Lovrecic L, Zhou J, Jeong H, Then F, Rosas HD, Hersch SM, Hogarth P, Bouzou B, Jensen RV, et al. (2005) Proc Natl Acad Sci USA 102:11023-11028]. The present results may nonetheless inform the future design and testing of HD biomarker strategies.
Resumo:
DNA-binding proteins mediate a variety of crucial molecular functions, such as transcriptional regulation and chromosome maintenance, replication and repair, which in turn control cell division and differentiation. The roles of these proteins in disease are currently being investigated using microarray-based approaches. However, these assays can be difficult to adapt to routine diagnosis of complex diseases such as cancer. Here, we review promising alternative approaches involving protein-binding microarrays (PBMs) that probe the interaction of proteins from crude cell or tissue extracts with large collections of synthetic or natural DNA sequences. Recent studies have demonstrated the use of these novel PBM approaches to provide rapid and unbiased characterization of DNA-binding proteins as molecular markers of disease, for example cancer progression or infectious diseases.
Resumo:
A key element for the development of suitable anti-cancer drugs is the identification of cancer-specific enzymatic activities that can be therapeutically targeted. Mucosa-associated lymphoid tissue transformation protein 1 (MALT1) is a proto-oncogene that contributes to tumorigenesis in diffuse large B-cell lymphoma (DLBCL) of the activated B-cell (ABC) subtype, the least curable subtype of DLBCL. Recent data suggest that MALT1 has proteolytic activity, but it is unknown whether this activity is relevant for tumor growth. Here we report that MALT1 is constitutively active in DLBCL lines of the ABC but not the GCB subtype. Inhibition of the MALT1 proteolytic activity led to reduced expression of growth factors and apoptosis inhibitors, and specifically affected the growth and survival of ABC DLBCL lines. These results demonstrate a key role for the proteolytic activity of MALT1 in DLBCL of the ABC subtype, and provide a rationale for the development of pharmacological inhibitors of MALT1 in DLBCL therapy.
Resumo:
Background: The RPS4 gene codifies for ribosomal protein S4, a very well-conserved protein present in all kingdoms. In primates, RPS4 is codified by two functional genes located on both sex chromosomes: the RPS4X and RPS4Y genes. In humans, RPS4Y is duplicated and the Y chromosome therefore carries a third functional paralog: RPS4Y2, which presents a testis-specific expression pattern. Results: DNA sequence analysis of the intronic and cDNA regions of RPS4Y genes from species covering the entire primate phylogeny showed that the duplication event leading to the second Y-linked copy occurred after the divergence of New World monkeys, about 35 million years ago. Maximum likelihood analyses of the synonymous and non-synonymous substitutions revealed that positive selection was acting on RPS4Y2 gene in the human lineage, which represents the first evidence of positive selection on a ribosomal protein gene. Putative positive amino acid replacements affected the three domains of the protein: one of these changes is located in the KOW protein domain and affects the unique invariable position of this motif, and might thus have a dramatic effect on the protein function.Conclusion: Here, we shed new light on the evolutionary history of RPS4Y gene family, especially on that of RPS4Y2. The results point that the RPS4Y1 gene might be maintained to compensate gene dosage between sexes, while RPS4Y2 might have acquired a new function, at least in the lineage leading to humans.
Resumo:
OBJECTIVES: To assess the in vitro susceptibility of Actinobaculum schaalii to 12 antimicrobial agents as well as to dissect the genetic basis of fluoroquinolone resistance. METHODS: Forty-eight human clinical isolates of A. schaalii collected in Switzerland and France were studied. Each isolate was identified by 16S rRNA sequencing. MICs of amoxicillin, ceftriaxone, gentamicin, vancomycin, clindamycin, linezolid, ciprofloxacin, levofloxacin, moxifloxacin, co-trimoxazole, nitrofurantoin and metronidazole were determined using the Etest method. Interpretation of results was made according to EUCAST clinical breakpoints. The quinolone-resistance-determining regions (QRDRs) of gyrA and parC genes were also identified and sequence analysis was performed for all 48 strains. RESULTS: All isolates were susceptible to amoxicillin, ceftriaxone, gentamicin, clindamycin (except three), vancomycin, linezolid and nitrofurantoin, whereas 100% and 85% were resistant to ciprofloxacin/metronidazole and co-trimoxazole, respectively. Greater than or equal to 90% of isolates were susceptible to the other tested fluoroquinolones, and only one strain was highly resistant to levofloxacin (MIC ?32 mg/L) and moxifloxacin (MIC 8 mg/L). All isolates that were susceptible or low-level resistant to levofloxacin/moxifloxacin (n?=?47) showed identical GyrA and ParC amino acid QRDR sequences. In contrast, the isolate exhibiting high-level resistance to levofloxacin and moxifloxacin possessed a unique mutation in GyrA, Ala83Val (Escherichia coli numbering), whereas no mutation was present in ParC. CONCLUSIONS: When an infection caused by A. schaalii is suspected, there is a risk of clinical failure by treating with ciprofloxacin or co-trimoxazole, and ?-lactams should be preferred. In addition, acquired resistance to fluoroquinolones more active against Gram-positive bacteria is possible.
Resumo:
Background: Annotations of completely sequenced genomes reveal that nearly half of the genes identified are of unknown function, and that some belong to uncharacterized gene families. To help resolve such issues, information can be obtained from the comparative analysis of homologous genes in model organisms. Results: While characterizing genes from the retinitis pigmentosa locus RP26 at 2q31-q33, we have identified a new gene, ORMDL1, that belongs to a novel gene family comprising three genes in humans (ORMDL1, ORMDL2 and ORMDL3), and homologs in yeast, microsporidia, plants, Drosophila, urochordates and vertebrates. The human genes are expressed ubiquitously in adult and fetal tissues. The Drosophila ORMDL homolog is also expressed throughout embryonic and larval stages, particularly in ectodermally derived tissues. The ORMDL genes encode transmembrane proteins anchored in the endoplasmic reticulum (ER). Double knockout of the two Saccharomyces cerevisiae homologs leads to decreased growth rate and greater sensitivity to tunicamycin and dithiothreitol. Yeast mutants can be rescued by human ORMDL homologs. Conclusions: From protein sequence comparisons we have defined a novel gene family, not previously recognized because of the absence of a characterized functional signature. The sequence conservation of this family from yeast to vertebrates, the maintenance of duplicate copies in different lineages, the ubiquitous pattern of expression in human and Drosophila, the partial functional redundancy of the yeast homologs and phenotypic rescue by the human homologs, strongly support functional conservation. Subcellular localization and the response of yeast mutants to specific agents point to the involvement of ORMDL in protein folding in the ER.
Resumo:
Adenoviruses of primates include human (HAdV) and simian (SAdV) isolates classified into 8 species (Human Adenovirus A to G, and Simian Adenovirus A). In this study, a novel adenovirus was isolated from a colony of cynomolgus macaques (Macaca fascicularis) and subcultured in VERO cells. Its complete genome was purified and a region encompassing the hexon gene, the protease gene, the DNA binding protein (DBP) and the 100 kDa protein was amplified by PCR and sequenced by primer walking. Sequence analysis of these four genes showed that the new isolate had 80% identity to other primate adenoviruses and lacked recombination events. The study of the evolutionary relationships of this new monkey AdV based on the combined sequences of the four genes supported a close relationship to SAdV-3 and SAdV-6, lineages isolated from Rhesus monkeys. The clade formed by these three types is separated from the remaining clades and establishes a novel branch that is related to species HAdV-A, F and G. However, the genetic distance corresponding to the newly isolated monkey AdV considerably differs from these as to belong to a new, not yet established species. Results presented here widen our knowledge on SAdV and represents an important contribution to the understanding of the evolutionary history of primate adenoviruses.
Resumo:
Adenoviruses of primates include human (HAdV) and simian (SAdV) isolates classified into 8 species (Human Adenovirus A to G, and Simian Adenovirus A). In this study, a novel adenovirus was isolated from a colony of cynomolgus macaques (Macaca fascicularis) and subcultured in VERO cells. Its complete genome was purified and a region encompassing the hexon gene, the protease gene, the DNA binding protein (DBP) and the 100 kDa protein was amplified by PCR and sequenced by primer walking. Sequence analysis of these four genes showed that the new isolate had 80% identity to other primate adenoviruses and lacked recombination events. The study of the evolutionary relationships of this new monkey AdV based on the combined sequences of the four genes supported a close relationship to SAdV-3 and SAdV-6, lineages isolated from Rhesus monkeys. The clade formed by these three types is separated from the remaining clades and establishes a novel branch that is related to species HAdV-A, F and G. However, the genetic distance corresponding to the newly isolated monkey AdV considerably differs from these as to belong to a new, not yet established species. Results presented here widen our knowledge on SAdV and represents an important contribution to the understanding of the evolutionary history of primate adenoviruses.
Resumo:
The function of DNA-binding proteins is controlled not just by their abundance, but mainly at the level of their activity in terms of their interactions with DNA and protein targets. Moreover, the affinity of such transcription factors to their target sequences is often controlled by co-factors and/or modifications that are not easily assessed from biological samples. Here, we describe a scalable method for monitoring protein-DNA interactions on a microarray surface. This approach was designed to determine the DNA-binding activity of proteins in crude cell extracts, complementing conventional expression profiling arrays. Enzymatic labeling of DNA enables direct normalization of the protein binding to the microarray, allowing the estimation of relative binding affinities. Using DNA sequences covering a range of affinities, we show that the new microarray-based method yields binding strength estimates similar to low-throughput gel mobility-shift assays. The microarray is also of high sensitivity, as it allows the detection of a rare DNA-binding protein from breast cancer cells, the human tumor suppressor AP-2. This approach thus mediates precise and robust assessment of the activity of DNA-binding proteins and takes present DNA-binding assays to a high throughput level.
Resumo:
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.
Resumo:
We investigate the evolutionary history of the greater white-toothed shrew across its distribution in northern Africa and mainland Europe using sex-specific (mtDNA and Y chromosome) and biparental (X chromosome) markers. All three loci confirm a large divergence between eastern (Tunisia and Sardinia) and western (Morocco and mainland Europe) lineages, and application of a molecular clock to mtDNA divergence estimates indicates a more ancient separation (2.25 M yr ago) than described by some previous studies, supporting claims for taxonomic revision. Moroccan ancestry for the mainland European population is inconclusive from phylogenetic trees, but is supported by greater nucleotide diversity and a more ancient population expansion in Morocco than in Europe. Signatures of rapid population expansion in mtDNA, combined with low X and Y chromosome diversity, suggest a single colonization of mainland Europe by a small number of Moroccan shrews >38 K yr ago. This study illustrates that multilocus genetic analyses can facilitate the interpretation of species' evolutionary history but that phylogeographic inference using X and Y chromosomes is restricted by low levels of observed polymorphism.
Resumo:
EMBnet is a consortium of collaborating bioinformatics groups located mainly within Europe (http://www.embnet.org). Each member country is represented by a 'node', a group responsible for the maintenance of local services for their users (e.g. education, training, software, database distribution, technical support, helpdesk). Among these services a web portal with links and access to locally developed and maintained software is essential and different for each node. Our web portal targets biomedical scientists in Switzerland and elsewhere, offering them access to a collection of important sequence analysis tools mirrored from other sites or developed locally. We describe here the Swiss EMBnet node web site (http://www.ch.embnet.org), which presents a number of original services not available anywhere else.
Resumo:
Approximately 520 Wilson disease-causing mutations in the ATP7B gene have been described to date. In this study we report DNA and RNA analyses carried out for molecular characterization of a consensus sequence splicing mutation found in homozygosity in a Swiss Wilson disease patient. RNA analysis of 1946 +6 T→C in both the peripheral lymphoblasts and liver resulted in the production in the propositus of only an alternative transcript lacking exons 6, 7, and 8 resulting most likely in alterations of cell biochemistry and disease. The patient presents an early form of severe hepatic disease characterized by hepatosplenomegaly, reduced hepatic function, anemia and thrombocytopenia indicating that 1946 +6 T→C is a severe mutation. Since identical results were obtained from both peripheral lymphoblasts and liver they also suggest that RNA studies of illegitimate transcripts can be safely used for molecular characterization of ATP7B splicing mutations, thus improving genetic counseling and diagnosis of Wilson disease. Moreover these studies, contribute to reveal the exact molecular mechanisms producing Wilson disease.
Resumo:
PPARs (peroxisome-proliferator-activated receptors) alpha, beta/delta and gamma are a group of transcription factors that are involved in numerous processes, including lipid metabolism and adipogenesis. By comparing liver mRNAs of wild-type and PPARalpha-null mice using microarrays, a novel putative target gene of PPARalpha, G0S2 (G0/G1 switch gene 2), was identified. Hepatic expression of G0S2 was up-regulated by fasting and by the PPARalpha agonist Wy14643 in a PPARalpha-dependent manner. Surprisingly, the G0S2 mRNA level was highest in brown and white adipose tissue and was greatly up-regulated during mouse 3T3-L1 and human SGBS (Simpson-Golabi-Behmel syndrome) adipogenesis. Transactivation, gel shift and chromatin immunoprecipitation assays indicated that G0S2 is a direct PPARgamma and probable PPARalpha target gene with a functional PPRE (PPAR-responsive element) in its promoter. Up-regulation of G0S2 mRNA seemed to be specific for adipogenesis, and was not observed during osteogenesis or myogenesis. In 3T3-L1 fibroblasts, expression of G0S2 was associated with growth arrest, which is required for 3T3-L1 adipogenesis. Together, these data indicate that G0S2 is a novel target gene of PPARs that may be involved in adipocyte differentiation.