931 resultados para Bioinformatics


Relevância:

10.00% 10.00%

Publicador:

Resumo:

We consider the problem of assessing the number of clusters in a limited number of tissue samples containing gene expressions for possibly several thousands of genes. It is proposed to use a normal mixture model-based approach to the clustering of the tissue samples. One advantage of this approach is that the question on the number of clusters in the data can be formulated in terms of a test on the smallest number of components in the mixture model compatible with the data. This test can be carried out on the basis of the likelihood ratio test statistic, using resampling to assess its null distribution. The effectiveness of this approach is demonstrated on simulated data and on some microarray datasets, as considered previously in the bioinformatics literature. (C) 2004 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

DNA Microarray is a powerful tool to measure the level of a mixed population of nucleic acids at one time, which has great impact in many aspects of life sciences research. In order to distinguish nucleic acids with very similar composition by hybridization, it is necessary to design microarray probes with high specificities and sensitivities. Highly specific probes correspond to probes having unique DNA sequences; whereas highly sensitive probes correspond to those with melting temperature within a desired range and having no secondary structure. The selection of these probes from a set of functional DNA sequences (exons) constitutes a computationally expensive discrete non-linear search problem. We delegate the search task to a simple yet effective Evolution Strategy algorithm. The computational efficiency is also greatly improved by making use of an available bioinformatics tool.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Modern toxicology investigates a wide array of both old and new health hazards. Priority setting is needed to select agents for research from the plethora of exposure circumstances. The changing societies and a growing fraction of the aged have to be taken into consideration. A precise exposure assessment is of importance for risk estimation and regulation. Toxicology contributes to the exploration of pathomechanisms to specify the exposure metrics for risk estimation. Combined effects of co-existing agents are not yet sufficiently understood. Animal experiments allow a separate administration of agents which can not be disentangled by epidemiological means, but their value is limited for low exposure levels in many of today's settings. As an experimental science, toxicology has to keep pace with the rapidly growing knowledge about the language of the genome and the changing paradigms in cancer development. During the pioneer era of assembling a working draft of the human genome, toxicogenomics has been developed. Gene and pathway complexity have to be considered when investigating gene-environment interactions. For a best conduct of studies, modem toxicology needs a close liaison with many other disciplines like epidemiology and bioinformatics. (C) 2004 Elsevier Ireland Ltd. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Scorpion toxins are common experimental tools for studies of biochemical and pharmacological properties of ion channels. The number of functionally annotated scorpion toxins is steadily growing, but the number of identified toxin sequences is increasing at much faster pace. With an estimated 100,000 different variants, bioinformatic analysis of scorpion toxins is becoming a necessary tool for their systematic functional analysis. Here, we report a bioinformatics-driven system involving scorpion toxin structural classification, functional annotation, database technology, sequence comparison, nearest neighbour analysis, and decision rules which produces highly accurate predictions of scorpion toxin functional properties. (c) 2005 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A novel member of the human relaxin subclass of the insulin superfamily was recently discovered during a genomics database search and named relaxin-3. Like human relaxin-1 and relaxin-2, relaxin-3 is predicted to consist of a two-chain structure and three disulfide bonds in a disposition identical to that of insulin. To undertake detailed biophysical and biological characterization of the peptide, its chemical synthesis was undertaken. In contrast to human relaxin-1 and relaxin-2, however, relaxin-3 could not be successfully prepared by simple combination of the individual chains, thus necessitating recourse to the use of a regioselective disulfide bond formation strategy. Solid phase synthesis of the separate, selectively S-protected A and B chains followed by their purification and the subsequent stepwise formation of each of the three disulfides led to the successful acquisition of human relaxin-3. Comprehensive chemical characterization confirmed both the correct chain orientation and the integrity of the synthetic product. Relaxin-3 was found to bind to and activate native relaxin receptors in vitro and stimulate water drinking through central relaxin receptors in vivo. Recent studies have demonstrated that relaxin-3 will bind to and activate human LGR7, but not LGR8, in vitro. Secondary structural analysis showed it to adopt a less ordered confirmation than either relaxin-1 or relaxin-2, reflecting the presence in the former of a greater percentage of nonhelical forming amino acids. NMR spectroscopy and simulated annealing calculations were used to determine the three-dimensional structure of relaxin-3 and to identify key structural differences between the human relaxins.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The mammalian retromer protein complex, which consists of three proteins - Vps26, Vps29, and Vps35 - in association with members of the sorting nexin family of proteins, has been implicated in the trafficking of receptors and their ligands within the endosomal/lysosomal system of mammalian cells. A bioinformatic analysis of the mouse genome identified an additional transcribed paralog of the Vps26 retromer protein, which we termed Vps26B. No paralogs were identified for Vps29 and Vps35. Phylogenetic studies indicate that the two paralogs of Vps26 become evident after the evolution of the chordates. We propose that the chordate Vps26-like gene published previously be renamed Vps26A to differentiate it from Vps26B. As for Vps26A, biochemical characterization of Vps26B established that this novel 336 amino acid residue protein is a peripheral membrane protein. Vps26B co-precipitated with Vps35 from transfected cells and the direct interaction between these two proteins was confirmed by yeast 2-hybrid analysis, thereby establishing Vps26B as a subunit of the retromer complex. Within HeLa cells, Vps26B was found in the cytoplasm with low levels at the plasma membrane, while Vps26A was predominantly associated with endosomal membranes. Within A549 cells, both Vps26A and Vps26B co-localized with actin-rich lamellipodia at the cell surface. These structures also co-localized with Vps35. Total internal reflection fluorescence microscopy confirmed the association of Vps26B with the plasma membrane in a stable HEK293 cell line expressing cyan fluorescent protein (CFP)-Vps26B. Based on these observations, we propose that the mammalian retromer complex is located at both endosomes and the plasma membrane in some cell types.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Proteins secreted by and anchored on the surfaces of parasites are in intimate contact with host tissues. The transcriptome of infective cercariae of the blood fluke, Schistosoma mansoni, was screened using signal sequence trap to isolate cDNAs encoding predicted proteins with an N-terminal signal peptide. Twenty cDNA fragments were identified, most of which contained predicted signal peptides or transmembrane regions, including a novel putative seven-transmembrane receptor and a membrane-associated mitogen-activated protein kinase. The developmental expression pattern within different life-cycle stages ranged from ubiquitous to a transcript that was highly upregulated in the cercaria. A bioinformatics-based comparison of 100 signal peptides from each of schistosomes, humans, a parasitic nematode and Escherichia coli showed that differences in the sequence composition of signal peptides, notably the residues flanking the predicted cleavage site, might account for the negative bias exhibited in the processing of schistosome signal peptides in mammalian cells. (c) 2005 Federation of European Microbiological Societies. Published by Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: The multitude of motif detection algorithms developed to date have largely focused on the detection of patterns in primary sequence. Since sequence-dependent DNA structure and flexibility may also play a role in protein-DNA interactions, the simultaneous exploration of sequence-and structure-based hypotheses about the composition of binding sites and the ordering of features in a regulatory region should be considered as well. The consideration of structural features requires the development of new detection tools that can deal with data types other than primary sequence. Results: GANN ( available at http://bioinformatics.org.au/gann) is a machine learning tool for the detection of conserved features in DNA. The software suite contains programs to extract different regions of genomic DNA from flat files and convert these sequences to indices that reflect sequence and structural composition or the presence of specific protein binding sites. The machine learning component allows the classification of different types of sequences based on subsamples of these indices, and can identify the best combinations of indices and machine learning architecture for sequence discrimination. Another key feature of GANN is the replicated splitting of data into training and test sets, and the implementation of negative controls. In validation experiments, GANN successfully merged important sequence and structural features to yield good predictive models for synthetic and real regulatory regions. Conclusion: GANN is a flexible tool that can search through large sets of sequence and structural feature combinations to identify those that best characterize a set of sequences.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment. (C) 2005 Wiley-Liss, Inc.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

To ensure signalling fidelity, kinases must act only on a defined subset of cellular targets. Appreciating the basis for this substrate specificity is essential for understanding the role of an individual protein kinase in a particular cellular process. The specificity in the cell is determined by a combination of peptide specificity of the kinase (the molecular recognition of the sequence surrounding the phosphorylation site), substrate recruitment and phosphatase activity. Peptide specificity plays a crucial role and depends on the complementarity between the kinase and the substrate and therefore on their three-dimensional structures. Methods for experimental identification of kinase substrates and characterization of specificity are expensive and laborious, therefore, computational approaches are being developed to reduce the amount of experimental work required in substrate identification. We discuss the structural basis of substrate specificity of protein kinases and review the experimental and computational methods used to obtain specificity information. (c) 2005 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Candida albicans is a pathogen commonly infecting patients who receive immunosuppressive drug therapy, long-term catheterization, or those who suffer from acquired immune deficiency syndrome (AIDS). The major factor accountable for pathogenicity of C. albicans is host immune status. Various virulence molecules, or factors, of are also responsible for the disease progression. Virulence proteins are published in public databases but they normally lack detailed functional annotations. We have developed CandiVF, a specialized database of C. albicans virulence factors (http://antigen.i2r.a-star.edu.sg/Templar/DB/CandiVF/) to facilitate efficient extraction and analysis of data aimed to assist research on immune responses, pathogenesis, prevention, and control of candidiasis. CandiVF contains a large number of annotated virulence proteins, including secretory, cell wall-associated, membrane, cytoplasmic, and nuclear proteins. This database has in-built bioinformatics tools including keyword and BLAST search, visualization of 3D-structures, HLA-DR epitope prediction, virulence descriptors, and virulence factors ontology.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Integrating information in the molecular biosciences involves more than the cross-referencing of sequences or structures. Experimental protocols, results of computational analyses, annotations and links to relevant literature form integral parts of this information, and impart meaning to sequence or structure. In this review, we examine some existing approaches to integrating information in the molecular biosciences. We consider not only technical issues concerning the integration of heterogeneous data sources and the corresponding semantic implications, but also the integration of analytical results. Within the broad range of strategies for integration of data and information, we distinguish between platforms and developments. We discuss two current platforms and six current developments, and identify what we believe to be their strengths and limitations. We identify key unsolved problems in integrating information in the molecular biosciences, and discuss possible strategies for addressing them including semantic integration using ontologies, XML as a data model, and graphical user interfaces as integrative environments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Adenosylhomocysteine hydrolase-like protein 1 (AHCYL1) is a novel intracellular protein with similar to 50% protein identity to adenosyl homocysteine hydrolase (AHCY), an important enzyme for metabolizing S-adenosyl-L-homocysteine, the by-product of S-adenosyl-L-homomethionine-dependent methylation. AHCYL1 binds to the inositol 1,4,5-trisphosphate receptor, suggesting that AHCYL1 is involved in intracellular calcium release. We identified two zebrafish AHCYL1 orthologs(zAHCYL1A and -B) by bioinformatics and reverse transcription-PCR. Unlike the ubiquitously present AHCY genes, AHCYL1 genes were only detected in segmented animals, and AHCYL1 proteins were highly conserved among species. Phylogenic analysis suggested that the AHCYL1 gene diverged early from AHCY and evolved independently. Quantitative reverse transcription-PCR showed that zAHCYL1A and -B mRNA expression was regulated differently from the other AHCY-like protein zAHCYL2 and zAHCY during zebrafish embryogenesis. Injection of morpholino antisense oligonucleotides against zAHCYL1A and -B into zebrafish embryos inhibited zAHCYL1A and -B mRNA translation specifically and induced ventralized morphologies. Conversely, human and zebrafish AHCYL1A mRNA injection into zebrafish embryos induced dorsalized morphologies that were similar to those obtained by depleting intracellular calcium with thapsigargin. Human AHCY mRNA injection showed little effect on the embryos. These data suggest that AHCYL1 has a different function from AHCY and plays an important role in embryogenesis by modulating inositol 1,4,5-trisphosphate receptor function for the intracellular calcium release.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Several pathogenic strains of Escherichia coli exploit type III secretion to inject effector proteins into human cells, which then subvert eukaryotic cell biology to the bacterium's advantage. We have exploited bioinformatics and experimental approaches to establish that the effector repertoire in the Sakai strain of enterohemorrhagic E. coli (EHEC) O157:H7 is much larger than previously thought. Homology searches led to the identification of > 60 putative effector genes. Thirteen of these were judged to be likely pseudogenes, whereas 49 were judged to be potentially functional. In total, 39 proteins were confirmed experimentally as effectors: 31 through proteomics and 28 through translocation assays. At the protein level, the EHEC effector sequences fall into > 20 families. The largest family, the NleG family, contains 14 members in the Sakai strain alone. EHEC also harbors functional homologs of effectors from plant pathogens (HopPtoH, HopW, AvrA) and from Shigella (OspD, OspE, OspG), and two additional members of the Map/IpgB family. Genes encoding proven or predicted effectors occur in > 20 exchangeable effector loci scattered throughout the chromosome. Crucially, the majority of functional effector genes are encoded by nine exchangeable effector loci that lie within lambdoid prophages. Thus, type III secretion in E. coli is linked to a vast phage metagenome, acting as a crucible for the evolution of pathogenicity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study describes the identification of outer membrane proteins (OMPs) of the bacterial pathogen Pasteurella multocida and an analysis of how the expression of these proteins changes during infection of the natural host. We analysed the sarcosine-insoluble membrane fractions, which are highly enriched for OMPs, from bacteria grown under a range of conditions. Initially, the OMP-containing fractions were resolved by 2-DE and the proteins identified by MALDI-TOF MS. In addition, the OMP-containing fractions were separated by 1-D SDS-PAGE and protein identifications were made using nano LC MS/MS. Using these two methods a total of 35 proteins was identified from samples obtained from organisms grown in rich culture medium. Six of the proteins were identified only by 2-DE MALDI-TOF MS, whilst 17 proteins were identified only by 1-D LC MS/MS. We then analysed the OMPs from P. multocida which had been isolated from the bloodstream of infected chickens (a natural host) or grown in iron-depleted medium. Three proteins were found to be significantly up-regulated during growth in vivo and one of these (Pm0803) was also up-regulated during growth in iron-depleted medium. After bioinformatic analysis of the protein matches, it was predicted that over one third of the combined OMPs predicted by the bioinformatics sub-cellular localisation tools PSORTB and Proteome Analyst, had been identified during this study. This is the first comprehensive proteomic analysis of the P. multocida outer membrane and the first proteomic analysis of how a bacterial pathogen modifies its outer membrane proteome during infection.