6 resultados para Support Vector Machines and Naive Bayes Classifier
em National Center for Biotechnology Information - NCBI
Resumo:
We introduce a method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments. The method is based on the theory of support vector machines (SVMs). SVMs are considered a supervised computer learning method because they exploit prior knowledge of gene function to identify unknown genes of similar function from expression data. SVMs avoid several problems associated with unsupervised clustering methods, such as hierarchical clustering and self-organizing maps. SVMs have many mathematical features that make them attractive for gene expression analysis, including their flexibility in choosing a similarity function, sparseness of solution when dealing with large data sets, the ability to handle large feature spaces, and the ability to identify outliers. We test several SVMs that use different similarity metrics, as well as some other supervised learning methods, and find that the SVMs best identify sets of genes with a common function using expression data. Finally, we use SVMs to predict functional roles for uncharacterized yeast ORFs based on their expression data.
Resumo:
Inteins are protein-splicing elements, most of which contain conserved sequence blocks that define a family of homing endonucleases. Like group I introns that encode such endonucleases, inteins are mobile genetic elements. Recent crystallography and computer modeling studies suggest that inteins consist of two structural domains that correspond to the endonuclease and the protein-splicing elements. To determine whether the bipartite structure of inteins is mirrored by the functional independence of the protein-splicing domain, the entire endonuclease component was deleted from the Mycobacterium tuberculosis recA intein. Guided by computer modeling studies, and taking advantage of genetic systems designed to monitor intein function, the 440-aa Mtu recA intein was reduced to a functional mini-intein of 137 aa. The accuracy of splicing of several mini-inteins was verified. This work not only substantiates structure predictions for intein function but also supports the hypothesis that, like group I introns, mobile inteins arose by an endonuclease gene invading a sequence encoding a small, functional splicing element.
Resumo:
The mosquito midgut plays a central role in the sporogonic development of malaria parasites. We have found that polyclonal sera, produced against mosquito midguts, blocked the passage of Plasmodium falciparum ookinetes across the midgut, leading to a significant reduction of infections in mosquitoes. Anti-midgut mAbs were produced that display broad-spectrum activity, blocking parasite development of both P. falciparum and Plasmodium vivax parasites in five different species of mosquitoes. In addition to their parasite transmission-blocking activity, these mAbs also reduced mosquito survivorship and fecundity. These results reveal that mosquito midgut-based antibodies have the potential to reduce malaria transmission in a synergistic manner by lowering both vector competence, through transmission-blocking effects on parasite development, and vector abundance, by decreasing mosquito survivorship and egg laying capacity. Because the intervention can block transmission of different malaria parasite species in various species of mosquitoes, vaccines against such midgut receptors may block malaria transmission worldwide.
Resumo:
Adenovirus (Ad) vectors have been extensively used to deliver recombinant genes to a great variety of cell types in vitro and in vivo. Ad-based vectors are available that replace the Ad early region 1 (E1) with recombinant foreign genes. The resultant E1-deleted vectors can then be propagated on 293 cells, a human embryonal kidney cell line that constitutively expresses the E1 genes. Unfortunately, infection of cells and tissues in vivo results in low-level expression of Ad early and late proteins (despite the absence of E1 activity) resulting in immune recognition of virally infected cells. The infected cells are subsequently eliminated, resulting in only a transient expression of foreign genes in vivo. We hypothesize that a second-generation Ad vector with a deletion of viral genes necessary for Ad genome replication should block viral DNA replication and decrease viral protein production, resulting in a diminished immune response and extended duration of foreign gene expression in vivo. As a first step toward the generation of such a modified vector, we report the construction of cell lines that not only express the E1 genes but also constitutively express the Ad serotype 2 140-kDa DNA polymerase protein, one of three virally encoded proteins essential for Ad genome replication. The Ad polymerase-expressing cell lines support the replication and growth of H5ts36, an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These packaging cell lines can be used to prepare Ad vectors deleted for the E1 and polymerase functions, which should facilitate development of viral vectors for gene therapy of human diseases.