62 resultados para discovery of a similarity


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The recent upsurge in microbial genome data has revealed that hemoglobin-like (HbL) proteins may be widely distributed among bacteria and that some organisms may carry more than one HbL encoding gene. However, the discovery of HbL proteins has been limited to a small number of bacteria only. This study describes the prediction of HbL proteins and their domain classification using a machine learning approach. Support vector machine (SVM) models were developed for predicting HbL proteins based upon amino acid composition (AC), dipeptide composition (DC), hybrid method (AC + DC), and position specific scoring matrix (PSSM). In addition, we introduce for the first time a new prediction method based on max to min amino acid residue (MM) profiles. The average accuracy, standard deviation (SD), false positive rate (FPR), confusion matrix, and receiver operating characteristic (ROC) were analyzed. We also compared the performance of our proposed models in homology detection databases. The performance of the different approaches was estimated using fivefold cross-validation techniques. Prediction accuracy was further investigated through confusion matrix and ROC curve analysis. All experimental results indicate that the proposed BacHbpred can be a perspective predictor for determination of HbL related proteins. BacHbpred, a web tool, has been developed for HbL prediction.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Entry profiles can be generated before children with Autism Spectrum Disorders (ASD) begin to traverse an intervention program. They can help evaluate the progress of each child on the dedicated syllabus in addition to enabling narrowing down the best intervention course over time. However, the traits of ASD are expressed in different ways in every individual affected. The resulting spectrum nature of the disorder makes it challenging to discover profiles of children with ASD. Using data from 491 children, traversing the syllabus of a comprehensive intervention program on iPad called TOBY Playpad, we learn the entry profiles of the children based on their age, sex and performance on their first skills of the syllabus. Mixed-variate restricted Boltzmann machines allow us to integrate the heterogeneous data into one model making it a suitable technique. The data based discovery of entry profiles may assist in developing systems that can automatically suggest best suitable paths through the syllabus by clustering the children based on the characteristics they present at the beginning of the program. This may open the pathway for personalised intervention.