Predictive models for chronic renal disease using decision trees, naïve bayes and case-based methods


Autoria(s): Khan, Saqib Hussain
Data(s)

2010

Resumo

Data mining can be used in healthcare industry to “mine” clinical data to discover hidden information for intelligent and affective decision making. Discovery of hidden patterns and relationships often goes intact, yet advanced data mining techniques can be helpful as remedy to this scenario. This thesis mainly deals with Intelligent Prediction of Chronic Renal Disease (IPCRD). Data covers blood, urine test, and external symptoms applied to predict chronic renal disease. Data from the database is initially transformed to Weka (3.6) and Chi-Square method is used for features section. After normalizing data, three classifiers were applied and efficiency of output is evaluated. Mainly, three classifiers are analyzed: Decision Tree, Naïve Bayes, K-Nearest Neighbour algorithm. Results show that each technique has its unique strength in realizing the objectives of the defined mining goals. Efficiency of Decision Tree and KNN was almost same but Naïve Bayes proved a comparative edge over others. Further sensitivity and specificity tests are used as statistical measures to examine the performance of a binary classification. Sensitivity (also called recall rate in some fields) measures the proportion of actual positives which are correctly identified while Specificity measures the proportion of negatives which are correctly identified. CRISP-DM methodology is applied to build the mining models. It consists of six major phases: business understanding, data understanding, data preparation, modeling, evaluation, and deployment.

Formato

application/pdf

Identificador

http://urn.kb.se/resolve?urn=urn:nbn:se:du-20790

Idioma(s)

eng

Publicador

Högskolan Dalarna, Datateknik

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #naïve bayes #decision trees #KNN #Computer and Information Sciences #Data- och informationsvetenskap
Tipo

Student thesis

info:eu-repo/semantics/bachelorThesis

text