946 resultados para Clinical Data Warehousing
Resumo:
References.
Resumo:
Retrospective clinical data presents many challenges for data mining and machine learning. The transcription of patient records from paper charts and subsequent manipulation of data often results in high volumes of noise as well as a loss of other important information. In addition, such datasets often fail to represent expert medical knowledge and reasoning in any explicit manner. In this research we describe applying data mining methods to retrospective clinical data to build a prediction model for asthma exacerbation severity for pediatric patients in the emergency department. Difficulties in building such a model forced us to investigate alternative strategies for analyzing and processing retrospective data. This paper describes this process together with an approach to mining retrospective clinical data by incorporating formalized external expert knowledge (secondary knowledge sources) into the classification task. This knowledge is used to partition the data into a number of coherent sets, where each set is explicitly described in terms of the secondary knowledge source. Instances from each set are then classified in a manner appropriate for the characteristics of the particular set. We present our methodology and outline a set of experiential results that demonstrate some advantages and some limitations of our approach. © 2008 Springer-Verlag Berlin Heidelberg.
Resumo:
Construction organizations typically deal with large volumes of project data containing valuable information. It is found that these organizations do not use these data effectively for planning and decision-making. There are two reasons. First, the information systems in construction organizations are designed to support day-to-day construction operations. The data stored in these systems are often non-validated, non-integrated and are available in a format that makes it difficult for decision makers to use in order to make timely decisions. Second, the organizational structure and the IT infrastructure are often not compatible with the information systems thereby resulting in higher operational costs and lower productivity. These two issues have been investigated in this research with the objective of developing systems that are structured for effective decision-making. ^ A framework was developed to guide storage and retrieval of validated and integrated data for timely decision-making and to enable construction organizations to redesign their organizational structure and IT infrastructure matched with information system capabilities. The research was focused on construction owner organizations that were continuously involved in multiple construction projects. Action research and Data warehousing techniques were used to develop the framework. ^ One hundred and sixty-three construction owner organizations were surveyed in order to assess their data needs, data management practices and extent of use of information systems in planning and decision-making. For in-depth analysis, Miami-Dade Transit (MDT) was selected which is in-charge of all transportation-related construction projects in the Miami-Dade county. A functional model and a prototype system were developed to test the framework. The results revealed significant improvements in data management and decision-support operations that were examined through various qualitative (ease in data access, data quality, response time, productivity improvement, etc.) and quantitative (time savings and operational cost savings) measures. The research results were first validated by MDT and then by a representative group of twenty construction owner organizations involved in various types of construction projects. ^
Resumo:
Construction organizations typically deal with large volumes of project data containing valuable information. It is found that these organizations do not use these data effectively for planning and decision-making. There are two reasons. First, the information systems in construction organizations are designed to support day-to-day construction operations. The data stored in these systems are often non-validated, nonintegrated and are available in a format that makes it difficult for decision makers to use in order to make timely decisions. Second, the organizational structure and the IT infrastructure are often not compatible with the information systems thereby resulting in higher operational costs and lower productivity. These two issues have been investigated in this research with the objective of developing systems that are structured for effective decision-making. A framework was developed to guide storage and retrieval of validated and integrated data for timely decision-making and to enable construction organizations to redesign their organizational structure and IT infrastructure matched with information system capabilities. The research was focused on construction owner organizations that were continuously involved in multiple construction projects. Action research and Data warehousing techniques were used to develop the framework. One hundred and sixty-three construction owner organizations were surveyed in order to assess their data needs, data management practices and extent of use of information systems in planning and decision-making. For in-depth analysis, Miami-Dade Transit (MDT) was selected which is in-charge of all transportation-related construction projects in the Miami-Dade county. A functional model and a prototype system were developed to test the framework. The results revealed significant improvements in data management and decision-support operations that were examined through various qualitative (ease in data access, data quality, response time, productivity improvement, etc.) and quantitative (time savings and operational cost savings) measures. The research results were first validated by MDT and then by a representative group of twenty construction owner organizations involved in various types of construction projects.
Resumo:
Pretende-se desenvolver um Data Warehouse para um grupo empresarial constituído por quatro empresas, tendo como objectivo primordial a consolidação de informação. A consolidação da informação é de extrema utilidade, uma vez que as empresas podem ter dados comuns, tais como, produtos ou clientes. O principal objectivo dos sistemas analíticos é permitir analisar os dados dos sistemas transacionais da organização, fazendo com que os utilizadores que nada percebem destes sistemas consigam ter apoio nas tomadas decisão de uma forma simples e eficaz. A utilização do Data Warehouse é útil no apoio a decisões, uma vez que torna os utilizadores autónomos na realização de análises. Os utilizadores deixam de estar dependentes de especialistas em informática para efectuar as suas consultas e passam a ser eles próprios a realizá-las. Por conseguinte, o tempo de execução de uma consulta através do Data Warehouse é de poucos segundos, ao contrário das consultas criadas anteriormente pelos especialistas que por vezes demoravam horas a ser executadas. __ ABSTRACT: lt is intended to develop a Data Warehouse for a business related group of four companies, having by main goal the information consolidation. This information consolidation is of extreme usefulness since the companies can have common data, such as products or customers. The main goal of the analytical systems is to allow analyze data from the organization transactional systems, making that the users that do not understand anything of these systems may have support in a simple and effective way in every process of taking decisions. Using the Data Warehouse is useful to support decisions, once it will allow users to become autonomous in carrying out analysis. Users will no longer depend on computer experts to make their own queries and they can do it themselves. Therefore, the time of a query through the Data Warehouse takes only a few seconds, unlike the earlier queries created previously by experts that sometimes took hours to run.
Resumo:
Laboratory safety data are routinely collected in clinical studies for safety monitoring and assessment. We have developed a truncated robust multivariate outlier detection method for identifying subjects with clinically relevant abnormal laboratory measurements. The proposed method can be applied to historical clinical data to establish a multivariate decision boundary that can then be used for future clinical trial laboratory safety data monitoring and assessment. Simulations demonstrate that the proposed method has the ability to detect relevant outliers while automatically excluding irrelevant outliers. Two examples from actual clinical studies are used to illustrate the use of this method for identifying clinically relevant outliers.
Resumo:
Purpose: To evaluate whether the correlation between in vitro bond strength data and estimated clinical retention rates of cervical restorations after two years depends on pooled data obtained from multicenter studies or single-test data. Materials and Methods: Pooled mean data for six dentin adhesive systems (Adper Prompt L-Pop, Clearfil SE, OptiBond FL, Prime & Bond NT, Single Bond, and Scotchbond Multipurpose) and four laboratory methods (macroshear, microshear, macrotensile and microtensile bond strength test) (Scherrer et al, 2010) were correlated to estimated pooled two-year retention rates of Class V restorations using the same adhesive systems. For bond strength data from a single test institute, the literature search in SCOPUS revealed one study that tested all six adhesive systems (microtensile) and two that tested five of the six systems (microtensile, macroshear). The correlation was determined with a database designed to perform a meta-analysis on the clinical performance of cervical restorations (Heintze et al, 2010). The clinical data were pooled and adjusted in a linear mixed model, taking the study effect, dentin preparation, type of isolation and bevelling of enamel into account. A regression analysis was carried out to evaluate the correlation between clinical and laboratory findings. Results: The results of the regression analysis for the pooled data revealed that only the macrotensile (adjusted R2 = 0.86) and microtensile tests (adjusted R2 = 0.64), but not the shear and the microshear tests, correlated well with the clinical findings. As regards the data from a single-test institute, the correlation was not statistically significant. Conclusion: Macrotensile and microtensile bond strength tests showed an adequate correlation with the retention rate of cervical restorations after two years. Bond strength tests should be carried out by different operators and/or research institutes to determine the reliability and technique sensitivity of the material under investigation.
Resumo:
Advances in biomedical signal acquisition systems for motion analysis have led to lowcost and ubiquitous wearable sensors which can be used to record movement data in different settings. This implies the potential availability of large amounts of quantitative data. It is then crucial to identify and to extract the information of clinical relevance from the large amount of available data. This quantitative and objective information can be an important aid for clinical decision making. Data mining is the process of discovering such information in databases through data processing, selection of informative data, and identification of relevant patterns. The databases considered in this thesis store motion data from wearable sensors (specifically accelerometers) and clinical information (clinical data, scores, tests). The main goal of this thesis is to develop data mining tools which can provide quantitative information to the clinician in the field of movement disorders. This thesis will focus on motor impairment in Parkinson's disease (PD). Different databases related to Parkinson subjects in different stages of the disease were considered for this thesis. Each database is characterized by the data recorded during a specific motor task performed by different groups of subjects. The data mining techniques that were used in this thesis are feature selection (a technique which was used to find relevant information and to discard useless or redundant data), classification, clustering, and regression. The aims were to identify high risk subjects for PD, characterize the differences between early PD subjects and healthy ones, characterize PD subtypes and automatically assess the severity of symptoms in the home setting.
Resumo:
This paper considers a model-based approach to the clustering of tissue samples of a very large number of genes from microarray experiments. It is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. Frequently in practice, there are also clinical data available on those cases on which the tissue samples have been obtained. Here we investigate how to use the clinical data in conjunction with the microarray gene expression data to cluster the tissue samples. We propose two mixture model-based approaches in which the number of components in the mixture model corresponds to the number of clusters to be imposed on the tissue samples. One approach specifies the components of the mixture model to be the conditional distributions of the microarray data given the clinical data with the mixing proportions also conditioned on the latter data. Another takes the components of the mixture model to represent the joint distributions of the clinical and microarray data. The approaches are demonstrated on some breast cancer data, as studied recently in van't Veer et al. (2002).
Resumo:
Тихомир Трифонов, Цветанка Георгиева-Трифонова - В настоящата статия е представена системата bgBell/OLAP за складиране и онлайн аналитична обработка на данни за уникални български камбани. Реализираната система предоставя възможност за извеждане на обобщени справки и анализиране на различни характеристики на камбаните, за да се извлече предварително неизвестна и потенциално полезна информация.
Resumo:
Although cartilaginous tumors have low microvascular density, vessels are important for the provision of nutrition so that the tumor can grow and generate metastasis. The aim of this study was to assess the value of the vascular pattern classification as a prognostic tool in chondrosarcomas (CSs) and its relation with vascular endothelial growth factor (VEGF) expression. This was a retrospective study of 21 enchondromas and 57 conventional CSs. Clinical data and outcome were retrieved from medical files. CSs histologic grades (on a scale of 1 to 3) were determined according to the World Health Organization classification. The vascular pattern (on a scale of A to C) was assessed through CD34, according to Kalinski. CD105 and VEGF were also evaluated. Poor outcome was significantly associated with vascular pattern groups B and C. Higher vascular pattern were 6.5 times more frequent in moderate-grade and high-grade CSs than in grade 1 CS. On multivariate analysis, a clear correlation was found between VEGF overexpression and B/C vascular patterns. Only 18 (benign and malignant) tumors stained for CD105. The results point to the use of the vascular pattern classification as a prognostic tool in CSs and to differentiate low-grade from moderate-grade/high-grade CSs. Vascular pattern might be also used to complement histologic grade, VEGF immunostaining, and microvascular density, for indicating a patient's prognosis. Low-grade CSs develop under low neoangiogenesis, which conforms to the slow growth rate of these tumors.
Resumo:
Background: RRP is a devastating disease in which papillomas in the airway cause hoarseness and breathing difficulty. The disease is caused by human papillomavirus (HPV) 6 or 11 and is very variable. Patients undergo multiple surgeries to maintain a patent airway and in order to communicate vocally. Several small studies have been published in which most have noted that HPV 11 is associated with a more aggressive course. Methodology/Principal Findings: Papilloma biopsies were taken from patients undergoing surgical treatment of RRP and were subjected to HPV typing. 118 patients with juvenile-onset RRP with at least 1 year of clinical data and infected with a single HPV type were analyzed. HPV 11 was encountered in 40% of the patients. By our definition, most of the patients in the sample (81%) had run an aggressive course. The odds of a patient with HPV 11 running an aggressive course were 3.9 times higher than that of patients with HPV 6 (Fisher's exact p = 0.017). However, clinical course was more closely associated with age of the patient (at diagnosis and at the time of the current surgery) than with HPV type. Patients with HPV 11 were diagnosed at a younger age (2.4y) than were those with HPV 6 (3.4y) (p = 0.014). Both by multiple linear regression and by multiple logistic regression HPV type was only weakly associated with metrics of disease course when simultaneously accounting for age. Conclusions/Significance Abstract: The course of RRP is variable and a quarter of the variability can be accounted for by the age of the patient. HPV 11 is more closely associated with a younger age at diagnosis than it is associated with an aggressive clinical course. These data suggest that there are factors other than HPV type and age of the patient that determine disease course.
A survey on clinical presentation and nutritional status of infants with suspected cow' milk allergy
Resumo:
Background: Cow's milk is the most common food allergen in infants and the diagnosis of cow's milk allergy is difficult, even with the use of several diagnostic tests. Therefore, elimination diets and challenge tests are essential for the diagnosis and treatment of this disorder. The aim of this study is to report the clinical presentation and nutritional status of children evaluated by pediatric gastroenterologists for the assessment of symptoms suggestive of cow's milk allergy. Methods: An observational cross-sectional study was performed among 9,478 patients evaluated by 30 pediatric gastroenterologists for 40 days in 5 different geographical regions in Brazil. Clinical data were collected from patients with symptoms suggestive of cow's milk allergy. The nutritional status of infants (age <= 24 months) seen for the first time was evaluated according to z-scores for weight-for-age, weight-for-height, and height-for-age. Epi-Info (CDC-NCHS, 2000) software was used to calculate z-scores. Results: The prevalence of suspected cow's milk allergy in the study population was 5.4% (513/9,478), and the incidence was 2.2% (211/9,478). Among 159 infants seen at first evaluation, 15.1% presented with a low weight-for-age z score (< -2.0 standard deviation - SD), 8.7% with a low weight-for-height z score (< -2.0 SD), and 23.9% with a low height-for-age z score (< -2.0 SD). Conclusion: The high prevalence of nutritional deficits among infants with symptoms suggestive of cow's milk allergy indicates that effective elimination diets should be prescribed to control allergy symptoms and to prevent or treat malnutrition.
Resumo:
The objectives of this cross-sectional study were to evaluate the differences between males and females in the clinical and biochemical manifestations of primary antiphospholipid antibody syndrome (PAPS). The method involved 49 patients (38 premenopausal females and 11 males) diagnosed with PAPS (according to the Sapporo criteria) and their demographic data, clinical data, medications used and antiphospholipid antibodies were analyzed. The results of this study are as follows. Both the groups of patients were statistically similar regarding age, race, weight and body mass index. However, males were significantly taller than females (172 +/- A 8.9 vs. 159.2 +/- A 6.2 cm, p < 0.0001). The duration of disease was similar for females and males. The prevalence of pulmonary thromboembolism was higher in females than in males (34.2 vs. 0.0%, p = 0.024). Regarding other PAPS manifestations (arterial events, venous events, deep venous thrombosis, thrombocytopenia, acute myocardial infarction, angina, cerebrovascular accidents and Sneddon syndrome), comorbidities (arterial hypertension and dyslipidemia), lifestyle (physical activity, previous smoking and current smoking) and the use of medications (current and previous use of corticosteroids, as well as the use of statins or chloroquine), both groups were statistically similar (p > 0.05). More females than males tested positive for IgM anticardiolipin antibodies (76.3 vs. 36.4%, p = 0.025) or for at least one of the antiphospholipid antibodies tested (either IgM anticardiolipin or IgG anticardiolipin 84.2 vs. 45.5%, p = 0.016). However, both groups were similar regarding the frequency of positivity for lupus anticoagulant and isolated IgG anticardiolipin, as well as regarding mean levels of IgG and IgM anticardiolipin (p > 0.05). We concluded that, among PAPS patients, the frequency of pulmonary thromboembolism and of positivity for IgM anticardiolipin is higher in females than in males. Our findings show that there are gender differences in PAPS, differences that might be related to alterations in sex hormones.